Construction of an Intelligent System for information Retrieval in an Environment of Information Network
信息网络环境下智能信息检索系统的构建
基本信息
- 批准号:09558041
- 负责人:
- 金额:$ 5.76万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (B)
- 财政年份:1998
- 资助国家:日本
- 起止时间:1998 至 1999
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The project aimed at realizing an intelligent system for information retrieval over the Internet, The main results obtained are as follows.(1)Basic principlesUse of 'key concepts', estimation of relevance of retrieved results, and spoken dialogue as the user interface, were adopted as basic principles.(2)Processing of unknown wordsIn order to conduct search based on key concepts, the system has to infer the concepts of key words that are not registered in the system's lexicon. On the basis of an extensive collection and classification of such 'unknown words', methods were developed for processing unknown words arising from variations of transcription as well as unknown compound words consisting of known morphemes.(3)Processing of polysemy and synonymyIn order to realize concept-based search, both polysemy and synonymy of keywords have to be coped with. As for polysemy, a method for disambiguation was developed on the basis of collocation information. As for synonymy, a method was developed to expand a keyword on the basis of its concept.(4)Relevance estimation of retrieved resultsA method was developed for estimating the degree of relevance of a document to a query on the basis of number and location of occurrence of keywords within a document. The estimation was optimized to maximize the correlation between the estimated relevance score and the actual score based on human judgment.(5)Dialogue management based on user and system modelingA method for dialogue management was developed on the basis of analysis of simulated dialogues between a user and the system. It adopts separate modeling of the user and the system, represented by two finite-state automata exchanging information mainly through their utterances.(6)Construction of a prototype systemA prototype system was constructed combining the above-mentioned results, and its validity was tested and confirmed experimentally.
本项目旨在实现一个智能化的Internet信息检索系统,取得的主要成果如下。(1)基本原则采用“关键概念”的使用、检索结果相关性的估计以及作为用户界面的口语对话作为基本原则。(2)未知词的处理为了进行基于关键概念的搜索,系统必须推断未在系统的词典中注册的关键词的概念。在广泛收集和分类的基础上,这些“未登录词”,方法被开发用于处理所产生的变化的转录以及未知的合成词由已知的语素。(3)一词多义和同义词的处理为了实现基于概念的检索,必须同时处理关键词的一词多义和同义词。对于一词多义,提出了一种基于搭配信息的排歧方法。对于同义关系,提出了一种在概念基础上扩展关键词的方法。(4)检索结果的相关性估计提出了一种基于关键词在文档中出现的数量和位置来估计文档与查询的相关程度的方法。对估计进行了优化,以最大化估计的相关性得分与基于人类判断的实际得分之间的相关性。(5)基于用户和系统建模的对话管理在分析用户与系统模拟对话的基础上,提出了一种对话管理方法。它采用了用户和系统的独立建模,由两个有限状态自动机表示,主要通过他们的话语交换信息。(6)原型系统的构建结合上述研究结果,构建了一个原型系统,并通过实验验证了其有效性。
项目成果
期刊论文数量(73)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
藤崎博也: "キー概念に基づく情報検索と検索結果の順位付けの検討"情報処理学会第57回全国大会講演論文集. 3. 249-250 (1998)
Hiroya Fujisaki:“基于关键概念和搜索结果排名的信息检索研究”第 57 届日本信息处理学会全国会议论文集 3. 249-250 (1998)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
阿部賢司: "学術情報検索における未知語処理"情報処理学会第58回全国大会講演論文集. 3. 135-136 (1999)
Kenji Abe:“学术信息检索中的未知字处理”日本信息处理学会第 58 届全国会议论文集 3. 135-136 (1999)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Hiroyuki Kameda: "A model of thought process in information retrieval and the design of intelligent information retrieval prototype system based on it"Technical report of IEICE. TL97-11. 1-8 (1997)
Hiroyuki Kameda:《信息检索思维过程模型及基于它的智能信息检索原型系统的设计》IEICE技术报告。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Hiroya Fujisaki: "Analysis of dialogues for the purpose of information retrieval"Proceedings of 5th Annual Convention of the Association for Natural Language Processing. 267-268 (1999)
Hiroya Fujisaki:“以信息检索为目的的对话分析”自然语言处理协会第五届年会论文集。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Kenji Abe: "A model of dialogue management for information retrieval and its evaluation"Proceedings of the 60th National Convention of the Information Processing Society of Japan. vol.2. 7-8 (2000)
Kenji Abe:《信息检索对话管理模型及其评估》第 60 届日本信息处理学会全国大会论文集。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
FUJISAKI Hiroya其他文献
FUJISAKI Hiroya的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('FUJISAKI Hiroya', 18)}}的其他基金
Automatic Estimation of Fundamental Frequency Contour Parameters and Automatic Acquisition of Generative rules
基频轮廓参数自动估计及生成规则自动获取
- 批准号:
11480090 - 财政年份:1999
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for Scientific Research (B).
A System for Rule Synthesis of Prosodic Features of Speech of Multiple Language Based on a Generative Model of Fundamental Frequency Contours
基于基频轮廓生成模型的多语言语音韵律特征规则综合系统
- 批准号:
08458090 - 财政年份:1996
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
International Coordination of Speech Databases, Prosodic Labeling, and Speech Input/Output Systems Assessment
语音数据库、韵律标记和语音输入/输出系统评估的国际协调
- 批准号:
08044173 - 财政年份:1996
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for international Scientific Research
Trial Construction of an Advanced Computer-readable Lexical Database Capable of Automatic Acquisition of Lexical Information
自动获取词汇信息的先进计算机可读词汇数据库的试建
- 批准号:
07558274 - 财政年份:1995
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
International Standardization of Spoken Language Detabases
口语数据库国际标准化
- 批准号:
05044112 - 财政年份:1993
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for international Scientific Research
Production of a Prototype Lexical Database Featuring High-speed, High-accuracy Access and Lexical Knowledge Acquisition
高速、高精度访问和词汇知识获取的原型词汇数据库的制作
- 批准号:
05558038 - 财政年份:1993
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for Developmental Scientific Research (B)
A scheme for continuous speech recognition in a large context based on the human process of spoken language recognition
基于人类口语识别过程的大上下文连续语音识别方案
- 批准号:
03452164 - 财政年份:1991
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for General Scientific Research (B)
Research on International Standardization of Spoken Language Database and Assessment Techniques for Speech Input/Output
口语数据库国际标准化及语音输入输出评估技术研究
- 批准号:
02044041 - 财政年份:1990
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for international Scientific Research
Co-operative Research on Modeling of Language Acquisition and Concept Formation Process in Engineering
工程中语言习得和概念形成过程建模的合作研究
- 批准号:
01300004 - 财政年份:1989
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for Co-operative Research (A)
Research on Synthesis Method for Spoken Sentences from Knowledge Representation
知识表示的口语句子合成方法研究
- 批准号:
63420051 - 财政年份:1988
- 资助金额:
$ 5.76万 - 项目类别:
Grant-in-Aid for General Scientific Research (A)