Production of a Prototype Lexical Database Featuring High-speed, High-accuracy Access and Lexical Knowledge Acquisition
高速、高精度访问和词汇知识获取的原型词汇数据库的制作
基本信息
- 批准号:05558038
- 负责人:
- 金额:$ 5.82万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Developmental Scientific Research (B)
- 财政年份:1993
- 资助国家:日本
- 起止时间:1993 至 1994
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
(1) Generation of lexical data : The data from the "Shin-Meikai Kokugo Jiten" (by Sanseido Publishing Co.) have been modified and expanded to generate the lexical data(approx. 170,000 words).(2) Construction of a word lexicon : A partial lexicon containing only the words used in a target domain has been constructed based on a model of the hierarchical structure of information.(3) Basic design of a database management system : The database management system consists of modules for the access, modification, expansion, acquisition, information structure management, and man-machine interface.(4) Implementation of the database management system : The database management system has been implemented on a workstation using the C language.(5) Design and implementation of detailed specifications of lexical data : Detailed specifications of the lexical data have been designed and the above-mentioned lexical data have been processed and implemented on a workstation.(6) Implementation and evaluation of the access module and the knowledge acquisition module : The access module and the knowledge acquisition module of the database management system have been implemented on a workstation using the C language and prolog.(7) Generation of the lexical database : The results of (5) and (6) have been integrated into a lexical database and its performance has been compared with that of a database using conventional access system both in the speed and the accuracy of access, confirming the advantages of the proposed system.(8) Test of usefulness of the lexical database : The proposed lexical database has been tested in the morpheme analysis of newspaper articles and weather reports, and the results have confirmed that the system has achieved the expected speed and accuracy of access as well as the capability of unknown word acquisition, demonstrating the validity of the proposed lexical database.
(1)词法数据的生成:对三生堂出版公司出版的《新meikai Kokugo Jiten》中的数据进行了修改和扩展,生成了约为1亿元的词法数据。170000个单词)。(2)词词典的构建:基于信息的层次结构模型,构建了只包含目标领域中使用的词的部分词典。(3)数据库管理系统的基本设计:数据库管理系统由访问、修改、扩展、采集、信息结构管理、人机界面等模块组成。(4)数据库管理系统的实现:使用C语言在一个工作站上实现了数据库管理系统。(5)词法数据详细规范的设计与实现:设计了词法数据的详细规范,并在工作站上对上述词法数据进行了处理与实现。(6)访问模块和知识获取模块的实现与评价:数据库管理系统的访问模块和知识获取模块在一个工作站上使用C语言和prolog实现。(7)词法数据库的生成:将(5)和(6)的结果集成到一个词法数据库中,并将其性能与使用传统访问系统的数据库在访问速度和准确性方面进行了比较,证实了所提出系统的优势。(8)词汇库的可用性测试:在报纸文章和天气预报的语素分析中对所提出的词汇库进行了测试,结果证实系统达到了预期的访问速度和准确性,以及未知词的获取能力,证明了所提出的词汇库的有效性。
项目成果
期刊论文数量(18)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
亀田弘之: "用例からの類推にもとづく知識の獲得と一般化について-未知複合語の獲得を中心にして-" 電子情報通信学会「言語・知識の獲得と運用」研究会資料. 1-8 (1993)
Hiroyuki Kameda:“论基于实例类比的知识获取和概括——关注未知复合词的获取——”IEICE《语言/知识获取与应用》研究组材料1-8(1993)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
YOKOTA,Kazuaki and FUJISAKI,Hiroya: "A Study on a Method of Text Analysis Based on Cognitive Units" Proceedings of 48th National Convention, the Information Processing Society of Japan. Vol.3. 69-70 (1994)
YOKOTA,Kazuaki 和 FUJISAKI,Hiroya:“基于认知单位的文本分析方法的研究”第 48 届全国代表大会论文集,日本信息处理学会。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
KAMEDA,Hiroyuki: "A Note on Acquisition and Generalization of Knowledge Based on Analogy Reasoning from Examples" Report of the Technical Committee on Acquisition and Utilization of Language and Knowledge, the Institute of Electronics, Information and Com
龟田弘之:“基于实例类比推理的知识获取和概括的注释”电子信息通信研究所语言和知识获取和利用技术委员会的报告
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
横田和幸: "認知単位を基本とする文解析手法の検討" 情報処理学会第48回(平成6年前期)全国大会講演論文集. 3. 60-70 (1994)
横田和之:《基于认知单位的句子分析方法的研究》第48届日本信息处理学会全国会议论文集(1994年上半年)3. 60-70(1994年)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
亀田 弘之: "統語解析処理にもとづく未知語獲得システムの試作" 電子情報通信学会総合大会基礎・境界部門講演論文集. 474 (1995)
Hiroyuki Kameda:“基于句法分析处理的未知词获取系统原型”IEICE大会基础/边界划分讲座论文集474(1995)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
FUJISAKI Hiroya其他文献
FUJISAKI Hiroya的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('FUJISAKI Hiroya', 18)}}的其他基金
Automatic Estimation of Fundamental Frequency Contour Parameters and Automatic Acquisition of Generative rules
基频轮廓参数自动估计及生成规则自动获取
- 批准号:
11480090 - 财政年份:1999
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for Scientific Research (B).
Construction of an Intelligent System for information Retrieval in an Environment of Information Network
信息网络环境下智能信息检索系统的构建
- 批准号:
09558041 - 财政年份:1998
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
A System for Rule Synthesis of Prosodic Features of Speech of Multiple Language Based on a Generative Model of Fundamental Frequency Contours
基于基频轮廓生成模型的多语言语音韵律特征规则综合系统
- 批准号:
08458090 - 财政年份:1996
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
International Coordination of Speech Databases, Prosodic Labeling, and Speech Input/Output Systems Assessment
语音数据库、韵律标记和语音输入/输出系统评估的国际协调
- 批准号:
08044173 - 财政年份:1996
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for international Scientific Research
Trial Construction of an Advanced Computer-readable Lexical Database Capable of Automatic Acquisition of Lexical Information
自动获取词汇信息的先进计算机可读词汇数据库的试建
- 批准号:
07558274 - 财政年份:1995
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
International Standardization of Spoken Language Detabases
口语数据库国际标准化
- 批准号:
05044112 - 财政年份:1993
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for international Scientific Research
A scheme for continuous speech recognition in a large context based on the human process of spoken language recognition
基于人类口语识别过程的大上下文连续语音识别方案
- 批准号:
03452164 - 财政年份:1991
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for General Scientific Research (B)
Research on International Standardization of Spoken Language Database and Assessment Techniques for Speech Input/Output
口语数据库国际标准化及语音输入输出评估技术研究
- 批准号:
02044041 - 财政年份:1990
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for international Scientific Research
Co-operative Research on Modeling of Language Acquisition and Concept Formation Process in Engineering
工程中语言习得和概念形成过程建模的合作研究
- 批准号:
01300004 - 财政年份:1989
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for Co-operative Research (A)
Research on Synthesis Method for Spoken Sentences from Knowledge Representation
知识表示的口语句子合成方法研究
- 批准号:
63420051 - 财政年份:1988
- 资助金额:
$ 5.82万 - 项目类别:
Grant-in-Aid for General Scientific Research (A)
相似海外基金
Incidental acquisition of lexical knowledge during reading in German as a foreign language
阅读德语作为外语时顺便获得词汇知识
- 批准号:
206828921 - 财政年份:2011
- 资助金额:
$ 5.82万 - 项目类别:
Research Grants
Using a word-learning paradigm to investigate three forms of generalisation in the acquisition of lexical knowledge
使用单词学习范式研究词汇知识获取中的三种泛化形式
- 批准号:
ES/H011730/1 - 财政年份:2010
- 资助金额:
$ 5.82万 - 项目类别:
Research Grant