Plain Japanese and Paraphrasing for Readability Enhancement
简单的日语和释义以增强可读性
基本信息
- 批准号:16200009
- 负责人:
- 金额:$ 31.12万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (A)
- 财政年份:2004
- 资助国家:日本
- 起止时间:2004 至 2007
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
1. We have compiled JC2.2, a basin vocabulary of Japanese, which contains 5,814 basic words and terms including 211 functional expressions and 177 idioms2. We have compiled a dictionary of Japanese functional expressions It contains 341 functional expressions and their 16,711 surface forms, which are hierarchically organized. By using this dictionary, we have implemented a paraphraser of Japanese functional expressions.3. We have compiled MUST1, an example database of Japanese compound functional expressions. Combining this database and a machine lemming method, we have implemented a detector of Japanese fimctional exressions4. We have compiled a textbook corpus which contains 1478 sample passages (about 1M characters) extracted from 127 textbooks of elementary school, junior high school, high school, and university. By using this textbook corpus as a criterion, we have implemented a readability analyzer of Japanese texts. For a given Japanese text, this analyzer produces one of 13 grades based on character unigram models constructed from the textbook corpus. The performance of this analyzer, which is measured by the correlation coefficient, is considerably high (R>0.9) .5. We studied a dynamic phrasal thesaurus, which generates phrasal synonyms for a given ph ram. We also studied application-oriented paraphrasing methods': paraphrasing of verbal noun phrases into compound nouns paraphrasing of verb phrases for easy understanding, transforming sentence ends into news headline style. We have proposed a new summarization method, summarization by analogy.6.We have released JC22, MUST1, and the readability analyzer to the public.
1.我们编制了日语词汇库JC2.2,它包含5,814个基本词汇和术语,其中包括211个功能表达和177个习语2。我们编制了一部日语函数式词典,它包含了341个函数式及其16,711个表面形式,它们是按层次组织的。利用该词典,我们实现了日语功能表达式的释义.我们已经编制了MUST1,日语复合函数表达式的示例数据库。结合这个数据库和机器学习的方法,我们已经实现了一个检测器的日本functional expressions4。本研究从小学、初中、高中、大学共127本教科书中抽取了1478个样本段落(约100万字),编制了一个教科书语料库。通过使用这个教科书语料库作为一个标准,我们已经实现了一个可读性分析器的日语文本。对于一个给定的日语文本,这个分析器产生的13个等级的基础上,从教科书语料库构建的字符单字模型。该分析仪的性能,这是衡量的相关系数,是相当高的(R>0.9). 5.我们研究了一个动态短语词库,它为给定的短语生成短语同义词。本文还研究了面向应用的释义方法:动词性名词短语释义为复合名词、动词性短语释义为便于理解、句尾转换为新闻标题风格。我们提出了一种新的摘要方法,类比摘要。6.我们已经发布了JC 22,MUST 1和可读性分析器。
项目成果
期刊论文数量(156)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Compilation of a Dictionary of Japanese Functional Expressions with Hierarchical Organization
日语层次结构功能用语辞典的编纂
- DOI:
- 发表时间:2006
- 期刊:
- 影响因子:0
- 作者:Suguru Matsuyoshi;Satoshi Sato;Takehito Utsuro
- 通讯作者:Takehito Utsuro
Automatic Detection of Japanese Compound Functional Expressions and Its Application to Statistical Dependency Analysis
日语复合功能表达的自动检测及其在统计依存分析中的应用
- DOI:
- 发表时间:2007
- 期刊:
- 影响因子:0
- 作者:Takao;Shime;Masatoshi;Tsuchiya;et. al.
- 通讯作者:et. al.
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
SATO Satoshi其他文献
SATO Satoshi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('SATO Satoshi', 18)}}的其他基金
Building a Japanese-language Class Model in Fukushima Following the Great East Japan Earthquake
东日本大地震后在福岛建立日语课堂模型
- 批准号:
15K04403 - 财政年份:2015
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Cross-Language Headword-Search by using Non-Productive Machine Translation
使用非生产性机器翻译进行跨语言词条搜索
- 批准号:
22650047 - 财政年份:2010
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Technology for Automatic Dictionary Compilation
词典自动编纂技术
- 批准号:
21300094 - 财政年份:2009
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Investigation of synthesis route of unsaturated alcohols in the selective dehydration of diols
二醇选择性脱水不饱和醇合成路线研究
- 批准号:
16360398 - 财政年份:2004
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
A Study of Automated Editing Technologies and their Application to Machine Translation
自动编辑技术及其在机器翻译中的应用研究
- 批准号:
13480097 - 财政年份:2001
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Thermal Decomposition, Organic-Inorganic Complex in Solid Catalyst Preparation
固体催化剂制备中的热分解、有机-无机复合物
- 批准号:
12650775 - 财政年份:2000
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Study on the buildup principle of the plasma membrane signaling domains.
质膜信号结构域构建原理的研究。
- 批准号:
11680658 - 财政年份:1999
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Example-Based Translation on Massivcly Parallel Computers
大规模并行计算机上基于示例的翻译
- 批准号:
05680292 - 财政年份:1993
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for General Scientific Research (C)
相似海外基金
A Historical and Quantitative Study of Compound Functional Expressions and Collocations
复合功能表达和搭配的历史和定量研究
- 批准号:
19K13204 - 财政年份:2019
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Acquisition and Usage Analysis of Compound Functional Expressions for a Japanese Writing Support System
日语写作支持系统复合函数式的获取与使用分析
- 批准号:
18K00703 - 财政年份:2018
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A Study of Compound Functional Expressions and Collocations in Medieval and Early Modern Japanese Based on the Corpus of Historical Japanese
基于历史日语语料库的中世纪和近代早期日语复合功能表达和搭配研究
- 批准号:
16K16850 - 财政年份:2016
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
Functional expressions of mammalian vomeronasal receptors for screening pheromones in transgenic Caenorhabditis elegans
哺乳动物犁鼻受体在转基因秀丽隐杆线虫信息素筛选中的功能表达
- 批准号:
26660255 - 财政年份:2014
- 资助金额:
$ 31.12万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research