Creation and Evaluation of Tacit Knowledge Based on Semantic Primes
基于语义素数的隐性知识创建与评价
基本信息
- 批准号:22K12160
- 负责人:
- 金额:$ 2.08万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (C)
- 财政年份:2022
- 资助国家:日本
- 起止时间:2022-04-01 至 2025-03-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
During the first year I have concentrated on developing the first set of tacit knowledge for further experiments. I considered the limited number of semantic primes to answer the question "which types of cognitive functionality an agent should posses before exploring the environment and understanding or learning about the world?". Inspired by the exponents grouped into related semantic primitives categories and their translation to Japanese, I created a list of perception-related 23 types of prompts. After series of many preliminary experiments and annotator tests, I simplified, aggregated and specified some prompts to make the annotation shorter and easier.In order to prepare the final dataset I utilized a Japanese sentences from the previous project. Meant for detecting danger level changes in slightly different contexts, it comprises of short sentence pairs as "child eats a soap" and "child eats a soap-shaped candy". I have created a program for generating prompts to make queries about agents, patients and acts in context of semantic primes and hired 66 annotators to choose answers about a sentence.The final golden set for future experiments consisted of 62,687 annotated sentence-prompt-choice triples and has been described in detail in an international conference paper which is currently under review. Experimental results show that although the agreement between annotators was high, classic language models as BERT and RoBERTa performed poorly in a task of recognizing cognitive perception-related questions based on semantic primes.
在第一年,我专注于发展第一套隐性知识,以供进一步的实验。我考虑了有限数量的语义启动来回答“在探索环境和理解或学习世界之前,代理应该拥有哪些类型的认知功能?”这个问题。受分组到相关语义原语类别的指数及其日语翻译的启发,我创建了一个与感知相关的23种提示类型的列表。经过一系列的初步实验和注释器测试,我简化、聚合和指定了一些提示,使注释更短、更容易。为了准备最终的数据集,我使用了以前项目中的日语句子。它由短句对组成,如“孩子吃了一块肥皂”和“孩子吃了一块肥皂形状的糖果”。我创建了一个程序,用于生成提示,以便在语义启动的背景下查询代理、患者和行为,并聘请了66名注释者来选择一个句子的答案。未来实验的最终黄金集由62,687个带注释的句子提示选择三元组组成,目前正在审查的一篇国际会议论文中对其进行了详细描述。实验结果表明,尽管标注者之间的一致性很高,但经典语言模型如BERT和RoBERTa在基于语义启动的认知感知相关问题识别任务中表现不佳。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
RZEPKA Rafal其他文献
RZEPKA Rafal的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
相似海外基金
Reflectance Estimation and Dataset construction using multiple light sources
使用多个光源的反射率估计和数据集构建
- 批准号:
15K06076 - 财政年份:2015
- 资助金额:
$ 2.08万 - 项目类别:
Grant-in-Aid for Scientific Research (C)