Segmentation of oral corpora
口腔语料库分割
基本信息
- 批准号:281693063
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:德国
- 项目类别:Research Grants
- 财政年份:2016
- 资助国家:德国
- 起止时间:2015-12-31 至 2019-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
A great variety of segmentation principles for oral language have been proposed since the beginning of research on talk-in-interaction. However, we still lack a segmentation system that is both theoretically well-founded and practically operationalizable for large and diverse corpora of spoken interaction, and this impairs the use of such corpora for linguistic analysis, for language teaching, for contrastive studies and for the development of language technology. The project has therefore set itself the aim to develop a method of segmentation that is adequate for the analysis of data from talk-in-interaction at different levels and for various communities of researchers. It evaluates and further develops approaches to segmentation put forward in the literature on conversation analysis, interactional linguistics, pragmatics and corpus linguistics by applying them to samples from three large collections of French and German audio and video recordings of various interaction types (the databases CLAPI, ESLO and FOLK, respectively). The project will result in a systematic segmentation guideline applicable across different interaction types and to French as well as German data.The project is the first approach to segmentation that is both based on comprehensive data treatment of a sufficiently large and diverse empirical basis and takes into account the cross-linguistic dimension. The results will improve the usability of the three databases, contribute to best practices for the work with oral corpora on a more general level, and enhance our understanding of structures of talk-in-interaction. The project will thus address current needs in conversation analysis, corpus-based language teaching, contrastive analysis of spoken German and French and in the development of language technology for interaction data.Methodologically, the project is based on two different perspectives: 1) a qualitative, multidimensional approach which takes into account segmentation indices, problems and criteria and leads to tested and improved segmentation guidelines and 2) a quantitative, unidimensional approach based on selected criteria where possible boundaries are automatically identified and classified by human annotators according to their relevance for segmentation. Both approaches initially use a pilot test corpus of 10 excerpts of around 10 minutes each for each language which represents the overall data diversity in terms of situation types. Over the course of the project, the corpus will be extended to 5 hours for each language and takes into account findings from the initial phase. From the beginning of the project, contrastive aspects will be considered particularly.
自交互式会话研究开始以来,人们提出了各种各样的口语切分原则。然而,我们仍然缺乏一个分割系统,这是既有理论依据和实践操作性的大型和多样化的语料库的口语互动,这妨碍了使用这样的语料库的语言分析,语言教学,对比研究和语言技术的发展。因此,该项目的目标是开发一种分割方法,适用于不同级别和不同研究人员社区的互动对话数据分析。它评估和进一步发展的方法,分段提出的文献中的会话分析,interminglinguistics,语用学和语料库语言学,通过将它们应用到样本从三个大集合的法国和德国的音频和视频记录的各种互动类型(数据库CLAPI,ESLO和FOLK,分别)。该项目将产生一个系统的分割准则,适用于不同的互动类型和法语以及德语数据,该项目是第一个分割方法,既基于对足够大和多样化的经验基础的全面数据处理,又考虑到跨语言层面。研究结果将提高三个数据库的可用性,有助于在更一般的水平上与口语语料库的工作的最佳实践,并提高我们的理解的结构的交谈互动。因此,该项目将满足当前在对话分析、基于语料库的语言教学、德语和法语口语对比分析以及开发交互数据语言技术方面的需求。1)考虑到细分指数的定性、多维方法,问题和标准,并导致测试和改进的分割准则和2)一个定量的,一维的方法,基于选定的标准,其中可能的边界自动识别和分类的人类注释根据其相关性的分割。这两种方法最初都使用一个试点测试语料库,每个语料库包含10个摘录,每个摘录大约10分钟,代表了情况类型的总体数据多样性。在项目实施过程中,每种语文的语料库将延长至5小时,并考虑到初始阶段的调查结果。从项目开始,对比方面将特别考虑。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Dr. Thomas Schmidt其他文献
Dr. Thomas Schmidt的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Dr. Thomas Schmidt', 18)}}的其他基金
Literatur und Leibesübungen - von Winckelmann bis zum Nachmärz. Untersuchungen zu einem abgedunkelten Kapitel Kulturgeschichte
文学和体育锻炼——从温克尔曼到三月后时期。
- 批准号:
5405550 - 财政年份:2003
- 资助金额:
-- - 项目类别:
Research Fellowships
The calendar and the consequences. Uwe Johnson´s novel "Jahrestage". On the problem of Collective Memory.
日历和后果。
- 批准号:
5238612 - 财政年份:2000
- 资助金额:
-- - 项目类别:
Publication Grants
相似国自然基金
牙周炎对腹主动脉瘤的作用和机制研究
- 批准号:82370953
- 批准年份:2023
- 资助金额:48.00 万元
- 项目类别:面上项目
紧密连接蛋白PARD3下调介导黏膜上皮屏障破坏激活STAT3/SNAI2通路促进口腔白斑病形成及进展的机制研究
- 批准号:82370954
- 批准年份:2023
- 资助金额:47.00 万元
- 项目类别:面上项目
肠上皮内γδT细胞诱导抗原特异性Treg的体内机制及其对肾移植慢性排斥的抑制作用研究
- 批准号:81170693
- 批准年份:2011
- 资助金额:60.0 万元
- 项目类别:面上项目
口服MHC肽诱导的受体Treg源性exosome干预肾移植排斥反应的机制研究
- 批准号:30872580
- 批准年份:2008
- 资助金额:31.0 万元
- 项目类别:面上项目
弓形虫MAG嵌合型类病毒颗粒转基因植物快速高效表达技术平台的建立及其动物口服免疫机制的探索
- 批准号:30872204
- 批准年份:2008
- 资助金额:33.0 万元
- 项目类别:面上项目
基于生命节律的数字化口服给药系统及方法的研究
- 批准号:30700160
- 批准年份:2007
- 资助金额:16.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Phase Ib/II study of safety and efficacy of EZH2 inhibitor, tazemetostat, and PD-1 blockade for treatment of advanced non-small cell lung cancer
EZH2 抑制剂、他泽美司他和 PD-1 阻断治疗晚期非小细胞肺癌的安全性和有效性的 Ib/II 期研究
- 批准号:
10481965 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Oral bacterium Streptococcus mutans promotes tumor metastasis via thrombosis formation
口腔细菌变形链球菌通过血栓形成促进肿瘤转移
- 批准号:
24K19985 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Early-Career Scientists
Greatwall in replication stress/DNA damage responses and oral cancer resistance
长城在复制应激/DNA损伤反应和口腔癌抵抗中的作用
- 批准号:
10991546 - 财政年份:2024
- 资助金额:
-- - 项目类别:
ICF: The development of a chemotherapeutic containing mucoadhesive patch for the treatment of oral epithelial dysplasia
ICF:开发含有粘膜粘附贴剂的化疗药物,用于治疗口腔上皮发育不良
- 批准号:
MR/Y000234/1 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Fellowship
Mixed-methods Digital Oral History: Enfolding semantic web technologies and historical-interpretative analysis
混合方法数字口述历史:包含语义网络技术和历史解释分析
- 批准号:
AH/Y007557/1 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Research Grant
Understanding how exocrine-derived signals promote beta cell growth
了解外分泌信号如何促进 β 细胞生长
- 批准号:
10750765 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Development of a novel oral vaccine for fish: Synergy of chitosan nano particle and complement-mediated opsonization
新型鱼类口服疫苗的开发:壳聚糖纳米颗粒与补体介导的调理作用的协同作用
- 批准号:
24K17960 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Early-Career Scientists
EnamExcel: Iteratively develop the world's first varnish that regenerates and regrows human enamel and transforms global oral health.
EnamExcel:迭代开发世界上第一个能够再生和再生人类牙釉质并改变全球口腔健康的清漆。
- 批准号:
10094179 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Collaborative R&D
Engineering of Extracellular Vesicles for Oral Delivery of Nucleic Acid Therapies
用于核酸治疗口服递送的细胞外囊泡工程
- 批准号:
BB/Y008065/1 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Research Grant
I-Corps: Developing Oral Microbiome Transplantation to Transform Oral Health Therapies
I-Corps:开发口腔微生物组移植以改变口腔健康疗法
- 批准号:
2409330 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant