CAREER: Authorship Analysis in Cross-Domain Settings
职业:跨域设置中的作者分析
基本信息
- 批准号:1462141
- 负责人:
- 金额:$ 46.96万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2014
- 资助国家:美国
- 起止时间:2014-08-31 至 2019-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Authorship Analysis (AA) is the task of extracting characteristics from written documents that can help to determine authorship of a document, generate a profile of the author, or identify cases of plagiarism. AA can be used for historical purposes, to settle disputes over the original creators of a given document, and to build a prosecution case against an online abuser. Most previous work in AA assumes the availability of samples with known authorship that closely match the domain of the documents of interest. A strong assumption like this one limits the applications of AA approaches. This program addresses this key outstanding challenge by designing robust frameworks for scenarios with different cross-domain degrees: cross-topic, cross-genre and cross-modality (text vs. transcribed speech). The project leverages the large amounts of free text available representing each cross-domain setting to learn general lexical and syntactic distributional correspondences. These correspondences are used to map the out-of-domain texts to a representation that is closer to the target domain. Direct contributions of this research include new approaches to extract and embed cross-domain prior knowledge into AA models in the form of distributional trajectories; and a solid understanding of the influence of topic, genre, and modality in the feature engineering process for AA that will also be helpful in other text processing tasks. This research will make direct contributions to the field of forensic linguistics, which is of major relevance for national security.The PI will design an advanced seminar in computational approaches for forensic linguistics and will expand her ongoing educational and outreach activities for underrepresented groups in the STEM disciplines. The PI will integrate opportunities for international visits to key research labs for the graduate students involved in the program that will enrich their training and provide great networking opportunities.
作者身份分析(AA)是从书面文档中提取特征的任务,这些特征可以帮助确定文档的作者身份,生成作者简介或识别剽窃案例。AA可以用于历史目的,解决给定文件的原始创建者的争议,并对在线滥用者提起诉讼。在AA的大多数以前的工作假设与已知的作者密切匹配的领域感兴趣的文档的样本的可用性。这样一个强有力的假设限制了AA方法的应用。该计划通过为具有不同跨领域程度的场景设计强大的框架来解决这一关键的突出挑战:跨主题,跨体裁和跨模态(文本与转录语音)。 该项目利用大量的自由文本,代表每个跨域设置,学习一般的词汇和句法分布对应。这些对应关系用于将域外文本映射到更接近目标域的表示。这项研究的直接贡献包括新的方法来提取和嵌入跨域先验知识到AA模型的形式分布轨迹;和一个坚实的理解主题,体裁和模态的影响,在特征工程过程中的AA,也将有助于其他文本处理任务。这项研究将直接贡献于法律语言学领域,这是与国家安全的重大关系。PI将设计一个高级研讨会在计算方法的法律语言学,并将扩大她正在进行的教育和推广活动,为代表性不足的群体在STEM学科。PI将为参与该计划的研究生提供国际访问主要研究实验室的机会,这将丰富他们的培训并提供很好的网络机会。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Thamar Solorio其他文献
Thamar Solorio的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Thamar Solorio', 18)}}的其他基金
IRES Track I: US-Mexico Collaboration on Multimodal Detection of Objectionable Content in Online Videos in Spanish and English
IRES 轨道 I:美国-墨西哥合作对西班牙语和英语在线视频中的不良内容进行多模式检测
- 批准号:
2106892 - 财政年份:2021
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
Workshop on desiderata for a multimodal dataset for objectionable content detection
用于不良内容检测的多模式数据集需求研讨会
- 批准号:
2036368 - 财政年份:2020
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
RI: Small: Robust Models for Sequence Labelling in Social Media Data
RI:小型:社交媒体数据中序列标记的稳健模型
- 批准号:
1910192 - 财政年份:2019
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
CAREER: Authorship Analysis in Cross-Domain Settings
职业:跨域设置中的作者分析
- 批准号:
1350360 - 财政年份:2014
- 资助金额:
$ 46.96万 - 项目类别:
Continuing Grant
HCC: Small: Collaborative Research: Analysis of Language Samples for Detecting Language Impairment in Monolingual and Bilingual Children
HCC:小型:合作研究:分析语言样本以检测单语和双语儿童的语言障碍
- 批准号:
1462143 - 财政年份:2014
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
CI-ADDO-NEW: Collaborative Research: A Repository for Annotating Multilingual Code Switched Data
CI-ADDO-NEW:协作研究:用于注释多语言代码交换数据的存储库
- 批准号:
1462142 - 财政年份:2014
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
CI-ADDO-NEW: Collaborative Research: A Repository for Annotating Multilingual Code Switched Data
CI-ADDO-NEW:协作研究:用于注释多语言代码交换数据的存储库
- 批准号:
1205475 - 财政年份:2012
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
Collaborative Research:CI-P: Creation of an annotated repository of multilingual and multigenre code switched data for several language pairs
合作研究:CI-P:创建多个语言对的多语言和多流派代码交换数据的带注释存储库
- 批准号:
0958088 - 财政年份:2010
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
HCC: Small: Collaborative Research: Analysis of Language Samples for Detecting Language Impairment in Monolingual and Bilingual Children
HCC:小型:合作研究:分析语言样本以检测单语和双语儿童的语言障碍
- 批准号:
1018124 - 财政年份:2010
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
相似海外基金
Development of Japanese Authorship Attribution System for Digital Forensics
日本数字取证作者归属系统的开发
- 批准号:
23K11107 - 财政年份:2023
- 资助金额:
$ 46.96万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Authorship and visual psychology of mammoth engravings from Magdalenian Gönnersdorf (Rhineland, Germany ~15,800 Before Present).
Magdalenian Gönnersdorf(德国莱茵兰,距今约 15,800 年)猛犸象雕刻的作者身份和视觉心理学。
- 批准号:
2884858 - 财政年份:2023
- 资助金额:
$ 46.96万 - 项目类别:
Studentship
Approaching the historical background and authorship of "Sakuteiki", the oldest landscape book, using historical methods
用历史方法探讨最古老的山水书《作庭记》的历史背景和作者
- 批准号:
23K13973 - 财政年份:2023
- 资助金额:
$ 46.96万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Collaborative Research: HNDS-R: SBP: RUI: Differences in Co-authorship across a Global Landscape: The Role of Network Structure in Scientific Productivity
合作研究:HNDS-R:SBP:RUI:全球格局中共同作者的差异:网络结构在科学生产力中的作用
- 批准号:
2318425 - 财政年份:2023
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
大学生のSelf-authorshipを高めるサービスラーニング型野外教育カリキュラムの開発
开发服务学习户外教育课程以增强大学生的自我创作能力
- 批准号:
23K12802 - 财政年份:2023
- 资助金额:
$ 46.96万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Collaborative Research: HNDS-R: SBP: RUI: Differences in Co-authorship across a Global Landscape: The Role of Network Structure in Scientific Productivity
合作研究:HNDS-R:SBP:RUI:全球格局中共同作者的差异:网络结构在科学生产力中的作用
- 批准号:
2318426 - 财政年份:2023
- 资助金额:
$ 46.96万 - 项目类别:
Standard Grant
Working-class women: elegy, agency and authorship in narratives of the deindustrialised North.
工人阶级女性:去工业化北方叙事中的挽歌、代理和作者身份。
- 批准号:
2752491 - 财政年份:2023
- 资助金额:
$ 46.96万 - 项目类别:
Studentship
Deep Learning for Cybersecurity: Assembly Code and Authorship Analytics
网络安全深度学习:汇编代码和作者分析
- 批准号:
RGPIN-2018-03872 - 财政年份:2022
- 资助金额:
$ 46.96万 - 项目类别:
Discovery Grants Program - Individual
Integrated Ensemble Learning with Embedded Vectors in Authorship Attribution
作者归属中使用嵌入式向量的集成集成学习
- 批准号:
22K12726 - 财政年份:2022
- 资助金额:
$ 46.96万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Ethical Authorship: Decolonising the crew in British documentary practice, 2017-2020.
道德作者身份:英国纪录片实践中的剧组去殖民化,2017-2020。
- 批准号:
2745404 - 财政年份:2022
- 资助金额:
$ 46.96万 - 项目类别:
Studentship