权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

RI: Small: DaRE: Detection and Recognition of Euphemisms

RI：小：DaRE：委婉语的检测和识别

基本信息

批准号：
2226006
负责人：
Anna Feldman
金额：
$ 56.41万
依托单位：
Montclair State University
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2023
资助国家：
美国
起止时间：
2023-01-01 至 2025-12-31
项目状态：
未结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2226006&HistoricalAwards=false
关键词：
RI Small DaRE Detection Recognition

项目摘要

To fully understand human language, machines need to be able to recognize and interpret expressions that contain hidden meanings. This project concentrates on euphemisms, mild or indirect phrases used in place of harsher or more offensive ones. Euphemisms are often used to mask profanity or refer to sensitive topics such as death, sex, religion, disability, or personal relationships in a polite way. People use euphemisms all the time, e.g., 'negative patient outcome', 'between jobs', 'financially fortunate', 'correctional facility','friendly fire', or 'sunshine unit'. Different cultures/languages use different euphemisms. Euphemisms change over time. Machines that process human language do not understand euphemisms yet. This project is devoted to making machines understand euphemisms in different languages, and therefore contributing to improving the capabilities of artificial intelligence. Additional benefits include interesting new generalizations about the nature of euphemisms and the training of a diverse cadre of undergraduate and graduate students in highly practical work on a difficult interdisciplinary problem. Montclair State University, a Hispanic Serving Institution, is known for its diverse student population and a large proportion of first-generation college students. Montclair State University puts great emphasis on justice and inclusivity in academia. This project is not an exception.Detecting and interpreting figurative language is a rapidly growing area in Natural Language Processing (NLP). Unfortunately, the processing of euphemisms is lacking in NLP thus far. The project addresses the following: 1) algorithm design for detecting and interpreting euphemisms, and 2) interpretability of black-box neural models by creating a series of new datasets and tasks that explore the embedding space of transformer language models for euphemism recognition. The key insights are 1) euphemistic expressions and their paraphrased counterparts differ in the strength of the sentiment they convey; 2) euphemistic and non-euphemistic interpretation is context-sensitive; 3) euphemisms are vaguer than the taboo expressions they substitute. The experiments test what linguistic properties of euphemisms the deep learning approaches capture and why. The algorithm developed can detect new euphemisms, not previously recorded in dictionaries, without human intervention. The computational work on euphemisms is important to further the understanding of how strategic use of language can bias people's perceptions of important and highly contentious actions and perhaps find ways how to de-bias language models. This work on euphemisms helps understand what topics are controversial or sensitive in a specific culture. Applying the algorithm to diachronic data and detecting the change in euphemism usage leads to a better understanding of culture changes. The corpora produced are useful for answering questions at the intersection of AI, NLP, linguistics, cultural anthropology, and social psychology. The range of languages provides a natural way of making interesting linguistic observations about euphemisms. Since euphemisms are a form of verbal behavior, finding a way to detect and interpret euphemisms automatically may lead to a better understanding of human behavior in general.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

为了完全理解人类语言，机器需要能够识别和解释包含隐藏含义的表情。这个项目集中在委婉语上，用温和或间接的短语来代替更严厉或更冒犯的短语。委婉语经常被用来掩盖亵渎或以礼貌的方式指代敏感话题，如死亡、性、宗教、残疾或人际关系。人们总是使用委婉语，例如，‘病人的负面结果’，‘工作之间’，‘经济上的幸运’，‘教养机构’，‘友军炮火’，或‘阳光单位’。不同的文化/语言使用不同的委婉语。委婉语会随着时间的推移而变化。处理人类语言的机器还不能理解委婉语。该项目致力于让机器理解不同语言的委婉语，从而为提高人工智能的能力做出贡献。其他好处包括关于委婉语性质的有趣的新概括，以及培训不同的本科生和研究生干部在一个困难的跨学科问题上进行高度实际的工作。蒙特克莱尔州立大学是一所西班牙裔服务机构，以其多样化的学生群体和很大比例的第一代大学生而闻名。蒙特克莱尔州立大学非常重视学术界的公正和包容性。这个项目也不例外。比喻语言的检测和解释是自然语言处理(NLP)中一个迅速发展的领域。不幸的是，到目前为止，自然语言处理缺乏委婉语的处理。该项目致力于以下工作：1)委婉语的检测和解释算法设计；2)通过创建一系列新的数据集和任务来探索委婉语识别的变压器语言模型的嵌入空间，从而提高黑盒神经模型的可解释性。本文的主要观点是：1)委婉语及其释义表达的情感强度不同；2)委婉语和非委婉语的解释与语境有关；3)委婉语比它们所替代的禁忌语更含糊。这些实验测试了深度学习方法捕捉到的委婉语的语言特征及其原因。开发的算法可以检测以前没有记录在词典中的新委婉语，而不需要人工干预。委婉语的计算工作对于进一步理解语言的策略性使用如何使人们对重要且极具争议性的行为产生偏见是很重要的，或许还可以找到方法来消除语言模型的偏见。这项关于委婉语的研究有助于理解在特定文化中哪些话题是有争议的或敏感的。将该算法应用于历时数据，并检测委婉语用法的变化，有助于更好地理解文化变化。产生的语料库有助于回答人工智能、自然语言处理、语言学、文化人类学和社会心理学的交叉问题。语言的范围提供了一种自然的方式来对委婉语进行有趣的语言学观察。由于委婉语是一种言语行为，找到一种自动检测和解释委婉语的方法可能会更好地理解人类的总体行为。这一奖项反映了NSF的法定使命，并通过使用基金会的智力优势和更广泛的影响审查标准进行评估，被认为值得支持。

项目成果

期刊论文数量（6）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

A Report on the Euphemisms Detection Shared Task

DOI：
10.48550/arxiv.2211.13327
发表时间：
2022-11
期刊：
ArXiv
影响因子：
0
作者：
Patrick Lee;Anna Feldman;J. Peng
通讯作者：
Patrick Lee;Anna Feldman;J. Peng

FEED PETs: Further Experimentation and Expansion on the Disambiguation of Potentially Euphemistic Terms.

FEED PET：消除潜在委婉术语歧义的进一步实验和扩展。

DOI：
发表时间：
2023
期刊：
12th Joint Conference on Lexical and Computational Semantics (SEM 2023
影响因子：
0
作者：
Lee P., Shode I.
通讯作者：
Lee P., Shode I.

CAT s are Fuzzy PETs : A Corpus and Analysis of Potentially Euphemistic Terms

CAT是模糊PET:语料库和潜在委婉术语的分析

DOI：
发表时间：
2022
期刊：
arXiv preprint arXiv:2205.02728.
影响因子：
0
作者：
Gavidia, M.;Lee, P.;Feldman, A.;Peng, J.
通讯作者：
Peng, J.

NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification

DOI：
10.48550/arxiv.2305.10971
发表时间：
2023-05
期刊：
ArXiv
影响因子：
0
作者：
Iyanuoluwa Shode;David Ifeoluwa Adelani;J. Peng;Anna Feldman
通讯作者：
Iyanuoluwa Shode;David Ifeoluwa Adelani;J. Peng;Anna Feldman

Searching for PETs: Using Distributional and Sentiment-Based Methods to Find Potentially Euphemistic Terms

搜索 PET：使用分布和基于情感的方法查找潜在的委婉术语

DOI：
发表时间：
2022
期刊：
USA
影响因子：
0
作者：
Gavidia, M.
通讯作者：
Gavidia, M.

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Anna Feldman其他文献

WordPrep: Word-based Preposition Prediction Tool

WordPrep：基于单词的介词预测工具

DOI：
10.1109/bigdata47090.2019.9005608
发表时间：
2019
期刊：
2019 IEEE International Conference on Big Data (Big Data)
影响因子：
0
作者：
Pooja Bhagat;A. Varde;Anna Feldman
通讯作者：
Anna Feldman

Experiments in Cross-Language Morphological Annotation Transfer

跨语言形态注释迁移实验

DOI：
10.1007/11671299_4
发表时间：
2006
期刊：
BioMed Research International
影响因子：
0
作者：
Anna Feldman;Jirka Hana;Chris Brew
通讯作者：
Chris Brew

Legend at ArAIEval Shared Task: Persuasion Technique Detection using a Language-Agnostic Text Representation Model

ArAIEval 共享任务的传奇：使用与语言无关的文本表示模型进行说服技术检测

DOI：
10.48550/arxiv.2310.09661
发表时间：
2023
期刊：
影响因子：
0
作者：
O. E. Ojo;O. O. Adebanji;Hiram Calvo;Damian O. Dieke;Olumuyiwa E. Ojo;S.E. Akinsanya;Tolulope O. Abiola;Anna Feldman
通讯作者：
Anna Feldman