Sociolinguistics Tools in the Analysis of Language Use in Software Development

软件开发中语言使用分析的社会语言学工具

基本信息

  • 批准号:
    RGPIN-2017-03847
  • 负责人:
  • 金额:
    $ 1.46万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2018
  • 资助国家:
    加拿大
  • 起止时间:
    2018-01-01 至 2019-12-31
  • 项目状态:
    已结题

项目摘要

Companies must ensure that software is high quality and understandable, the latter because it is virtually guaranteed that another person will need to read and understand software written by someone else, whether for purposes of upgrading, fixing, or replacing the code. In the field of natural language researchers have used machine learning to determine an author's gender with up to 90% accuracy; this process also allowed the identification of how the two groups used the language differently. This is valuable information for software development teams, whether we look at (for instance) differences between men and women, junior and senior developers, or developers with different native natural languages. While there is a great deal of research on people's use of natural language, both written and spoken, there is very little on how people use artificial languages such as those used for writing software, and none at all on the sociolinguistics of artificial language use. I propose to address this gap with a view to using my discoveries to contribute to improving both software quality and software readability.******One high level motivation of this work lies in the quest for diversity. If different groups use language in different ways, then can tools be tweaked to guide programmers toward more stylistically “neutral” code, thus reducing potential sociolinguistic differences between diverse groups? Or is this defeating the purpose of encouraging a diverse population of contributors and their unique perspectives? When teaching students to code, are there particular styles that might be more “comfortable” for different groups, and should we be encouraging this? We don't know the answers to these questions, but evidence from the natural languages suggests that we should be considering them. ******However as we investigate how a diverse population creates software, the underlying motivation is that of the need for quality code that can be clearly understood. If different groups write code differently, is that contributing to misunderstanding? Are there ways that we can, for instance, “translate” code to make it easier for another reader to understand? If there are significant differences, do these correspond to code quality? Finally, what has larger impact; language use at the micro levels that we are proposing to start with (i.e. line by line), or larger, structural and conceptual differences that form the overall architecture and design of large pieces of software? My research program aims to examine these questions, and I believe that this novel approach will have a significant impact on the field of software engineering.
公司必须确保软件是高质量和可理解的,后者是因为几乎可以保证另一个人将需要阅读和理解其他人编写的软件,无论是为了升级、修复还是替换代码。在自然语言领域,研究人员已经使用机器学习来确定作者的性别,准确率高达90%;这一过程还允许识别两组人如何使用不同的语言。这对于软件开发团队来说是很有价值的信息,无论我们看(例如)男性和女性、初级开发人员和高级开发人员之间的差异,还是使用不同母语的开发人员之间的差异。虽然有大量关于人们使用自然语言的研究,无论是书面语言还是口头语言,但关于人们如何使用人工语言(如用于编写软件的语言)的研究很少,更没有关于人工语言使用的社会语言学的研究。我建议解决这一差距,以期利用我的发现为提高软件质量和软件可读性做出贡献。*这项工作的一个高层次动机在于寻求多样性。如果不同的群体以不同的方式使用语言,那么是否可以调整工具来引导程序员编写更具风格的“中立”代码,从而减少不同群体之间潜在的社会语言差异?或者,这是否违背了鼓励不同的贡献者及其独特观点的目的?在教学生编程时,有没有特定的风格可能更适合不同的群体,我们应该鼓励这样做吗?我们不知道这些问题的答案,但来自自然语言的证据表明,我们应该考虑这些问题。*然而,当我们调查不同群体如何创造软件时,潜在的动机是对能够清楚理解的高质量代码的需求。如果不同的组编写不同的代码,这是否会导致误解?例如,我们有没有办法可以“翻译”代码,让其他读者更容易理解?如果存在显著差异,这些差异是否对应于代码质量?最后,什么影响更大;是我们建议开始的微观层面的语言使用(即逐行),还是形成大型软件整体架构和设计的更大的结构和概念差异?我的研究计划旨在研究这些问题,我相信这种新颖的方法将对软件工程领域产生重大影响。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Rice, Jacqueline其他文献

Rice, Jacqueline的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Rice, Jacqueline', 18)}}的其他基金

Sociolinguistics Tools in the Analysis of Language Use in Software Development
软件开发中语言使用分析的社会语言学工具
  • 批准号:
    RGPIN-2017-03847
  • 财政年份:
    2022
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Sociolinguistics Tools in the Analysis of Language Use in Software Development
软件开发中语言使用分析的社会语言学工具
  • 批准号:
    RGPIN-2017-03847
  • 财政年份:
    2021
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Sociolinguistics Tools in the Analysis of Language Use in Software Development
软件开发中语言使用分析的社会语言学工具
  • 批准号:
    RGPIN-2017-03847
  • 财政年份:
    2020
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Sociolinguistics Tools in the Analysis of Language Use in Software Development
软件开发中语言使用分析的社会语言学工具
  • 批准号:
    RGPIN-2017-03847
  • 财政年份:
    2019
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Sociolinguistics Tools in the Analysis of Language Use in Software Development
软件开发中语言使用分析的社会语言学工具
  • 批准号:
    RGPIN-2017-03847
  • 财政年份:
    2017
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Logic Synthesis for Reversible and Traditional Logic Circuits
可逆和传统逻辑电路的逻辑综合
  • 批准号:
    261211-2012
  • 财政年份:
    2016
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Logic Synthesis for Reversible and Traditional Logic Circuits
可逆和传统逻辑电路的逻辑综合
  • 批准号:
    261211-2012
  • 财政年份:
    2015
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Logic Synthesis for Reversible and Traditional Logic Circuits
可逆和传统逻辑电路的逻辑综合
  • 批准号:
    261211-2012
  • 财政年份:
    2014
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Logic Synthesis for Reversible and Traditional Logic Circuits
可逆和传统逻辑电路的逻辑综合
  • 批准号:
    261211-2012
  • 财政年份:
    2013
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual
Logic Synthesis for Reversible and Traditional Logic Circuits
可逆和传统逻辑电路的逻辑综合
  • 批准号:
    261211-2012
  • 财政年份:
    2012
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Discovery Grants Program - Individual

相似海外基金

2022BBSRC-NSF/BIO Generating New Network Analysis Tools for Elucidating the Functional Logic of 3D Vision Circuits of the Drosophila Brain
2022BBSRC-NSF/BIO 生成新的网络分析工具来阐明果蝇大脑 3D 视觉电路的功能逻辑
  • 批准号:
    BB/Y000234/1
  • 财政年份:
    2024
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Research Grant
Euclid Legacy Science Advanced analysis tools
Euclid Legacy Science 高级分析工具
  • 批准号:
    10093177
  • 财政年份:
    2024
  • 资助金额:
    $ 1.46万
  • 项目类别:
    EU-Funded
REU Site: University of North Carolina at Greensboro - Complex Data Analysis using Statistical and Machine Learning Tools
REU 站点:北卡罗来纳大学格林斯伯勒分校 - 使用统计和机器学习工具进行复杂数据分析
  • 批准号:
    2244160
  • 财政年份:
    2023
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Standard Grant
Creating an advanced multi-ancestral resource and tools for short tandem repeat analysis in the AOURP researcher workbench
在 AOURP 研究人员工作台中创建先进的多祖先资源和工具,用于短串联重复分析
  • 批准号:
    10798717
  • 财政年份:
    2023
  • 资助金额:
    $ 1.46万
  • 项目类别:
Exploratory Analysis Tools for Developmental Studies of Brain Microstructure with Diffusion MRI
利用扩散 MRI 进行脑微结构发育研究的探索性分析工具
  • 批准号:
    10645844
  • 财政年份:
    2023
  • 资助金额:
    $ 1.46万
  • 项目类别:
EgypToolWear: Metalwork Wear Analysis of Ancient Egyptian Tools
EgypToolWear:古埃及工具的金属制品磨损分析
  • 批准号:
    EP/X026434/1
  • 财政年份:
    2023
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Fellowship
MAGIC tools for genome-wide mosaic analysis with existing Drosophila resources (Equipment Supplement 2023)
利用现有果蝇资源进行全基因组嵌合分析的 MAGIC 工具(设备补充资料 2023)
  • 批准号:
    10808546
  • 财政年份:
    2023
  • 资助金额:
    $ 1.46万
  • 项目类别:
Scalable Bayesian regression: Analytical and numerical tools for efficient Bayesian analysis in the large data regime
可扩展贝叶斯回归:在大数据领域进行高效贝叶斯分析的分析和数值工具
  • 批准号:
    2311354
  • 财政年份:
    2023
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Standard Grant
New software tools for differential analysis of single-cell genomics perturbation experiments
用于单细胞基因组扰动实验差异分析的新软件工具
  • 批准号:
    10735033
  • 财政年份:
    2023
  • 资助金额:
    $ 1.46万
  • 项目类别:
Robust approaches for the analysis of agreement between clinical measurements: development of guidance and software tools for researchers
分析临床测量之间一致性的稳健方法:为研究人员开发指南和软件工具
  • 批准号:
    MR/X029301/1
  • 财政年份:
    2023
  • 资助金额:
    $ 1.46万
  • 项目类别:
    Research Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了