Corpus-based grammar in contrast: the cross-linguistic distributional analysis of Nepali grammatical categories
基于语料库的语法对比:尼泊尔语语法类别的跨语言分布分析
基本信息
- 批准号:AH/E008976/1
- 负责人:
- 金额:$ 15.02万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2007
- 资助国家:英国
- 起止时间:2007 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The proposed research concerns the application of a new set of methods to the study of the grammar of nouns and verbs in the Nepali language.In grammar, certain types of word are associated with particular categories of grammatical markers. The grammatical categories associated with nouns, for example, include markers of case - in English these are mostly prepositions, e.g. in phrases such as 'in the bath', 'on Saturday', 'with a hammer'. Verbs, by contrast, are associated with grammatical elements indicating time (tense) or the probability/possibility of an event (mood) - examples of these categories in English include auxiliary verbs such as 'can', 'may', and 'shall', or forms such as 'have [done]' or 'be [doing]'.In Nepali, complex sequences of grammatical categories may found after both nouns and verbs. For example:(1) Example of a noun: 'manisharulai'manis-haru-laiman-PLRL-ACC/DAT'(to the) men'(2) Another example noun: 'kalamle'kalam-lepen-ERG/INSTR'with (a/the) pen'(3) Example of a verb: 'garirahancha'gar-i-rahan-chado-PASS-stay-be.3SG's/he is doing'(NB: in the examples above, the grammatical categories are labelled with abbreviations in capital letters of the terms usually used for them.)The nature of these categories, and the nature of the structures which they occur in, are the topics of this proposal.Grammatical categories have often been analysed by looking at their distribution: that is, in terms of what contexts they can and cannot occur in. For instance, in English, an important feature of nouns is that they may co-occur with the word 'the', to give one very basic example. Distribution is also important in corpus linguistics - the approach to studying language based on analysing large collections of text on computer. In corpus linguistics, distribution can be analysed statistically - that is, by looking at what words and what forms commonly occur together or in close proximity to one another. This link between words which occur frequently together in a large corpus of text is described as 'collocation'.The advantage of using corpus methods in the study of grammar is that findings from a corpus reflect grammar in texts, as it is used in everyday writing (and, in some cases, speech as well). It allows our knowledge of grammar to emerge from a wide view of the ways in which people actually use their language.This study will develop methods based on statistical measures of co-occurrence and collocation, and use them to look at the distribution of grammatical categories in Nepali. This will enable an investigation of the nature of these grammatical categories and the patterns in which they are used in language. This will extend our knowledge of the types of elements in the Nepali language seen in the examples above.It has long been established that some features of grammar are specific to particular languages, whereas others are more general and are found in many different languages. The final stage of the investigation proposed here will be to look at the contrast between Nepali and other languages, from the perspective of the quantitative methods outlined above. To do this, the same set of methods will be applied to corpora of two language - English and Russian - which are from the same family of languages as Nepali, although they come from different regions of the world. This will allow a finely-grained, closely focussed contrastive analysis. The patterns found in the three languages will be compared, allowing us to build up a picture of how the grammars of these languages use grammar differently in real texts.In all cases, the raw material for analysis will be the output of computerised searches of a corpus. However, this statistical information must be carefully analysed to establish what patterns can of interest can be shown to exist, in terms of the meaning or the grammar of the statistical links that the computer detects.
这项研究涉及一套新的方法来研究尼泊尔语中的名词和动词的语法。在语法中,某些类型的词与特定类别的语法标记相关联。例如,与名词相关的语法类别包括格的标记-在英语中,这些标记大多是介词,例如在短语中,如“在洗澡”,“在星期六”,“用锤子”。相比之下,动词与表示时间(时态)或事件(语气)的可能性/可能性的语法元素相关-这些类别的例子在英语中包括助动词,如'can','may'和'shall',或形式,如'have [done]'或'be [doing]'。在尼泊尔语中,名词和动词之后都可以找到复杂的语法类别序列。例如:(1)名词的例子:'manisharulai'manis-haru-laiman-PLRL-ACC/DAT '(to the)men'(2)另一个例子名词:'kalamle'kalam-lepen-ERG/EARR 'with(a/the)pen'(3)动词的例子:'garirahancha'gar-i-rahan-chado-PASS-stay-be.3SG's/he is doing '(注意:在上面的例子中,语法类别用通常用于它们的术语的大写字母缩写来标记。这些范畴的性质以及它们所出现的结构的性质是本建议的主题,语法范畴的分析通常是通过观察它们的分布来进行的,也就是说,根据它们可以和不可以出现在什么样的语境中。例如,在英语中,名词的一个重要特征是它们可以与单词“the”共同出现,这是一个非常基本的例子。分布在语料库语言学中也很重要--语料库语言学是一种基于在计算机上分析大量文本的语言研究方法。在语料库语言学中,分布可以通过统计学来分析-也就是说,通过观察哪些单词和哪些形式通常一起出现或彼此接近。这种在大量文本中频繁出现的词之间的联系被称为“搭配”。在语法研究中使用语料库方法的好处是,语料库的发现反映了文本中的语法,因为它在日常写作中使用(在某些情况下,也包括语音)。它使我们的语法知识从人们实际使用语言的方式的广泛视角中浮现出来。本研究将开发基于共现和搭配的统计措施的方法,并使用它们来研究尼泊尔语中语法类别的分布。这将使调查的性质,这些语法类别和模式,他们在语言中使用。这将扩展我们对上述例子中尼泊尔语元素类型的了解。长期以来,人们一直认为,语法的某些特征是特定语言所特有的,而其他特征则更普遍,在许多不同的语言中都可以找到。这里建议的调查的最后阶段将是从上面概述的定量方法的角度来研究尼泊尔语和其他语言之间的对比。为此,将对英语和俄语这两种语言的语料库采用同一套方法,这两种语言与尼泊尔语同属一个语系,尽管它们来自世界不同地区。这将允许细粒度的,密切关注的对比分析。在这三种语言中发现的模式将被比较,使我们能够建立一个图片,这些语言的语法如何在真实的文本中不同地使用语法。在所有情况下,分析的原材料将是语料库的计算机搜索的输出。然而,必须仔细分析这些统计信息,以确定计算机检测到的统计联系的意义或语法方面存在哪些感兴趣的模式。
项目成果
期刊论文数量(4)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
(Provisional title) methodological paper
(暂定标题)方法论论文
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:Andrew Hardie (Author)
- 通讯作者:Andrew Hardie (Author)
Collocational patterning in cross-linguistic perspective: adpositions in English, Nepali, and Russian
跨语言视角的搭配模式:英语、尼泊尔语和俄语的介词
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:Andrew Hardie (Author)
- 通讯作者:Andrew Hardie (Author)
TBC: on Nepali grammatical categories in detail
TBC:详细了解尼泊尔语语法类别
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:Andrew Hardie (Author)
- 通讯作者:Andrew Hardie (Author)
(Provisional Title) on cross-linguistic comparison
(暂定标题)跨语言比较
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:Andrew Hardie (Author)
- 通讯作者:Andrew Hardie (Author)
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Andrew Hardie其他文献
The Were-Subjunctive in British Rural Dialects: Marrying Corpus and Questionnaire Data
- DOI:
10.1023/a:1022657227889 - 发表时间:
2003-05-01 - 期刊:
- 影响因子:1.800
- 作者:
Andrew Hardie;Tony McEnery - 通讯作者:
Tony McEnery
in the digital era Crossing Boundaries : Using GIS in Literary Studies , History and Beyond
数字时代跨越界限:在文学研究、历史及其他领域使用GIS
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Ian N. Gregory;Alistair Baron;David Cooper;Andrew Hardie;Patricia;Murrieta;Paul Rayson - 通讯作者:
Paul Rayson
MP54-12 SARCOPENIA AS MEASURED BY ILIOPSOAS AND ERECTOR SPINAE MUSCLE DENSITY IS ASSOCIATED WITH HIGHER INCIDENCE OF POSTOPERATIVE COMPLICATIONS FOLLOWING RADICAL CYSTECTOMY FOR BLADDER CANCER
- DOI:
10.1016/j.juro.2017.02.1684 - 发表时间:
2017-04-01 - 期刊:
- 影响因子:
- 作者:
William Rawls;Mark Currin;Austin DeRosa;James Rosoff;John Roebel;Matthew Jaenicke;Thomas Beckham;Andrew Hardie;Sandip Prasad - 通讯作者:
Sandip Prasad
Ziggurat : A new data model and indexing format for large annotated text corpora
Ziggurat:用于大型注释文本语料库的新数据模型和索引格式
- DOI:
- 发表时间:
2015 - 期刊:
- 影响因子:0
- 作者:
Stefan Evert;Andrew Hardie - 通讯作者:
Andrew Hardie
2204 IS MRI WITH DIFFUSION WEIGHTED IMAGING EFFECTIVE IN DETECTING PROSTATE CANCER IN MEN WITH PREVIOUS NEGATIVE BIOPSIES?
- DOI:
10.1016/j.juro.2013.02.2113 - 发表时间:
2013-04-01 - 期刊:
- 影响因子:
- 作者:
Matthew Young;Ryan Levey;James Rosoff;Josh Smith;George Ghareeb;Brian Lane;Andrew Hardie;Thomas Keane;Stephen Savage - 通讯作者:
Stephen Savage
Andrew Hardie的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Andrew Hardie', 18)}}的其他基金
Comparable and Parallel Corpus Approaches to the Third Code: English and Chinese Perspectives
第三代码的可比和平行语料库方法:英语和汉语视角
- 批准号:
ES/K010107/1 - 财政年份:2013
- 资助金额:
$ 15.02万 - 项目类别:
Research Grant
ESRC Centre for Corpus Approaches to Social Science (CASS)
ESRC 社会科学语料库方法中心 (CASS)
- 批准号:
ES/K002155/1 - 财政年份:2013
- 资助金额:
$ 15.02万 - 项目类别:
Research Grant
相似国自然基金
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国青年学者研究基金项目
Incentive and governance schenism study of corporate green washing behavior in China: Based on an integiated view of econfiguration of environmental authority and decoupling logic
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国学者研究基金项目
Exploring the Intrinsic Mechanisms of CEO Turnover and Market Reaction: An Explanation Based on Information Asymmetry
- 批准号:W2433169
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国学者研究基金项目
含Re、Ru先进镍基单晶高温合金中TCP相成核—生长机理的原位动态研究
- 批准号:52301178
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
NbZrTi基多主元合金中化学不均匀性对辐照行为的影响研究
- 批准号:12305290
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
眼表菌群影响糖尿病患者干眼发生的人群流行病学研究
- 批准号:82371110
- 批准年份:2023
- 资助金额:49.00 万元
- 项目类别:面上项目
镍基UNS N10003合金辐照位错环演化机制及其对力学性能的影响研究
- 批准号:12375280
- 批准年份:2023
- 资助金额:53.00 万元
- 项目类别:面上项目
CuAgSe基热电材料的结构特性与构效关系研究
- 批准号:22375214
- 批准年份:2023
- 资助金额:50.00 万元
- 项目类别:面上项目
A study on prototype flexible multifunctional graphene foam-based sensing grid (柔性多功能石墨烯泡沫传感网格原型研究)
- 批准号:
- 批准年份:2020
- 资助金额:20 万元
- 项目类别:
基于大数据定量研究城市化对中国季节性流感传播的影响及其机理
- 批准号:82003509
- 批准年份:2020
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Selection of word-formation processes in naming of things in French: a research based on the approach of Construction Grammar
法语事物命名中构词过程的选择:基于构式语法方法的研究
- 批准号:
23K12184 - 财政年份:2023
- 资助金额:
$ 15.02万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Collaborative construction of emergent grammar in Japanese university English language learning tasks: From graphic materials to spoken and written production
日本大学英语学习任务中生成语法的协同构建:从图形材料到口语和书面表达
- 批准号:
22K00692 - 财政年份:2022
- 资助金额:
$ 15.02万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Minimalist-Grammar-based evaluation metrics: The case of head-final clause structure acquisition
基于极简语法的评价指标:中心词尾从句结构获取案例
- 批准号:
22K00502 - 财政年份:2022
- 资助金额:
$ 15.02万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Latin-based vocabulary and grammar in the Wycliffite Bible and their historical development
威克里夫圣经中基于拉丁语的词汇和语法及其历史发展
- 批准号:
22K00619 - 财政年份:2022
- 资助金额:
$ 15.02万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Developing Digital Materials for a Comprehension-Based Approach to English Grammar and Usage
开发基于理解的英语语法和用法的数字材料
- 批准号:
21K02854 - 财政年份:2021
- 资助金额:
$ 15.02万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A Descriptive Grammar of Papiamentu — a Corpus-based Descriptive Research—
帕皮阿门图语描述性语法
- 批准号:
21J10486 - 财政年份:2021
- 资助金额:
$ 15.02万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Grammar of present-day Spanish based on geographical variation
基于地理差异的当今西班牙语语法
- 批准号:
20K00585 - 财政年份:2020
- 资助金额:
$ 15.02万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Inductive re-construction of Japanese grammar and its application to Japanese language education based on the large scale extraction of Japanese formulaic sequences and its structural analyses
基于大规模日语公式序列提取及其结构分析的日语语法归纳重构及其在日语教育中的应用
- 批准号:
20H00096 - 财政年份:2020
- 资助金额:
$ 15.02万 - 项目类别:
Grant-in-Aid for Scientific Research (A)