SGER: Discovery of Research Trends and Classification in Domain-Specific Text: Application to Nanoscale Science and Engineering Field
SGER:特定领域文本研究趋势和分类的发现:在纳米科学与工程领域的应用
基本信息
- 批准号:0417401
- 负责人:
- 金额:$ 9.97万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2004
- 资助国家:美国
- 起止时间:2004-02-15 至 2007-01-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
AbstractCTS-0417401A. Bellaachia, George Washington UniversityWith the explosion of the World Wide Web and the massive amount of data available from several research areas, the need for knowledge discovery techniques that help extract hidden information has increased. Analysis of these data may lead to the discovery of new trends and patterns hidden within the data that would be impossible to discover by just traditional query-based systems. These would improve forecasting, efficiency of resource allocations, and quality of the decision making process. In our initial analysis of this exploratory research project, we have queried the 250,000 NSF online awards for nanoscale science and engineering (NSE) abstracts. The retrieved data were analyzed, cleaned, and stored in a web-based database. This database can be accessed at http://128.164.158.138/bellaTest. We have started the exploration of data mining techniques to extract new research areas and new research topics. So far we have implemented the popular k-mean technique along with the singular value decomposition. Currently, we are in the process of analyzing the results of our initial experiments. In this request, we are planning to finish the analysis of our preliminary mining techniques and we would also like to explore two other clustering techniques: the Self-Organizing Oscillators Network (SOON) clustering algorithm and the E-CAST clustering techniques. Both of these techniques do not require any knowledge about the number of clusters. They have also been successfully used in other domains. In this exploratory research, we would like to request additional fund to conduct the following tasks: - Finish the analysis of our initial data mining techniques. - Conduct other experiments using the predefined set of existing seven concepts as seeds to the k-mean technique. - Since k-mean requires the specification of the number of clusters, we are also planning to explore two other data mining techniques that do not require the number of clusters: the Self-Organizing Oscillators Network (SOON) and the E-CAST clustering techniques. - Analyze all the above techniques and determine key terms that best describe new emerging research areas in the field of NSE field. In addition to mining new trends and research areas in the NSE awards, we are also planning to add new features to the NSF website. These include the display of details of awards in the form a US map with Hover features, and searching largest awards, top funded investigators, etc. The intellectual merit and new contributions of this request include a comprehensive understanding of how to process small, special-purpose collection of data in other domains. Students, researchers, and educators can also use our system to discover new research areas. The proposed system has broader impacts that can be applied to other applications such retrieval systems and digital library systems.
摘要CTS-0417401 A。贝拉契亚,乔治华盛顿大学随着万维网的爆炸和大量的数据可从几个研究领域,知识发现技术,帮助提取隐藏的信息的需求增加了。对这些数据的分析可能会发现隐藏在数据中的新趋势和模式,而这些趋势和模式是传统的基于查询的系统无法发现的。这将改善预测、资源分配的效率和决策过程的质量。 在我们对这个探索性研究项目的初步分析中,我们查询了250,000个NSF在线奖项的纳米级科学与工程(NSE)摘要。检索到的数据进行了分析,清理,并存储在基于Web的数据库。该数据库可在http://128.164.158.138/bellaTest上查阅。 我们已经开始探索数据挖掘技术,以提取新的研究领域和新的研究课题。到目前为止,我们已经实现了流行的k-mean技术沿着的奇异值分解。目前,我们正在分析初步实验的结果。 在这个请求中,我们计划完成对我们初步挖掘技术的分析,我们还想探索其他两种聚类技术:自组织振荡器网络(SOON)聚类算法和E-CAST聚类技术。这两种技术都不需要任何关于聚类数量的知识。它也被成功地应用于其他领域。 在这项探索性研究中,我们想请求额外的资金来进行以下任务:-完成我们最初的数据挖掘技术的分析。 - 使用现有的七个概念的预定义集合作为k-mean技术的种子进行其他实验。 - 由于k-mean需要指定簇的数量,我们还计划探索其他两种不需要簇数量的数据挖掘技术:自组织振荡器网络(SOON)和E-CAST聚类技术。 - 分析所有上述技术,并确定最能描述NSE领域新兴研究领域的关键术语。 除了挖掘NSE奖项中的新趋势和研究领域外,我们还计划为NSF网站添加新功能。这些包括以具有Hover功能的美国地图的形式显示奖项的详细信息,以及搜索最大的奖项,顶级资助的研究人员等。该请求的智力价值和新贡献包括全面了解如何处理其他领域的小型,特殊用途的数据收集。学生,研究人员和教育工作者也可以使用我们的系统来发现新的研究领域。所提出的系统具有更广泛的影响,可以应用到其他应用程序,如检索系统和数字图书馆系统。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Abdelghani Bellaachia其他文献
String Matching Over Compressed Text on Handheld Devices Using Tagged Sub-Optimal Code (TSC)
- DOI:
10.1007/s11241-005-6886-9 - 发表时间:
2005-03-01 - 期刊:
- 影响因子:1.300
- 作者:
Abdelghani Bellaachia;Iehab AL Rassan - 通讯作者:
Iehab AL Rassan
Communication capabilities of product networks
- DOI:
10.1023/a:1019183720873 - 发表时间:
2000-01-01 - 期刊:
- 影响因子:2.300
- 作者:
Abdelghani Bellaachia;Abdou Youssef - 通讯作者:
Abdou Youssef
Abdelghani Bellaachia的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Abdelghani Bellaachia', 18)}}的其他基金
SGER: Discovery of Research Trends Using Concept Extraction and Data Mining Techniques in domain-specific Text: Application to Nanoscale Science and Engineering Field.
SGER:在特定领域文本中使用概念提取和数据挖掘技术发现研究趋势:在纳米科学与工程领域的应用。
- 批准号:
0737961 - 财政年份:2007
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
SGER: Discovery of Research Trends and Classification in Domain-Specific Text: Application to Nanoscale Science and Engineering Field.
SGER:特定领域文本中研究趋势和分类的发现:在纳米科学与工程领域的应用。
- 批准号:
0243579 - 财政年份:2002
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
US-Morocco Workshop: Information Technology, June 2002
美国-摩洛哥研讨会:信息技术,2002 年 6 月
- 批准号:
0209514 - 财政年份:2002
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
ITR/AP: A Web-Based Scientific Analysis Facility for Nuclear & Particle Physics Data
ITR/AP:基于网络的核科学分析设施
- 批准号:
0113163 - 财政年份:2001
- 资助金额:
$ 9.97万 - 项目类别:
Continuing Grant
相似海外基金
Planning: Advancing Discovery on a Sustainable National Research Enterprise
规划:推进可持续国家研究企业的发现
- 批准号:
2412406 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
Collaborative Research: TRTech-PGR TRACK: Discovery and characterization of small CRISPR systems for virus-based delivery of heritable editing in plants.
合作研究:TRTech-PGR TRACK:小型 CRISPR 系统的发现和表征,用于基于病毒的植物遗传编辑传递。
- 批准号:
2334028 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
Collaborative Research: Road Information Discovery through Privacy-Preserved Collaborative Estimation in Connected Vehicles
协作研究:通过联网车辆中保护隐私的协作估计来发现道路信息
- 批准号:
2422579 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
Collaborative Research: Design and Discovery of Entropy-Stabilized Perovskite Halides for Optoelectronics
合作研究:用于光电子学的熵稳定钙钛矿卤化物的设计和发现
- 批准号:
2421149 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Continuing Grant
CLIMA/Collaborative Research: Discovery of Covalent Adaptable Networks for Sustainable Manufacturing and Recycling of Wind Turbine Blades
CLIMA/合作研究:发现用于风力涡轮机叶片可持续制造和回收的共价适应性网络
- 批准号:
2332276 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
From Data to Discovery: BlokBIO's Vision of Transforming Genomic Research with User-Centric Intelligence Solutions
从数据到发现:BlokBIO 通过以用户为中心的智能解决方案转变基因组研究的愿景
- 批准号:
10109374 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Launchpad
Collaborative Research: TRTech-PGR TRACK: Discovery and characterization of small CRISPR systems for virus-based delivery of heritable editing in plants.
合作研究:TRTech-PGR TRACK:小型 CRISPR 系统的发现和表征,用于基于病毒的植物遗传编辑传递。
- 批准号:
2334027 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
CLIMA/Collaborative Research: Discovery of Covalent Adaptable Networks for Sustainable Manufacturing and Recycling of Wind Turbine Blades
CLIMA/合作研究:发现用于风力涡轮机叶片可持续制造和回收的共价适应性网络
- 批准号:
2332275 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
Research Infrastructure: CC* Data Storage: Broadening UMBCs Data Storage footprint to Advance Scientific Research and Discovery
研究基础设施:CC* 数据存储:扩大 UMBC 数据存储足迹以推进科学研究和发现
- 批准号:
2346667 - 财政年份:2024
- 资助金额:
$ 9.97万 - 项目类别:
Standard Grant
Concurrent multi-organ responses to chronic physical activity and inactivity intervention to increase research discovery in human health and wellbeing
对慢性身体活动和不活动干预的并发多器官反应,以增加人类健康和福祉的研究发现
- 批准号:
BB/X015173/1 - 财政年份:2023
- 资助金额:
$ 9.97万 - 项目类别:
Research Grant