STRUCTURAL VISUALIZATION IN BIOINFORMATICS
生物信息学中的结构可视化
基本信息
- 批准号:7561561
- 负责人:
- 金额:$ 14.78万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2007
- 资助国家:美国
- 起止时间:2007-08-01 至 2008-07-31
- 项目状态:已结题
- 来源:
- 关键词:Accession NumberAlgorithmsApplications GrantsArchitectureAreaAwardBackBehaviorBioinformaticsBiologicalBiologyChicagoCollaborationsCollectionComputer Retrieval of Information on Scientific Projects DatabaseDataData FilesData SetDatabasesDevelopmentDoctor of PhilosophyEducational process of instructingElectrodesExtensible Markup LanguageFacultyFundingGenbankGene ExpressionGenesGoalsGrantHealthcareImageImageryIndividualInstitutionJavaLaboratoriesLearningLeftLiftingMediatingMemoryMeta-AnalysisMethodsMetricMicroarray AnalysisMinorityModelingMotor CortexNatureNeurosciencesPaperPhenotypePreparationProceduresProcessPublishingResearchResearch InfrastructureResearch PersonnelResourcesSeriesServicesSourceStandards of Weights and MeasuresStructureStudentsSystemTechniquesTechnologyTestingTexasTimeTranslatingTranslationsUnited States National Institutes of HealthUniversitiesUpdateVisitWorkWritingabstractingbasefallsinsightmemberprogramsprototyperesearch studysoftware developmentspatiotemporaltechnique developmenttooltwo-dimensional
项目摘要
This subproject is one of many research subprojects utilizing the
resources provided by a Center grant funded by NIH/NCRR. The subproject and
investigator (PI) may have received primary funding from another NIH source,
and thus could be represented in other CRISP entries. The institution listed is
for the Center, which is not necessarily the institution for the investigator.
A. Specific Aims
The specific aims of this grant are as stated in the original proposal.
B. Studies and Results
Year 3 of the grant was focused on tools to handle large data sets and multiple scales. We have made progress in three areas: 1) assembling and handling of microarray datasets, 2) analysis with workflows and hypothesis testing and 3) navigation and multiple scale analysis by lifting to abstract feature spaces. Progress on these efforts is described below. I also wrote a grant proposal for and was awarded a UTSA faculty development leave. This leave will allow me to work on research without teaching or service responsibilities during the fall semester of 2007.
1) Assembling and handling of microarray datasets. An essential step in performing structural visualization across large groups of microarray data sets is that the data be translated into a common format for comparison. The NCBI GEO (Gene Expression Omnibus) has gathered a large collection of microarray experiments in a single database. They provide the data in two formats: XML (MINiML) and SOFT. We have developed programs to download and parse the data into individual data files and to automatically assemble this data into comma separated spreadsheets for visualization and analysis.
A major logistical problem with comparing datasets across platforms is that different microarray platforms use different, and sometimes inconsistent, methods for identifying genes on the microarrays. We have decided to use the NCBI Gene ID as the standard identifier in our work. Unfortunately, some common microarray platforms use the GenBank Accession Number instead of the GeneID. These platforms require a circuitous translation to Gene ID, which Cory Burkhardt has semi-automated. He has downloaded, reformatted and identified most of the microarray experiments from the top 20 platforms in the NCBI Geo database.
We have also developed various programs for parsing the data from the XML specifications and have developed software for serializing the data that isnt being used during program execution using Java SoftReference technology. This allows us to deal with more data than will fit into memory in a clean, object-oriented way. Jason Edwards and James Packer developed a CacheManager Architecture for handling large microarray datasets. This architecture will be deployed in the microarray analysis tool that we have under development. We will also investigate converting Davis (our Java-based data visualization platform) to use this caching technique in the coming year.
2) Analysis with workflows and hypothesis testing. This year we focused on developing a workflow-based tool infrastructure based on wizards. Wizards are a familiar application interface in which the user navigates through a procedure in a step-by-step process using Next and Back buttons. In their undergraduate honors theses, Jason Edwards and James Packer developed the infrastructure to support a simple prototype workflow for comparing the behavior of two genes across many microarray platforms. Their prototype application is written in Java and is called MicroMetal (Microarray Meta Analysis). This is the starting point for more general workflow-hypothesis testing approach.
3) Navigation and multiple scale analysis by lifting to abstract feature spaces. Doctoral student Dragana Veljkovic and I continue to work on the development of techniques for abstracting features for comprehensive multi-scale navigation of datasets. The idea is that a time window of a spatiotemporal data set (e.g., a multi-electrode recording or a series of microarray experiments) is represented by a low-dimensional subspace (for example, the two-dimensional space spanned by its largest two principal components). The data in each time window then becomes a single point in feature space. The distance between two feature points is computed using a distance metric on the subspaces. We can then project this feature space into a plane using a manifold learning algorithm such as ISOMAP. We are developing navigation and summary techniques that describe the distribution of very large datasets by summarizing their abstract features. A paper describing a MATLAB tool that implements these navigation techniques is under preparation.
Other activities: We continue to develop and test Davis (DAta VIewing System). Our collaborators, particularly David Senseman and his students, are using Davis extensively for their research. We have documented Davis and created video tutorials to make learning to use Davis easier.
Update on collaborations formed because of this grant: Another aspect of this development grant is the formation of collaborations in biosciences. The following collaborations that were formed last year have proceeded:
1) Nicholas Hatsopoulos, University of Chicago, with Doug Rubino from his laboratory: Our paper entitled "Propagating waves mediate information transfer in the motor cortex" was published in Nature Neuroscience in December. Doug, who was an undergraduate when this collaboration started, has visited several times. He entered a PhD program in neuroscience at the University of San Diego in fall semester of 2006.
2) Colleen Witt, director of the RCMI imaging facility at UTSA and I are continuing to talk about potential integration of visualization and modeling with imaging. We are planning to revise and resubmit our Texas ARP research program grant proposal if the program is offered this year. Dr. Witt supervised minority student Alejandro Montelongos undergraduate honors thesis on this work (completed spring 2007). I am a member of this students thesis committee.
C. Significance
Accessible tools for analysis of microarray data are needed to fully realize the potential of high-throughput data-driven biology. In addition, other types of data (such as phenotype information) must be integrated in order to apply the results to health-care. Current tools tend to be simplistic in their data handling and present results that are difficult to relate to actual biological questions. The development of general data-handling infrastructure, meaningful visualizations, and fusion of diverse types of information is critical. The goal of this project is to build tools that allow hypothesis-driven inquiry of structure across multiple data sets at multiple scales in order to derive higher-level insight into fundamental mechanism. We have made some initial progress towards this type of deployment.
这个子项目是众多研究子项目之一
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
KAY A ROBBINS其他文献
KAY A ROBBINS的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('KAY A ROBBINS', 18)}}的其他基金
The Cancer Bioinformatics Initiative: A UTSA/UTHSCSA Partnership (2 of 2)
癌症生物信息学计划:UTSA/UTHSCSA 合作伙伴关系(2 of 2)
- 批准号:
8914545 - 财政年份:2012
- 资助金额:
$ 14.78万 - 项目类别:
The Cancer Bioinformatics Initiative: A UTSA/UTHSCSA Partnership (2 of 2)
癌症生物信息学计划:UTSA/UTHSCSA 合作伙伴关系(2 of 2)
- 批准号:
8729467 - 财政年份:2012
- 资助金额:
$ 14.78万 - 项目类别:
The Cancer Bioinformatics Initiative: A UTSA/UTHSCSA Partnership (2 of 2)
癌症生物信息学计划:UTSA/UTHSCSA 合作伙伴关系(2 of 2)
- 批准号:
8543664 - 财政年份:2012
- 资助金额:
$ 14.78万 - 项目类别:
The Cancer Bioinformatics Initiative: A UTSA/UTHSCSA Partnership (2 of 2)
癌症生物信息学计划:UTSA/UTHSCSA 合作伙伴关系(2 of 2)
- 批准号:
8461324 - 财政年份:2012
- 资助金额:
$ 14.78万 - 项目类别:
CORE B: NEUROCOMPUTATIONAL & NEUROVISUALIZATION FACILITY
核心 B:神经计算
- 批准号:
6973882 - 财政年份:2004
- 资助金额:
$ 14.78万 - 项目类别:
相似海外基金
CAREER: Blessing of Nonconvexity in Machine Learning - Landscape Analysis and Efficient Algorithms
职业:机器学习中非凸性的祝福 - 景观分析和高效算法
- 批准号:
2337776 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Continuing Grant
CAREER: From Dynamic Algorithms to Fast Optimization and Back
职业:从动态算法到快速优化并返回
- 批准号:
2338816 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Continuing Grant
CAREER: Structured Minimax Optimization: Theory, Algorithms, and Applications in Robust Learning
职业:结构化极小极大优化:稳健学习中的理论、算法和应用
- 批准号:
2338846 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Continuing Grant
CRII: SaTC: Reliable Hardware Architectures Against Side-Channel Attacks for Post-Quantum Cryptographic Algorithms
CRII:SaTC:针对后量子密码算法的侧通道攻击的可靠硬件架构
- 批准号:
2348261 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Standard Grant
CRII: AF: The Impact of Knowledge on the Performance of Distributed Algorithms
CRII:AF:知识对分布式算法性能的影响
- 批准号:
2348346 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Standard Grant
CRII: CSR: From Bloom Filters to Noise Reduction Streaming Algorithms
CRII:CSR:从布隆过滤器到降噪流算法
- 批准号:
2348457 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Standard Grant
EAGER: Search-Accelerated Markov Chain Monte Carlo Algorithms for Bayesian Neural Networks and Trillion-Dimensional Problems
EAGER:贝叶斯神经网络和万亿维问题的搜索加速马尔可夫链蒙特卡罗算法
- 批准号:
2404989 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Standard Grant
CAREER: Efficient Algorithms for Modern Computer Architecture
职业:现代计算机架构的高效算法
- 批准号:
2339310 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Continuing Grant
CAREER: Improving Real-world Performance of AI Biosignal Algorithms
职业:提高人工智能生物信号算法的实际性能
- 批准号:
2339669 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Continuing Grant
DMS-EPSRC: Asymptotic Analysis of Online Training Algorithms in Machine Learning: Recurrent, Graphical, and Deep Neural Networks
DMS-EPSRC:机器学习中在线训练算法的渐近分析:循环、图形和深度神经网络
- 批准号:
EP/Y029089/1 - 财政年份:2024
- 资助金额:
$ 14.78万 - 项目类别:
Research Grant