Comparative Analyses and Genomic Sequence-Only Prediction of DNA Breakpoints via Gradient Boosting Machines and Deep Learning
通过梯度增强机和深度学习对 DNA 断点进行比较分析和仅基因组序列预测
基本信息
- 批准号:2434544
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:英国
- 项目类别:Studentship
- 财政年份:2020
- 资助国家:英国
- 起止时间:2020 至 无数据
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Genomic insertion/deletion alterations, which occur through the formation of DNA strand breaks, are the second most significant DNA modifications after point mutations. However, unlike point mutations, the long- and short-range sequence-context dependence of DNA strand breakpoints, as well as the detailed regional variation of breakage propensities in the genome, have not been extensively interrogated through cutting edge computational means. This has prevalently been the case because of the relative sparseness of available data on such DNA alterations, coupled with complex multi-etiologic nature of DNA strand breaks that would further stratify already sparse data. Nevertheless, the importance of understanding DNA breakages led to a number of computational works outlining potential associations of CNV breakpoints vs. non-B DNA conformations, frequent SCNA breakpoints vs. cancer genes, induced DNA breaks vs. chromatin packing and abasic sites. Discrete-valued bivariate hidden Markov model developed on CNV data could reach a predictive resolution of ~300bp for about 400 breakpoints in the human genome, showing the promise for higher resolution sequence- based prediction of DNA breakpoints for, at least, CNVs. The past decade has witnessed the advent of specific sequencing methodologies for experimental genome-wide mapping of DNA breakpoints, and, cutting edge machine learning techniques started to become readily available with recent outstanding applications on biological big data. It is therefore the right time to apply the state-of-the-art machine learning techniques on the substantially expanded and well stratified DNA breakpoint data from tissues undergoing different physiological, spontaneous and pathological processes, in order to reveal their fine short- and long-range sequence dependences, with their commonalities and differences across different processes leading to DNA strand breaks. Throughout the project, the student will gain and implement skills in computational biology, advanced data analysis, machine learning and statistics ("Quantitative skills"), while working with genomics datasets and integrating many bottom-to-top developments from thermodynamic and chemical considerations of nucleic acids ("Interdisciplinary skills").
基因组插入/删除改变,通过DNA链断裂的形成而发生,是仅次于点突变的第二大DNA修饰。然而,与点突变不同,DNA链断点的长期和短期序列上下文依赖性,以及基因组中断裂倾向的详细区域差异,尚未通过尖端的计算手段得到广泛的研究。这种情况之所以普遍存在,是因为这种DNA改变的可用数据相对较少,再加上DNA链断裂的复杂多病因性质,将进一步使已经稀疏的数据分层。然而,理解DNA断裂的重要性导致了许多计算工作,概述了CNV断点与非b DNA构象、频繁的SCNA断点与癌症基因、诱导DNA断裂与染色质包装和基本位点的潜在关联。基于CNV数据开发的离散值二元隐马尔可夫模型对人类基因组中约400个断点的预测分辨率可达~300bp,这表明基于序列的DNA断点预测至少对CNV具有更高分辨率的前景。过去十年见证了用于DNA断点全基因组实验测绘的特定测序方法的出现,并且随着最近在生物大数据上的杰出应用,尖端机器学习技术开始变得容易获得。因此,现在是时候将最先进的机器学习技术应用于从经历不同生理、自发和病理过程的组织中大量扩展和分层良好的DNA断点数据,以揭示其精细的短期和长期序列依赖性,以及它们在导致DNA链断裂的不同过程中的共性和差异。在整个项目中,学生将获得并实施计算生物学、高级数据分析、机器学习和统计学方面的技能(“定量技能”),同时使用基因组学数据集,并整合从热力学和核酸化学考虑的许多自下而上的发展(“跨学科技能”)。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
其他文献
吉治仁志 他: "トランスジェニックマウスによるTIMP-1の線維化促進機序"最新医学. 55. 1781-1787 (2000)
Hitoshi Yoshiji 等:“转基因小鼠中 TIMP-1 的促纤维化机制”现代医学 55. 1781-1787 (2000)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
LiDAR Implementations for Autonomous Vehicle Applications
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
吉治仁志 他: "イラスト医学&サイエンスシリーズ血管の分子医学"羊土社(渋谷正史編). 125 (2000)
Hitoshi Yoshiji 等人:“血管医学与科学系列分子医学图解”Yodosha(涉谷正志编辑)125(2000)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Effect of manidipine hydrochloride,a calcium antagonist,on isoproterenol-induced left ventricular hypertrophy: "Yoshiyama,M.,Takeuchi,K.,Kim,S.,Hanatani,A.,Omura,T.,Toda,I.,Akioka,K.,Teragaki,M.,Iwao,H.and Yoshikawa,J." Jpn Circ J. 62(1). 47-52 (1998)
钙拮抗剂盐酸马尼地平对异丙肾上腺素引起的左心室肥厚的影响:“Yoshiyama,M.,Takeuchi,K.,Kim,S.,Hanatani,A.,Omura,T.,Toda,I.,Akioka,
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('', 18)}}的其他基金
An implantable biosensor microsystem for real-time measurement of circulating biomarkers
用于实时测量循环生物标志物的植入式生物传感器微系统
- 批准号:
2901954 - 财政年份:2028
- 资助金额:
-- - 项目类别:
Studentship
Exploiting the polysaccharide breakdown capacity of the human gut microbiome to develop environmentally sustainable dishwashing solutions
利用人类肠道微生物群的多糖分解能力来开发环境可持续的洗碗解决方案
- 批准号:
2896097 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
A Robot that Swims Through Granular Materials
可以在颗粒材料中游动的机器人
- 批准号:
2780268 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Likelihood and impact of severe space weather events on the resilience of nuclear power and safeguards monitoring.
严重空间天气事件对核电和保障监督的恢复力的可能性和影响。
- 批准号:
2908918 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Proton, alpha and gamma irradiation assisted stress corrosion cracking: understanding the fuel-stainless steel interface
质子、α 和 γ 辐照辅助应力腐蚀开裂:了解燃料-不锈钢界面
- 批准号:
2908693 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Field Assisted Sintering of Nuclear Fuel Simulants
核燃料模拟物的现场辅助烧结
- 批准号:
2908917 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Assessment of new fatigue capable titanium alloys for aerospace applications
评估用于航空航天应用的新型抗疲劳钛合金
- 批准号:
2879438 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Developing a 3D printed skin model using a Dextran - Collagen hydrogel to analyse the cellular and epigenetic effects of interleukin-17 inhibitors in
使用右旋糖酐-胶原蛋白水凝胶开发 3D 打印皮肤模型,以分析白细胞介素 17 抑制剂的细胞和表观遗传效应
- 批准号:
2890513 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Understanding the interplay between the gut microbiome, behavior and urbanisation in wild birds
了解野生鸟类肠道微生物组、行为和城市化之间的相互作用
- 批准号:
2876993 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
相似海外基金
Comparative genomic analyses of vertebrate non-coding regions to study convergence of limbless-ness in snakes and caecilia.
脊椎动物非编码区的比较基因组分析,以研究蛇和蚓螈无肢的趋同。
- 批准号:
2432823 - 财政年份:2020
- 资助金额:
-- - 项目类别:
Studentship
Comparative genomic analyses of cichlids and othe organisms in Lake Victoria to understand molecular mechanism of adaptive ratiation
对维多利亚湖慈鲷和其他生物进行比较基因组分析,以了解适应性配比的分子机制
- 批准号:
17H04606 - 财政年份:2017
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (B)
Understanding Biomineralization in Coccolithophores by Comparative Genomic, Transcriptomic and Epigenetic Analyses
通过比较基因组、转录组和表观遗传学分析了解颗石藻的生物矿化
- 批准号:
9976534 - 财政年份:2017
- 资助金额:
-- - 项目类别:
Understanding Biomineralization in Coccolithophores by Comparative Genomic, Transcriptomic and Epigenetic Analyses
通过比较基因组、转录组和表观遗传学分析了解颗石藻的生物矿化
- 批准号:
9279616 - 财政年份:2017
- 资助金额:
-- - 项目类别:
Comparative genomic and population genomic analyses of disease regions
疾病区域的比较基因组和群体基因组分析
- 批准号:
16K18470 - 财政年份:2016
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Young Scientists (B)
Bridging observational, experimental and genomic insights on species and speciation through comparative analyses in a new vertebrate model system
通过在新的脊椎动物模型系统中进行比较分析,将关于物种和物种形成的观察、实验和基因组见解联系起来
- 批准号:
1457774 - 财政年份:2015
- 资助金额:
-- - 项目类别:
Continuing Grant
Comparative Genomic and Transcriptomic Analyses of High Arctic Microorganisms
高北极微生物的比较基因组和转录组分析
- 批准号:
426465-2012 - 财政年份:2012
- 资助金额:
-- - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Master's
DISSERTATION RESEARCH: Systematics of the Order Brassicales and Family Brassicaceae, and comparative genomic analyses (nuclear and organellar genomes): a phylogenomic approach
论文研究:芸苔目和十字花科的系统学,以及比较基因组分析(核和细胞器基因组):系统发育学方法
- 批准号:
1110443 - 财政年份:2011
- 资助金额:
-- - 项目类别:
Standard Grant
Comparative genomic analyses of firmicutes (low GC gram-positive bacteria
厚壁菌门(低 GC 革兰氏阳性菌)的比较基因组分析
- 批准号:
368561-2008 - 财政年份:2008
- 资助金额:
-- - 项目类别:
University Undergraduate Student Research Awards
An Xserve computer cluster for phlogenetic and comparative genomic analyses
用于系统发育和比较基因组分析的 Xserve 计算机集群
- 批准号:
349478-2007 - 财政年份:2006
- 资助金额:
-- - 项目类别:
Research Tools and Instruments - Category 1 (<$150,000)