Dense Monocular Reconstruction and Semantic Segmentation of 3D Environments

3D 环境的密集单目重建和语义分割

基本信息

  • 批准号:
    2116531
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Studentship
  • 财政年份:
    2018
  • 资助国家:
    英国
  • 起止时间:
    2018 至 无数据
  • 项目状态:
    已结题

项目摘要

Most applications of robotics and augmented reality (AR) rely on some form of a 3D model of the environment in which they operate in order to interact with the environment. At the most basic level, these models provide information about where things are in the environment, allowing systems to accurately and safely interact with the environment. Conventionally there has been a trade-off between the quality and the density of the model (i.e. accurate models typically consist of a cloud of points with no information about the region between the points). This sparseness typically arises from the use of 3D scanners, such as lasers. There are scanners that are able to produce dense information, however, these scanners are either limited to indoors operation, or have an extremely short range (typically 1-5m). If more meaningful interaction is desired then semantic information, which describes the contents of the environment, is required.The ability to quickly generate accurate models of 3D spaces has a variety of impacts, from increasing the ease with which automated systems can navigate and explore the world, to improving the interaction of visuals generated by AR with the real world. Further, there are potential applications for disability and access, whereby buildings or areas can be easily and quickly mapped, and then scale models printed, to aid those with impaired vision to navigate ares. Semantic information allows systems to understand the world and answer questions about it. For example, with semantic information, we can ask questions like "Where are the chairs in this room?"We aim to investigate systems for both reconstructing dense 3D models of environments as well as generating semantic segmentations of those models. We hope to be able to develop a system that is capable of generating these models in real time, and ultimately on embedded and mobile devices, such as an iPhone. Further, we aim to be able to generate these models in places where current sensor based systems cannot, i.e. outdoors and over ranges greater than 5m. To improve on the existing techniques for reconstructing 3D models from monocular images, we plan to utilize convolutional neural networks (CNNs), alongside existing geometric methods. However, rather than computing a depth image, we propose to directly compute a full 3D model from the network. Although this process requires significantly more memory, we believe that this will allow for better integration of the available information. We suspect that the direct use of 3D information will be of particular importance for semantic segmentation, where certain viewing angles of objects can be misleading (e.g. a chair from above looks a lot like a table). We also plan to investigate the potential of recurrent neural networks to improve the quality of the reconstructions over a sequence of images, as this input pipeline mimics those that you would most likely see in real world data acquisition scenarios.This project falls within the EPSRC Information and Communication Technologies theme, specifically the Image and Vision Computing research area.
机器人和增强现实(AR)的大多数应用都依赖于某种形式的环境的3D模型,它们在其中操作,以便与环境进行交互。在最基本的层面上,这些模型提供了关于事物在环境中位置的信息,允许系统准确安全地与环境交互。传统上,在模型的质量和密度之间存在折衷(即,精确模型通常由点云组成,而没有关于点之间的区域的信息)。这种稀疏性通常是由于使用3D扫描仪,如激光。有些扫描仪能够产生密集的信息,然而,这些扫描仪要么限于室内操作,要么具有极短的范围(通常为1- 5米)。如果需要更有意义的交互,则需要描述环境内容的语义信息。快速生成精确的3D空间模型的能力具有各种影响,从增加自动化系统导航和探索世界的轻松性,到改善AR生成的视觉效果与真实的世界的交互。此外,还有针对残疾和无障碍的潜在应用,可以轻松快速地绘制建筑物或区域的地图,然后打印缩放模型,以帮助视力受损的人在战神中导航。语义信息使系统能够理解世界并回答有关世界的问题。例如,有了语义信息,我们可以问“这个房间里的椅子在哪里?“我们的目标是研究用于重建环境的密集3D模型以及生成这些模型的语义分割的系统。我们希望能够开发出一个能够在真实的时间内生成这些模型的系统,并最终在嵌入式和移动的设备上,如iPhone。此外,我们的目标是能够在当前基于传感器的系统无法生成这些模型的地方,即户外和超过5米的范围。为了改进现有的从单目图像重建3D模型的技术,我们计划利用卷积神经网络(CNN)以及现有的几何方法。然而,我们建议直接从网络中计算完整的3D模型,而不是计算深度图像。虽然这一过程需要更多的内存,但我们相信这将有助于更好地整合现有信息。我们怀疑直接使用3D信息对于语义分割特别重要,因为对象的某些视角可能会产生误导(例如,从上面看椅子很像桌子)。我们还计划研究递归神经网络的潜力,以提高图像序列的重建质量,因为这个输入管道模仿那些你最有可能在真实的世界的数据采集scenaries.This项目的福尔斯落在EPSRC信息和通信技术的主题,特别是图像和视觉计算研究领域。

项目成果

期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Approximating Continuous Convolutions for Deep Network Compression
  • DOI:
    10.48550/arxiv.2210.08951
  • 发表时间:
    2022-10
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Theo W. Costain;V. Prisacariu
  • 通讯作者:
    Theo W. Costain;V. Prisacariu
Towards Generalising Neural Implicit Representations
  • DOI:
  • 发表时间:
    2021-01
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Theo W. Costain;V. Prisacariu
  • 通讯作者:
    Theo W. Costain;V. Prisacariu
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

其他文献

吉治仁志 他: "トランスジェニックマウスによるTIMP-1の線維化促進機序"最新医学. 55. 1781-1787 (2000)
Hitoshi Yoshiji 等:“转基因小鼠中 TIMP-1 的促纤维化机制”现代医学 55. 1781-1787 (2000)。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
LiDAR Implementations for Autonomous Vehicle Applications
  • DOI:
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
生命分子工学・海洋生命工学研究室
生物分子工程/海洋生物技术实验室
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
吉治仁志 他: "イラスト医学&サイエンスシリーズ血管の分子医学"羊土社(渋谷正史編). 125 (2000)
Hitoshi Yoshiji 等人:“血管医学与科学系列分子医学图解”Yodosha(涉谷正志编辑)125(2000)。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
Effect of manidipine hydrochloride,a calcium antagonist,on isoproterenol-induced left ventricular hypertrophy: "Yoshiyama,M.,Takeuchi,K.,Kim,S.,Hanatani,A.,Omura,T.,Toda,I.,Akioka,K.,Teragaki,M.,Iwao,H.and Yoshikawa,J." Jpn Circ J. 62(1). 47-52 (1998)
钙拮抗剂盐酸马尼地平对异丙肾上腺素引起的左心室肥厚的影响:“Yoshiyama,M.,Takeuchi,K.,Kim,S.,Hanatani,A.,Omura,T.,Toda,I.,Akioka,
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:

的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('', 18)}}的其他基金

An implantable biosensor microsystem for real-time measurement of circulating biomarkers
用于实时测量循环生物标志物的植入式生物传感器微系统
  • 批准号:
    2901954
  • 财政年份:
    2028
  • 资助金额:
    --
  • 项目类别:
    Studentship
Exploiting the polysaccharide breakdown capacity of the human gut microbiome to develop environmentally sustainable dishwashing solutions
利用人类肠道微生物群的多糖分解能力来开发环境可持续的洗碗解决方案
  • 批准号:
    2896097
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
A Robot that Swims Through Granular Materials
可以在颗粒材料中游动的机器人
  • 批准号:
    2780268
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Likelihood and impact of severe space weather events on the resilience of nuclear power and safeguards monitoring.
严重空间天气事件对核电和保障监督的恢复力的可能性和影响。
  • 批准号:
    2908918
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Proton, alpha and gamma irradiation assisted stress corrosion cracking: understanding the fuel-stainless steel interface
质子、α 和 γ 辐照辅助应力腐蚀开裂:了解燃料-不锈钢界面
  • 批准号:
    2908693
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Field Assisted Sintering of Nuclear Fuel Simulants
核燃料模拟物的现场辅助烧结
  • 批准号:
    2908917
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Assessment of new fatigue capable titanium alloys for aerospace applications
评估用于航空航天应用的新型抗疲劳钛合金
  • 批准号:
    2879438
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Developing a 3D printed skin model using a Dextran - Collagen hydrogel to analyse the cellular and epigenetic effects of interleukin-17 inhibitors in
使用右旋糖酐-胶原蛋白水凝胶开发 3D 打印皮肤模型,以分析白细胞介素 17 抑制剂的细胞和表观遗传效应
  • 批准号:
    2890513
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
CDT year 1 so TBC in Oct 2024
CDT 第 1 年,预计 2024 年 10 月
  • 批准号:
    2879865
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship
Understanding the interplay between the gut microbiome, behavior and urbanisation in wild birds
了解野生鸟类肠道微生物组、行为和城市化之间的相互作用
  • 批准号:
    2876993
  • 财政年份:
    2027
  • 资助金额:
    --
  • 项目类别:
    Studentship

相似海外基金

Elucidating the Role of Dorsal Lateral Geniculate Nucleus Burst-Mode Firing in Retinal Inactivation Induced Recovery from Monocular Deprivation
阐明背外侧膝状核爆发模式放电在视网膜失活诱导的单眼剥夺恢复中的作用
  • 批准号:
    10464250
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
Self-supervised Monocular Depth Estimation
自监督单目深度估计
  • 批准号:
    2747408
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Studentship
Visual-Inertial Monocular Dynamic SLAM for Autonomous Driving
用于自动驾驶的视觉惯性单目动态 SLAM
  • 批准号:
    559121-2021
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Elucidating the Role of Dorsal Lateral Geniculate Nucleus Burst-Mode Firing in Retinal Inactivation Induced Recovery from Monocular Deprivation
阐明背外侧膝状核爆发模式放电在视网膜失活诱导的单眼剥夺恢复中的作用
  • 批准号:
    10609435
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
Audio-visual object-based dynamic scene representation from monocular video
单目视频中基于视听对象的动态场景表示
  • 批准号:
    2701695
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Studentship
Monocular Vision System Development for Object Detection
用于物体检测的单目视觉系统开发
  • 批准号:
    573228-2022
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    University Undergraduate Student Research Awards
SBIR Phase II: Three Dimensional Monocular Thermal Ranging Camera
SBIR二期:三维单目热测距相机
  • 批准号:
    2128439
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Cooperative Agreement
Monocular Camera Localization with Depth Estimation
带深度估计的单目相机定位
  • 批准号:
    567510-2021
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    University Undergraduate Student Research Awards
Visual-Inertial Monocular Dynamic SLAM for Autonomous Driving
用于自动驾驶的视觉惯性单目动态 SLAM
  • 批准号:
    559121-2021
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Monocular Visual Confusion for Field Expansion
用于视野扩展的单眼视觉混乱
  • 批准号:
    10474347
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了