Cross-modal egocentric activity recognition and zero-shot learning
跨模式自我中心活动识别和零样本学习
基本信息
- 批准号:1971464
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:英国
- 项目类别:Studentship
- 财政年份:2017
- 资助国家:英国
- 起止时间:2017 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The availability of low-cost wearable cameras has renewed the interest of first-person human activity analysis. The recognition of first-person activities has important challenges to be addressed such as, rapid changes in illuminations, significant camera motion and complex hand-object manipulations. In recent years, the advances in deep learning have influenced significantly the computer vision community, as convolutional networks gave impressive results in tasks such as, object recognition and detection, scene understanding and image segmentation. Convolutional networks have been used with success in first-person activity recognition as well. Before the emergence of deep learning the community of first-person computer vision was focused on the engineering of important egocentric features that capture properties of the first-person point of view, such as hand-object interactions and gaze. Convolutional networks allow the learning of such features automatically using big amounts of data, eliminating the need of hand-designed features. In this work, we focus on activity recognition with convolutional networks. Influenced by the recent success of multi-stream architectures, we are investigating their applicability in egocentric videos, by employing multiple modalities for training the models. An important observation that motivates us is that humans combine their senses to understand concepts of the world, such as acoustic and visual information. To this end, we propose the employment of both videos and sounds towards more accurate activity recognition. Specifically, we will investigate how shared aligned representations can be learnt using the multi-stream paradigm. Moreover, we are interested in temporal feature pooling methods to leverage information that spans over the whole video, as in many cases the whole video should be observed in order to be able to discriminate between similar actions. Our final goal is to employ these ideas in zero-shot learning. Zero-shot learning is being able to solve a task despite not having received any training examples of that task. An example is to recognize activities without having seen any video of these activities during training. This can be done by using the knowledge of trained classifiers (trained in other classes and not in the ones to be predicted by the zero-shot paradigm) and additional knowledge about the new classes.
低成本可穿戴摄像头的出现重新激发了人们对第一人称人类活动分析的兴趣。第一人称活动的识别有重要的挑战要解决,如照明的快速变化,显着的相机运动和复杂的手对象操作。近年来,深度学习的进步对计算机视觉社区产生了重大影响,因为卷积网络在对象识别和检测、场景理解和图像分割等任务中取得了令人印象深刻的结果。卷积网络也已成功用于第一人称活动识别。在深度学习出现之前,第一人称计算机视觉社区专注于重要的自我中心特征的工程设计,这些特征捕获第一人称视角的属性,例如手-物体交互和凝视。卷积网络允许使用大量数据自动学习这些特征,无需手工设计特征。在这项工作中,我们专注于卷积网络的活动识别。受多流架构最近成功的影响,我们正在研究它们在以自我为中心的视频中的适用性,通过采用多种模式来训练模型。激励我们的一个重要观察是,人类联合收割机结合他们的感官来理解世界的概念,例如听觉和视觉信息。为此,我们建议使用视频和声音来进行更准确的活动识别。具体来说,我们将研究如何共享对齐表示可以使用多流范式学习。此外,我们对时间特征池方法感兴趣,以利用跨越整个视频的信息,因为在许多情况下,应该观察整个视频,以便能够区分类似的动作。我们的最终目标是将这些想法应用于零射击学习。零触发学习是指即使没有接受过任何任务的训练示例,也能够解决该任务。一个例子是在培训期间识别活动,而无需查看这些活动的任何视频。这可以通过使用训练的分类器的知识(在其他类中训练,而不是在由零触发范例预测的类中训练)和关于新类的附加知识来完成。
项目成果
期刊论文数量(4)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Slow-Fast Auditory Streams for Audio Recognition
- DOI:10.1109/icassp39728.2021.9413376
- 发表时间:2021-03
- 期刊:
- 影响因子:0
- 作者:E. Kazakos;Arsha Nagrani;Andrew Zisserman;D. Damen
- 通讯作者:E. Kazakos;Arsha Nagrani;Andrew Zisserman;D. Damen
The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines
- DOI:10.1109/tpami.2020.2991965
- 发表时间:2021-11-01
- 期刊:
- 影响因子:23.6
- 作者:Damen, Dima;Doughty, Hazel;Wray, Michael
- 通讯作者:Wray, Michael
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
其他文献
吉治仁志 他: "トランスジェニックマウスによるTIMP-1の線維化促進機序"最新医学. 55. 1781-1787 (2000)
Hitoshi Yoshiji 等:“转基因小鼠中 TIMP-1 的促纤维化机制”现代医学 55. 1781-1787 (2000)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
LiDAR Implementations for Autonomous Vehicle Applications
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
吉治仁志 他: "イラスト医学&サイエンスシリーズ血管の分子医学"羊土社(渋谷正史編). 125 (2000)
Hitoshi Yoshiji 等人:“血管医学与科学系列分子医学图解”Yodosha(涉谷正志编辑)125(2000)。
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
Effect of manidipine hydrochloride,a calcium antagonist,on isoproterenol-induced left ventricular hypertrophy: "Yoshiyama,M.,Takeuchi,K.,Kim,S.,Hanatani,A.,Omura,T.,Toda,I.,Akioka,K.,Teragaki,M.,Iwao,H.and Yoshikawa,J." Jpn Circ J. 62(1). 47-52 (1998)
钙拮抗剂盐酸马尼地平对异丙肾上腺素引起的左心室肥厚的影响:“Yoshiyama,M.,Takeuchi,K.,Kim,S.,Hanatani,A.,Omura,T.,Toda,I.,Akioka,
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
- 通讯作者:
的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('', 18)}}的其他基金
An implantable biosensor microsystem for real-time measurement of circulating biomarkers
用于实时测量循环生物标志物的植入式生物传感器微系统
- 批准号:
2901954 - 财政年份:2028
- 资助金额:
-- - 项目类别:
Studentship
Exploiting the polysaccharide breakdown capacity of the human gut microbiome to develop environmentally sustainable dishwashing solutions
利用人类肠道微生物群的多糖分解能力来开发环境可持续的洗碗解决方案
- 批准号:
2896097 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
A Robot that Swims Through Granular Materials
可以在颗粒材料中游动的机器人
- 批准号:
2780268 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Likelihood and impact of severe space weather events on the resilience of nuclear power and safeguards monitoring.
严重空间天气事件对核电和保障监督的恢复力的可能性和影响。
- 批准号:
2908918 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Proton, alpha and gamma irradiation assisted stress corrosion cracking: understanding the fuel-stainless steel interface
质子、α 和 γ 辐照辅助应力腐蚀开裂:了解燃料-不锈钢界面
- 批准号:
2908693 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Field Assisted Sintering of Nuclear Fuel Simulants
核燃料模拟物的现场辅助烧结
- 批准号:
2908917 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Assessment of new fatigue capable titanium alloys for aerospace applications
评估用于航空航天应用的新型抗疲劳钛合金
- 批准号:
2879438 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Developing a 3D printed skin model using a Dextran - Collagen hydrogel to analyse the cellular and epigenetic effects of interleukin-17 inhibitors in
使用右旋糖酐-胶原蛋白水凝胶开发 3D 打印皮肤模型,以分析白细胞介素 17 抑制剂的细胞和表观遗传效应
- 批准号:
2890513 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
Understanding the interplay between the gut microbiome, behavior and urbanisation in wild birds
了解野生鸟类肠道微生物组、行为和城市化之间的相互作用
- 批准号:
2876993 - 财政年份:2027
- 资助金额:
-- - 项目类别:
Studentship
相似国自然基金
基于异构医学影像数据的深度挖掘技术及中枢神经系统重大疾病的精准预测
- 批准号:61672236
- 批准年份:2016
- 资助金额:64.0 万元
- 项目类别:面上项目
相似海外基金
Flexible fMRI-Compatible Neural Probes with Organic Semiconductor based Multi-modal Sensors for Closed Loop Neuromodulation
灵活的 fMRI 兼容神经探针,带有基于有机半导体的多模态传感器,用于闭环神经调节
- 批准号:
2336525 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
Collaborative Research: NCS-FR: Individual variability in auditory learning characterized using multi-scale and multi-modal physiology and neuromodulation
合作研究:NCS-FR:利用多尺度、多模式生理学和神经调节表征听觉学习的个体差异
- 批准号:
2409652 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
Imaging for Multi-scale Multi-modal and Multi-disciplinary Analysis for EnGineering and Environmental Sustainability (IM3AGES)
工程和环境可持续性多尺度、多模式和多学科分析成像 (IM3AGES)
- 批准号:
EP/Z531133/1 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Research Grant
MUSE: Multi-Modal Software Evolution
MUSE:多模式软件演进
- 批准号:
EP/W015927/2 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Research Grant
High speed multi modal in-situ Transmission Electron Microscopy platform
高速多模态原位透射电子显微镜平台
- 批准号:
LE240100060 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Linkage Infrastructure, Equipment and Facilities
Multi-scale, multi-modal X-ray imaging using speckle
使用散斑的多尺度、多模态 X 射线成像
- 批准号:
DE220101402 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Discovery Early Career Researcher Award
Multi-modal electron microscopy of 3D racetrack memory
3D 赛道记忆的多模态电子显微镜
- 批准号:
EP/X025632/1 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Research Grant
NSF-SNSF: Rapid Beamforming for Massive MIMO using Machine Learning on RF-only and Multi-modal Sensor Data
NSF-SNSF:在纯射频和多模态传感器数据上使用机器学习实现大规模 MIMO 的快速波束成形
- 批准号:
2401047 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Standard Grant
FDG-PET in combination with proton (1H) and sodium (23Na) MRI: a di-modal metabolic imaging approach
FDG-PET 结合质子 (1H) 和钠 (23Na) MRI:双模态代谢成像方法
- 批准号:
24K15805 - 财政年份:2024
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (C)
Enhancing STEM Success: A Multi-modal Investigating of Spatial Reasoning and Training in Undergraduate Education
促进 STEM 成功:本科教育空间推理和培训的多模式研究
- 批准号:
2300785 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Continuing Grant