Digitisation / Cataloguing of non-textual objects: A standardised and optimised process for data acquisition from digital images of herbarium specimens

非文本对象的数字化/编目:从植物标本馆标本的数字图像中获取数据的标准化和优化的过程

基本信息

项目摘要

The project will develop and document a software-driven standard process for extracting metadata from images of herbarium specimens (i.e. dried pressed plants or plant parts mounted on cardboard and stored in natural history collections). We will address a large proportion of science collections: approximately 22 million herbarium specimens exist as botanical reference objects in Germany, about 500 million worldwide. Metadata like plant name, collection site and date, collector, accession numbers, etc. are also glued flat on the sheet and thus visible on the specimen image. Up to now most of the data capture is manually fed into collection databases, but increasingly, imaging techniques are employed (also to ensure that the on-line metadata can be verified). The standard process shall replace or add to the manual data input as much as possible. Image processing software detects objects on the digitized record and classifies them. Text objects will be transformed into structured information using text mining algorithms. For handwriting, author identification is attempted. The project will evaluate and enhance existing software to conform to standard interfaces and integrate it into an overall open software architecture on the basis of established IT standards. Finally, the requirements for the process will be formulated as a standard and the actual application will be documented.
该项目将开发和记录一个软件驱动的标准流程,用于从植物标本馆标本(即干压植物或植物部分安装在纸板上并储存在自然历史收藏品中)的图像中提取元数据。我们将解决大部分的科学收藏:大约2200万植物标本馆标本存在作为植物参考对象在德国,约5亿世界各地。植物名称、采集地点和日期、采集者、登录号等元数据也被粘贴在纸张上,因此在标本图像上可见。到目前为止,大多数数据采集都是人工输入收集数据库,但越来越多地采用成像技术(也是为了确保可以核实在线元数据)。标准过程应尽可能取代或增加人工数据输入。图像处理软件检测数字化记录上的对象并对其进行分类。使用文本挖掘算法将文本对象转换为结构化信息。对于笔迹,试图鉴定作者。该项目将评价和加强现有软件,使其符合标准接口,并根据既定的信息技术标准将其纳入一个全面开放的软件结构。最后,该过程的要求将被制定为标准,并将实际应用记录在案。

项目成果

期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Professor Dr. Walter Berendsohn其他文献

Professor Dr. Walter Berendsohn的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Professor Dr. Walter Berendsohn', 18)}}的其他基金

Digitisation / Cataloguing of non-textual objects: Access and format of existing digital object data, alignment of established database systems and development of a joint data portal - Biodiversity Network of the Humboldt-Ring (BiNHum)
非文本对象的数字化/编目:现有数字对象数据的访问和格式、已建立的数据库系统的调整以及联合数据门户的开发 - 洪堡环生物多样性网络 (BiNHum)
  • 批准号:
    203096210
  • 财政年份:
    2012
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Internationalisation and realisation of the interdisciplinary potential of the online annotation system - AnnoSys and its extension to further structured data formats.
在线注释系统 AnnoSys 的国际化和跨学科潜力的实现及其对进一步结构化数据格式的扩展。
  • 批准号:
    187927094
  • 财政年份:
    2011
  • 资助金额:
    --
  • 项目类别:
    Research data and software (Scientific Library Services and Information Systems)

相似海外基金

Digitisation / Cataloguing of non-textual objects: Creation of a Standard for the 3D computed tomography of musical instruments and enhancement of the MIMO Digitisation Standard (DFG-MUSICES)
非文本对象的数字化/编目:创建乐器 3D 计算机断层扫描标准并增强 MIMO 数字化标准 (DFG-MUSICES)
  • 批准号:
    248476191
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: Metadata standardization for digitized recordings in scientific collections based on the ASR2-METS/MODS profile and compilation of a discographic authority file
非文本对象的数字化/编目:基于 ASR2-METS/MODS 配置文件的科学收藏中数字化记录的元数据标准化和唱片记录权威文件的编译
  • 批准号:
    247988593
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: eScience-compliant standards for morphology
非文本对象的数字化/编目:符合 eScience 的形态标准
  • 批准号:
    248394582
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: Human skeletal collections. Development of Standards for the access to historical anthropological / anatomical research collections using the Freiburg Alexander Ecker collection as example
非文本对象的数字化/编目:人类骨骼收藏。
  • 批准号:
    248492242
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: A community platform for the development and documentation of the ABCD standard for natural history collections
非文本对象的数字化/编目:用于开发和记录自然历史收藏品 ABCD 标准的社区平台
  • 批准号:
    248067007
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: Description and Digitization of Precious Book Covers as Independent Works of Art
非文本对象的数字化/编目:作为独立艺术作品的珍贵书籍封面的描述和数字化
  • 批准号:
    248356741
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: Development of new digitization standards for the large-scale assessment of leaf venation traits from herbarium collections using microradiographic imaging
非文本对象的数字化/编目:开发新的数字化标准,用于使用显微放射成像对植物标本馆收藏的叶脉特征进行大规模评估
  • 批准号:
    247987558
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: Development of interoperable metadata standards for the contextualization of heterogeneous collection objects,using the 18th century scholarly collection compiled by Georg Thomas v. Asch as an example
非文本对象的数字化/编目:使用 Georg Thomas v. 编译的 18 世纪学术馆藏,开发用于异构馆藏对象情境化的可互操作元数据标准。
  • 批准号:
    248069826
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: Development of standards for the photographic documentation of permanent microscope slide mounts in precarious mounting media
非文本对象的数字化/编目:制定不稳定安装介质中永久显微镜载玻片安装的摄影记录标准
  • 批准号:
    248331536
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
Digitisation / Cataloguing of non-textual objects: Towards an integrative and comprehensive standard for meta-omics data of collection objects (MOD-CO)
非文本对象的数字化/编目:建立集合对象元组学数据的综合综合标准(MOD-CO)
  • 批准号:
    248069971
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Cataloguing and Digitisation (Scientific Library Services and Information Systems)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了