Table 2

Structure of a data paper and its mapping to GBIF IPT Metadata Profile elements

Section/sub-section heading

Mapping with with GBIF IPT Metadata Profile elements


Derived from the 'title' element. This is centred sentence without a full stop at the end.


Derived from the 'creator', 'metadataProvider' and 'associatedParty' elements. From these elements the combination of 'first name' and 'last name' are derived and separated by commas. Corresponding affiliations of the authors are denoted with superscript numbers (1, 2, 3,…) at the end of each last name. Centered.


Derived from the 'creator', 'metadataProvider' and 'associatedParty' elements. From these elements combinations of 'organization name', 'address', 'postal code', 'city', 'country' and 'email' constitute the address. If two or more authors share the same address, it is denoted by the same number.

Corresponding authors

Derived from the 'creator' and 'metadataProvider' elements. From these elements 'first name', 'last name' and 'email' are derived. Emails are written in parentheses. If there is more than one corresponding author, these are separated by commas. If creator and metadataProvider are the same, creator is reflected as corresponding author. Text is centered.

Received, Revised, Accepted and Published dates

These are to be manually inserted by the publisher of the data paper to indicate the dates of original manuscript submission, revised manuscript submission, acceptance of manuscript and publication of the manuscript as a data paper in the journal.


This is to be manually inserted by the publisher of the data paper. It is a combination of authors, year of data paper publication (in parentheses), title, journal name, volume, issue number (in parentheses), and doi of the data paper.


Derived from the 'abstract' element. Text is indented on the both sides.


Derived from the 'keyword' element. Keywords are separated by commas.


Taxonomic Coverage

Derived from the taxonomic coverage elements: 'taxonomicCoverage', 'taxonomicRankName', 'taxonomicRankValue' and 'commonName'. 'taxonomicRankName' and 'taxonomicRankValue' are derived together.

Spatial Coverage

Derived from the spatial coverage elements: 'geographicDescription', 'westBoundingCoordinate', 'eastBoundingCoordinate', 'northBoundingCoordinate' and 'southBoundingCoordinate'.

Temporal Coverage

Derived from the temporal coverage elements: 'beginDate' and 'endDate'.

Project Description

Derived from project elements: 'title', 'personnel', 'funding', 'studyAreaDescription' and 'designDescription'.

Natural Collections Description

Derived from project NCD elements: 'parentCollectionIdentifier', 'collectionName', 'collectionIdentifier', formationPeriod', 'livingTimePeriod', 'specimenPreservationMethod' and 'jgtiCuratorialUnit'.


Derived from methods elements: 'methodStep', 'StudyExtent', 'samplingDescription' and qualityControl'.

Dataset Descriptions

Derived from physical and other elements: 'objectName', 'characterEncoding', 'formatName', 'formatVersion', 'online/URL', 'pubDate', 'language' and 'intellectualRights'.

Additional Information

Derived from the 'additionalInfo' element.


Derived from the 'citation' element.

Chavan and Penev BMC Bioinformatics 2011 12(Suppl 15):S2   doi:10.1186/1471-2105-12-S15-S2

Open Data