Citing a Data Package

Along with ensuring proper attribution is paid to the original data author, citing data in publications and other published data increases findability, encourages data reuse, and enables reliable tracking of data citation metrics.[1]

Components of a data citation: (1) Data Author or Creator, (2) Public Release Date, (3) Title, (4) Version, (5) Repository, (6) Persistent Identifier (DOI), (7) Access Date.

Citing in an article

Copy the citation from a data package landing page

Every data package landing page provides a citation, formatted to community adopted practices, that can be easily copied into a document.

Managing data citations

The data package citation can be automatically imported into a reference manager application using web browser plugins (e.g. Zotero Connector and Mendeley Web Importer) which allow users to copy the dataset DOI into a specified field. Once in the reference manager, the DOI will be used to look-up and download the citation metadata, and the citation can be exported to a bibliography.

Programmatically create data package citations

Use the EDIutils read_data_package_citation to generate data package citations formatted for different communities of practice.

This operation can also be performed using the EDI Cite web service. Watch a video walkthrough demonstrating the use of Cite.

Citing in a derived data package

For citing data in a data package derived from one or more source data packages, see the page on provenance metadata.


[1]Renée F Brown, The Importance of Data Citation, BioScience, Volume 71, Issue 3, March 2021, Page 211,

ESIP Data Preservation and Stewardship Committee (2019): Data Citation Guidelines for Earth Science Data
, Version 2. ESIP. Online resource.