Accessing Data

Data packages and their contents can be accessed through the EDI Data Portal, EDIutils R Package, and the REST API. These options support scientific workflows driven by manual human effort and automated machine routines.

Data Portal

The EDI Data Portal provides data access through a user-friendly web browser interface.

Point and click

Single data objects or the entire data package can be downloaded as files:

  1. From a data package landing page, navigate to the Resources section.
  2. Download individual resources by clicking the links under Download Data.
  3. Use the Download Zip Archive button to download the entire data package.

Data download scripts

Data download scripts, read data directly from the EDI Repository into common programming language workspaces. This option supports reproducible research by accessing the data directly and reducing the possibility of errors propagating through local files.

  1. Navigate to the Code Generation section of a data package landing page.
  2. Select the language of interest (R, Python, MATLAB, etc).
  3. Download the script that is generated as a file, or copy and paste the text into the development environment of the language (e.g. RStudio).

EDIutils

The EDIutils R package allows data to be read directly from the EDI Repository into the R environment. This option serves a similar use case as the data download scripts but provides full access to the REST API, which is useful when constructing automated workflows.

  1. Download the metadata of a data package using the read_metadata() function.
  2. Download individual resources from a data package using the read_data_entity() function.
  3. Download an entire data package as a zip archive using the create_data_package_archive() functions.

See the Search and Access Data vignette for examples.

For a language-agnostic solution, see the REST API documentation on Read Metadata, Read Data Entity, Create Data Package Archive, and Read Data Package Archive.

Accessing embargoed data

Embargoed data can be accessed by users who are authorized according to the <access> element of the EML file. Users should reach out directly to the contact listed on the data package for access to embargoed data. If the contact is unresponsive, contact EDI for assistance.