Downloading NMDC Data via Globus

Globus provides a mechanism to download NMDC data using high-bandwidth managed transfers. Globus has an automated point-and-click interface that lets you schedule a bulk transfer to your own machine or another compute center. You can learn more about using Globus to transfer data, by reading the Globus documentation.

Globus Collections

NMDC collections are publicly visible to everyone that has a Globus ID.

NERSC

To access NMDC data housed at NERSC, you can use the “NMDC” collection.

In that collection, NMDC data are organized by NMDC identifiers. This collection serves up the same contents as https://data.microbiomedata.org/data/ does, so any file path underneath that base URL can be mapped to the equivalent file in Globus.

EMSL

To access NMDC data housed at EMSL, you can use the “NMDC Bulk Data Cache” collection.

In that collection, NMDC data are organized by omics types. This collection serves up the same contents as https://nmdcdemo.emsl.pnnl.gov/ does, so any file path underneath that base URL can be mapped to the equivalent file in Globus.