Data Volume

SDSS Science Archive Server (SAS)

The SAS is a hierarchical directory structure with named data products, and can be navigated directly:

Individual data products are either replacements from previous data releases, and add incrementally to the data volume, or contain links to previous data releases, and add cumulatively to the data volume. Whether you are interested in the incremental volume or the cumulative volume depends on whether you want to maintain the archive’s directory structure including the links, or if you prefer to expand the data volume by traversing the links. Although it is recommended to do so, copying the incremental volume may risk broken links unless you also copy the data from the previous releases, whereas traversing links to copy the full cumulative volume may risk obtaining duplicate files.

SDSS Access is the recommended python package to transfer each species of file within a data product, and copies the increments for each specified release keeping links intact without duplication.

In addition, SciServer Compute provides Jupyter notebooks in Python or R. You can create a SciServer account and create containers for your notebooks, as described here.

Incremental Data Volume for Data Release 19

Incremental volume for DR19:

  • 316,182 directories
  • 55,177,486 files
  • 104.06 TB

Previous releases have a combined incremental volume:

  • 5,238,520 directories
  • 340,027,065 files
  • 651.84 TB

Spectroscopic Pipeline Output

Data ProductSAS DirectoryDir CountFile CountSize
APOGEE_REDUXspectro/boss/redux168,9887,600,64827.41 TB
BOSS_SPECTRO_REDUXspectro/boss/redux48,09421,361,17716.36 TB
LVM_SPECTRO_REDUXspectro/boss/redux41484 MB
MWM_ASTRAspectro/astra95,3568,485,5587.28 TB

Raw Observatory Data (Northern only)

Data ProductSAS DirectoryDir CountFile CountSize
APOGEE_DATA_Ndata/apogee/spectro/apo919159,04822.51 TB
BOSS_SPECTRO_DATA_Ndata/boss/spectro/apo79973,4421.03 TB
FCAM_DATA_Ndata/fcam/apo527136,9082.23 TB
GCAM_DATA_Ndata/gcam/apo1,50117,359,35411.01 TB

Target Data

Data ProductSAS DirectoryDir CountFile CountSize
MOS_TARGET_DIRmos/target/1.0.201,287417.04 GB

VAC Data

Data ProductSAS DirectoryDir CountFile CountSize
APMADGICSvac/mwm/apMADGICS/v2024_03_16211416.23 TB
APOGEE_OCCAMvac/mwm/apogee-occam02345.6 KB
APOGEE_STARHORSEvac/mwm/apogee-starhorse01160.2 MB
BHM_QSOPROPvac/bhm/qso-properties/1.0.101352.7 MB
DL1_SDSS_EROSITAvac/mos/DL1_SDSS_eROSITA/v1_0_20216.67 MB
MWM_MDWARFvac/mwm/m-dwarf/elemental_abundances013.57 MB
MWM_MINESWEEPERvac/mwm/minesweeper0111.82 MB
MWM_STARFLOWvac/mwm/starflow/v1_0_0034.29 GB
MWM_WHITEDWARFvac/mwm/white-dwarf/da_white_dwarf_properties/1.0.30283.78 MB

Data volumes for earlier data releases, such as the final data release for SDSS-IV (DR17) can be found here; and the initial data release for SDSS-V (DR18) can be found here.

Back to Top