SDSS Data Archive Server

This is the SDSS Data Archive Server (DAS) main page. DR6 DAS. --> For general information on the SDSS, please consult the SDSS homepage. For more inforation on using SDSS data, please consult the SDSS DR7 web site.

The SDSS Data Archive Server (DAS) serves files produced and consumed by the SDSS data processing pipelines. See this overview of the pipelines for a description of the pipelines, references to more detailed information on them, and tables of what files are available in the DAS and where to find them. For the online SQL database and more advanced interactive data exploration tools, see the Catalog Archive Server (CAS).

The DAS itself provides direct access to the directory tree with the data, interactive forms that allow users to upload tables of data of interest for exploration of bulk download, and web pages for browsing data release 7 and older data releases.

Interactive download tools

Each of these forms can be used either to browse a user specified data set, or generate a list of URLs of files corresponding to that data set. You can then save this list to disk, and use wget or any other bulk http download utility (eg curl) to download the whole list. If you saved the URL list an a file named sdss-wget.lis, you can use wget to download the files into a directory structure corresponding to that of the DAS with a command that looks like this:

bash$ wget -x -nH -i sdss-wget.lis 

The -c and --cut-dirs options are also likely to be of interest to DAS users; consult the wget documentation.

Mass download through rsync is also available. The command to use a DAS generated file for rsync looks like this:

bash$ rsync -vtHP --files-from=sdss-rsync.lis rsync://user@rsync.sdss.org/DAS . 

The password is sdss.

Browse Data Release 7

The DAS also includes Steve Kent's SDSS mosaic interface.

You can also browse older data releases, or all the data organized by night.

Working directly with the directories and files

You can also browse the raw directory structures:

das.sdss.org/imaging
holds the files generated by the imaging pipelines. The base URL points to a directory of all imaging runs included in the DAS.
das.sdss.org/spectro
holds the files generated by the spectroscopic pipelines.
das.sdss.org/pt
holds the files generated by the photometric telescope pipeline, mtpipe, including photometric soluctions and object catalogs.
das.sdss.org/nightly
holds the files generated at the observatory by MJD. These files include the text night log, cloud camera images, and other observing data.
das.sdss.org/software
holds source code for some of the software products used in the survey. The directory holds the source code of the final (for SDSS-II) versions of serveral key processing pipelines. The CVSrepository subdirectory holds tar files of the contents of the CVS repository for most key products.
das.sdss.org/misc
holds a handful of static datasets used in data processing. For example, the astrometric reference catalogs can be found here.
das.sdss.org/va
holds value added catalogs stored in the DAS.
das.sdss.org/unprocessed
holds the raw data files as they came from the mountain.

From these directories, you can download files using rsync or an http downloader such as wget. The total volumes of these directories can be found here.

For example, to download all files for run 3836, rerun 41 using wget:

bash$ wget -r -nH -np -N -l inf das.sdss.org/imaging/3836/41/

where

You will probably want to explore the --cut-dirs and --directory-prefix options to refine where wget stores the files it retrieves. If you started a download which was then interrupted, the -c option can be used to tell it to continue partially downloaded files.

If you want a directory but not other files or subdirectories in its parent, you must use the trailing slash in the URL!

You can accomplish something very similar using rsync:

bash$ rsync -rvtHP rsync://user@rsync.sdss.org/DAS/imaging/3836/41 .

The password is sdss.

The -r asks that the files be downloaded recursively.

Older Data Releases

gipoco.com is neither affiliated with the authors of this page nor responsible for its contents. This is a safe-cache copy of the original web site.