Skip to main content
Skip table of contents

Understanding Accession Numbers

Overview

NIAGADS utilizes 4 sets of accession numbers. These unique identifiers help researchers identify and navigate the data and metadata available:

  • Dataset Accession Number - NGXXXXX

    • Dataset accession numbers correspond to the data for a particular set of experiments. A dataset can be made up of 1 or more studies or sample sets. When submitting a data access request in DSS, researchers apply for access to data using the dataset accession number.

  • Study Accession Number - saXXXXXX

    • Study accession numbers correspond to the overarching study that datasets are part of. A study can be associated with one or more datasets. Datasets can be linked to more than one study if multiple studies were involved in generating the data.

  • Sample Set Accession Number - sndXXXXX

    • A sample set accession number is used as an identifier for a particular set of samples within a dataset. Often it corresponds to a set of samples run on a particular platform in a particular batch. More than one sample set can be included within a dataset if different assays or platforms were used to generate the data in the dataset.

A dataset may not have any sample sets if it does not have any sample-level data (i.e. all files are aggregate-level data or summary statistics).

  • Fileset Accession Number - fsaXXXXXX

    • A fileset accession number is a way to group related files together, and can be used to:

      • Differentiate between files that are part of different experiments or analysis in the same dataset.

      • Differentiate between public and private access files.

    • Use the fileset accession number to narrow down the files you are looking for in the DSS.

What Can I Find on Each Accession Page?

Each accession type, excluding fileset, corresponds to its own summary page that will give information about the dataset, sample set, or study. See a summary below of the type of information that can be found on each page type.

Dataset Page
  • A description of data, including the sample set and filesets associated with the dataset as well as:

    • Number of subjects and links to demographic information, if available

    • A summary of changes made in each data release, if applicable

    • Related studies, which will link you to the study page

    • Cohorts, which will link you to the cohort page

    • Research use limitations for the participants used in the dataset

    • Acknowledgment, which should be used when publishing work derived from the dataset. If multiple studies are associated with the dataset, each studies’ acknowledgment is collated on the dataset acknowledgment page.

    • Related publications, if applicable

    • Approved data access requests by other researchers

    • Release notes for the current version, which details changes to the dataset, the dataset description, the list of all associated accession numbers, subject count and consent levels

    • File manifest for the current version, which details the files available within each fileset and their consent levels. File set accession numbers can be found here and can be used to identify files

    • Digital Object Identifier (DOI)

Study Page
  • A description of the study and link to related datasets and sample sets

    • Principal Investigator (PI) information

    • Grant funding

    • Acknowledgement for the study

    • Cohorts, which will link you to the cohort page

    • Related publications

Sample Set Page
  • A description of the subjects included in the sample set and links to the related studies and datasets

    • Links to cohort/s pages

    • A breakdown of the sex distribution in the sample set

    • A breakdown of the diagnosis status in the sample set (disease type, case and control)

    • A breakdown of demographics

    • A breakdown of APOE genotypes

How Do I Find an Accession Number?

Below are screenshots and instructions on how to find the different accession numbers.

All accession numbers associated with a dataset can always be found in the dataset release notes.

image-20240222-155817.png

Click the link to download the release notes.

image-20240222-160218.png

Open the release note file, then scroll down in the release notes to the “Dataset Accession Numbers Available in NGXXXXX.vX” to see all accession numbers associated with the dataset.

Dataset

Dataset accession numbers can be found in the following places:

All Dataset Accession Numbers

To see a full list of datasets, visit Available Datasets | NIAGADS

image-20240222-181135.png

All datasets will be transitioned to the DSS database. To see all datasets in the DSS database, visit Datasets Archive - DSS NIAGADS

image-20240301-165545.png
On a Dataset Page

At the top of the page

image-20240222-181318.png
On a Sample Set Page

In the metadata on the left-hand side of the page

image-20240222-181603.png
On a Study Page
image-20240222-181925.png
On a Cohort Page

On the left-hand side of the page

image-20240222-182450.png
In Release Notes
image-20240222-155817.png

Click the link to download the release notes.

image-20240222-160218.png

Open the release note file, then scroll down in the release notes to the “Dataset Accession Numbers Available in NGXXXXX.vX” to see all accession numbers associated with the dataset.

Study

Study accession numbers can be found in the following places:

All study accession number

Can be found by visiting Studies Archive - DSS NIAGADS

image-20240222-184603.png
On a Dataset Page
image-20240222-184804.png
On a Study Page
image-20240222-185005.png
In Release Notes
image-20240222-155817.png

Click the link to download the release notes.

image-20240222-160218.png

Open the release note file, then scroll down in the release notes to the “Dataset Accession Numbers Available in NGXXXXX.vX” to see all accession numbers associated with the dataset.

Sample

Sample set accession numbers can be found in the following places:

On a Dataset Page
image-20240222-185823.png

OR

image-20240222-185936.png
On a Study Page
image-20240222-203232.png
On a Sample Set Page
image-20240222-203704.png
On a Cohort Page
image-20240222-203521.png
In Release Notes
image-20240222-155817.png

Click the link to download the release notes.

image-20240222-160218.png

Open the release note file, then scroll down in the release notes to the “Dataset Accession Numbers Available in NGXXXXX.vX” to see all accession numbers associated with the dataset.

Fileset

Fileset accession numbers can be found in the following places:

On a Dataset Page
image-20240223-220846.png
In Release Notes
image-20240222-155817.png

Click the link to download the release notes.

image-20240222-160218.png

Open the release note file, then scroll down in the release notes to the “Dataset Accession Numbers Available in NGXXXXX.vX” to see all accession numbers associated with the dataset.

In the File Manifest
image-20240223-221110.png

Click the link to download the file manifest. Open the file manifest in excel.

image-20240223-221232.png

How Do I Use Accession Numbers in the DSS?

Once you have identified an accession number you are interested in downloading the data for, the accession number can then be used in the different tabs of the DSS to help you narrow down and select the files you are interested in downloading. The screenshots below show how to use the different accession numbers in the subject information, gVCF, CRAMs, phenotype files, genotype files, and miscellaneous files tabs.

Subject Information Tab

image-20240326-163844.png

Single Sample Tab

Contains individual level files (CRAMs, CRAM Indexes, VCF, gVCF, VCF indexes, gVCF indexes).

image-20240326-164247.png

Multi-Sample and Metadata Tab

Contains files that cover multiple samples or metadata for datasets (pVCF companion files, QC metrics, phenotype files, genotype files, WES target regions, ect).

image-20240326-164611.png

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.