Center for Alzheimer's and Related Dementias (CARD) Data Release 2 on the AnVIL Platform

August 25, 2025

The second release of long-read DNA sequencing of brain tissue samples from the NIH Center for Alzheimer's and Related Dementias (CARD) Long Read Sequencing group is now available on AnVIL for controlled access through dgGaP. This data release includes 206 samples from a North American Brain Expression Consortium study (NABEC, dbGaP:phs001300) and 155 from the DIRP NIMH Human Brain Collection Core (HBCC:phs000979).

Access requests for these controlled datasets are available through dbGaP under accessions phs001300.v5.p1 and phs000979.v4.p2 and the data is available at https://explore.anvilproject.org/datasets or https://duos.org/datalibrary/anvil.

Nanopore sequencing data is provided in mapped BAM format that includes read phasing assignment and 5-methylcytosine (5mC) calls. In addition to sequencing data, this data release includes: haplotype-resolved assemblies, structural and small variant calls, as well as methylation calls for all 361 neurologically ‘normal’ prefrontal cortex ( and cortex ) brain tissue samples. All of these data were produced on Terra using Nanopore Analysis Pipeline (NAPU) workflows which can be found here, also see this publication for more information about the NAPU workflow.

CARD aims to build a framework for the large-scale application of Oxford Nanopore sequencing while filling knowledge gaps about genomic variation in Alzheimer’s disease and other neurological disorders. CARD makes its long-read sequencing data, protocols, and pipelines open access for other researchers. To this end, CARD has also provided a catalogue of haplotype-resolved small and structural variants and aggregated methylation for each cohort in this data release. These data are used to investigate how genomic variants are associated with gene expression and epigenetic modifications in brain tissue in this preprint Long-read sequencing of hundreds of diverse brains provides insight into the impact of structural variation on gene expression and DNA methylation.

Release 1Release 2
Brain Tissue Samples14361
Cell Lines30
Long-read phased genomes17361
Small variant callsets17361
Structural variant callsets17361
Methylation callsets haplotype-resolved17361

CARD is a collaboration between the National Institute of Neurological Disorders and Stroke (NINDS) and the National Institute on Aging (NIA). Learn more about CARD’s long-read sequencing efforts for AD/ADRD here and here.

The CARD long-read consortium includes researchers from National Institute on Aging, UC Santa Cruz Genomics Institute, National Cancer Institute, National Human Genome Research Institute, Baylor College of Medicine, Johns Hopkins University, University of Southern California and Northeastern University College of engineering.


Help us make these docs great!
All AnVIL docs are open source. See something that’s wrong or unclear? Submit a pull request.
Make a contribution
NHGRINIHHHSUSA.GOV
HelpPrivacy
v2.16.0-a835fa9