AnVIL Data

AnVIL hosts high value genomic datasets relevant to human health and disease.

Current Consortia

A summary of the currently ingested data is listed below:

ConsortiumCohortsSubjectsSamplesFilesSize (TB)
6861,82161,82181,107911.34
256,3536,35310,85318.23
197917,38241,844157.64
12,5042,5042,50441.29
127712775.52
Total9671,93488,061136,5851134.03

Planned Consortia

The folowing consortia are planned for data ingestion. Additional consortia are under consideration and will be listed as they are approved.

  • Covid19hg - Covid-19 Host Genetics Initiative
  • CSER - Clinical Sequencing Evidence-Generating Research
  • GTExv9 - Genotype-Tissue Expression Project
  • HPP - Human Pangenome Project
  • NIA - National Institute of Aging
  • NIMH - National Institute of Mental Health

Accessing AnVIL Data via Pre-Built Terra Workspaces

Currently, AnVIL data can be accessed through the pre-built Terra workspaces listed below. In the near future, the Gen3 Data Explorer will allow for selection of datasets and the creation of virtual cohorts that can then be analyzed in a users personal or shared Terra workspace.

To access data:

  1. Create an account in AnVIL Terra and link your eRA Commons / NIH account.
  2. Request access to the study in using dbGaPs Authorized Access Portal with the study's phsId. See NCBI's Starting Point to Applying for dbGaP Data for more information.
  3. Once your access is approved, your data will appear as one or more workspaces in your Terra account.

Please see the Requesting Data Access and Once Your Access is Granted guides for more detai.

Pre-Built Terra Workspaces

Pre-Built Terra workspaces are listd below. Please note that:

For workspaces with no dbGaP Id, the dbGaP study registration is in progress. The dbGap Ids will be displayed once assigned by dbGaP.

For workspaces where the dbGaP Id is not selectable, the dbGaP study registration is in progress. The dbGaP Id has been assigned but the study is not yet listed in dbGaP.

ConsortiumTerra Workspace NamedbGaP IdData TypeAccessSubjectsSamplesFilesSize
CCDGphs001624WGSPrivate1,1221,1221,12226.79
CCDGWGSPrivate63963963913.92
CCDG--WGSPrivate1,2011,2011,20126.23
CCDGWGSPrivate3,9443,9443,944102.66
CCDG--WGSPrivate1,3581,3581,35827.44
CCDGphs001642WGSPrivate1771773542.28
CCDGphs001642WGSPrivate9049041,80811.94
CCDGphs001642WGSPrivate2532535064.08
CCDGphs001642WGSPrivate1,3481,3482,69616.54
CCDGphs001642WGSPrivate1,0871,0872,17412.62
CCDGphs001642WGSPrivate1,5651,5653,13021.61
CCDGphs001642WGSPrivate2525500.29
CCDGphs001543WGSPrivate2482484964.93
CCDGphs001600WGSPrivate1231232462.56
CCDGphs001547WGSPrivate90901801.87
CCDGphs001545WGSPrivate4644649289.07
CCDGWGSPrivate2902905805.45
CCDGWGSPrivate1051052102.32
CCDGphs001544WGSPrivate1181182362.55
CCDGphs001601WGSPrivate4184188368.56
CCDG--WGSPrivate1121122242.41
CCDGWGSPrivate2240.04
CCDGphs001569NAPrivate1,1361,1362,27215.58
CCDGWGSPrivate7737731,54614.48
CCDGWGSPrivate2,1582,1584,31837.67
CCDGWGSPrivate4964969926.17
CCDG--WGSPrivate5345341,0688.50
CCDG--WESPrivate3563567120.55
CCDG--WESPrivate3803807600.63
CCDG--WESPrivate1241242480.21
CCDG--WESPrivate4747940.04
CCDG--WESPrivate1481482960.26
CCDG--WESPrivate1616320.03
CCDG--WESPrivate2,3372,3374,6743.60
CCDG--WESPrivate1941943880.36
CCDG--WESPrivate90901800.15
CCDG--WESPrivate1121122240.19
CCDG--WESPrivate7877871,5741.26
CCDG--WESPrivate92921840.16
CCDG--WESPrivate2352354700.32
CCDG--WESPrivate3636720.06
CCDG--WESPrivate8708701,7401.46
CCDG--WESPrivate2442444880.40
CCDG--WESPrivate7907901,5801.23
CCDGWGSPrivate1,1711,1711,17128.15
CCDGWGSPrivate1,1771,1771,17727.93
CCDGWGSPrivate1,0491,0491,04924.49
CCDGWGSPrivate1481481483.65
CCDG--WGSPrivate6969691.56
CCDG--WGSPrivate42642642610.33
CCDGphs001766WGSPrivate4,6014,6014,601106.07
CCDGphs001894WGSPrivate72472472417.22
CCDGWGSPrivate58058058013.91
CCDGphs001676WGSPrivate9,2019,2019,2010.00
CCDGWGSPrivate90590590522.82
CCDGphs001222WGSPrivate3,1663,1663,16653.46
CCDGphs001155WGSPrivate6476476479.86
CCDGphs001624WGSPrivate62462462411.95
CCDGWGSPrivate3483483486.89
CCDGphs001506WGSPrivate1,0511,0511,05121.83
CCDGphs001913WGSPrivate2772772775.52
CCDGWGSPrivate4294294298.77
CCDG--WGSPrivate1551551553.09
CCDG--WGSPrivate2,7902,7902,79039.81
CCDGphs001579WGSPrivate3,1213,1213,12164.70
CCDGWGSPrivate1,3561,3561,35627.00
CCDGWGSPrivate1461461462.84
CCDG--WGSPrivate1121121120.00
CMGWGSPrivate2772775540.52
CMG--Private1,1811,1812,3623.06
CMGWGSPrivate7677671,5341.69
CMGWGSPrivate6026021,2041.50
CMGWGSPrivate3535700.64
CMGWGSPrivate79791581.26
CMGWGSPrivate1291292580.22
CMGWGSPrivate1010200.02
CMGWGSPrivate1,1951,1952,4442.41
CMGWGSPrivate2727540.52
CMGWGSPrivate1091092180.19
CMGWGSPrivate253253480.04
CMG--Private7979640.56
CMGWGSPrivate4949980.10
CMGWGSPrivate2240.00
CMGWGSPrivate4444880.12
CMGWGSPrivate1010200.19
CMGWGSPrivate73731460.11
CMGWGSPrivate3535700.05
CMGWESPrivate723723--0.00
CMGWGSPrivate3131620.08
CMGWGSPrivate1151152552.05
CMGWGSPrivate4194199041.03
CMGWGSPrivate4343860.76
CMGWGSPrivate66661321.10
eMERGEphs001913WGSPrivate--12775.52
GTEx (v8)--Private97917,38241,844157.64
1000 Genomes--WGS, VCFPublic--2,5042,50441.29
Improve this pageContent guide