AnVIL Portal
  • Data Submission Guide
  • Submission Process Overview
  • 1 - Register Study/Obtain Approvals
  • 2 - Set Up a Data Model
  • 3 - Prepare for Submission
  • 4 - Stage Your Data in AnVIL
  • 5 - QC Data
  • Data Submission Resources
  • Consortium Data Access Guidelines
  • Data Withdrawal Procedures

Step 1 - Register Study/Obtain Approvals

NIH/NHGRI Data Sharing Policy Compliance

Institutions are responsible for assuring, through an Institutional Certification, that plans for the submission of large-scale human genomic data meet the expectations of the Genomic Data Sharing Policy.

To deposit large-scale, individual-level data into AnVIL, data submitters must follow the steps below.

1.1. Obtain approval
1.2. Register study
1.3. Apply to AnVIL

1.1. Obtain Approval

Working from an NIH grant?

If your Funding Opportunity Announcement (FOA) explicitly states that grantees are expected to share data in the AnVIL, you have this covered by default.

NHGRI-funded researcher?

For other work, including grants supported by Parent FOAs where data deposition in a certain repository isn’t described, PIs should discuss submission to AnVIL with your Program Director (PD). If you are an Intramural Researcher, discuss with your Scientific Director (SD). If the PD/SD agrees with submission of data to AnVIL, work with the NHGRI Genomic Program Administrator (GPA) to determine the appropriate place to register your study (see more on registration below).

NIH-funded researcher?

If your research is funded by another NIH Institute (not NHGRI), submit a request for the AnVIL to host your dataset by completing the AnVIL Dataset Onboarding Application form. Once AnVIL has agreed to host your study, contact your NIH Institute’s GPA (if you don't know your GPA, refer to this list) to determine the appropriate place to register your study (see more on registration below).

Not working from an NIH grant?

If your research is not funded by the NIH, identify an institute whose mission your research is most closely aligned with that is willing to sponsor your data submission.

At the same time, submit a request for AnVIL to host your dataset by completing the AnVIL Dataset Onboarding Application form. Once both have agreed to host your study, work with the GPA of the NIH Institute that agreed to sponsor your data submission to determine the appropriate place to register your study (see more on registration below).

1.2. Register study

Where you register your data and who will advise you of next steps depend on what kind of data you are generating and how your study is funded.

Where to register different data types.
Where to register the various data types.

Generating human data? Register with dbGaP

See this link for more details about dbGaP and how to register your study data.

NHGRI-funded researcher?

NIH-funded researcher?

  • Contact your Institutes and Centers GPA (if you don't know your GPA, refer to this list).
  • The GPA will request information from you to register your study.
    • Ask the GPA to select "AnVIL" for the External Data Repository field during registration.

Non-NIH-funded?

  • Work with the GPA of your sponsoring NIH Institute.
  • The GPA will lead you through the study registration process.
    • Ask the GPA to select "AnVIL" for the External Data Repository field during registration.

Non-human data?

If you are working with non-human data, you will need to register with an NCBI repository such as GEO or SRA.

1.3. Apply to AnVIL

Prospective AnVIL data submitters should complete the AnVIL Dataset Onboarding Application for review by the AnVIL leadership committee.

Before you apply

Note that you will need the phsID from dbGap (step 1.2 above) and Access Rrstrictions (i.e., Data Use limitations/consent groups) to complete the application.

Access Restrictions

Known Data Use Limitations (DUL) - the list of requirements for gaining access and using the data - need to be clearly defined by the data depositor. This is completed via the NIH Institutional Certification form signed by the submitter's institutional official and provided directly to the Genomic Program Administrator for the IC (i.e,. NHGRI).

You should include your DULs as part of the AnVIL onboarding application. Consent information, along with other information listed in this spreadsheet, are documented in DUOS. Once ingested into TDR, DUOS will handle all protocols for gaining access.


Help us make these docs great!
All AnVIL docs are open source. See something that’s wrong or unclear? Submit a pull request.
Make a contribution