Platform

AnVIL is a cloud based ecosystem that colocates high value data sets with commonly used bioinformatics tools in a secure environment.

By providing access to data and tools in a secure environment that can scale to meet the computational needs of researchers, AnVIL eliminates logistical concerns and allows researchers to focus on biological problems.

Providing access to such a platform with tens of thousands of human genome sequences will allow the genomics community to ask new questions that were not logistically feasible before.

Platform Components

AnVIL leverages the Terra computing platform to manage access and billing of cloud projects, launch analysis environments, and manage data.

Terra is integrated with the Dockstore workflow repository, enabling AnVIL users to utilize the library of published workflows in Dockstore.

Gen3 is leveraged by AnVIL as the data management system, enabling users to associate complex phenotypes, metadata, and clinical data with sequencing based data with sophisticated data models.

Terra

Terra is an analysis platform that allows users to access data, run analysis tools, and collaborate. Terra is powered by Google Cloud Platform, enabling the user to scale and manage billing of their own projects.

Gen3

Gen3 is a cloud-based software platform for managing, analyzing, harmonizing, and sharing large datasets. Gen3 is an open source platform for developing data commons.

Dockstore

Dockstore is an open platform used by the GA4GH for sharing Docker-based tools described with the Common Workflow Language (CWL), the Workflow Description Language (WDL), or Nextflow (NFL).

Improve this page