NHGRI Analysis Visualization and Informatics Lab-space



AnVIL at Virtual Applied Data Science Institute 2023

Wednesday, April 5, 2023 11:00 AM to 2:00 PM EDT
Thursday, April 6, 2023 11:00 AM to 2:00 PM EDT

AnVIL Module: Tools for Applied Data Science Using Cloud-Based Platforms

AnVIL is a cloud-based platform that supports the management, analysis and sharing of biomedical data for the NHGRI research community. It aims to advance our basic understanding of the genetic basis of complex traits and accelerate discovery and development of therapies, diagnostic tests, and other technologies for diseases like cardiovascular disease or autism spectrum disorders. The platform currently hosts more than 75,000 whole human genome data sets, and offers a variety of analysis capabilities including: Terra for large scale batch computing and interactive computing; Gen3 for managing, analyzing, harmonizing, and sharing large datasets; DockStore for sharing Docker-based analysis workflows; Jupyter notebooks for organizing live code, equations, visualizations and narrative text into a single document; R Studio for interactive machine learning, statistical computing, and visualizations; Bioconductor for community-driven interactive genomics with R; and Galaxy, for accessible, reproducible, and transparent genomic science. In this module, you will be introduced to the platform, tools and functionality for data science projects.


The Howard University Research Centers in Minority Institutions, the AIM-AHEAD program, and the Public Health Informatics Technology for DC (PHIT4DC) program is pleased to announce VADSTI 2.0, Spring 2023 Training Series to the Howard University community of researchers and beyond. The goal is to enhance data science capability and application by providing training in the foundations of programming and critical data analytic skills for planning and conducting research involving big data pertinent to minority health and health equities. The Spring Training Series is project-based and will cover topics including Foundations of Data Science, Python, Data Preparation, Exploration and Visualization, and Cloud Computing, among others.


Technological advancements and efficient use of computational tools have made it possible to generate and store large amounts of heterogeneous and complex datasets in many disciplines, including public health, clinical, biomedical, and genomics. There is therefore increased demand for data analytics capabilities to look at trends, predict outcomes, and make better clinical and health policy decisions. Skill sets in data science are particularly critical for advancing the science of minority health and health disparities.


Register for the AnVIL module at Eventbrite.

No prerequisite for research knowledge topics. Basic undergraduate knowledge of algebra and probability recommended for content knowledge topics.

Event Details

AnVIL Demos: Using the Human Pangenome on AnVILAnVIL Demos: Introduction to AnVIL
Improve this pageContent guide