NHGRI Analysis Visualizationand Informatics Lab-space


IntroductionData AnalystsInvestigatorsData Submitters

Data Analysts - Guides and Tutorials

This section lists guides, tutorials, and other resources to help data analysts find and process data and share results in the AnVIL cloud.

Bioconductor / RStudio

2021 Bioconductor Popup Workshops

Recordings and related materials for the 2021 Bioconductor popup workshops are listed below:

  • Week 1: Using R / Bioconductor in AnVIL - An introduction to the AnVIL cloud computing environment. We learn how to create a Google account to use in AnVIL. We explore key concepts related to workspaces and billing projects. We explore creating a Jupyter notebooks-based cloud environment, and an RStudio cloud environment.
  • Week 2: The R / Bioconductor AnVIL Package - An exploration of how workspaces provide a framework for managing data and large-scale analyses using the HCA Optimus Pipeline and 1000G-high-coverage-2019 workspaces and R using the AnVIL package.
  • Week 3: Running a Workflow - How to configure and run a workflow, based on the Bioconductor-Workflow-DESeq2 workspace. The workflow starts with FASTQ files and transforms them using salmon to the inputs required for Bioconductor DESeq2 analysis of differential expression.
  • Week 4: Single-cell RNASeq with 'Orchestrating Single Cell Analysis' in R / Bioconductor - An introduction to a resource, developed primarily by Aaron Lun of Genentech, Inc., that employs Bioconductor resources for many aspects of the analysis of single-cell RNA-seq data. The resource is a "computable book" written in R Markdown, published at
  • Week 5: Using AnVIL for Teaching R - A case study of using AnVIL to teach R for a Biostatistics course and provides essentials for using AnVIL for other instructional efforts.
  • Week 6: Reproducible Research with AnVILPublish - An exploration of elements of reproducible research with the AnVILPublish package. We will illustrate how to make a docker container tailored publishing AnVIL packages and then emphasize the merits of an R package structure for organizing research activities in a manner that emphasizes provenance and reproducibility.
  • Week 7: Participant Stories - Participants share their own use of AnVIL.


  • Getting Started with Galaxy - A step-by-step tutorial demonstrating how to compute quality metrics of unaligned reads, align the reads to a reference genome using bowtie2, plot a coverage histogram, call variants using FreeBayes, and then summarize the variant calls using bcftools.
  • Starting Galaxy on the AnVIL Platform - A video introduction to using Galaxy on the AnVIL platform.


Cross Platform Data Access with GA4GH DRS in TerraUsing R / Bioconductor in AnVIL
Improve this pageContent guide