ASHG 2021 - Structural variant discovery from long-read sequencing data on the cloud with Galaxy in Terra
Description
In this workshop, we will guide you through an end-to-end SV identification journey using Galaxy, a platform designed to facilitate access to computational methods for researchers without a programming background. Specifically, we will use Galaxy in Terra, in the context of the NHGRI Genomic Data Science Analysis, Visualization and Informatics Lab-space (AnVIL). This cloud-based environment enables you to analyze large genomic datasets with familiar tools and reproducible workflows securely.
Through live demonstrations and interactive exercises, you will learn how to:
- Bring data into a project workspace in Terra
- Combine data (your own or controlled-access) with an open-access dataset
- Launch a Galaxy instance in Terra and run a complete workflow to identify SVs
- Visualize results and identify potentially pathogenic variants
The skills you will learn in this workshop will extend to other scientific use cases, datasets and tools beyond the examples shown.
Background
Growing evidence that structural variants (SVs) are responsible for many types of diseases and traits is fueling interest in taking a fresh look at different disease types using long-read sequencing. Although short-read technologies have long been cheaper and more readily available, long-read sequencing produces data that can yield significantly more accurate results for identifying SVs.
However, the large amounts of data and complexity of the computational methods involved can make it difficult for newcomers to access this exciting area of research, particularly in the context of the traditional computing environments that are provided by default to academic researchers.
Audience
Researchers and clinicians interested in exploring SV calling with long-read sequencing data. This workshop will also appeal to anyone more broadly interested in practical ways to access and analyze data in the cloud - with or without advanced computing training.
Prerequisites
The ideal audience member will have a basic familiarity with genomics terminology and standard high-throughput sequencing data formats.
Agenda
12:00 pm - 12:05 pm | Welcome and overview of workshop agenda and logistics
View the presentation (slides).
12:05 pm - 12:15 pm | Overview of structural variants and methods for SV discovery
12:15 pm - 12:25 pm | AnVIL Powered by Terra Overview
An overview of AnVIL Powered by Terra platform and its capabilities, focusing on the Cloud Environment and its architecture.
12:25 pm - 12:55 pm | Hands-on with workspaces and data in AnVIL Powered by Terra
Cloning a Terra workspace for downstream analysis in Galaxy.
12:55 pm - 01:25 pm | Structural variant discovery with Galaxy in AnVIL Powered by Terra
Learn an end-to-end SV identification journey using Galaxy to visualize results and identify potentially pathogenic variants.
Step-by-step written instructions (doc)
Step-by-step screenshot instructions (slides)
01:25 pm - 01:30 pm | Closing Remarks, Q&A
Questions? Ask us any time on the AnVIL Support Forum!
Registration and Costs
More Info
https://www.ashg.org/meetings/2021meeting/
Contact
Annual Meeting, general inquiries: meetings@ashg.org