NHGRI Analysis Visualization and Informatics Lab-space



WGSPD Project 1: Whole Genome Sequencing for Schizophrenia and Bipolar Disorder

WGSPD1phs002041.v1.p1dbGapdbGap FHIR


The Center for Genomic Psychiatry at the University of Southern California (USC) and an extensive network of academic medical centers have created the Genomic Psychiatry Cohort (GPC). The GPC consists of a large clinical cohort of patients with schizophrenia, bipolar disorder, and healthy controls. The pilot phase of whole genome sequencing of the GPC has been done in collaboration with the USC and the Broad Institute of MIT and Harvard.

Whole genome sequencing (20X) and analysis of 9,033 well-phenotyped individuals from the Genomic Psychiatry Cohort (GPC), divided among schizophrenia cases, bipolar disorder cases, and psychiatrically normal controls, and comprised of European-Ancestry (EA), African-Ancestry (AA), and Latino individuals. The combination of clinically well-characterized GPC participants and whole genome sequencing with rich functional annotation aims to identify functional variants associated with schizophrenia and bipolar disorder risk. Over half of the sequenced samples are patients with a diagnosis of either schizophrenia or bipolar disorder. The majority of the study participants are of African American ancestry, increasing the diversity of sampling for these important diseases in traditionally understudied populations.

This data set also contains 545 samples that have undergone 10x Genomics Linked-Read Sequencing (169 participants overlapping with the 9,033 participant WGS 20X dataset). In addition to the benefits due to disease phenotyping and ancestry selection that apply to all of the WGSPD Project 1 samples, the 10x Genomics Linked-Read samples, through the use of barcodes that can identify the long input DNA fragment that each sequencing read was generated from, can be used to 1) identify structural variants that are difficult to call with standard short read data, 2) map short reads to repetitive regions of the genome that cannot be assayed with standard short reads, and provide highly accurate phasing information about genetic variants, including rare variants that are more difficult to phase using statistical phasing techniques.


177.36 TBSize


DiseasesSchizophrenia cases and controls, Schizophrenia and Bipolar Disorder cases and controls
AccessControlled Access
Study DesignCase-control
Data TypesWhole Genome

Applying For Access

dbGaP FAQdbGaP Access Request Video Tutorial

Terra Workspaces

This study has been divided into the following workspaces by consent codes and optionally the originating laboratory.

Terra Workspace NameConsent CodeDiseaseAccessStudy DesignData TypeSamplesParticipantsSize (TB)
AnVIL_NIMH_Broad_WGSPD1_McCarroll_Braff_DS_10XLRGenomesDS-SZRD-MDSSchizophrenia cases and controlsCase-controlWhole Genome18718711.59
AnVIL_NIMH_Broad_WGSPD1_McCarroll_Braff_DS_WGSDS-SZRD-MDSSchizophrenia cases and controlsCase-controlWhole Genome86486420.12
AnVIL_NIMH_Broad_WGSPD1_McCarroll_Escamilla_DS_WGSDS-MLHLTH-MDSSchizophrenia cases and controlsCase-controlWhole Genome85853.45
AnVIL_NIMH_Broad_WGSPD1_McCarroll_Pato_GRU_10XLRGenomesGRUSchizophrenia and Bipolar Disorder cases and co...Case-controlWhole Genome36835521.73
AnVIL_NIMH_Broad_WGSPD1_McCarroll_Pato_GRU_WGSGRUSchizophrenia and Bipolar Disorder cases and co...Case-controlWhole Genome8,0848,084120.47