StudyType	PubMedID	Author	Title	Journal	PublishDate	Chromosome	Disease	Technology	Species	CaseID	Platform	CNA	Connection	Gene	Affiliation	Abstract	GenomeAssembly	GEO	dbGaP	ENA	IsCancer	FusionGene
Research	28260531	Collins RL, Brand H, Redin CE, Hanscom C, Antolik C, Stone MR, Glessner JT, Mason T, Pregno G, Dorrani N, Mandrile G, Giachino D, Perrin D, Walsh C, Cipicchio M, Costello M, Stortchevoi A, An JY, Currall BB, Seabra CM, Ragavendran A, Margolin L, Martinez-Agosto JA, Lucente D, Levy B, Sanders SJ, Wapner RJ, Quintero-Rivera F, Kloosterman W, Talkowski ME	Defining the diverse spectrum of inversions, complex structural variation, and chromothripsis in the morbid human genome	Genome Biol	2017 Mar	1,12,14,15	Autism spectrum disorder	Next Generation Sequencing	Homo sapiens	TL010	Illumina HiSeq 2000 /2500 	chr14:0-77704851:0;chr14:77704852-77739092:-1;chr14:77739093-107349540:0;chr6:0-99267814:0;chr6:102567331-102657383:-1;chr6:102657384-103737414:0;chr6:103737415-103763292:-1;chr6:103763293-116305817:0;chr6:116305818-116423965:-1;chr6:116423966-171115067:0;chr6:99267815-99840246:-1;chr6:99840247-102567330:0		LOC101927314;ASCC3;NT5DC1;LOC643770;NUMB;SLC41A2;ELMSAN1;DLST;RAD51B;ESRRB;COQ3;PNISR;AMD1;LAMA4;RGS6;TMEM63C;POMT2;SV2B;DNAL1;LAMA2;NCOA7;GPHN;FRK;ACOT2;POU3F2;FBXL4;FAXC;NGB	Molecular Neurogenetics Unit and Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, and Department of Neurology, Massachusetts General Hospital, Boston, MA, 02114, USA	BACKGROUND: Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies. RESULTS: We sequenced 689 participants with autism spectrum disorder (ASD) and other developmental abnormalities to construct a genome-wide map of large SV. Using long-insert jumping libraries at 105X mean physical coverage and linked-read whole-genome sequencing from 10X Genomics, we document seven major SV classes at ~5 kb SV resolution. Our results encompass 11,735 distinct large SV sites, 38.1% of which are novel and 16.8% of which are balanced or complex. We characterize 16 recurrent subclasses of complex SV (cxSV), revealing that: (1) cxSV are larger and rarer than canonical SV; (2) each genome harbors 14 large cxSV on average; (3) 84.4% of large cxSVs involve inversion; and (4) most large cxSV (93.8%) have not been delineated in previous studies. Rare SVs are more likely to disrupt coding and regulatory non-coding loci, particularly when truncating constrained and disease-associated genes. We also identify multiple cases of catastrophic chromosomal rearrangements known as chromoanagenesis, including somatic chromoanasynthesis, and extreme balanced germline chromothripsis events involving up to 65 breakpoints and 60.6 Mb across four chromosomes, further defining rare categories of extreme cxSV. CONCLUSIONS: These data provide a foundational map of large SV in the morbid human genome and demonstrate a previously underappreciated abundance and diversity of cxSV that should be considered in genomic studies of human disease.	GRCh37/hg19				No	
Research	28260531	Collins RL, Brand H, Redin CE, Hanscom C, Antolik C, Stone MR, Glessner JT, Mason T, Pregno G, Dorrani N, Mandrile G, Giachino D, Perrin D, Walsh C, Cipicchio M, Costello M, Stortchevoi A, An JY, Currall BB, Seabra CM, Ragavendran A, Margolin L, Martinez-Agosto JA, Lucente D, Levy B, Sanders SJ, Wapner RJ, Quintero-Rivera F, Kloosterman W, Talkowski ME	Defining the diverse spectrum of inversions, complex structural variation, and chromothripsis in the morbid human genome	Genome Biol	2017 Mar	2,7,8,9	Autism spectrum disorder	Next Generation Sequencing	Homo sapiens	UTR22	Illumina HiSeq 2000 /2500 	chr2:0-167032587:0;chr2:167032588-167393745:-1;chr2:167393746-243199373:0		AC0101273;WIPF1;XIRP2;ZNF385B;ACO1;NSMAF;FRMPD1;KIAA1715;TOX;UBAP2;SCN9A;SCN7A;LOC101929680	Molecular Neurogenetics Unit and Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, and Department of Neurology, Massachusetts General Hospital, Boston, MA, 02114, USA	BACKGROUND: Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies. RESULTS: We sequenced 689 participants with autism spectrum disorder (ASD) and other developmental abnormalities to construct a genome-wide map of large SV. Using long-insert jumping libraries at 105X mean physical coverage and linked-read whole-genome sequencing from 10X Genomics, we document seven major SV classes at ~5 kb SV resolution. Our results encompass 11,735 distinct large SV sites, 38.1% of which are novel and 16.8% of which are balanced or complex. We characterize 16 recurrent subclasses of complex SV (cxSV), revealing that: (1) cxSV are larger and rarer than canonical SV; (2) each genome harbors 14 large cxSV on average; (3) 84.4% of large cxSVs involve inversion; and (4) most large cxSV (93.8%) have not been delineated in previous studies. Rare SVs are more likely to disrupt coding and regulatory non-coding loci, particularly when truncating constrained and disease-associated genes. We also identify multiple cases of catastrophic chromosomal rearrangements known as chromoanagenesis, including somatic chromoanasynthesis, and extreme balanced germline chromothripsis events involving up to 65 breakpoints and 60.6 Mb across four chromosomes, further defining rare categories of extreme cxSV. CONCLUSIONS: These data provide a foundational map of large SV in the morbid human genome and demonstrate a previously underappreciated abundance and diversity of cxSV that should be considered in genomic studies of human disease.	GRCh37/hg19				No	
Research	28260531	Collins RL, Brand H, Redin CE, Hanscom C, Antolik C, Stone MR, Glessner JT, Mason T, Pregno G, Dorrani N, Mandrile G, Giachino D, Perrin D, Walsh C, Cipicchio M, Costello M, Stortchevoi A, An JY, Currall BB, Seabra CM, Ragavendran A, Margolin L, Martinez-Agosto JA, Lucente D, Levy B, Sanders SJ, Wapner RJ, Quintero-Rivera F, Kloosterman W, Talkowski ME	Defining the diverse spectrum of inversions, complex structural variation, and chromothripsis in the morbid human genome	Genome Biol	2017 Mar	19	Autism spectrum disorder	Next Generation Sequencing	Homo sapiens	TL009	Illumina HiSeq 2000 /2500 	chr19:0-16868159:0;chr19:16868160-22316097:-1;chr19:22316098-22477874:0;chr19:22477875-23580214:-1;chr19:23580215-29843543:0;chr19:29843544-30967207:-1;chr19:30967208-32841723:0;chr19:32841724-33908877:-1;chr19:33908878-35162171:0;chr19:35162172-35774420:-1;chr19:35774421-38262470:0;chr19:38262471-44025989:-1;chr19:44025990-51937752:0;chr19:51937753-52788019:-1;chr19:52788020-53006099:0;chr19:53006100-56225088:-1;chr19:56225089-58145137:0;chr19:58145138-58410428:1;chr19:58410429-59128983:0		ABHD8;ANKLE1;ANO8;ARMC6;ARRDC2;ATP13A1;B3GNT3;BABAM1;BST2;C19orf60;CCDC124;CERS1;CILP2;COLGALT1;COMP;COPE;CPAMD8;CRLF1;CRTC1;DDA1;DDX49;ELL;F2RL3;FAM129C;FCHO1;FKBP8;GATAD2A;GDF1;GDF15;GMIP;GTPBP3;HAPLN4;HAUS8;HOMER3;IFI30;IL12RB1;INSL3;ISYNA1;JAK3;JUND;KCNN1;KIAA1683;KLHL26;KXD1;LINC00663;LINC00664;LOC641367;LOC729966;LPAR2;LRRC25;LSM4;MAP1S;MAST3;MAU2;MEF2B;MEF2BNB;MEF2BNB-MEF2B;MIR1270-1;MIR1270-2;MIR3188;MIR3189;MPV17L2;MRPL34;MVB12A;MYO9B;NCAN;NDUFA13;NR2C2AP;NR2F6;NXNL1;OCEL1;PBX4;PDE4C;PGLS;PGPEP1;PIK3R2;PLVAP;RAB3A;RFXANK;RPL18A;SIN3B;SLC25A42;SLC27A1;SLC5A5;SNORA68;SSBP4;SUGP1;SUGP2;TM6SF2;TMEM161A;TMEM221;TMEM59L;TSSK6;UBA52;UNC13A;UPF1;USE1;USHBP1;YJEFN3;ZNF100;ZNF101;ZNF14;ZNF208;ZNF253;ZNF257;ZNF429;ZNF43;ZNF430;ZNF431;ZNF486;ZNF493;ZNF506;ZNF626;ZNF682;ZNF708;ZNF714;ZNF737;ZNF738;ZNF826P;ZNF85;ZNF90;ZNF93;IPO5P1;LINC01233;LOC100996349;LOC101929124;LOC101929144;LOC101929164;LOC440518;ZNF492;ZNF724P;ZNF728;ZNF730;ZNF91;ZNF98;ZNF99;C19orf12;CCNE1;PLEKHF1;POP4;URI1;VSTM2B;AC007773.2;ANKRD27;C19orf40;CEBPA;CEBPA-AS1;CEBPG;CEP89;DPY19L3;GPATCH1;LRP3;NUDT19;PDCD5;RGS9BP;RHPN2;SLC7A10;SLC7A9;TDRD12;WDR88;FAM187B;FXYD1;FXYD3;FXYD5;FXYD7;GRAMD1A;HPN;HPN-AS1;LGI4;LINC00904;LOC102723513;LOC400685;LSR;MIR6887;SCN1B;USF2;ZNF181;ZNF30;ZNF302;ZNF599;ZNF792;AC004603.4;AC006129.4;AC016582.2;ACTN4;ADCK4;AKT2;ARHGEF1;ATP1A3;ATP5SL;AXL;B3GNT8;B9D2;BCKDHA;BLVRB;C19orf33;C19orf47;C19orf54;CAPN12;CATSPERG;CCDC97;CCER2;CD177;CD79A;CEACAM1;CEACAM21;CEACAM3;CEACAM4;CEACAM5;CEACAM6;CEACAM7;CEACAM8;CIC;CLC;CNFN;CNTD2;CXCL17;CYP2A13;CYP2A6;CYP2A7;CYP2B6;CYP2B7P;CYP2F1;CYP2G1P;CYP2S1;DEDD2;DLL3;DMRTC2;DPF1;DYRK1B;ECH1;EGLN2;EID2;EID2B;EIF3K;ERF;ERICH4;EXOSC5;FAM98C;FBL;FBXO17;FBXO27;FCGBP;GGN;GMFG;GRIK5;GSK3A;HIPK4;HNRNPL;HNRNPUL1;IFNL1;IFNL2;IFNL3;IFNL4;ITPKC;KCNK6;LEUTX;LGALS13;LGALS14;LGALS16;LGALS17A;LGALS4;LGALS7;LGALS7B;LIPE;LIPE-AS1;LOC100129935;LOC100505622;LOC284344;LOC644554;LRFN1;LTBP4;LYPD3;LYPD4;MAP3K10;MAP4K1;MED29;MEGF8;MIA;MIA-RAB4B;MIR4323;MIR4530;MIR641;MIR6719;MIR6796;MIR6797;MIR8077;MRPS12;NCCRP1;NFKBIB;NUMBL;PAF1;PAFAH1B3;PAK4;PAPL;PCAT19;PHLDB3;PLD3;PLEKHG2;POU2F2;PPP1R14A;PRG1;PRR19;PRX;PSG1;PSG10P;PSG11;PSG2;PSG3;PSG4;PSG5;PSG6;PSG7;PSG8;PSG9;PSMC4;PSMD8;RAB4B;RAB4B-EGLN2;RABAC1;RASGRP4;RINL;RPS16;RPS19;RYR1;SAMD4B;SARS2;SELV;SERTAD1;SERTAD3;SHKBP1;SIPA1L3;SIRT2;SNRPA;SPINT2;SPRED3;SPTBN4;SUPT5H;SYCN;TEX101;TGFB1;TIMM50;TMEM145;TMEM91;TTC9B;WDR87;YIF1B;ZFP36;ZNF526;ZNF546;ZNF574;ZNF780A;ZNF780B;CEACAM18;FLJ30403;FPR1;FPR2;FPR3;HAS1;HCCAT3;LOC101928571;MIR125A;MIR643;MIR6801;MIR99B;MIRLET7E;PPP2R1A;SIGLEC12;SIGLEC14;SIGLEC5;SIGLEC6;SIGLEC8;SPACA6P;SPACA6P-AS;ZNF175;ZNF350;ZNF432;ZNF577;ZNF613;ZNF614;ZNF615;ZNF616;ZNF649;ZNF836;ZNF841;BIRC8;BRSK1;CACNG6;CACNG7;CACNG8;CCDC106;CDC42EP5;CNOT3;COX6B2;DNAAF3;DPRX;EPN1;EPS8L1;ERVV-1;ERVV-2;FAM71E2;FAM90A27P;FCAR;FIZ1;GP6;HSPBP1;IL11;ISOC2;KIR2DL1;KIR2DL3;KIR2DL4;KIR2DS4;KIR3DL1;KIR3DL2;KIR3DL3;KIR3DX1;LAIR1;LAIR2;LENG1;LENG8;LENG9;LILRA1;LILRA2;LILRA3;LILRA4;LILRA5;LILRA6;LILRB1;LILRB2;LILRB3;LILRB4;LILRB5;LILRP2;LOC100287534;LOC101928804;LOC284379;MBOAT7;MIR1283-1;MIR1283-2;MIR1323;MIR371A;MIR371B;MIR372;MIR373;MIR4752;MIR498;MIR512-1;MIR512-2;MIR515-1;MIR515-2;MIR516A1;MIR516A2;MIR516B1;MIR516B2;MIR517A;MIR517B;MIR517C;MIR518A1;MIR518A2;MIR518B;MIR518C;MIR518D;MIR518E;MIR518F;MIR519A1;MIR519A2;MIR519B;MIR519C;MIR519D;MIR519E;MIR520A;MIR520B;MIR520C;MIR520D;MIR520E;MIR520F;MIR520G;MIR520H;MIR521-1;MIR521-2;MIR522;MIR523;MIR524;MIR525;MIR526A1;MIR526A2;MIR526B;MIR527;MIR6802;MIR6803;MIR6804;MIR6805;MIR8061;MIR935;MYADM;NAT14;NCR1;NDUFA3;NLRP12;NLRP2;NLRP7;OSCAR;PPP1R12C;PPP6R1;PRKCG;PRPF31;PTPRH;RDH13;RNU6-35P;RNU6-64P;RPL28;RPS9;SBK2;SBK3;SHISA7;SSC5D;SUV420H2;SYT5;TARM1;TFPT;TMC4;TMEM150B;TMEM190;TMEM238;TMEM86B;TNNI3;TNNT1;TPM3P9;TSEN34;TTYH1;U2AF2;UBE2S;VN1R2;VN1R4;VSTM1;ZNF137P;ZNF160;ZNF28;ZNF320;ZNF321P;ZNF331;ZNF347;ZNF415;ZNF468;ZNF524;ZNF525;ZNF579;ZNF580;ZNF581;ZNF600;ZNF611;ZNF628;ZNF665;ZNF677;ZNF701;ZNF702P;ZNF761;ZNF765;ZNF784;ZNF808;ZNF813;ZNF816;ZNF816-ZNF321P;ZNF818P;ZNF83;ZNF845;ZNF865;FKBP1AP1;ZNF154;ZNF211;ZNF551;ZNF552;ZNF586;ZNF587;ZNF587B;ZNF671;ZNF776;ZNF814;ZSCAN4	Molecular Neurogenetics Unit and Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, and Department of Neurology, Massachusetts General Hospital, Boston, MA, 02114, USA	BACKGROUND: Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies. RESULTS: We sequenced 689 participants with autism spectrum disorder (ASD) and other developmental abnormalities to construct a genome-wide map of large SV. Using long-insert jumping libraries at 105X mean physical coverage and linked-read whole-genome sequencing from 10X Genomics, we document seven major SV classes at ~5 kb SV resolution. Our results encompass 11,735 distinct large SV sites, 38.1% of which are novel and 16.8% of which are balanced or complex. We characterize 16 recurrent subclasses of complex SV (cxSV), revealing that: (1) cxSV are larger and rarer than canonical SV; (2) each genome harbors 14 large cxSV on average; (3) 84.4% of large cxSVs involve inversion; and (4) most large cxSV (93.8%) have not been delineated in previous studies. Rare SVs are more likely to disrupt coding and regulatory non-coding loci, particularly when truncating constrained and disease-associated genes. We also identify multiple cases of catastrophic chromosomal rearrangements known as chromoanagenesis, including somatic chromoanasynthesis, and extreme balanced germline chromothripsis events involving up to 65 breakpoints and 60.6 Mb across four chromosomes, further defining rare categories of extreme cxSV. CONCLUSIONS: These data provide a foundational map of large SV in the morbid human genome and demonstrate a previously underappreciated abundance and diversity of cxSV that should be considered in genomic studies of human disease.	GRCh37/hg19				No	
