NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE74070 Query DataSets for GSE74070
Status Public on Oct 21, 2015
Title Learning the Sequence Determinants of Alternative Splicing from Millions of Random Sequences
Organisms Homo sapiens; synthetic construct
Experiment type Other
Summary Most human transcripts are alternatively spliced, and many disease-causing mutations affect RNA splicing. Towards better modeling the sequence determinants of alternative splicing, we measured the splicing patterns of nearly 2 million (M) synthetic mini-genes, which include degenerate subsequences totaling to nearly 100M bases of variation. The massive size of these training data allowed us to improve upon current models of splicing as well as to gain new mechanistic insights. Our results show that a vast majority of hexamer sequence motifs measurably influence splice site selection when positioned within alternative exons, with multiple motifs acting additively rather than cooperatively. Intriguingly, motifs that enhance (suppress) exon inclusion in alternative 5’ splicing also enhance (suppress) exon inclusion in alternative 3’ or cassette exon splicing, suggesting a universal mechanism for alternative exon recognition. Finally, our empirically trained models are highly predictive of the effects of naturally occurring variants on alternative splicing in vivo.
 
Overall design HEK293 cells were transfected with two alternatively spliced plasmid libraries. Spliced reads were sequenced to determine isoform counts for each library sequence.
 
Contributor(s) Rosenberg AB, Patwardhan RP, Shendure J, Seelig G
Citation missing Has this study been published? Please login to update or notify GEO.
Submission date Oct 15, 2015
Last update date May 15, 2019
Contact name Alexander B Rosenberg
E-mail(s) abros@uw.edu
Organization name University of Washington
Department Electrical Engineering
Lab Seelig Lab
Street address 4000 15th Ave NE
City Seattle, WA 98195
State/province WA
ZIP/Postal code 98195
Country USA
 
Platforms (3)
GPL11154 Illumina HiSeq 2000 (Homo sapiens)
GPL15228 Illumina HiSeq 2000 (synthetic construct)
GPL17769 Illumina MiSeq (synthetic construct)
Samples (4)
GSM1911083 A3SS_DNA
GSM1911084 A3SS_RNA
GSM1911085 A5SS_DNA
Relations
BioProject PRJNA299151
SRA SRP064967

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE74070_RAW.tar 94.3 Mb (http)(custom) TAR (of TXT)
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap