NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE173861 Query DataSets for GSE173861
Status Public on Sep 13, 2021
Title Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution  
Organism Saccharomyces cerevisiae
Experiment type Expression profiling by high throughput sequencing
Summary The noncoding genome plays an important role in de novo gene birth and the emergence of genetic novelty. Nevertheless, how the properties of noncoding sequences could promote the birth of novel genes and shape the structural diversity and evolution of proteins remains unclear. Here, we investigated the potential of the noncoding genome of yeast to produce novel protein bricks that can give rise to novel genes or be integrated in pre-existing proteins, thus participating in protein structure evolution and diversity. Combining different bioinformatics approaches, we showed that intergenic ORFs of yeast encompass the large structural diversity of canonical proteins with the majority encoding peptides predicted as foldable. Then, we investigated the early stages of de novo gene birth with Ribosome Profiling and systematic reconstruction of yeast de novo gene ancestral sequences. We highlighted sequence and structural factors determining de novo gene birth and protein evolution. Finally, we showed a strong correlation between the fold potential of de novo genes and their ancestral ORFs reflecting the relationship between the noncoding genome and the protein structure universe.
 
Overall design RiboSeq of two replicates of the same BY 4742 yeast strain, to identify for alternative CDS in unanotated regions, such CDS are then called IGORFs if RiboSeq signal is detected.
 
Contributor(s) Papadopoulos C, Callebaut I, Gelly J, Hatin I, Namy O, Renard M, Lespinet O, Lopes A
Citation missing Has this study been published? Please login to update or notify GEO.
Submission date May 04, 2021
Last update date Sep 14, 2021
Contact name Olivier Namy
E-mail(s) olivier.namy@i2bc.paris-saclay.fr
Organization name CNRS, Université Paris-Sud
Department I2BC
Lab Genomic, structure and Translation
Street address Rue Gregor Mendel, bat 400
City Orsay
ZIP/Postal code 91400
Country France
 
Platforms (1)
GPL13821 Illumina HiSeq 2000 (Saccharomyces cerevisiae)
Samples (2)
GSM5282046 BY4742 rep 1
GSM5282047 BY4742 rep 2
Relations
BioProject PRJNA727323
SRA SRP318462

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE173861_RAW.tar 1.2 Mb (http)(custom) TAR (of TAB)
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap