NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE33213 Query DataSets for GSE33213
Status Public on Oct 25, 2011
Title Open Chromatin TFBS by ChIP-seq from ENCODE/Open Chrom(UT Austin)
Project ENCODE
Organism Homo sapiens
Experiment type Genome binding/occupancy profiling by high throughput sequencing
Summary This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (Terry Furey mailto:tsfurey@duke.edu). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu).

These tracks display a synthesis of evidence from different assays as part of the four Open Chromatin track sets. This track displays open chromatin regions and/or transcription factor binding sites identified in multiple cell types by one or more complementary methodologies, DNaseI hypersensitivity (HS) (Duke DNaseI HS), Formaldehyde-Assisted Isolation of Regulatory Elements (FAIRE) (UNC FAIRE), and chromatin immunoprecipitation (ChIP) for select regulatory factors (UTA TFBS). Each methodology was performed on the same cell type using identical growth conditions. (Note: Data for some or all ChIP experiments may not be available for all cell types). Regions that overlap between methodologies identify regulatory elements that are cross-validated indicating high confidence regions. In addition, multiple lines of evidence suggest that regions detected by a single assay (e.g., DNase-only or FAIRE-only) are also biologically relevant (Song et al., submitted).
DNaseI HS data: DNaseI is an enzyme that has long been used to map general chromatin accessibility, and DNaseI "hypersensitivity" is a feature of active cis-regulatory sequences.The use of this method has led to the discovery of functional regulatory elements that include promoters, enhancers, silencers, insulators, locus control regions, and novel elements. DNaseI hypersensitivity signifies chromatin accessibility following binding of trans-acting factors in place of a canonical nucleosome.
FAIRE data: FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements) is a method to isolate and identify nucleosome-depleted regions of the genome. FAIRE was initially discovered in yeast and subsequently shown to identify active regulatory elements in human cells (Giresi et al., 2007). Similar to DNaseI HS, FAIRE appears to identify functional regulatory elements that include promoters, enhancers, silencers, insulators, locus control regions and novel elements.
ChIP data: ChIP (Chromatin Immunoprecipitation) is a method to identify the specific location of proteins that are directly or indirectly bound to genomic DNA. By identifying the binding location of sequence-specific transcription factors, general transcription machinery components, and chromatin factors, ChIP can help in the functional annotation of the open chromatin regions identified by DNaseI HS mapping and FAIRE.
Input data: As a background control experiment, we sequenced the input genomic DNA sample that was used for ChIP. Crosslinked chromatin is sheared and the crosslinks are reversed without carrying out the immunoprecipitation step. This sample is otherwise processed in a manner identical to the ChIP sample as described below. The input track is useful in revealing potential artifacts arising from the sequence alignment process such as copy number differences between the reference genome and the sequenced samples, as well as regions of poor sequence alignability.

For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf
 
Overall design For each site, the maximum F-Seq Density Signal value has been calculated for each assay that was performed in that cell type. F-Seq employs Parzen kernel density estimation to create base pair scores (Boyle et al., 2008b). Significant regions, or peaks, were determined by fitting the data to a gamma distribution to calculate p-values. Contiguous regions where p-values were below a 0.05 (DNaseI HS, ChIP) or 0.1 (FAIRE) threshold were considered significant. See assay specific description pages ( Duke DNaseI HS, UNC FAIRE and UTA TFBS) for more details.
A Fisher's Combined P-value for DNaseI HS and FAIRE was calculated using Fisher's combined probability test. First, a test statistic is calculated using the formula x^2 = -2*sum(ln(pi)) where pi are the p-values calculated for DNaseI HS and FAIRE. X2 follows a chi-squared distribution, thus a combined p-value can be assigned to this test statistic.
Web link http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg19&g=wgEncodeOpenChromChip
 
Contributor(s) Furey T, Zhang Z, Song L, Crawford G, Giresi P, Lieb J, Liu Z, McDaniell R, Lee B, Iyer V, Flicek P, Keefe D, Birney E, Graf S
Citation missing Has this study been published? Please login to update or notify GEO.
BioProject PRJNA63443
Submission date Oct 25, 2011
Last update date May 15, 2019
Contact name ENCODE DCC
E-mail(s) encode-help@lists.stanford.edu
Organization name ENCODE DCC
Street address 300 Pasteur Dr
City Stanford
State/province CA
ZIP/Postal code 94305-5120
Country USA
 
Platforms (1)
GPL9052 Illumina Genome Analyzer (Homo sapiens)
Samples (79)
GSM822267 UT-A_ChipSeq_Fibrobl_Input
GSM822268 UT-A_ChipSeq_Gliobla_Input
GSM822269 UT-A_ChipSeq_ProgFib_Input
Relations
SRA SRP009080

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE33213_RAW.tar 308.3 Gb (http)(custom) TAR (of BIGWIG, NARROWPEAK)
GSE33213_run_info.txt.gz 2.4 Kb (ftp)(http) TXT
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap