The MAQC-II Project: A comprehensive study of common practices for the development and validation of microarray-based predictive models
Effect of training-sample size and classification difficulty on the accuracy of genomic predictors.
Sex, Age, Specimen part, Race, Compound
View SamplesThe Hamner data set (endpoint A) was provided by The Hamner Institutes for Health Sciences (Research Triangle Park, NC, USA). The study objective was to apply microarray gene expression data from the lung of female B6C3F1 mice exposed to a 13-week treatment of chemicals to predict increased lung tumor incidence in the 2-year rodent cancer bioassays of the National Toxicology Program. If successful, the results may form the basis of a more efficient and economical approach for evaluating the carcinogenic activity of chemicals. Microarray analysis was performed using Affymetrix Mouse Genome 430 2.0 arrays on three to four mice per treatment group, and a total of 70 mice were analyzed and used as the MAQC-II's training set (GEO Series GSE6116). Additional data from another set of 88 mice were collected later and provided as the MAQC-II's external validation set (this Series). The training dataset had already been deposited in GEO by its provider and its accession number is GSE6116.
Effect of training-sample size and classification difficulty on the accuracy of genomic predictors.
Specimen part, Compound
View SamplesThis SuperSeries is composed of the SubSeries listed below.
No associated publication
Specimen part, Cell line, Treatment
View SamplesDNA methylation of C5-cytosine (5mC) in the mammalian genome is a key epigenetic event that is critical for various cellular processes. However, how the genome-wide 5mC pattern is dynamically regulated remains a fundamental question in epigenetic biology. The TET family of 5mC hydroxylases, which convert 5mC to 5-hydroxymethylcytosine (5hmC), have provided a new potential mechanism for the dynamic regulation of DNA methylation. The extent to which individual Tet family members contribute to the genome-wide 5mC and 5hmC patterns and associated gene network remains largely unknown. Here we report genome-wide mapping of Tet1 and 5hmC in mESCs and reveal a mechanism of action by which Tet1 controls 5hmC and 5mC levels in mESCs. In combination with microarray and mRNA-seq expression profiling, we identify a comprehensive yet intricate gene network influenced by Tet1. We propose a model whereby Tet1 controls DNA methylation both by binding to CpG-rich regions to prevent unwanted DNA methyltransferase activity, and by converting the existing 5mC to 5hmC through its enzymatic activity. This Tet1-mediated antagonism of CpG methylation imparts differential maintenance of DNA methylation status at Tet1 target loci, thereby providing a new regulatory mechanism for establishing the epigenetic landscape of mESCs, which ultimately contributes to mESC differentiation and the onset of embryonic development.
Genome-wide regulation of 5hmC, 5mC, and gene expression by Tet1 hydroxylase in mouse embryonic stem cells.
Specimen part, Treatment
View Samples