The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but a similar reference has lacked for epigenomic studies. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection to-date of human epigenomes for primary cells and tissues. Here, we describe the integrative analysis of 111 reference human epigenomes generated as part of the program, profiled for histone modification patterns, DNA accessibility, DNA methylation, and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically-relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation, and human disease.
DNA methylation is a defining feature of mammalian cellular identity and essential for normal development1,2. Most cell types, except germ cells and pre-implantation embryos3–5, display relatively stable DNA methylation patterns with 70–80% of all CpGs being methylated6. Despite recent advances we still have a too limited understanding of when, where and how many CpGs participate in genomic regulation. Here we report the in depth analysis of 42 whole genome bisulfite sequencing (WGBS) data sets across 30 diverse human cell and tissue types. We observe dynamic regulation for only 21.8% of autosomal CpGs within a normal developmental context, a majority of which are distal to transcription start sites. These dynamic CpGs co-localize with gene regulatory elements, particularly enhancers and transcription factor binding sites (TFBS), which allow identification of key lineage specific regulators. In addition, differentially methylated regions (DMRs) often harbor SNPs associated with cell type related diseases as determined by GWAS. The results also highlight the general inefficiency of WGBS as 70–80% of the sequencing reads across these data sets provided little or no relevant information regarding CpG methylation. To further demonstrate the utility of our DMR set, we use it to classify unknown samples and identify representative signature regions that recapitulate major DNA methylation dynamics. In summary, although in theory every CpG can change its methylation state, our results suggest that only a fraction does so as part of coordinated regulatory programs. Therefore our selected DMRs can serve as a starting point to help guide novel, more effective reduced representation approaches to capture the most informative fraction of CpGs as well as further pinpoint putative regulatory elements.
SUMMARY The developmental potential of human pluripotent stem cells suggests that they can produce disease-relevant cell types for biomedical research. However, substantial variation has been reported among pluripotent cell lines, which could affect their utility and clinical safety. Such cell-line-specific differences must be better understood before one can confidently use embryonic stem (ES) or induced pluripotent stem (iPS) cells in translational research. Toward this goal we have established genome-wide reference maps of DNA methylation and gene expression for 20 previously derived human ES lines and 12 human iPS cell lines, and we have measured the in vitro differentiation propensity of these cell lines. This resource enabled us to assess the epigenetic and transcriptional similarity of ES and iPS cells and to predict the differentiation efficiency of individual cell lines. The combination of assays yields a scorecard for quick and comprehensive characterization of pluripotent cell lines.
Differentiation of human embryonic stem cells (hESCs) provides a unique opportunity to study the regulatory mechanisms that facilitate cellular transitions in a human context. To that end, we performed comprehensive transcriptional and epigenetic profiling of populations derived through directed differentiation of hESCs representing each of the three embryonic germ layers. Integration of whole genome bisulfite sequencing, chromatin immunoprecipitation-sequencing and RNA-sequencing reveals unique events associated with specification towards each lineage. Dynamic alterations in DNA methylation and H3K4me1 are evident at putative distal regulatory elements bound by pluripotency factors or activated in specific lineages. In addition, we identified germ layer-specific H3K27me3 enrichment at sites exhibiting high DNA methylation in the undifferentiated state. A better understanding of these initial specification events will facilitate identification of deficiencies in current approaches leading to more faithful differentiation strategies as well as provide insights into the rewiring of human regulatory programs during cellular transitions.
Summary Although many distinct mutations in a variety of genes are known to cause Amyotrophic Lateral Sclerosis (ALS), it remains poorly understood how they selectively impact motor neuron biology and whether they converge on common pathways to cause neuronal degeneration. Here, we have combined reprogramming and stem cell differentiation approaches with genome engineering and RNA sequencing to define the transcriptional and functional changes that are induced in human motor neurons by mutant SOD1. Mutant SOD1 protein induced a transcriptional signature indicative of increased oxidative stress, reduced mitochondrial function, altered sub-cellular transport as well as activation of the ER stress and unfolded protein response pathways. Functional studies demonstrated that these pathways were perturbed in a manner dependent on the SOD1 mutation. Finally, interrogation of stem cell-derived motor neurons produced from ALS patients harboring a repeat expansion in C9orf72 indicates at least a subset of these changes are more broadly conserved in ALS.
SUMMARY Intra-tumoral heterogeneity plays a critical role in tumor evolution. To define the contribution of DNA methylation to heterogeneity within tumors, we performed genome-scale bisulfite sequencing of 104 primary chronic lymphocytic leukemias (CLL). Compared to 26 normal B cell samples, CLLs consistently displayed higher intra-sample variability of DNA methylation patterns across the genome, which appears to arise from stochastically disordered methylation in malignant cells. Transcriptome analysis of bulk and single CLL cells revealed that methylation disorder was linked to low-level expression. Disordered methylation was further associated with adverse clinical outcome. We therefore propose that disordered methylation plays a similar role to genetic instability, enhancing the ability of cancer cells to search for superior evolutionary trajectories.
DNA methylation is a key epigenetic modification involved in regulating gene expression and maintaining genomic integrity. Here we inactivated all three catalytically active DNA methyltransferases in human embryonic stem cells (ESCs) using CRISPR/Cas9 genome editing to further investigate their roles and genomic targets. Disruption of DNMT3A or DNMT3B individually, as well as of both enzymes in tandem, creates viable, pluripotent cell lines with distinct effects on their DNA methylation landscape as assessed by whole-genome bisulfite sequencing. Surprisingly, in contrast to mouse, deletion of DNMT1 resulted in rapid cell death in human ESCs. To overcome the immediate lethality, we generated a doxycycline (DOX) responsive tTA-DNMT1* rescue line and readily obtained homozygous DNMT1 mutant lines. However, DOX-mediated repression of the exogenous DNMT1* initiates rapid, global loss of DNA methylation, followed by extensive cell death. Our data provide a comprehensive characterization of DNMT mutant ESCs, including single base genome-wide maps of their targets.
Summary Pluripotent stem cells provide a powerful system to dissect the underlying molecular dynamics that regulate cell fate changes during mammalian development. Here we report the integrative analysis of genome wide binding data for 38 transcription factors with extensive epigenome and transcriptional data across the differentiation of human embryonic stem cells to the three germ layers. We describe core regulatory dynamics and show the lineage specific behavior of selected factors. In addition to the orchestrated remodeling of the chromatin landscape, we find that the binding of several transcription factors is strongly associated with specific loss of DNA methylation in one germ layer and in many cases a reciprocal gain in the other layers. Taken together, our work shows context-dependent rewiring of transcription factor binding, downstream signaling effectors, and the epigenome during human embryonic stem cell differentiation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.