These authors contributed equally.
DNA methylation is a key epigenetic modification involved in regulating gene expression and maintaining genomic integrity. Here we inactivated all three catalytically active DNA methyltransferases in human embryonic stem cells (ESCs) using CRISPR/Cas9 genome editing to further investigate their roles and genomic targets. Disruption of
DNA methylation is a heritable and reversible enzyme-mediated modification to DNA that is implicated in a diverse range of biological processes
DNA methylation has been extensively mapped in somatic cells as well as ESCs in human and mouse
While deletion of
The limited number of human studies can likely be attributed to long-standing difficulties of genetic manipulation in the human ESC system, rather than lack of interest. However, new classes of genome-editing tools, transcription activator-like effector nucleases (TALENs) and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) systems have been applied for genomic editing in mammalian cells
Here we report the inactivation of all three DNMT
Of the three catalytically active DNMTs,
To better understand the role of the three
In order to avoid hypomorphic effects
All of our
To characterize the effects of the
We then analyzed the WGBS data from the later passage samples, which captured 14.8 million CpGs covered at 5x coverage in all four samples, to see if specific genomic features were particularly sensitive to the loss of
We next divided the genome into 1kb tiles and compared the distribution of CpG methylation values within each tile across the samples (
The individual knockouts show a mild global reduction, an effect that becomes slightly more pronounced in the double knockouts (
We continued passaging of the
Lastly, we investigated the global level of non-CpG methylation, which is known to be present in mouse
As noted above, we found a strong enrichment of CpG islands in the set of redundant DNMT3A/3B targets (
Using our sequencing based data allows us to investigate the DNA methylation state of neighboring CpGs and classifying them into concordant or discordant patterns (
To further assess the functional significance of the DMRs found above, we calculated the degree of overlap of the DMRs in the three knockout lines with published transcription factor binding sites
To further investigate the knockout effects during differentiation, we derived early endodermal cells from the
As described above, we were unable to obtain homozygous mutant lines for
We therefore designed a rescue strategy that would enable the generation of homozygous mutant lines without the immediate lethality of DNMT1 loss (
Both qPCR and Western blot analysis confirm tight control of expression for the exogenous
Despite G1 arrest and cell death, our inducible system enabled us to assess the effect on global DNA methylation after acute loss of
DNA methylation is essential for normal development. Over the past three decades, major advances in understanding its role have included the deletion of the responsible enzymes in the mouse model as well as the more recent emergence of comprehensive mapping tools. Here, we have applied both to the human ESC model system to study their function, providing an unprecedented view of the respective genomic targets (shared and unique) of these enzymes.
All of the
Our
It is well established that mouse and human ESC represent different pluripotent states, which may explain the sensitivity of the human ESCs to loss of DNA methylation. All somatic cells require DNA methylation and the more mature epiblast state modeled by current human ESCs may represent the first developmental period where maintenance becomes essential. In fact, although no data were shown, a similar effect has been suggested for mouse epiblast cells
Our study provides a comprehensive, high-resolution view of the largely conserved roles that the three catalytically active DNMTs play in human cells. The mutant lines represent powerful tools to study their function in various differentiation models and will therefore greatly accelerate our understanding of DNA methylation in human development and disease.
Human embryonic stem cells were expanded on murine embryonic fibroblasts CF1 (Global Stem) in KO-DMEM (Life Technologies) containing 20% Knockout Serum Replace (Life Technologies) and FGF2 (10 ng/ml) (Millipore). Cells were passaged by enzymatic dissociation using Collagenase IV (1 mg/ml) (Life Technologies). For feeder free culture condition, plates were coated with Geltrex (Life Technologies) and the culture medium was mTeSR1 (Stemcell Technologies). HUES64 was derived in the Melton/Eggan labs
Human ESCs were harvested using 1mg/mL Collagenase Type IV (invitrogen) first and then dissociated into single cells using TrypLE™ (invitrogen). 1 × 107 cells resuspended in buffered saline (PBS) were electroporated with 15μg Cas9 and 15μg gRNA plasmids (Gene Pulser Xcell System, Bio-Rad: 250 V, 500μF, 0.4 cm cuvettes). Cas9 plasmid and gRNA backbone have been described previously
GIPZ
TRIPZ DOX inducible
plenti-ef1a-
plenti-ef1a-
plenti-ef1α-
plenti-ef1α-
Forward: GACTCCATCACGGTGGGCATGG;
Reverse: TGTCCCTCTTGTCACTAACGCC.
Forward: TCCACTGTGAATGATAAGCTG;
Reverse: GGAAACCAAATACCCTTTCCA.
Forward: GAGTCCATTGCTGTTGGAACCG;
Reverse: ATGTCCCTCTTGTCGCCAACCT.
Forward: GACTGCTTGGAATACAATAGGA;
Reverse: AAAGCCAAAGATCCTTTCGAG.
Forward: GGTTTCCTTCCTCAGCTACTGCGA;
Reverse: CACTGATAGCCCATGCGGACCA.
ACTIN:
Forward: TTTGAGACCTTCAACACCCCAGCC;
Reverse: AATGTCACGCACGATTTCCCGC.
For
tgtgggccatcgagatgtGGGACCCTGCGGCCCAggcgttccggctgaacaa
For
tcttctttgagttctaccGCCTCCTGCATGATGCgcggcccaaggagggaga
Capital letters are the spacer region. Lower case letters are the TALEN binding sites.
For virus production, virus was produced by co-transfecting lenti plasmid with delta 8.91 and VSVG on 293 cell by using FuGENE® HD (Promega). 24 hours after transfection, medium was changes to mTeSR1 for virus collection. Supernatant of virus was collected twice 48h and 72h after transfection separately. Virus was stored in 4°C. Before infection, virus was filtered through 0.45μm filter.
For virus infection on human ESCs, cells were plated as small clumps on geltrex pre-coated plate. 24 hour after plating, cells were infected with virus supernatant for 3 hours in incubator. Medium was changed back to mTeSR1. Repeat the 3 hours infection the next day. In this case, cells were maintained well in pluripotent state.
Cells in each well were about 90% confluent. Washed twice with PBS and then lysed with 50μl/well of lysis buffer (10mM Tris PH7.5, 10mM EDTA, 10mM NaCl, 0.5% Sarcosyl or SDS, 1mg/ml Proteinase K) at 37°C overnight. Genomic DNA was precipitated by adding 100μl/well of cold NaCl/EtOH (1.5% vol 5M NaCl in 100% cold EtOH) at room temperature (RT) for 2–4 hrs. Genomic DNA precipitate was washed 3 times with 70% EtOH. Each time discarding the alcohol by inversion. Air-dry DNA for 15–20 min at RT. DNA was re-suspended in 30–100 μl of DNase-free, RNase-free ddH2O.
For immunostaining:
OCT4: BD Transduction Laboratories, 611202
NANOG: R&D, AF1997
TRA-1-60: Stemgent 090068
ANNEXIN V: Abcam, ab54775
FOXA2: R&D, AF2400
HNF4A: Abcam, Ab41898
DAPI: Hoechst 33342, Invitrogen, H3570.
γ-H2A.X (phospho S139): Abcam, ab2893.
For western blotting:
DNMT3A: Cell signaling, 2160S
DNMT3B: santa cruz biotechnology, sc-20704
DNMT1: Activemotif, 39204
ACTIN: Abcam, ab8226
Human ESCs were passaged on Geltrex coated plates and cell were cultured in mTeSR1 medium. Then the cells were fixed in 4% paraformaldehyde (PFA) for 15 minutes at room temperature. Permeabilization and blocking was performed in 5% bovine serum albumin (BSA) and 1% Triton X-100 in PBS for 30minutes. Cells were stain with primary antibody in 4°C overnight. The secondary antibody was applied for 2 hours at room temperature. Nucleuses were stained with Hoechst 33342 (Life Technologies). Images were obtained using Olympus IX71 microscope and MetaMorph Advanced software.
RNA was isolated from hESCs using TRIzol (Invitrogen, 15596-026), further purification was carried out with RNeasy columns (QIAGEN, 74104) and DNase treatment.
Human ESCs were expanded in 10cm2 dishes on irradiated CF-1 MEFs (Global Stem). Cells were harvested using 1mg/ml Collagenase IV (Life Technologies) once they were confluent. After three times washes with 1 × phosphate-buffered saline (PBS), cell clumps were plated into 6-well low attachment dishes (Corning) in 20% Knockout Serum Replacement (KOSR) (Life Technologies), 200mM Glutamax (Life Technologies), MEM Non-Essental Amino Acid, in KO DMEM (Life Technologies). Medium was changed every 2–3 days.
hESCs were plated as clumps on 6-well plates coated with Geltrex (Life Technologies) in mTeSR1 culture medium (Stem Cell Technologies). On day 3, directed differentiation was induced and medium was changed every day.
Ectoderm differentiation medium contains 2μM A83-01 TGFb inhibitor (Tocris), 2μM PNU-74654 WNT3A inhibitor (Tocris), and 2μM Dorsomorphin BMP inhibitor (Tocris) in 15% KOSR (Life Technologies), MEM Non-Essential Amino (Life Technologies), and 55μM 2-mercaptoethanol.
Mesoderm differentiation medium contains 100ng/ml Activin A (Life Technologies), 10 ng/ml bFGF (Millipore), 100ng/ml BMP4 (Life Technologies), 100ng/ml VEGF 100ng/ml (Life Technologies) in 0.5% FBS (Hyclone), MEM Non-Essential Amino Acid (Life Technologies), 55μM 2-mercaptoethanol and Glutamax (Life Technologies) in DMEM/F12 medium (Life Technologies) for the first 24 hrs of differentiation. For the rest 4 days, the medium contains 10 ng/ml bFGF (Millipore), 100ng/ml BMP4 (Life Technologies), 100ng/ml VEGF 100ng/ml (Life Technologies) in 0.5% FBS (Hyclone), MEM Non-Essental Amino Acid (Life Technologies), 55μM 2-mercaptoethanol and glutamax (Life Technologies) in DMEM/F12 medium (Life Technologies).
Endoderm differentiation medium contains 100ng/ml Activin A (Life Technologies) and 2μM/ml Lithium Chloride (Sigma) in 0.5% FBS (Hyclone), 200mM GlutaMax (Life Technologies), MEM Non-Essential Amino Acid (Life Technologies), 55μM 2-mercaptoethanol in RPMI medium (Life Technologies).
Teratoma formation assay was performed as a service by Harvard Genome Modification Facilities. Briefly, hESCs were injected into the kidney capsules of 3 immuno-suppressed mice as 1 million cells per animal to form teratomas. The formed teratomas were fixed in 4% paraformaldehyde (PFA) pH7.4, embedded in paraffin, sectioned 10–12μm thickness, and stained with Hematoxylin and Eosin (H&E) for examination. Histology examination was performed by histology core facility at Harvard Stem Cell Institute.
Differentiation was performed as described previously
Alive cells were incubated with Hoechst 33342 in the incubator for 15min. Then cells were trypsinized into single cells, washed with PBS once and analyzed by FACs directly.
Isolated genomic DNA was denatured in 0.4M NaCl and 10mM EDTA for 10 min at 99°C. The denatured DNA was spotted in a 2-fold serial dilution on a Zeta Membrane (BioRad) using a Bio-Dot apparatus (BioRad). The blotted membrane was rinsed in 2xSSC, air-dried and UV-crosslinked at 120,000 μJ cm−2. The membrane was blocked in 1xTBS containing 5% nonfat dry milk for 30min. Anti-5-methylcytosine monoclonal antibody (Diagenode, #33D3) was diluted 1:250 in TTBS, containing 0.3% Tween 20 and 5% nonfat dry milk, and was incubated overnight at 4°C. The membrane was washed in 3x 5min in TBS and was incubated for 1 hour at room temperature with HRP-conjugated goat anti-mouse IgG (Jackson Lab, #115-025-174). The membrane was washed 3x in TBS and visualized by chemiluminescence using the Gel Doc XR (BioRad). The same membrane was incubated 30min with SYBR Gold Nucleic Acid stain (Invitrogen, #S-11494) in 1xTAE buffer, total DNA loading was visualized by UV transillumination.
Experiment was carried out using TaqMan® hPSC Scorecard™Kit (Life Technologies, A15872). Data could be analyzed through TaqMan® hPSC Scorecard™ Analysis Software through Life Technologies website.
RRBS library construction was described in Ref
WGBS raw sequencing reads were aligned using MAQ in bisulfite mode against human genome version hg19/GRCh37, discarding duplicate reads. DNA methylation calling was performed based on an extended custom software pipeline published previously for RRBS
To ensure comparability of region DNA methylation levels across all samples, only CpGs covered by ≥ 5x in all samples qualified for the computation of region DNA methylation levels. We defined the average methylation for a genomic region as the coverage-weighted mean of the methylation levels of the individual CpGs within the region. Subsequently, we averaged a region’s DNA methylation level over replicates. Differentially methylated regions were identified by using a two sample weighted t-test using the methylation values of the CpGs within each regions as individual sampling events (minimum number of CpGs in region for each sample is 2). We performed multiple testing correction using a method defined in Ref
We used two sets of CpG island annotations. The first uses a bioinformatics definition and designates a region as a CpG island if it has a GC content higher than 0.5 and a ratio greater than 0.6 of observed CpG dinucleotides to what would be expected based on the GC content over a length of 700 bp. The second is a set of experimentally verified CpG islands
We defined promoters as the regions 2000 bp upstream and 500 bp downstream of the Refseq TSS set. These were divided into high CpG promoters (HCP), intermediate CpG promoters (ICP), and low CpG promoters (LCP) as published previously
We used GREAT (Genomic Region Enrichment of Annotation Tool)
Fidelity of DNMT1 was calculated by fitting an exponential model to the mean DNA methylation of different genomic features using the nls function in R. The exponential model has the form y=ax, where “a” is the fidelity and “x” is the estimated number of population doublings based on passage number, days per passage (8) and doubling time (28.8 hours).
The proportion of discordant reads (PDR) was calculated for each CpG by examining the individual reads covering that CpG and classifying them into concordant (all CpGs methylated or all CpGs unmethylated), filtering out reads with less than 3 CpGs covered.
We downloaded the conservative set of uniform peaks called by the ENCODE project
Human imprinted control regions were taken from Ref.
Mouse
We would like to thank all members of the Meissner lab, in particular Camille Sindhu for helpful discussion and Zachary D. Smith for critical feedback on the manuscript. We would like to thank Kiran Musunuru for providing the CRIPSR/Cas9 plasmids and thank Qiurong Ding from Musunuru lab for technical support. J.L. was supported by a postdoctoral fellowship from the Human Frontiers Science Program. J.K.J is supported by NIH Director’s Pioneer Award DP1 GM105378. A.M. is a New York Stem Cell Foundation Robertson Investigator. The work was funded by the US National Institutes of Health (NIGMS) grant (P01GM099117) and The New York Stem Cell Foundation.
All data have been deposited in GEO under
JKJ is a consultant for Horizon Discovery. JKJ has financial interests in Editas Medicine and Transposagen Biopharmaceuticals. JKJ’s interests were reviewed and are managed by Massachusetts General Hospital and Partners HealthCare in accordance with their conflict of interest policies.
J.L. and A.M. designed and conceived the study, J.L. generated all the cell lines and performed the experiments, R.K. performed the analysis, H.G. generated the WGBS libraries, A.G and A.M supervised the DNA methylation experiments, M.J.Z. and K.C performed some analysis and assisted in the general data processing, A.M.T and V.A performed with the Scorecard assay, C.A.G. and J.D. assisted in the endoderm and hepatocyte differentiation, C.G performed the dot blot assay and R.P supported the RNA profiling and FACS, D.R, S.Q.T and J.K.J designed and generated the TALENs, W.M. and J.L.R performed expression analysis, J.L, R.K. and A.M. interpreted the data and wrote the paper.
(a) Expression levels of
(e) Western blot analysis for DNMT3A, 3B and DNMT1 in HUES64,
(a) Overview of clones and passage numbers used for the various assays. HUES64 passage 24 (P24) was used to generate
(b) Wild-type and
(c) TaqMan hPSC Scorecard analysis of direct differentiation potential for WT,
(d) Top: Schematic overview for the embryonic body (EB) formation. Bottom: Representative immunostaining images for TUJ1 (ectoderm), BRACHYURY (mesoderm) and FOXA2 (endoderm). 10X magnification is shown. Scale bars, 100μm.
(e) Top: Summary of injection and teratoma formation results. Bottom: Images of the nine teratomas collected. Ruler on top provides a size reference.
(f) Representative images of Hematoxylin and Eosin (H&E) staining of the teratoma from
(a) Overview of the samples used for whole genome bisulfite sequencing (WGBS).
(b) Left: Hierarchical clustering using Euclidean distance based on mean DNA methylation levels of 1 kb tiles across the human genome. Right: Principal component analysis (PCA) based on mean CpG methylation levels for 1 kb tiles.
(c) Fraction of CpGs with high (≥ 0.8, red), intermediate (inter, >0.2 and <0.8, green) and low (≤ 0.2, blue) methylation values. Top left panel shows all CpGs while the other panels show CpG methylation distribution within different genomic features, including high CpG Promoters (HCP), intermediate CpG promoters (ICP), low CpG promoters (LCP), CpG islands (CGI), CpG island shores, and satellite repeats. The total number of CpGs associated with each feature is shown above each bar plot.
(d) Heatmap of non-repetitive differentially methylated 1kb tiles (q-value < 0.05 and methylation difference greater than 0.4).
(e) Enrichment [−log10(hypergeometric p-value)] of genomic features in the target classes identified in Fig. 3d.
(f) Genome browser tracks covering approximately 160kb in the different lines.
(g) Higher resolution view of the
(h) CpG methylation levels over several passages of the
(i) Global mean CpA methylation levels as assayed by WGBS and over several passages of the
(a) Composite plot of methylation around CpG islands in WT and knockouts. The solid lines show the mean methylation of differentially methylated CpG islands, while the dotted lines show the mean methylation of all other CpG islands.
(b) Schematic for the definition of concordant and discordant reads.
(c) Proportion of discordant reads (PDR) in WT,
(d) Two-dimensional density plots of PDR vs. methylation for CpGs with low (<= 0.2), medium(> 0.2 and <= 0.6) and high(> 0.6) methylation in WT cells.
(e) Enrichment of experimentally determined transcription factor binding sites in H1 ESCs generated by the ENCODE project for DMRs in
(a) Schematic for the generation of CD184+ endodermal progenitor cells (dEN).
(b) Global analysis for the WGBS data of undifferentiated and dEN cells from WT (P23 and P26) and
(c) Heat map of methylation levels (black, 0; red, 1) in WT, WT dEN,
(d) Classification of endoderm DMRs into promoters, gene bodies and distal regions as defined in the legend.
(e) Genome browser tracks of 1kb and 10kb regions highlight global and local methylation level for
(f) Hepatoblast differentiation of WT (P39) and
(a) Strategy to obtain
(b) Representative bright field (BR), mCherry expression and pluripotent markers NANOG, TRA-1-60 immunostaining images for rescue line tTA+
(c)
(a) Representative bright field (BR) and NANOG/TRA-1-60 immunostaining for
(b) RT-qPCR analysis after
(c)
(d) Violin plots of the mean methylation (measured by RRBS) of 1 kb tiles across the human genome after
(e) Global reduction of DNA methylation in different genomic features. Overall CpGs (1kb tile) and selected features including CpG Islands (CGI), Shores, HCPs, ICPs, LCPs, LINEs, SINEs and satellite repeats are shown.
(f) Exponential model of methylation decay for 1kb tiles. Methylation at day 2 was normalized to 1 and methylation levels on subsequent days were calculated as a percentage of the day 2 value. An exponential model was fitted starting at day 2 and is shown by the red line. The dashed grey lines mark the timepoints when 50% and 25% of the day 2 methylation levels remain. Days 0 and 1 were excluded from this model to avoid noise from any remaining DNMT1 protein.
(g) Mean CpA methylation levels after