Genome-wide histone modification profiling of inner cell mass and trophectoderm of bovine blastocysts by RAT-ChIP

Authors: Tõnis Org ^aff001; Kati Hensen ^aff001; Rita Kreevan ^aff001; Elina Mark ^aff002; Olav Sarv ^aff003; Reidar Andreson ^aff004; Ülle Jaakma ^aff002; Andres Salumets ^aff003; Ants Kurg ^aff001
Authors place of work: Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia ^aff001; Chair of Animal Breeding and Biotechnology, Estonian University of Life Sciences, Tartu, Estonia ^aff002; Competence Centre on Health Technologies, Tartu, Estonia ^aff003; Department of Bioinformatics, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia ^aff004; Institute of Genomics, University of Tartu, Tartu, Estonia ^aff005; Department of Obstetrics and Gynaecology, Institute of Clinical Medicine, University of Tartu, Tartu, Estonia ^aff006; Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia ^aff007; Department of Obstetrics and Gynaecology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland ^aff008
Published in the journal: PLoS ONE 14(11)
Category: Research Article
doi: https://doi.org/10.1371/journal.pone.0225801

Summary

Chromatin immunoprecipitation coupled with next-generation sequencing (ChIP-seq) has revolutionized our understanding of chromatin-related biological processes. The method, however, requires thousands of cells and has therefore limited applications in situations where cell numbers are limited. Here we describe a novel method called Restriction Assisted Tagmentation Chromatin Immunoprecipitation (RAT-ChIP) that enables global histone modification profiling from as few as 100 cells. The method is simple, cost-effective and takes a single day to complete. We demonstrate the sensitivity of the method by deriving the first genome-wide maps of histone H3K4me3 and H3K27me3 modifications of inner cell mass and trophectoderm of bovine blastocyst stage embryos.

Keywords:

Chromatin – Gene expression – Gene regulation – Embryos – Histones – Immunoprecipitation – Blastocysts – Histone modification

Introduction

The development and functioning of a multicellular organism are determined by how the genetic information, encoded in its genomic DNA sequence, is utilized by individual cells. In eukaryotic cells, DNA is packed into chromatin using proteins of which histones are the most abundant. Each cell type has its specific chromatin structure, which is dynamic and can be remodeled to regulate gene expression, DNA repair and cell division. Our understanding of these chromatin-related processes has improved vastly over the past years thanks to the advances in DNA sequencing technologies. Chromatin immunoprecipitation (ChIP) has been the method of choice to study the location of DNA bound proteins for years [1]. Coupling ChIP with deep sequencing (ChIP-seq) has enabled to determine the localization of chromatin-bound proteins at a genome-wide level [2]. For example, it has helped to identify that different histone post-translational modifications are associated with different genomic features and transcriptional states, helping to explain how cell type specific gene regulation is achieved [3,4].

One of the limitations of ChIP-seq method is that it usually requires a large number of cells. In a typical experiment, several million cells are used, which can be a limiting factor when working with samples where cell numbers are restricted, such as highly purified rare cell populations or early developmental stages. Recent advances in technologies have made it possible to develop more sensitive ChIP-seq methods (S1 Table) [5–19], however, many of these methods are complex, laborious, require specific apparatus or reagents, or are not sensitive enough. With these limitations in mind, we aimed to develop a novel ChIP-method for a limited number of cells that would be sensitive but also robust so it could be used without the need for special equipment and readily available reagents.

A typical ChIP-seq experiment consists of several experimental steps to produce a library that can be sequenced using massively parallel sequencing. These steps include fixing, cell lysis, chromatin fragmentation, immunoprecipitation, decrosslinking, DNA purification, and sequencing library preparation followed by sequencing. There are many issues that arise when working with a low number of cells of which the loss of material is the most prominent. In the published protocols the reduction of material loss has been achieved through the use of different carriers that mimic more material [7,12][19] or with indexing first and then pool the samples technique to obtain more material for subsequent steps [9,14,15]. Another option is to minimize the number of experimental steps, such as centrifugations and material transfer from one tube to another, where a material loss is expected. In addition, it is desirable to downscale the sample volumes to account for the reduction in the amount of input material. This can be relatively easily achieved when the sonication step is replaced with enzymatic DNA digestion, as using standard equipment, sonication cannot be done effectively in a small volume.

There are many instances where more sensitive ChIP-seq methods could be useful but the most obvious is studying early development, like mammalian embryo preimplantation development, as the number of available cells in these study samples is especially low. The cellular changes that take place during development are remarkable both at the molecular and phenotypic level. Understanding the molecular basis for differentiation it is not only fascinating but it also holds great promise to advance reproductive medicine and the generation of desired cell types for regenerative medicine. Recently, the first histone modification landscapes of early mouse development were reported [16,17,20]. In addition to using more sensitive ChIP assays, a pool from large number of embryos was still needed in order to obtain a sufficient amount of cells. Therefore, although substantial advancements have been made both in sensitivity and simplicity of the ChIP-seq methods, there is still room for improvement. This is in particular important in the context of studying human embryonic development as many legal, ethical and technical issues come to play. To overcome some of these issues, bovine can be used as a model as its embryogenesis is more similar to the human compared to other common model organisms. Previously, we have used bovine to study chromosomal instability in early embryos and shown it to be a good model [21]. Here, by using a novel RAT-ChIP method we derive the first genome-wide histone H3K4me3 and H3K27me3 profiles of inner cell mass (ICM) and trophectoderm (TE) of blastocyst stage bovine embryos, being the two major cell lineages that develop into embryonic and extraembryonic tissues, respectively. Combining this epigenetics data with published expression profiling data serves as a resource to provide insights into the gene expression regulation of early bovine development and paves a way to new studies, for example with a somatic cell nuclear transfer (SCNT), e.g. cloned embryos where epigenetics has thought to act as a major roadblock of nuclear reprogramming.

Materials and methods

Cell lines

Human K562 and H1299 cells were grown in IMDM and DMEM (both from Naxo) respectively, supplemented with 10% of FBS and penicillin/streptomycin (Naxo) in the presence of 5% CO₂ at 37°C.

Oocyte collection and in vitro maturation

All chemicals used for in vitro embryo production were purchased from IVF Limited T/A IVF Bioscience. Slaughterhouse derived ovaries were transported to the laboratory in a 0.9% sterile NaCl solution within 4 h after slaughter at approximately 32–37°C and washed twice in a 0.9% NaCl solution. Using a vacuum pump (Minitüb GmbH), cumulus oocyte complexes (COC) from follicles with a diameter of 2-8mm were aspirated. Grade 1 COCs were washed and matured in groups of 50 in 500μl of in vitro maturation medium in four-well plates (Nunc). Oocytes were incubated at 38.5°C with humidified 5% CO₂ in air for 22–24 h.

In vitro fertilization

Frozen-thawed semen from a Holstein bull ZIARD (id EE 13993023) was used to fertilize the matured oocytes. Oocytes and sperm were co-incubated in groups of 50 in 500μl of BO-IVM media in four-well plates (Nunc) at 38.5°C with humidified 5% CO₂ in air for 22–24 h.

In vitro cultivation

Zygotes were individually cultured in 60μl droplets BO-IVC media for 8 days at 38.5°C, 5% CO₂, 5% O₂ and 90% N₂ with maximum humidity. Embryos having reached blastocyst stage by day 8 were collected and used for laser-assisted microdissection.

Laser-assisted microdissection to obtain ICM and TE fractions

Integra 3 micromanipulator (Research Instruments Limited) equipped with Saturn 5 Active^™ laser system was used to manually separate bovine blastocysts into ICM and TE fractions (of note–while manual dissection achieves to get pure populations of TE, small fraction of TE cells remain associated with the separated ICM mass). Separated fractions from three blastocysts were pooled and used for subsequent RAT-ChIP-seq experiments.

RAT-ChIP-(seq)

For a single immunoprecipitation 1μl ProtG Dynabeads (Thermo Fisher Scientific) were bound with 0.25μg of corresponding antibody (H3K4me3 (07–473, Millipore), H3K27me3 (07–449, Millipore)) or H3 (C15200011, Diagenode) in 5μl of complete immunoprecipitation (IP) buffer (20mM Tris HCl pH 7.4, 2mM EDTA, 150mM NaCl, 0.1% Triton X-100) at RT using standard 0.2ml Eppendorf tubes with end-over end mixing (30 rpm) for 2 h followed by two washes using 50μl of IP buffer. All washes were performed by gently pipetting the beads 10 times up and down. Magnetic beads were captured by 1-minute incubation on a magnetic stand (Diagenode). Finally, beads were suspended in the original amount of IP buffer (1μl per IP).

The density of cultured K562 or H1299 cells was determined using haemocytometer. Cells were spun down and resuspended in PBS at a density 100 or 1,000 cells per 0.5μl. Subsequently 0.5μl of lysis-restriction mix (1μl FD (FastDigest) buffer combined with 3.75μl of 2x nuclear lysis buffer (20mM Tris HCl, pH 7.4, 20mM NaCl, 6mM MgCl₂, 0.2% NP-40)) and 0.25μl 4x restriction enzyme mix (AluI #FD0014, SaqAI#FD2174, HinfI#FD0804, MvaI#FD0554, all FastDigest enzymes from Thermo Scientific in equal amounts) was added to the cells and incubated 15 min on ice and thereafter 5 min at 37°C. Next, 1μl of 0.2% NaDOC, 0.2% TritonX-100 with protease inhibitors was added to the samples and incubated 15 min on ice, vortexed for 30 sec, after which 8μl of IP buffer (20mM Tris HCl, pH 7.4, 150mM NaCl, 2mM EDTA and 1% TritonX-100) and 1μl of ProtG Dynabeads (Thermo Fisher Scientific) prebound with corresponding antibody was added to the samples. Chromatin immunoprecipitation was performed at 4°C for four hours with end-over end mixing (30 rpm). After IP, beads were washed twice with 100μl of following buffers: low salt washing buffer (0.1% SDS, 1% Triton X-100, 2mM EDTA, 20mM Tris HCl (pH 8.0), 150mM NaCl), high salt washing buffer (0.1% SDS, 1% Triton X-100, 2mM EDTA, 20mM Tris HCl (pH 8.0), 500mM NaCl), IP buffer and 20mM Tris HCl, pH 7.4, as described above. To reduce background the beads were carried over to a new tube during the last wash. Multichannel pipet was used for processing multiple samples simultaneously.

Tagging was performed by resuspending the beads in 2.5μl of transposase mix (prepared by mixing 5μl of 2x DNA tagment buffer with 4μl mQ and 1μl Transposase) (Illumina Nextera kit) and incubation of 1 min at 37°C. Beads were washed once with 100μl of low salt washing buffer, once with 20mM Tris HCl pH 7.4, as described above, and resuspended in 5μl of 20mM Tris HCl, pH 7.4. 16 cycles of PCR were performed using bead bound DNA as a template by mixing 5μl of beads with 2,5μl of 5μM forward and reverse primers from the ATAC-seq protocol [22] and 10μl of 2x NEBNext PCR master mix (New England Biolabs). The following PCR program was used: (72°C 5 min, 98°C 2 min, 98°C 10 sec, 63°C 10 sec, 72°C 1 min, repeat steps 3–5 15 times, hold at 4°C). PCR products were purified with Agencourt RNA XP magnetic beads (Beckman Coulter), eluted in 10μl of Tris HCl pH 7.4 followed by quality control and DNA quantification using NanoDrop, Qubit (Thermo Fisher Scientific) and TapeStation (Agilent). The resulting library was subjected to 50 or 75bp paired-end Illumina sequencing using either HiSeq2500 or NextSeq platforms. Alternatively, the library was analyzed with q-RT-PCR using Applied Biosystems 7900HT real-time qPCR machine, HOT FIREPol^® EvaGreen^® qPCR Mix Plus (Solis BioDyne) and following primers: GAPDH_F CCCGTCCTTGACTCCCTAG, GAPDH_R CTGGTTCAACTGGGCACG; VPS29_F TCGCTACTTCCTGTTCTGCA, VPS29_R GATAGGGGCACGGTCCTC; ZNF7_F TACTGTTTCCTCGCCAGCTC, ZNF7_R GAGGCAAAGGAGACAAAGCA; Neg_cntrl_F CAAATGTGGTCACTAAGGCAAC, Neg_cntrl_R GTGACTCTCCTGGACCAACA.

RAT-ChIP-seq with bovine blastocysts was performed as described above except that after dissection the cells were collected in 3μl of embryo medium. Lysis/restriction buffer was prepared by combining 4.75μl of 10x NL (100mM Tris HCl, pH 7.4, 100mM NaCl, 30mM MgCl₂) buffer, 4.75μl of 10x FastDigest buffer and 0.5μl of mix of 4x restriction endonucleases. 0.75μl of lysis/restriction buffer was added to 3μl of cells and incubated 15 min on ice and 5 min at 37°C. Next, 1μl of mix of 0.5% NaDOC, 0.5% TritonX100 with protease inhibitor cocktail was added to the samples and incubated 15 min on after which 15μl of IP buffer (20mM Tris HCl pH 7.4, 2mM EDTA, 150mM NaCl, 0.1% Triton X-100) was added and sample was divided into 2 tubes, 10μl each for subsequent IP with Dynabeads bound with corresponding antibodies.

Two types of input samples were prepared. First, the samples were treated according to the RAT-ChIP protocol, but instead of immunoprecipitation, DNA was extracted using DNA Clean & Concentrator kit from Zymo Research, followed by tagging with Tn5 transposase according to manufacturer’s protocol (Illumina) and sequencing library construction using PCR. Second, samples were initially treated according to the RAT-ChIP protocol and tagging of chromatin was performed in cell nuclei right after treatment with restriction enzymes followed by DNA extraction and library generation using PCR as described above. Detailed RAT-ChIP protocol is available in S1 File and at protocols.io website (dx.doi.org/10.17504/protocols.io.69qhh5w).

In silico analysis of restriction enzyme cutting sites

Human genome hg19 version GRCh37.p13 from EnsEMBL website (http://grch37.ensembl.org/) was used for the in silico analysis. The consensus sequence (‘AGCT’, ‘TTAA’, ‘CCWGG’—where ‘W* can be either ‘A’ or ‘T’, and ‘GANTC’—where ‘N’ can be any nucleotide) for each restriction enzyme (AluI, SaqAI, HinfI, MvaI) respectively, was mapped onto each chromosome sequence. The final list contained over 50 million restriction site positions. The recorded coordinates were used to determine the average number of restriction enzyme recognition sites per 1kb in chromHMM K562 chromatin state regions [23] (repeat containing states 14 and 15 were excluded from the analysis) using custom written Perl scripts followed by average fragment length calculation. Custom Perl scripts are available from https://github.com/reidar-andreson/nucleosomes. Coordinates for gap, repeat and chromHMM K562 chromatin state regions were downloaded from UCSC Table browser [24]. Qualimap [25] was used to get the quality metrics of different ChIP-seq datasets in S4 Table and for the calculation of median read length in K562 cell input sample in different chromatin state regions.

Used publicly available data

Raw data from following K562 ChIP-seq datasets from GEO database were downloaded and reprocessed as described below: GSM945165, GSM945228, GSM733680, GSM733658, GSM1782695, GSM1782755, GSM1782693, GSM1782739, GSM1918612-GSM1918616, GSM1918602-GSM1918606, GSM1918592-GSM1918596, GSM1918582-GSM1918586, GSM1141671 and GSM1141672. bESC ChIP-seq data is form GSE110039 and K562 RNA-seq data from GSM1940168. For S13C Fig, bigwig files were downloaded from the following datasets—GSM2082690, GSM2082693, GSM2082696, GSM2082698, GSM2082701, GSM2082703. For ICM and TE gene expression comparisons gene lists from the following publications were used [26–30].

RAT-ChIP-seq and expression data analysis

Sequencing reads were mapped to hg19 or bosTau8 genome using Bowtie2 (version 2.3.3.1) [31] using following parameters -k 2 -N 1. Next, bam files were sorted and indexed using SAMtools (version 1.6) [32] and bigwig files were created using deepTools 2.0 [33]. We used blacklist regions for hg19 genome annotation created during ENCODE project to exclude them from further analysis [34]. Bigwig files were visualized in UCSC genome browser as custom tracks [24]. Peak calling was done using SICER-rb.sh script (version 1.1) [35] with following parameters: (100 200 200 0.74 600 50). For the final list of peaks regions that overlapped with hg19 blacklisted regions were discarded. Differential peak calling was done using SICER-df-rb.sh script (version 1.1) [35] with following parameters: (100 200 200 0.74 600 50). In addition to the FDR threshold, arbitrary 4-fold cut-off was used for H3K27me3 signal to restrict the number regions for further analysis. Manipulation with genomic regions such as intersection or subtraction was done using bedtools (version 2.26.0) [36] or various operate on genomic intervals tools in the Cistrome Galaxy server [37,38]. Global pairwise Pearson correlation analysis, clustering and heatmap generation was performed in 5kb windows using multiple wiggle files correlation tool in Cistrome Galaxy server [37,38]. Global H3K4me3 heatmaps in 10kb regions around TSS (in 100bp windows) were generated using heatmap tool in Cistrome Galaxy server [37,38]. Average H3K4me3 enrichment profiles in 10kb regions around TSS (in 10bp windows) were generated using Sitepro tool in Cistrome Galaxy server [37,38]. Locus specific average metagene (3kb up and downstream of the gene and 6kb metagene body) enrichments of H3K27me3 were calculated and visualized using bwtool [39]. GO enrichment analysis of cell-type enriched histone modification regions (identified using SICER and sorted based on fold change) was performed using GREAT (version 3.0.0) [40] with proximal: 5 kb upstream, 1 kb downstream, plus distal: up to 100 kb, parameters. Venn diagrams were created using InteractiVenn [41]. For data visualization and statistical analysis MS Excel and GraphPad were used. Paired T-Test was used to calculate if average signals between pairs of corresponding gene regions (TE or ICM upregulated genes) in TE and ICM were significantly (p<0.05) different.

The datasets supporting the conclusions of this article are included within the article, its supplementary materials and in the Gene Expression Omnibus (GEO) repository, [GSE103734].

Results

Development of the RAT-ChIP method

The outline of the RAT-ChIP method is depicted in Fig 1A. The whole protocol is designed to work as a single tube–one-day assay, reducing the number of necessary experimental steps, thus minimizing the loss of material. When designing the assay we aimed to use Tn5 transposase for sequencing library preparation as is used in ATAC-seq [42] and ChIPmentation method [43]. We therefore needed a method for chromatin fragmentation that would be compatible with both tagmentation and low cell numbers. Sonication is the most commonly used method chromatin fragmentation but it was not a good option in our case, as it cannot be done in small volumes using the standard equipment available in most labs. Larger volumes, in turn, lead to dilution of the material, which is undesirable when working with a small number of cells. Moreover, we wanted to avoid crosslinking, which due to harsh treatment needs to be done when using sonication, as it adds several additional steps to the protocol resulting in loss of material. We therefore turned to enzymatic digestion methods, which can be performed in considerably smaller volumes without the need for crosslinking (at least when working with histones). Micrococcal nuclease (MNase) digestion that is used in native ChIP protocols for chromatin fragmentation was in our case not optimal, as MNase digests the entire free DNA between nucleosomes resulting in inefficient adapter insertion when these nucleosomes will be used for tagging. Thus, we looked for alternative enzymatic means for chromatin digestion.

**Fig. 1. RAT-ChIP enables genome wide histone modification profiling from 100 cells.**

Restriction endonucleases have long been used for DNA footprinting to identify the locations of DNA bound proteins. They only cut at specific recognition sequences, preferably in between nucleosomes, and do not have exonuclease activity. Theoretically, even a single 4bp recognizing restriction enzyme should cut on average every 256bp but the actual cutting frequency depends on the DNA sequence and position of the nucleosomes. Nevertheless, a combination of frequently cutting restriction enzymes should allow achieving relatively even fragmentation coverage across the genome and fine enough resolution needed for histone modification profiling. We therefore tested an array of frequently cutting restriction endonucleases (S2 Table) for their ability to cut DNA in the chromatin context. All used enzymes were from Thermo Scientific FastDigest lineup, which allows for rapid 5-minute digestions. We used buffer conditions containing a low amount of non-ionic detergents that can disrupt cell and nuclear membranes but do not interfere with the enzymatic activity. Out of the 10 tested restriction endonucleases, 5 were able to cut chromatin with various efficiencies, as evidenced by the appearance of nucleosomal ladders on an agarose gel (S1A Fig). AluI and SaqAI were the most efficient cutters followed by MvaI, HinfI and BsuRI. Surprisingly, the other enzymes were not able to visibly cut the chromatin even if they were able to cut naked DNA under the same conditions (data not shown). Although the best cutting single enzymes relatively efficiently fragmented chromatin, a majority of the DNA was still too large for ChIP. We therefore cut chromatin using combinations of restriction enzymes and observed a decrease in average DNA size when more restriction endonucleases were used simultaneously (S1B Fig). Next, we tested if incubation time has an effect on fragmentation efficiency but did not see a substantial variation in DNA size between 5-, 10 -⁠ and 15-minute incubation times (S1C Fig), indicating that 5 minutes is enough to digest majority of the DNA.

Since restriction enzymes recognize specific DNA sequences we used in silico analysis to identify the genome-wide cutting sites of the used restriction endonucleases based on hg19 genome build. Using a combination of 4 restriction endonucleases (AluI, SaqAI, MvaI and HinfI), 87% of the genome was predicted to be cut into smaller pieces than 1,000bp (S2A Fig). In total there were 2,465 regions that based on the in silico analysis remained larger than 1,000 bp. Out of these 273 overlapped with gap regions (regions with no annotated sequence in the hg19 genome build), 290 overlapped with blacklisted regions (regions that have abnormally high read counts in next-generation sequencing based studies) identified by ENCODE project and 2,065 overlapped with repeat regions downloaded from UCSC table browser RepeatMasker track (S3A Table). This left with only 299 regions that did not overlap with any of the three lists (S2B Fig and S3B Table), showing that at least in silico there are only a handful of unique genomic regions that cannot be effectively fragmented using a combination of restriction endonucleases.

We also compared in silico DNA fragment sizes that would be created by cutting with the 4 restriction enzymes with experimentally determined fragment sizes after cutting with the 4 restriction enzymes followed by tagmentation and sequencing. The analysis showed that regardless of the studied genomic regions with different chromatin states defined by chromHMM, the average/median size of DNA fragments was always smaller than 176bp–indicative of even and sufficient fragmentation for ChIP (S2C Fig).

Having identified that restriction enzymes could be used for chromatin digestion we proceeded with testing the RAT-ChIP protocol. The use of enzymatic digestion enabled us to considerably downscale the sample volume so that the digestion was performed in 1μl and immunoprecipitation (IP) was performed in 11μl final volume using 1μl of magnetic beads pre-bound with an antibody against the histone modification of interest. After IP, beads were washed and tagging was performed directly on the beads. After another round of washes, the beads were used directly in PCR reaction. Skipping the decrosslinking, proteinase K treatment and DNA purification steps minimizes the loss of DNA and allows working with very low amounts of material. Even when starting with only 100 cells, we easily obtained enough material for sequencing after 16 rounds of PCR (S3A Fig). Moreover, tagging on beads further decreases the fragment size, so that the bulk RAT-ChIP library was between 200 and 500bp in size (Fig 1B, S3B Fig), which is ideal for sequencing. Initial RAT-ChIP tests using 100 and 1,000 erythroleukemic K562 cells and H3K4me3 antibody followed by qPCR showed enrichment at the promoters of housekeeping genes GAPDH and VSP29 compared to negative control regions (4^th exon of ZNF7 gene and an intergenic region on chr17 (neg ctrl)) (S3C Fig) showing that using qPCR the method is capable of detecting histone modification enrichments from only 100 cells.

RAT-ChIP enables high quality genome-wide histone modification profiling from 100 cells

We next coupled our RAT-ChIP protocol with Illumina sequencing for genome-wide histone modification profiling. We used human K562 cells derived from a chronic myelogenous leukemia for which many publicly available datasets exist allowing us to compare our method with others. We used 100 and 1,000 cells and antibodies that recognize H3K4me3 or H3K27me3 to see if RAT-ChIP can be used to study both active and inactive histone marks. After paired-end sequencing the reads were mapped to hg19 genome assembly, enrichment profiles were created and visualized as custom tracks in the UCSC genome browser. Visual inspection and comparison to the corresponding ENCODE data suggested that RAT-ChIP can produce high quality profiles that look similar to ENCODE data for both histone H3K4me3 and H3K7me3 modifications (Fig 1C).

To further assess the quality of the RAT-ChIP data we compared it to several other published ChIP-seq experiments that had data available with K562 cells. These included two datasets from ENCODE, a native ChIP-seq dataset (NCHIP), as well as three datasets from low input methods (see S4 Table) [15,34,43–46]. We downloaded raw sequencing data and processed all the datasets in the same way. Comparing various parameters such as % of mapped reads, GC% and fragment size for paired-end data showed that although there is variability between the compared datasets, RAT-ChIP performed comparably to other methods (S4 Table). One parameter that, as expected, is clearly dependent on the number of cells used is the percentage of duplicated reads. In here, the low input methods had more duplicated reads ranging from 30–64% (S4 Table).

Visualization in the UCSC genome browser showed that RAT-ChIP produces similar enrichment profiles with all the studied datasets (S4 Fig). To further assess how does RAT-ChIP compare to other methods we conducted clustering based on global correlations in 5kb windows between all the datasets. Two main clusters formed according to the studied histone H3 modification (Fig 1D). Overall, the following correlations were found between H3K4me3 (r2 = 0.71–1.00) and H3K27me3 datasets (r2 = 0.43–0.93). Within the modifications, unsurprisingly, datasets from the same lab showed higher correlations and usually clustered together. Global heatmaps of H3K4me3 signals around + -⁠ 5kb region of TSSs also showed similar profiles between RAT-ChIP and published datasets (S5A Fig).

To identify how RAT-ChIP signal intensities correlate with gene expression we divided genes into 3 groups based on RPKM values of published K562 cell-line RNA-seq data [47]. Average signal intensities around + -⁠ 5kb region of TSS correlated with gene expression status in all profiled datasets, expectedly, genes with higher expression had more H3K4me3 around their TSS. One striking difference that appeared with the average signal intensity profiles around TSS was that while majority datasets displayed a lower signal around TSS, known to harbour a nucleosome free-region, RAT-ChIP data, recently published CUT&Tag datasets [46] and to a lower extent MINT-ChIP [15] showed a clear signal in this region. This difference might be caused by both biological and technical reasons. The signal at the TSS in RAT-ChIP data could be reduced if fragments in between 120–420 bp were analyzed or if input sample signal was subtracted from the IP signal but not when subtracting H3 RAT-ChIP signal (S5B Fig).

Average H3K27me3 levels in metagene plots, opposite to H3K4me3 data, negatively correlated with gene expression—in all datasets, the promoter and whole gene body had higher levels of H3K27me3 signal in genes with lower expression (S6 Fig). However, the shape of the average profiles varied from dataset to dataset. With RAT-ChIP data, the average signal was very little influenced by exclusion of small (<120) and large (>420) fragments but was markedly influenced by subtracting H3 signal and clearly over compensated when subtracting input signal (S6 Fig). These data collectively show that RAT-ChIP produces comparable H3K4me3 and H3K27me3 profiles to published data, and manages to capture the known properties of these two histone marks.

To assess the reproducibility of the method we created additional 3 replicates using 100 and 1000 K562 cells and H3K4me3 and H3K27me3 antibodies (S7 Fig). Pairwise comparison of the replicates ChIP signal in 5kb windows showed that, expectedly, there is more variation between samples with 100 cells (Pearson correlation coefficients 0.69–0.81 for H3K4me3 samples and 0.67–0.76 for H3K27me3 samples) compared to 1000 cells (Pearson correlation coefficients between 0.93–0.97 for H3K4me3 samples and 0.84–0.95 for H3K27me3 samples) (S8 Fig).

We next determined regions enriched for histone H3K4me3 using SICER and overlapped the regions between different samples (S5 Table). When one of the ENCODE datasets (UW1) was used as a reference RAT-ChIP H3K4me3 peaks overlapped with 72–73% of the reference peaks which was in the same range with Mint-ChIP (68–73%), CUT&Tag (66–74%) and NChIP (71%) but lower than with other methods with more cells (82–90%) and a replicate from the same lab (93%) (S9A Fig). This analysis shows that despite of lower overlap, low input methods are still capable of identifying majority of the enriched regions. Moreover, the regions not identified, have on average much lower signal in the original ENCODE data, suggesting that RAT-ChIP misses regions with low H3K4me3 enrichment (S9B Fig).

RAT-ChIP can identify differences in histone modification profiles between cell-lines

Having identified that RAT-ChIP-seq data from K562 cells are comparable to other published datasets we next studied if it is capable of identifying cell type specific differences in histone modification profiles. To this end we performed RAT-ChIP-seq with 100 and 1,000 cells in human non-small cell lung carcinoma cell line H1299, for which no published ChIP-seq data exist, using histone H3K4me3 and H3K27me3 antibodies. After alignment and filtering, bigwig files were created and visualized in UCSC genome browser. Visual inspection and comparison of the RAT-ChIP tracks from H1299 and K562 cells showed similar enrichment profiles at the transcriptional start sites (TSS) of genes for H3K4me3 and broad H3K27me3 domains (S10A Fig). Inspecting the loci of known hematopoietic transcription factors such as GFI1b (Fig 2A), GATA1 (S10B Fig), LMO2 (S10C Fig), ETO2 (data not shown) and entire globin locus (S11A Fig), revealed clear differences between the two cell lines. For example, in GFI1b locus there was enrichment of H3K4me3 around the TSS in K562 cells but the modification was completely absent in H1299 cells (Fig 2A). The opposite was seen with H3K27me3, where a region around GFI1b gene had clearly higher signal of H3K27me3 in H1299 cells compared to K562 cells (Fig 2A). Similar examples could be found for H1299 cell-line—several genes involved in epithelial to mesenchymal transition (EMT) such as TWIST2 and SIX1, showed elevated histone modification profile of active genes in H1299 cells (S11B and S11C Fig). Pairwise correlation and clustering analysis showed that samples clustered first according to the profiled histone modifications and within the modifications according to cell-lines (Fig 2B). To gain a more global view of the differences we identified differential H3K4me3 peaks between the two cell lines. Fig 2C shows heatmap of the signal intensities around TSS of 300 genes, which were differentially modified in one of the cell lines in contrast with 300 random genes that were not differentially modified. To see if the differentially modified regions are near functionally relevant genes we performed a GO enrichment analysis using GREAT [40]. Analysis of 500 top H3K4me3 peaks that were more enriched in one of the cell lines compared to the other revealed enrichment of hematopoiesis related terms for K562 cells and signalling related terms for H1299 cells in biological processes category (Fig 2D). Similar analysis was performed with H3K27me3 mark (S12A and S12B Fig). Regions with higher signal (2840 regions) in K562 cell line were enriched in genes associated with cellular movement associated GO terms and regions with higher signal in H1299 cells (2350 regions) were enriched in hematopoiesis related GO terms. This analysis shows that RAT-ChIP-seq can identify hundreds of tissue specific genes with different histone modification profiles between K562 and H1299 cells.

**Fig. 2. RAT-ChIP can identify differences in histone modifications between cell-lines.**

RAT-ChIP enables histone profiling of blastocyst stage bovine embryos

Recently, our group has used bovine as a model to study the molecular events that take place during of early embryogenesis of large mammals–chromosomal instability in particular [21]. We therefore aimed to put RAT-ChIP-seq to test and profile histone H3K4me3 and H3K27me3 modifications in blastocyst stage embryos. Thus far, mouse is the only mammal, which embryos have been used for genome-wide histone profiling at such an early stage of development [16,17,20]. Using in vitro fertilized embryos, we used micromanipulator in combination with laser microdissection to separate blastocysts into inner cell mass (ICM) and trophectoderm (TE) fractions. Pooled material of three embryos was subsequently used for RAT-ChIP-seq experiments. After alignment to bosTau8 genome, bigwig tracks with enrichment profiles were created and visualized in UCSC genome browser next to recently published histone modification data from bovine embryonic stem cells (bESCs) [48]. As expected, histone H3K4me3 was enriched mostly around promoter regions while histone H3K27me3 had broad domains of enrichment as exemplified by looking at the locus centred around housekeeping gene GAPDH (S13A Fig), showing that RAT-ChIP can be used to obtain genome-wide histone modification profiles from early developmental stage embryos.

To gain more global view of how these two histone marks act in regulation of gene expression we intersected our histone modification data with published gene expression data from ICM and TE of bovine blastocysts. We found five studies where gene expression profiles of ICM and TE were compared [26–30]. Two of them used RNA–seq and two others Affymetrix microarrays. In addition, one of the studies compared in vivo and in vitro derived blastocysts, making it in total six datasets. We obtained the published lists of differentially expressed genes between ICM and TE for all the studies and intersected them. Overall, the overlap was relatively modest–there were only 6 and 0 gene(s) that were consistently upregulated in ICM and TE, respectively, in all 6 datasets. The same numbers for at least 5 overlapping datasets were 28 and 5 and for at least 3 overlapping datasets 210 and 221 for ICM and TE, respectively (S13B Fig, S6 Table). This analysis shows that there is a lot of variability and that the changes between TE and ICM at the transcriptome level are not huge at this early stage. In order to link gene expression to histone modification profiles we took the genes that were differentially expressed between ICM and TE at least in 3 datasets and profiled the average histone H3K4me3 profiles around 5kb regions around TSS and created metagene plots for H3K27me3 encompassing gene body and 3kb up and downstream of TSS and TES, respectively (Fig 3A). In ICM, both groups of genes had similar levels of H3K4me3 and H3K27me3. In trophectoderm, TE upregulated genes had higher average levels of H3K4me3 around their promoters and lower levels of H3K27me3 levels around the whole gene region compared to genes upregulated in ICM (Fig 3A), suggesting for a more pronounced epigenetic regulation. The differences seen between ICM and TE are not due to major differences in immunoprecipitation quality as both H3K4me3 and H3K27me3 average signal was clearly associated with gene expression levels in both datasets (S14A Fig).

**Fig. 3. Histone H3K4me3 and H3K27me3 modification profiles of ICM and TE of blastocyst stage bovine embryos.**

The involvement of epigenetic regulation in ICM and TE specific gene expression is further supported by the analysis of average signals between pairs of corresponding gene regions (TE or ICM upregulated genes) in TE and ICM. In all cases there were statistically significant differences—ICM upregulated genes had more H3K4me3 and less H3K27me3 signal in ICM compared to TE and, vice versa, TE upregulated genes had more H3K4me3 and less H3K27me3 signal in TE compared to ICM (S14B Fig).

Although between samples the changes in histone modifications are in the expected direction, the changes are relatively modest–on a single gene level SICER managed to identify higher H3K4me3 on the TSS of 77 genes out of 210 (37%) ICM upregulated genes in ICM and on the TSS of 82 genes out of 221 (37%) for TE upregulated genes in TE. The same numbers for TSS where H3K4me3 levels were upregulated at least 2x were 17 (8%) and 12 (5.4%) for ICM and TE upregulated genes respectively (S6C and S6D Table). To identify genes, which are potentially polycomb regulated, we calculated average H3K27me3 levels for ICM and TE upregulated genes for regions spanning the whole gene plus 2kb upstream and for a region +-2kb of TSS. We identified 6 (gene+2kb upstream) and 17 (+-2kb of TSS) genes where the changes in H3K27me 3 levels were at least 4 times higher in TE compared to ICM for ICM upregulated genes and 23 (gene +2kb upstream) and 24 (+-2kb of TSS) genes where the changes in H3K27me 3 levels were at least 4 times higher in ICM compared to TE for TE upregulated genes (S6C and S6D Table).

To understand how the data can be used to learn about the epigenetic regulation of individual genes, we focused on genes known to be important in either ICM or TE specification and function. The promoter region of NANOG, a well-known pluripotency gene in embryonic stem (ES) cells was in our combined list of ICM upregulated genes and had a H3K4me3 peak in ICM but not in TE (Fig 3B). This is different from a recently published data from mouse where Nanog promoter region is enriched for H3K4me3 in both ICM and TE (S15A Fig) [20]. Another good example where we detected difference in H3K4me3 levels around the promoter region is DPPA3, although it was differentially expressed between TE and ICM in only one of the five published transcriptome studies (S15B Fig, S6B Table). In contrast, enrichment of H3K4me3 seen in ICM sample was absent from the promoter of DPPA3 gene in a recently published data with bovine embryonic stem cells (S15B Fig) [48] and was present in both ICM and TE of mouse blastocysts (S15C Fig).

Similar to DPPA3, the master regulator of TE development, CDX2 [49], was in the list of TE upregulated genes in only one of the expression datasets (S6B Table) and was enriched for H3K4me3 in both cell types. Interestingly, there was an enrichment of H3K27me3 upstream of CDX2 gene specifically in ICM. The same region came up as the first hit when BLAT alignment was performed using a sequence of the recently characterized mouse trophectoderm specific enhancer upstream of CDX2 gene [50], suggesting that CDX2 might be down regulated in ICM through Polycomb-mediated enhancer repression. These examples show that RAT-ChIP-seq data can be used for hypotheses generation to identify molecular mechanisms driving gene regulation in early embryonic development.

Discussion

We have developed a novel low input ChIP method called RAT-ChIP that can be used to create genome-wide histone modification profiles from only 100 cells. There are several important modifications to the standard protocol that enabled us to achieve successful results with such a low number of cells.

First, the use of restriction enzymes for chromatin fragmentation enabled us to keep the sample volumes small, which is essential when working with low amount of starting material. The volumes used in other published protocols, except for these that use dedicated equipment [13] not readily available, use much higher volumes. In addition, due to small volumes the reagent costs are reduced significantly. For example, in a typical ChIP experiment 30μl of magnetic beads are used compared to 1μl in RAT-ChIP. Similar to MNase, restriction enzymes only cut in between nucleosomes but in contrast to MNase they leave DNA overhangs that can be used for sequencing library generation by tagmentation using chromatin as a template. Restriction endonucleases have been used in DNA fingerprinting studies for years. More recently they have been successfully used in different chromosome conformation capture (3C) based methods to assess the 3D chromatin architecture [51]. The drawback of using restriction enzymes is that the cutting is not random. However, combining several frequently cutting enzymes enables to optimize the coverage and desired fragment size. As restriction endonucleases are sequence-specific and only cut in between nucleosomes there is no problem of over digestion. Moreover, due to the sequence specificity it is possible to predict genome-wide cutting sites. In combination of the four restriction endonucleases used in this study based on in silico analysis most of the genome is fragmented to the size suitable for ChIP. The larger fragments that remain often overlap with gaps, ENCODE project identified black regions (regions that have abnormally high number of reads in next generation sequencing data) or repeats that are difficult to analyze. Moreover, the larger regions that can cause false positive signals can be identified in silico and removed from further analysis. A recently published method called RELACS also used a similar approach to us showing that restriction endonucleases can be used for chromatin fragmentation in ChIP assays [52].

Second–minimization of steps where material could be lost, such as centrifugations and DNA extractions. This was achieved by omitting several steps in regular ChIP protocols such as crosslinking and proteinase K treatment. All steps in a protocol are carried out in a single tube so that the first DNA purification occurs only after PCR, when loss of material is not anymore an issue.

Third—simple, one step library preparation. We adapted the first step of library generation step from the ChIPmentation method. In addition, being extremely simple and cost effective, due to the random nature of tagmentation it allows to further decrease DNA size, so that majority of the fragments in the final library come from single nucleosomes. In contrast to ChIPmentation, we performed PCR directly on magnetic beads using the bound chromatin as a template, similar to recently published high-throughput ChIPmentation [53] and lobChIP [54]. Skipping DNA purification avoids loss of material. The method is also very fast taking less than a day to complete. Recently, several methods–CUT&RUN, CUT&Tag, ChIL–seq, scChIC-seq and CoBATCH have been published that look really promising, as they have been shown to be able to obtain data from single cells [46,55–59].

Using K562 cells, we showed that the histone H3K4me3 and H3K27me3 profiles created using RAT-ChIP compare well to other published datasets, demonstrating that it can be used to profile chromatin marks associated with both active and inactive genes. One difference we observed in RAT-ChIP H3K4me3 data compared to regular ChIP datasets was a prominent signal exactly at TSS. Even higher signal in this region was also seen in CUT&Tag H3K4me3 data, but not in ChIPmentation data both of which use Tn5 transposase. To a lesser extent, the signal was also present in H3K4me3 Mint-ChIP data. All the three methods that display the peak do not use crosslinking and use mild conditions for chromatin fragmentation. In case of expressed genes, the region surrounding TSS is more accessible and usually considered to be depleted of nucleosomes [60]. Therefore, part of the signal definitely comes from higher accessibility. We do see a signal in this region if we prepare input samples by tagging restriction enzyme digested chromatin within cell nuclei. However, the input prepared this way resembles ATAC-seq and is not ideal control as in case of RAT-ChIP the tagging takes place on beads after chromatin precipitation. The fact that the signal is still present after exclusion of small fragments (<120bp) suggests that this region might contain histone proteins that are not detectable with regular ChIP assays. Indeed, several papers have reported for the presence of MNase sensitive -1 nucleosome [61–64]. Our data agree with these reports and suggests that this nucleosome is H3K4me3 methylated.

One issue that arises with ChIP methods that use on-bead tagmentation is what kind of control samples to use, as input cannot be treated the same manner as immunoprecipitation samples. In our case, tagging input samples in intact nuclei right after restriction enzyme treatment creates a profile that resembles ATAC-seq data and tagging after DNA extraction misses the accessibility created bias. Tagmentation method, has opted for the use of IgG controls, however, IgG samples are not considered the best controls for sequencing based approaches [65]. Because of the low amount of starting material we failed to obtain good quality libraries with IgG samples. Another option is to use immunoprecipitation with H3 antibody as a control since these samples are treated identical to the samples of interest and give more even coverage compared to IgG. It has been shown that using INPUT or H3 pull-down as a reference has small differences with a negligible impact on the quality of a standard analysis [66]. Moreover, several reports have shown that using an input has a little if any advantage over not using an input [67–70]. Some studies imply that they do even worse and should be used only for peak prioritization [71]. The benefits or caveats of using input are probably case specific. For example, for pairwise comparison of samples to find regions with differential enrichments, input is not crucial, as biases present in both samples should cancel each other out. More care should be taken when quantitative statements are being made about the amounts of modifications. In this case even proper input might not be sufficient and using spike-ins should be considered if global changes in histone modification patterns are excepted [72,73]. Considering the above, we recommend using immunoprecipitation with H3 as the first choice for a control sample. Moreover, analysis should be done with and without using a control and attention should be paid when the regions of interest show differences depending on the analysis. In this case, an alternative method should be used to verify the results.

Replication experiments showed that RAT-ChIP can constantly obtain data from as few as 100 cells, however there is more experimental variation with such a low number of cells and thus replicates are especially important to make biologically valid conclusions.

By profiling histone H3K4me3 and H3K27me3 modifications in H1299 cells and comparing it to the data from K562 cells we showed that RAT-ChIP can identify epigenetic differences between cell lines.

We also showed that RAT-ChIP works on small numbers of primary cells by applying it to blastocyst stage bovine embryos. In addition to the vast potential bovine has in farming and biomedicine, it also serves as a good model system to study the molecular events that take place during early embryogenesis as its development is more similar to human compared to other common model organisms such as mouse [74]. Majority of the epigenetics experiments, genome-wide studies in particular, have been performed in mouse. Using RAT-ChIP we have created the first genome wide histone H3K4me3 and H3K27me3 modification profiles of ICM and TE of blastocyst stage bovine embryos. Considering that day 7–8 embryos consist on average about 125 cells (80 in TE and 45 in ICM) [45,46] and we used material from 3 pooled embryos for 2 ChIPs, only 70–120 cells were used per one immunoprecipitation.

Combined analysis of our histone modification data with lists of ICM and TE upregulated genes from published papers showed that gene expression changes are on average reflected by expected changes in histone modifications. The average differences in H3K4me3 levels are more pronounced in regions adjacent to TSS, especially downstream. This is in agreement with several recent reports that have associated breadth of H3K4me3 domains with cell identity and transcriptional activity [20,75]. Due to several reasons, however, such as low levels of enrichment in some regions, not completely pure cell populations, cellular heterogeneity and biological similarity, the changes at epigenetic level are not huge between ICM and TE. Nevertheless, looking at the histone modification profiles of factors with known importance either in ICM or TE function we could see evidence for the involvement of epigenetics in gene regulation. For example, a well-known pluripotency factor NANOG TSS had H3K4me3 only in ICM but not in TE. Similarly, it was shown recently that there is loss of H3K4me3 on NANOG gene upon human embryonal carcinoma NT2/D1 cell differentiation towards neural progenitors [76]. This is different from mouse blastocysts where the whole Nanog gene is covered with H3K4me3 both in ICM and TE (S15A Fig) [20]. Similarly, there were no differences in H3K4me3 signal at the promoter of DPP3A gene between ICM and TE in the mouse data, while our bovine data showed a signal only in ICM. This is again different from a recently published bES cell data where the promoter region of DPP3A was devoid of H3K4me3. DPP3A has a known conserved function of protecting the female genome from TET3 activity, it has role in pluripotency maintenance and reprogramming [77] and has been shown to be essential for bovine embryonic development [78]. The observed differences might be explained by locus specificity, timing, cell-type, cell purity used in ChIP experiments, or interspecies differences. Indeed, it was shown recently that there are interspecies differences upon ablation of OCT4 gene between mouse and human, including the regulation of NANOG gene expression [79]. There are also several examples of differentially regulated genes between ICM and ES cells [80]. These examples show that there are important cell type and interspecies differences that need to be taken into account when drawing conclusions about the regulation of specific genes.

At the moment we have tested RAT-ChIP only with histone modifications. It remains to be studied if it could be also used to profile other chromatin bound proteins, including transcription factors. As the interaction of transcription factors is in general more labile, crosslinking step is usually used in ChIP. However, there is a recent protocol called ORGANIC ChIP, which demonstrated that transcription factors can be also immunoprecipitated without the need for crosslinking [81]. Moreover, recent CUT&RUN and CUT&Tag methods do not use crosslinking and work with transcription factors [46,55,56].

With the publication of several novel ultra-low input ChIP methods it is important to note that each method has its own characteristics, using small amount of material inevitably creates more variability and thus direct comparison of results obtained with different methods can be complicated. For example, Tn5 transposase based methods can be influenced by chromatin accessibility and correction for this bias with input is not always straightforward. Therefore, to confirm biological significance, replicate experiments are crucial and results obtained with one method should be repeatable by a different method.

In summary, we have developed a novel simple yet sensitive RAT-ChIP method that it can be used to study genome-wide histone modifications from less than 100 cells. Using the new method we have created the first genome wide histone H3K4me3 and H3K27me3 profiles in blastocyst stage bovine embryos that serve as a resource for further studies.

Supporting information

S1 Fig [pdf]
Restriction enzymes can be used for chromatin fragmentation.

S2 Fig [pdf]
analysis of fragment sizes produced by 4 restriction enzymes in hg19 genome assembly.

S3 Fig [pdf]
RAT-ChIP can identify histone H3K4me3 modification enrichments from 100 cells.

S4 Fig [pdf]
RAT-ChIP enrichment profiles compared to other publicly available datasets.

S5 Fig [pdf]
RAT-ChIP enrichment profiles compared to other publicly available datasets.

S6 Fig [pdf]
RAT-ChIP enrichment profiles compared to other publicly available datasets.

S7 Fig [pdf]
RAT-ChIP enrichment profiles of 4 replicate experiments.

S8 Fig [a]
Correlation analysis of replicate RAT-ChIP experiments.

S9 Fig [pdf]
RAT-ChIP H3K4me3 peak comparison with published datasets.

S10 Fig [a]
RAT-ChIP can identify cell type specific histone profile differences.

S11 Fig [a]
RAT-ChIP can identify cell type specific histone profile differences.

S12 Fig [pdf]
RAT-ChIP can identify differences in histone modifications between cell-lines.

S13 Fig [left]
RAT-ChIP can identify histone H3K4me3 and H3K27me3 modification profiles from bovine blastocysts.

S14 Fig [pdf]
RAT-ChIP H3K4me3 and H3K27me3 enrichment profiles in bovine ICM and TE correlate with gene expression.

S15 Fig [pdf]
RAT-ChIP data from bovine ICM and TE enables identification of cell type and species-specific differences in histone modifications.

S1 File [pdf]
RAT-ChIP-seq protocol.

S1 Table [xlsx]
List of selected low input ChIP-seq methods.

S2 Table [xlsx]
List of tested Thermo Scientific FastDigest restriction endonucleases.

S3 Table [xlsx]
List of regions larger than 1kb after restriction.

S4 Table [xlsx]
Statistics and quality parameters of RAT-ChIP-seq and reanalyzed ChIP-seq datasets in K562 cells.

S5 Table [xlsx]
Number and overlap of H3K4me3 peaks of RAT-ChIP-seq and reanalyzed ChIP-seq datasets in K562 cells.

S6 Table [xlsx]
Integration of RAT-ChIP and published expression data of bovine ICM and TE.

Zdroje

1. Park PJ. ChIP-seq: advantages and challenges of a maturing technology. Nat. Rev. Genet. 2009;10 : 669–80. doi: 10.1038/nrg2641 19736561

2. Schmidt D, Wilson MD, Spyrou C, Brown GD, Hadfield J, Odom DT. ChIP-seq: Using high-throughput sequencing to discover protein?DNA interactions. Methods. 2009;48 : 240–8. doi: 10.1016/j.ymeth.2009.03.001 19275939

3. Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature. 2007;448 : 553–60. doi: 10.1038/nature06008 17603471

4. Heintzman ND, Hon GC, Hawkins RD, Kheradpour P, Stark A, Harp LF, et al. Histone modifications at human enhancers reflect global cell-type-specific gene expression. Nature. 2009;459 : 108–12. doi: 10.1038/nature07829 19295514

5. Adli M, Zhu J, Bernstein BE. Genome-wide chromatin maps derived from limited numbers of hematopoietic progenitors. Nat. Methods. 2010;7 : 615–8. doi: 10.1038/nmeth.1478 20622861

6. Shankaranarayanan P, Mendoza-Parra M-A, Walia M, Wang L, Li N, Trindade LM, et al. Single-tube linear DNA amplification (LinDA) for robust ChIP-seq. Nat. Methods. 2011;8 : 565–7. doi: 10.1038/nmeth.1626 21642965

7. Zwart W, Koornstra R, Wesseling J, Rutgers E, Linn S, Carroll JS. A carrier-assisted ChIP-seq method for estrogen receptor-chromatin interactions from breast cancer core needle biopsy samples. BMC Genomics. 2013;14 : 232. doi: 10.1186/1471-2164-14-232 23565824

8. Ng J-H, Kumar V, Muratani M, Kraus P, Yeo J-C, Yaw L-P, et al. In Vivo Epigenomic Profiling of Germ Cells Reveals Germ Cell Molecular Signatures. Dev. Cell. 2013;24 : 324–33. doi: 10.1016/j.devcel.2012.12.011 23352811

9. Lara-Astiaso D, Weiner A, Lorenzo-Vivas E, Zaretsky I, Jaitin DA, David E, et al. Chromatin state dynamics during blood formation. Science (80-). 2014;345 : 943–9.

10. Brind’Amour J, Liu S, Hudson M, Chen C, Karimi MM, Lorincz MC. An ultra-low-input native ChIP-seq protocol for genome-wide profiling of rare cell populations. Nat. Commun. 2015;6 : 6033. doi: 10.1038/ncomms7033 25607992

11. Schmidl C, Rendeiro AF, Sheffield NC, Bock C. ChIPmentation: fast, robust, low-input ChIP-seq for histones and transcription factors. Nat. Methods. 2015;12 : 963–5. doi: 10.1038/nmeth.3542 26280331

12. Zheng X, Yue S, Chen H, Weber B, Jia J, Zheng Y. Low-Cell-Number Epigenome Profiling Aids the Study of Lens Aging and Hematopoiesis. Cell Rep. 2015;13 : 1505–18. doi: 10.1016/j.celrep.2015.10.004 26549448

13. Cao Z, Chen C, He B, Tan K, Lu C. A microfluidic device for epigenomic profiling using 100 cells. Nat. Methods. 2015;12 : 959–62. doi: 10.1038/nmeth.3488 26214128

14. Rotem A, Ram O, Shoresh N, Sperling RA, Goren A, Weitz DA, et al. Single-cell ChIP-seq reveals cell subpopulations defined by chromatin state. Nat. Biotechnol. 2015;33 : 1165–72. doi: 10.1038/nbt.3383 26458175

15. van Galen P, Viny AD, Ram O, Ryan RJH, Cotton MJ, Donohue L, et al. A Multiplexed System for Quantitative Comparisons of Chromatin Landscapes. Mol. Cell. 2016;61 : 170–80. doi: 10.1016/j.molcel.2015.11.003 26687680

16. Zhang B, Zheng H, Huang B, Li W, Xiang Y, Peng X, et al. Allelic reprogramming of the histone modification H3K4me3 in early mammalian development. Nature. 2016;537 : 553–7. doi: 10.1038/nature19361 27626382

17. Dahl JA, Jung I, Aanes H, Greggains GD, Manaf A, Lerdrup M, et al. Broad histone H3K4me3 domains in mouse oocytes modulate maternal-to-zygotic transition. Nature. 2016;537 : 548–52. doi: 10.1038/nature19360 27626377

18. Skene PJ, Henikoff JG, Henikoff S. Targeted in situ genome-wide profiling with high efficiency for low cell numbers. Nat. Protoc. Nature Publishing Group; 2018;13 : 1006–19. doi: 10.1038/nprot.2018.015 29651053

19. Valensisi C, Liao JL, Andrus C, Battle SL, Hawkins RD. cChIP-seq: a robust small-scale method for investigation of histone modifications. BMC Genomics. BioMed Central; 2015;16 : 1083. doi: 10.1186/s12864-015-2285-7 26692029

20. Liu X, Wang C, Liu W, Li J, Li C, Kou X, et al. Distinct features of H3K4me3 and H3K27me3 chromatin domains in pre-implantation embryos. Nature. 2016;537 : 558–62. doi: 10.1038/nature19362 27626379

21. Destouni A, Zamani Esteki M, Catteeuw M, Tšuiko O, Dimitriadou E, Smits K, et al. Zygotes segregate entire parental genomes in distinct blastomere lineages causing cleavage-stage chimerism and mixoploidy. Genome Res. 2016;26 : 567–78. doi: 10.1101/gr.200527.115 27197242

22. Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods. 2013;10 : 1213–8. doi: 10.1038/nmeth.2688 24097267

23. Ernst J, Kheradpour P, Mikkelsen TS, Shoresh N, Ward LD, Epstein CB, et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature. 2011;473 : 43–9. doi: 10.1038/nature09906 21441907

24. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al. The human genome browser at UCSC. Genome Res. 2002;12 : 996–1006. doi: 10.1101/gr.229102 12045153

25. García-Alcalde F, Okonechnikov K, Carbonell J, Cruz LM, Götz S, Tarazona S, et al. Qualimap: evaluating next-generation sequencing alignment data. Bioinformatics. 2012;28 : 2678–9. doi: 10.1093/bioinformatics/bts503 22914218

26. Ozawa M, Sakatani M, Yao J, Shanker S, Yu F, Yamashita R, et al. Global gene expression of the inner cell mass and trophectoderm of the bovine blastocyst. BMC Dev. Biol. 2012;12 : 33. doi: 10.1186/1471-213X-12-33 23126590

27. Brinkhof B, van Tol HT, Groot Koerkamp MJ, Riemers FM, IJzer SG, Mashayekhi K, et al. A mRNA landscape of bovine embryos after standard and MAPK-inhibited culture conditions: a comparative analysis. BMC Genomics. 2015;16 : 277. doi: 10.1186/s12864-015-1448-x 25888366

28. Nagatomo H, Akizawa H, Sada A, Kishi Y, Yamanaka K, Takuma T, et al. Comparing spatial expression dynamics of bovine blastocyst under three different procedures: in-vivo, in-vitro derived, and somatic cell nuclear transfer embryos. Jpn. J. Vet. Res. 2015;63 : 159–71. 26753242

29. Zhao X-M, Cui L-S, Hao H-S, Wang H-Y, Zhao S-J, Du W-H, et al. Transcriptome analyses of inner cell mass and trophectoderm cells isolated by magnetic-activated cell sorting from bovine blastocysts using single cell RNA-seq. Reprod. Domest. Anim. 2016;51 : 726–35. doi: 10.1111/rda.12737 27440443

30. Hosseini SM, Dufort I, Caballero J, Moulavi F, Ghanaei HR, Sirard MA. Transcriptome profiling of bovine inner cell mass and trophectoderm derived from in vivo generated blastocysts. BMC Dev. Biol. 2015;15 : 49. doi: 10.1186/s12861-015-0096-3 26681441

31. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9 : 357–9. doi: 10.1038/nmeth.1923 22388286

32. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25 : 2078–9. doi: 10.1093/bioinformatics/btp352 19505943

33. Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–5. doi: 10.1093/nar/gkw257 27079975

34. Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F, et al. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489 : 57–74. doi: 10.1038/nature11247 22955616

35. Xu S, Grullon S, Ge K, Peng W. Spatial clustering for identification of ChIP-enriched regions (SICER) to map regions of histone methylation patterns in embryonic stem cells. Methods Mol. Biol. 2014;1150 : 97–111. doi: 10.1007/978-1-4939-0512-6_5 24743992

36. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. Narnia; 2010;26 : 841–2. doi: 10.1093/bioinformatics/btq033 20110278

37. Liu T, Ortiz JA, Taing L, Meyer CA, Lee B, Zhang Y, et al. Cistrome: an integrative platform for transcriptional regulation studies. Genome Biol. 2011;12:R83. doi: 10.1186/gb-2011-12-8-r83 21859476

38. Blankenberg D, Von Kuster G, Coraor N, Ananda G, Lazarus R, Mangan M, et al. Galaxy: a web-based genome analysis tool for experimentalists. Curr. Protoc. Mol. Biol. 2010;Chapter 19:Unit 19.10.1–21.

39. Pohl A, Beato M. bwtool: a tool for bigWig files. Bioinformatics. Oxford University Press; 2014;30 : 1618–9. doi: 10.1093/bioinformatics/btu056 24489365

40. McLean CY, Bristor D, Hiller M, Clarke SL, Schaar BT, Lowe CB, et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 2010;28 : 495–501. doi: 10.1038/nbt.1630 20436461

41. Heberle H, Meirelles GV, da Silva FR, Telles GP, Minghim R. InteractiVenn: a web-based tool for the analysis of sets through Venn diagrams. BMC Bioinformatics. BioMed Central; 2015;16 : 169. doi: 10.1186/s12859-015-0611-3 25994840

42. Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods. 2013;10 : 1213–8. doi: 10.1038/nmeth.2688 24097267

43. Schmidl C, Rendeiro AF, Sheffield NC, Bock C. ChIPmentation: fast, robust, low-input ChIP-seq for histones and transcription factors. Nat. Methods. 2015;

44. Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, et al. The accessible chromatin landscape of the human genome. Nature. 2012;489 : 75–82. doi: 10.1038/nature11232 22955617

45. Mercer TR, Edwards SL, Clark MB, Neph SJ, Wang H, Stergachis AB, et al. DNase I-hypersensitive exons colocalize with promoters and distal regulatory elements. Nat. Genet. 2013;45 : 852–9. doi: 10.1038/ng.2677 23793028

46. Kaya-Okur HS, Wu SJ, Codomo CA, Pledger ES, Bryson TD, Henikoff JG, et al. CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat. Commun. Nature Publishing Group; 2019;10 : 1930. doi: 10.1038/s41467-019-09982-5 31036827

47. Frank CL, Manandhar D, Gordân R, Crawford GE. HDAC inhibitors cause site-specific chromatin remodeling at PU.1-bound enhancers in K562 cells. Epigenetics Chromatin. 2016;9 : 15. doi: 10.1186/s13072-016-0065-5 27087856

48. Bogliotti YS, Wu J, Vilarino M, Okamura D, Soto DA, Zhong C, et al. Efficient derivation of stable primed pluripotent embryonic stem cells from bovine blastocysts. Proc. Natl. Acad. Sci. 2018;115 : 2090–5. doi: 10.1073/pnas.1716161115 29440377

49. Strumpf D, Mao C-A, Yamanaka Y, Ralston A, Chawengsaksophak K, Beck F, et al. Cdx2 is required for correct cell fate specification and differentiation of trophectoderm in the mouse blastocyst. Development. 2005;132 : 2093–102. doi: 10.1242/dev.01801 15788452

50. Rayon T, Menchero S, Nieto A, Xenopoulos P, Crespo M, Cockburn K, et al. Notch and hippo converge on Cdx2 to specify the trophectoderm lineage in the mouse blastocyst. Dev. Cell. NIH Public Access; 2014;30 : 410–22.

51. Davies JOJ, Oudelaar AM, Higgs DR, Hughes JR. How best to identify chromosomal interactions: a comparison of approaches. Nat. Methods. 2017;14 : 125–34. doi: 10.1038/nmeth.4146 28139673

52. Arrigoni L, Al-Hasani H, Ramírez F, Panzeri I, Ryan DP, Santacruz D, et al. RELACS nuclei barcoding enables high-throughput ChIP-seq. Commun. Biol. Nature Publishing Group; 2018;1 : 214. doi: 10.1038/s42003-018-0219-z 30534606

53. Gustafsson C, De Paepe A, Schmidl C, Månsson R. High-throughput ChIPmentation: freely scalable, single day ChIPseq data generation from very low cell-numbers. BMC Genomics. 2019;20 : 59. doi: 10.1186/s12864-018-5299-0 30658577

54. Wallerman O, Nord H, Bysani M, Borghini L, Wadelius C. lobChIP: from cells to sequencing ready ChIP libraries in a single day. Epigenetics Chromatin. BioMed Central; 2015;8 : 25. doi: 10.1186/s13072-015-0017-5 26195988

55. Hainer SJ, Fazzio TG. High-Resolution Chromatin Profiling Using CUT&RUN. Curr. Protoc. Mol. Biol. 2019;e85. doi: 10.1002/cpmb.85 30688406

56. Hainer SJ, Bošković A, McCannell KN, Rando OJ, Fazzio TG. Profiling of Pluripotency Factors in Single Cells and Early Embryos. Cell. Cell Press; 2019;177 : 1319–1329.e11.

57. Ku WL, Nakamura K, Gao W, Cui K, Hu G, Tang Q, et al. Single-cell chromatin immunocleavage sequencing (scChIC-seq) to profile histone modification. Nat. Methods. 2019;16 : 323–5. doi: 10.1038/s41592-019-0361-7 30923384

58. Wang Q, Xiong H, Ai S, Yu X, Liu Y, Zhang J, et al. CoBATCH for High-Throughput Single-Cell Epigenomic Profiling. Mol. Cell. Cell Press; 2019;

59. Harada A, Maehara K, Handa T, Arimura Y, Nogami J, Hayashi-Takanaka Y, et al. A chromatin integration labelling method enables epigenomic profiling with lower input. Nat. Cell Biol. 2019;21 : 287–96. doi: 10.1038/s41556-018-0248-3 30532068

60. Hughes AL, Rando OJ. Mechanisms Underlying Nucleosome Positioning In Vivo. Annu. Rev. Biophys. 2014;43 : 41–63. doi: 10.1146/annurev-biophys-051013-023114 24702039

61. Kubik S, Bruzzone MJ, Jacquet P, Falcone J-L, Rougemont J, Shore D. Nucleosome Stability Distinguishes Two Different Promoter Types at All Protein-Coding Genes in Yeast. Mol. Cell. 2015;60 : 422–34. doi: 10.1016/j.molcel.2015.10.002 26545077

62. Voong LN, Xi L, Sebeson AC, Xiong B, Wang J-P, Wang X. Insights into Nucleosome Organization in Mouse Embryonic Stem Cells through Chemical Mapping. Cell. Cell Press; 2016;167 : 1555–1570.e15.

63. Ramani V, Qiu R, Shendure J. High Sensitivity Profiling of Chromatin Structure by MNase-SSP. Cell Rep. Cell Press; 2019;26 : 2465–2476.e4. doi: 10.1016/j.celrep.2019.02.007 30811994

64. Brahma S, Henikoff S. RSC-Associated Subnucleosomes Define MNase-Sensitive Promoters in Yeast. Mol. Cell. Elsevier; 2019;73 : 238–249.e3. doi: 10.1016/j.molcel.2018.10.046 30554944

65. Kidder BL, Hu G, Zhao K. ChIP-Seq: technical considerations for obtaining high-quality data. Nat. Immunol. 2011;12 : 918–22. doi: 10.1038/ni.2117 21934668

66. Flensburg C, Kinkel SA, Keniry A, Blewitt ME, Oshlack A. A comparison of control samples for ChIP-seq of histone modifications. Front. Genet. 2014;5.

67. Zhang Y-F, Su B. Peak identification for ChIP-seq data with no controls. Dong wu xue yan jiu = Zool. Res. 2012;33:E121–8. doi: 10.3724/SP.J.1141.2012.E120-06E121 23266983

68. Cheung M-S, Down TA, Latorre I, Ahringer J. Systematic bias in high-throughput sequencing data and its correction by BEADS. Nucleic Acids Res. Oxford University Press; 2011;39:e103–e103. doi: 10.1093/nar/gkr425 21646344

69. de Boer BA, van Duijvenboden K, van den Boogaard M, Christoffels VM, Barnett P, Ruijter JM. OccuPeak: ChIP-Seq Peak Calling Based on Internal Background Modelling. Langmann T, editor. PLoS One. Public Library of Science; 2014;9:e99844. doi: 10.1371/journal.pone.0099844 24936875

70. Szalkowski AM, Schmid CD. Rapid innovation in ChIP-seq peak-calling algorithms is outdistancing benchmarking efforts. Brief. Bioinform. Oxford University Press; 2011;12 : 626–33. doi: 10.1093/bib/bbq068 21059603

71. Thomas R, Thomas S, Holloway AK, Pollard KS. Features that define the best ChIP-seq peak calling algorithms. Brief. Bioinform. 2016;18:bbw035.

72. Chen K, Hu Z, Xia Z, Zhao D, Li W, Tyler JK. The Overlooked Fact: Fundamental Need for Spike-In Control for Virtually All Genome-Wide Analyses. Mol. Cell. Biol. American Society for Microbiology (ASM); 2016;36 : 662.

73. Egan B, Yuan C-C, Craske ML, Labhart P, Guler GD, Arnott D, et al. An Alternative Approach to ChIP-Seq Normalization Enables Detection of Genome-Wide Changes in Histone H3 Lysine 27 Trimethylation upon EZH2 Inhibition. PLoS One. Public Library of Science; 2016;11:e0166438. doi: 10.1371/journal.pone.0166438 27875550

74. Santos RR, Schoevers EJ, Roelen BAJ. Usefulness of bovine and porcine IVM/IVF models for reproductive toxicology. Reprod. Biol. Endocrinol. BioMed Central; 2014;12 : 117. doi: 10.1186/1477-7827-12-117 25427762

75. Benayoun BA, Pollina EA, Ucar D, Mahmoudi S, Karra K, Wong ED, et al. H3K4me3 breadth is linked to cell identity and transcriptional consistency. Cell. NIH Public Access; 2014;158 : 673–88. doi: 10.1016/j.cell.2014.06.027 25083876

76. Topalovic V, Schwirtlich M, Stevanovic M, Mojsin M. Histone modifications on the promoters of human OCT4 and NANOG genes at the onset of neural differentiation of NT2/D1 cells. Biochem. 2017;82 : 715–22.

77. Zhao S, Xu J, Liu S, Cui K, Li Z, Liu N. Dppa3 in pluripotency maintenance of ES cells and early embryogenesis. J. Cell. Biochem. 2019;120 : 4794–9. doi: 10.1002/jcb.28063 30417435

78. Bakhtari A, Ross PJ. DPPA3 prevents cytosine hydroxymethylation of the maternal pronucleus and is required for normal development in bovine embryos. Epigenetics. 2014;9 : 1271–9. doi: 10.4161/epi.32087 25147917

79. Fogarty NME, McCarthy A, Snijders KE, Powell BE, Kubikova N, Blakeley P, et al. Genome editing reveals a role for OCT4 in human embryogenesis. Nature. Nature Research; 2017;550 : 67–73. doi: 10.1038/nature24033 28953884

80. Tang F, Barbacioru C, Bao S, Lee C, Nordman E, Wang X, et al. Tracing the derivation of embryonic stem cells from the inner cell mass by single-cell RNA-Seq analysis. Cell Stem Cell. Elsevier; 2010;6 : 468–78. doi: 10.1016/j.stem.2010.03.015 20452321

81. Kasinathan S, Orsi GA, Zentner GE, Ahmad K, Henikoff S. High-resolution mapping of transcription factor binding sites on native chromatin. Nat. Methods. 2014;11 : 203–9. doi: 10.1038/nmeth.2766 24336359