?
Elaine R Mardis
Elaine R. Mardis is at the Washington University School of Medicine, St. Louis, Missouri 63108, USA.
Next-generation sequencing technology combines with chromatin immunoprecipitation to provide a genome-wide look at transcription-factor binding.
Next-generation sequencing technologies, capable of producing tens of millions of sequence reads during each instrument run, are quickly being applied in a myriad of creative ways to answer genome-wide questions. In this issue, Robertson and colleagues describe such an application, comparing chromatin immunoprecipitation (ChIP) of the Stat1 transcription factor using a next-generation sequencing platform and a conventional microarray-based platform1.
This report provides an elegant example of the power of next-generation sequencing platforms to expand what once was a focused assay to a genome-wide scope. In the process, our ability to characterize and understand phenomena such as alterations in transcription-factor binding in response to environmental stimuli can be evaluated for the entire genome in a single experiment. Hence, the ramifications for the pace of biological inquiry and the functional annotation of genomes are profound.
ChIP, first described by Varshavsky and colleagues2 as a method to study protein-DNA interactions, comprises three basic steps. First, covalent cross-links between proteins and DNA are formed, typically by treating cells with formaldehyde or another chemical reagent. In the second step, an antibody specific to the protein of interest is used to selectively coimmunoprecipitate the protein-bound DNA fragments that were covalently cross-linked. Finally, the immunoprecipitated protein-DNA links are reversed and the recovered DNA is assayed to determine the sequences bound by that protein (Fig. 1). Because random protein-DNA cross-linking can occur, and nonspecific DNA can be pulled down in the immunoprecipitation step, the ChIP-selected DNA is typically compared to a mock sample of DNA collected without antibody addition during the immunoprecipitation step. Typically, these two DNA populations are differentially labeled and compared by hybridization to a genomic microarray ('ChIP-chip'), as initially reported by Ren and colleagues3 in yeast.
Although ChIP-chip approaches have greatly expanded our understanding of genome-wide protein-DNA associations, the substitution of next-generation sequencing technology to analyze the DNA fragments released after ChIP ('ChIP-seq') has distinct advantages over microarray hybridization. As shown in Robertson et al.1, the Solexa sequencing technology4 provided short read length sequences of |[sim]|30 base pairs that were ideal for characterizing ChIP-derived fragments. The researchers mapped the resulting sequences back to the reference genome, whereby the most frequently sequenced fragments formed peaks at specific genomic regions. They then analyzed sequences under these peaks by comparison known Stat1 binding site sequences and locations, and for their proximity to genes. They also compared the results to previous Stat1 ChIP-chip data. Their comparison of Stat1 binding locations in human HeLa S3 cells stimulated by interferon gamma versus unstimulated cells showed that stimulated cells provided evidence of fourfold more Stat1-bound sites, and that specific sites bound in these interferon gamma|[ndash]|stimulated cells correlated well with expectations from previous studies5, 6. Peaks for both unstimulated and stimulated cells showed the highest density at approximately 100 base pairs upstream of the transcriptional start sites of nearby genes.
In summary, ChIP-seq offers important advantages over ChIP-chip, including lower cost, minimal hands-on processing and a requirement for fewer replicate experiments as well as less input material. Moreover, the Stat1 experimental ChIP-seq data have a high degree of similarity to results obtained by ChIP-chip for the same type of experiment, with >64% of peaks in shared genomic regions. Because the data are sequence reads, ChIP-seq offers a rapid analysis pipeline (as long as a high-quality genome sequence is available for read mapping) as well as the potential to detect mutations in binding-site sequences, which may directly support any observed changes in protein binding and gene regulation.
This report, and a similar study by Johnson and colleagues7 that defined genome-wide binding sites of the neuron-restrictive silencer factor (NRSF), provide initial evidence that next-generation sequencing platforms like Solexa are being cleverly coupled with previously focused techniques to produce genome-wide views of protein-binding phenomena. Variations on this theme have also provided data that delineate changes in histone methylation and thus give insights into chromatin packaging of the human genome8, 9.
In the near future these sequencing platforms are likely to identify new microRNAs and provide quantitative tag-based gene expression data, for example. These approaches will be combined to complete the picture of biological phenomena such as embryonic stem cell development and cancer progression. Their net impact will be to rapidly enhance the functional annotation of genomes, well beyond our present understanding. Improved annotation will in turn dramatically improve our ability to interpret sequence-level variations that concomitantly are being discovered by resequencing of genomes using next-generation technologies.
REFERENCES
- Robertson, G. et al. Nat. Methods 4, 651–657 (2007). | Article |
- Solomon, M.J., Larsen, P.L. & Varshavsky, A. Cell 53, 937–947 (1988). | Article | PubMed | ISI | ChemPort |
- Ren, B. et al. Science 290, 2306–2309 (2000). | Article | PubMed | ISI | ChemPort |
- Bentley, D.R. Curr. Opin. Genet. Dev. 16, 545–552 (2006). | Article | PubMed | ISI | ChemPort |
- Brierley, M.M. & Fish, E.N. J. Interferon Cytokine Res. 25, 733–744 (2005). | Article | PubMed | ISI | ChemPort |
- Schroder, K. et al. J. Leukoc. Biol. 75, 163–189 (2004). | Article | PubMed | ISI | ChemPort |
- Johnson, D.S. et al. Science 316, 1497–1502 (2007). | Article | PubMed | ISI | ChemPort |
- Barski, A. et al. Cell 129, 823–837 (2007). | Article | PubMed | ISI | ChemPort |
- Mikkelsen, T.S. et al. Nature; published online 1 July 2007.