Advancing Epigenetics Towards Systems Biology

Identification of Transcription Factor Binding Sites in ChIP-exo using R/Bioconductor (Prot 68)

Pedro Madrigal1,2

Introduction

Precisely mapping protein-DNA binding to genomic sites is a pivotal task in order to understand gene regulation. Chromatin immunoprecipitation (ChIP) followed by microarray hybridization (ChIP-chip) or sequencing (ChIP-seq) have been extensively used to map transcription factor binding sites (TFBSs), with ChIP-seq comparing favourably with respect to ChIP-chip in terms of resolution and signal-to-noise ratio (Ho et al., 2011). While ChIP-seq remains the standard, most-used methodology (Furey, 2012), λ exonuclease digestion followed by high-throughput sequencing, or ChIP-exo, has recently emerged as a powerful and promising technique able to substitute ChIP-seq, and to circumvent its limitations (Rhee and Pugh, 2011; Mendenhall and Bernstein, 2012). In this protocol, the distribution of mapped reads is characterised by pairs of two distinct peaks, one at each DNA strand, centred at the λ exonuclease borders and separated frequently at fixed distances (Rhee and Pugh, 2011). Importantly, the improved resolution of ChIP-exo can provide novel insights into protein-DNA interactions (Rhee and Pugh, 2011; Serandour et al., 2013). Furthermore, ChIP-exo distinguishes weaker peaks more confidently, and also closely-located binding events, that in ChIP-seq are generally unresolved or deconvolved through computational approaches (e.g., Guo et al. (2012)).
In this protocol, first I describe the differences between ChIP-seq and ChIP-exo data analysis pipelines, and then concentrate on peak calling using the R/Bioconductor package CexoR. Unlike (for example) the popular ChIP-seq peak caller MACS (Feng et al., 2012), CexoR analyses multiple ChIP-exo replicates together, allowing a better identification of narrow peaks and simpler downstream analysis.
CexoR is able to locate reproducible protein-DNA interaction in ChIP-exo datasets with no need of genome sequence information, manual matching of peak-pairs, paired control data (inputs), or downstream assessment of replicate reproducibility. In addition, the R statistical environment allows integration with other pipelines and downstream analyses via other R and Bioconductor packages.

PDF version

Pedro Madrigal1,2

1 Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
2 Wellcome Trust-MRC Cambridge Stem Cell Institute, Anne McLaren Laboratory for Regenerative Medicine, Department of Surgery, University of Cambridge, Cambridge, CB2 0SZ, UK

Corresponding author: Pedro Madrigal
Email feedback to: This email address is being protected from spambots. You need JavaScript enabled to view it.

Pedro Madrigal
Wed, May 1st 2019- Sat, May 4th 2019

Epigenetics refers to heritable change in gene expression that does not involve changes to the underlying DNA sequence. At least three systems including DNA methylation, histone modifications and non-...

Mon, Jun 10th 2019- Tue, Jun 11th 2019

Following the success of the previous conferences, we are excited to announce that “7th International Conference on Hypertension & Healthcare” will be held during June 10-11, 2019 at the most beautif...

Mon, Jun 17th 2019- Wed, Jun 19th 2019

Being successful in the previous 3 conferences in the series, Cardiologists 2016 in Berlin, Germany, Cardiologists 2017 in Paris, France and Cardiologists 2018 in Barcelona, Spain, this year we are mo...

Mon, Jun 17th 2019- Tue, Jun 18th 2019

RNA modifications are ubiquitous in biology and present in all classes of cellular RNAs. Recent studies have shown that RNA modifications are critical to the processing and metabolism of different RNA...

LAST EVENTS

EpiGeneSys Final
Meeting in Paris

Thur. 11 February 2016 - Sat. 13 February 2016

More than 280 scientists attended the fifth Annual Meeting of EpiGeneSys. The conference kicked off with a talk by coordinator Geneviève Almouzni, Director of the Research Center at the Institut Curie, highlighting the achievements of the network over more than five years...

Maison des océans - Paris Read more

PAST EVENTS

The Non-Coding Genome ...

December 3-4 th, 2015

The last training workshop of the EpiGeneSys network

Hotel Mediterraneo - Rome, Italy Read more

Paris / TriRhena Chromatin Club

July 9th, 2015

...exciting talks and network with members of the Chromatin community!

... An EpiGeneSys TAB workshop

June 11st-12nd , 2015

... learn about current approaches to single cell epigenetics and to meet up and network with...

Montpellier, FranceRead more

Latest publications

2017-06-27

The Histone Acetyltransferase Mst2 Protects Active Chromatin from Epigenetic Silencing by Acetylating the Ubiquitin Ligase Brl1.

Read more
2017-05-26

Proliferation Drives Aging-Related Functional Decline in a Subpopulation of the Hematopoietic Stem Cell Compartment.

Read more
2017-04-30

The impact of rare and low-frequency genetic variants in common disease.

Read more