To mock or not: A comprehensive comparison of mock IP and DNA input for ChIP-seq

Jinrui Xu; Michelle M. Kudron; Alec Victorsen; Jiahao Gao; Haneen N. Ammouri; Fabio C.P. Navarro; Louis Gevirtzman; Robert H. Waterston; Kevin P. White; Valerie Reinke; Mark Gerstein

doi:10.1093/nar/gkaa1155

To mock or not: A comprehensive comparison of mock IP and DNA input for ChIP-seq

Jinrui Xu, Michelle M. Kudron, Alec Victorsen, Jiahao Gao, Haneen N. Ammouri, Fabio C.P. Navarro, Louis Gevirtzman, Robert H. Waterston, Kevin P. White, Valerie Reinke, Mark Gerstein

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Chromatin immunoprecipitation (IP) followed by sequencing (ChIP-seq) is the gold standard to detect transcription-factor (TF) binding sites in the genome. Its success depends on appropriate controls removing systematic biases. The predominantly used controls, i.e. DNA input, correct for uneven sonication, but not for nonspecific interactions of the IP antibody. Another type of controls, 'mock' IP, corrects for both of the issues, but is not widely used because it is considered susceptible to technical noise. The tradeoff between the two control types has not been investigated systematically. Therefore, we generated comparable DNA input and mock IP experiments. Because mock IPs contain only nonspecific interactions, the sites predicted from them using DNA input indicate the spurious-site abundance. This abundance is highly correlated with the 'genomic activity' (e.g. chromatin openness). In particular, compared to cell lines, complex samples such as whole organisms have more spurious sites-probably because they contain multiple cell types, resulting in more expressed genes and more open chromatin. Consequently, DNA input and mock IP controls performed similarly for cell lines, whereas for complex samples, mock IP substantially reduced the number of spurious sites. However, DNA input is still informative; thus, we developed a simple framework integrating both controls, improving binding site detection.

Original language	English (US)
Article number	e17
Journal	Nucleic acids research
Volume	49
Issue number	3
DOIs	https://doi.org/10.1093/nar/gkaa1155
State	Published - Feb 22 2021
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 2021 The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access

10.1093/nar/gkaa1155

OpenUrl availability

Full text

Cite this

@article{d57af6a63df54cf2956d65f5ed21a0be,

title = "To mock or not: A comprehensive comparison of mock IP and DNA input for ChIP-seq",

abstract = "Chromatin immunoprecipitation (IP) followed by sequencing (ChIP-seq) is the gold standard to detect transcription-factor (TF) binding sites in the genome. Its success depends on appropriate controls removing systematic biases. The predominantly used controls, i.e. DNA input, correct for uneven sonication, but not for nonspecific interactions of the IP antibody. Another type of controls, 'mock' IP, corrects for both of the issues, but is not widely used because it is considered susceptible to technical noise. The tradeoff between the two control types has not been investigated systematically. Therefore, we generated comparable DNA input and mock IP experiments. Because mock IPs contain only nonspecific interactions, the sites predicted from them using DNA input indicate the spurious-site abundance. This abundance is highly correlated with the 'genomic activity' (e.g. chromatin openness). In particular, compared to cell lines, complex samples such as whole organisms have more spurious sites-probably because they contain multiple cell types, resulting in more expressed genes and more open chromatin. Consequently, DNA input and mock IP controls performed similarly for cell lines, whereas for complex samples, mock IP substantially reduced the number of spurious sites. However, DNA input is still informative; thus, we developed a simple framework integrating both controls, improving binding site detection.",

author = "Jinrui Xu and Kudron, {Michelle M.} and Alec Victorsen and Jiahao Gao and Ammouri, {Haneen N.} and Navarro, {Fabio C.P.} and Louis Gevirtzman and Waterston, {Robert H.} and White, {Kevin P.} and Valerie Reinke and Mark Gerstein",

note = "Publisher Copyright: {\textcopyright} 2021 The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.",

year = "2021",

month = feb,

day = "22",

doi = "10.1093/nar/gkaa1155",

language = "English (US)",

volume = "49",

journal = "Nucleic acids research",

issn = "0305-1048",

publisher = "Oxford University Press",

number = "3",

}

TY - JOUR

T1 - To mock or not

T2 - A comprehensive comparison of mock IP and DNA input for ChIP-seq

AU - Xu, Jinrui

AU - Kudron, Michelle M.

AU - Victorsen, Alec

AU - Gao, Jiahao

AU - Ammouri, Haneen N.

AU - Navarro, Fabio C.P.

AU - Gevirtzman, Louis

AU - Waterston, Robert H.

AU - White, Kevin P.

AU - Reinke, Valerie

AU - Gerstein, Mark

PY - 2021/2/22

Y1 - 2021/2/22

N2 - Chromatin immunoprecipitation (IP) followed by sequencing (ChIP-seq) is the gold standard to detect transcription-factor (TF) binding sites in the genome. Its success depends on appropriate controls removing systematic biases. The predominantly used controls, i.e. DNA input, correct for uneven sonication, but not for nonspecific interactions of the IP antibody. Another type of controls, 'mock' IP, corrects for both of the issues, but is not widely used because it is considered susceptible to technical noise. The tradeoff between the two control types has not been investigated systematically. Therefore, we generated comparable DNA input and mock IP experiments. Because mock IPs contain only nonspecific interactions, the sites predicted from them using DNA input indicate the spurious-site abundance. This abundance is highly correlated with the 'genomic activity' (e.g. chromatin openness). In particular, compared to cell lines, complex samples such as whole organisms have more spurious sites-probably because they contain multiple cell types, resulting in more expressed genes and more open chromatin. Consequently, DNA input and mock IP controls performed similarly for cell lines, whereas for complex samples, mock IP substantially reduced the number of spurious sites. However, DNA input is still informative; thus, we developed a simple framework integrating both controls, improving binding site detection.

AB - Chromatin immunoprecipitation (IP) followed by sequencing (ChIP-seq) is the gold standard to detect transcription-factor (TF) binding sites in the genome. Its success depends on appropriate controls removing systematic biases. The predominantly used controls, i.e. DNA input, correct for uneven sonication, but not for nonspecific interactions of the IP antibody. Another type of controls, 'mock' IP, corrects for both of the issues, but is not widely used because it is considered susceptible to technical noise. The tradeoff between the two control types has not been investigated systematically. Therefore, we generated comparable DNA input and mock IP experiments. Because mock IPs contain only nonspecific interactions, the sites predicted from them using DNA input indicate the spurious-site abundance. This abundance is highly correlated with the 'genomic activity' (e.g. chromatin openness). In particular, compared to cell lines, complex samples such as whole organisms have more spurious sites-probably because they contain multiple cell types, resulting in more expressed genes and more open chromatin. Consequently, DNA input and mock IP controls performed similarly for cell lines, whereas for complex samples, mock IP substantially reduced the number of spurious sites. However, DNA input is still informative; thus, we developed a simple framework integrating both controls, improving binding site detection.

UR - http://www.scopus.com/inward/record.url?scp=85102216664&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85102216664&partnerID=8YFLogxK

U2 - 10.1093/nar/gkaa1155

DO - 10.1093/nar/gkaa1155

M3 - Article

C2 - 33347581

AN - SCOPUS:85102216664

SN - 0305-1048

VL - 49

JO - Nucleic acids research

JF - Nucleic acids research

IS - 3

M1 - e17

ER -

To mock or not: A comprehensive comparison of mock IP and DNA input for ChIP-seq

Abstract

Bibliographical note

UN SDGs

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this