TY - JOUR
T1 - Quantitative assessment reveals the dominance of duplicated sequences in germline-derived extrachromosomal circular DNA
AU - Mouakkad-Montoya, Lila
AU - Murata, Michael M.
AU - Sulovari, Arvis
AU - Suzuki, Ryusuke
AU - Osia, Beth
AU - Malkova, Anna
AU - Katsumata, Makoto
AU - Giuliano, Armando E.
AU - Eichler, Evan E.
AU - Tanaka, Hisashi
N1 - Publisher Copyright:
© 2021 National Academy of Sciences. All rights reserved.
PY - 2021/11/23
Y1 - 2021/11/23
N2 - Extrachromosomal circular DNA (eccDNA) originates from linear chromosomal DNA in various human tissues under physiological and disease conditions. The genomic origins of eccDNA have largely been investigated using in vitro-amplified DNA. However, in vitro amplification obscures quantitative information by skewing the total population stoichiometry. In addition, the analyses have focused on eccDNA stemming from single-copy genomic regions, leaving eccDNA from multicopy regions unexamined. To address these issues, we isolated eccDNA without in vitro amplification (naïve small circular DNA, nscDNA) and assessed the populations quantitatively by integrated genomic, molecular, and cytogenetic approaches. nscDNA of up to tens of kilobases were successfully enriched by our approach and were predominantly derived from multicopy genomic regions including segmental duplications (SDs). SDs, which account for 5% of the human genome and are hotspots for copy number variations, were significantly overrepresented in sperm nscDNA, with three times more sequencing reads derived from SDs than from the entire singlecopy regions. SDs were also overrepresented in mouse sperm nscDNA, which we estimated to comprise 0.2% of nuclear DNA. Considering that eccDNA can be integrated into chromosomes, germline-derived nscDNA may be a mediator of genome diversity.
AB - Extrachromosomal circular DNA (eccDNA) originates from linear chromosomal DNA in various human tissues under physiological and disease conditions. The genomic origins of eccDNA have largely been investigated using in vitro-amplified DNA. However, in vitro amplification obscures quantitative information by skewing the total population stoichiometry. In addition, the analyses have focused on eccDNA stemming from single-copy genomic regions, leaving eccDNA from multicopy regions unexamined. To address these issues, we isolated eccDNA without in vitro amplification (naïve small circular DNA, nscDNA) and assessed the populations quantitatively by integrated genomic, molecular, and cytogenetic approaches. nscDNA of up to tens of kilobases were successfully enriched by our approach and were predominantly derived from multicopy genomic regions including segmental duplications (SDs). SDs, which account for 5% of the human genome and are hotspots for copy number variations, were significantly overrepresented in sperm nscDNA, with three times more sequencing reads derived from SDs than from the entire singlecopy regions. SDs were also overrepresented in mouse sperm nscDNA, which we estimated to comprise 0.2% of nuclear DNA. Considering that eccDNA can be integrated into chromosomes, germline-derived nscDNA may be a mediator of genome diversity.
KW - Extrachromosomal circular DNA
KW - Multi-mapped reads
KW - Segmental duplications
KW - Sperm
UR - http://www.scopus.com/inward/record.url?scp=85120314815&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85120314815&partnerID=8YFLogxK
U2 - 10.1073/pnas.2102842118
DO - 10.1073/pnas.2102842118
M3 - Article
C2 - 34789574
AN - SCOPUS:85120314815
SN - 0027-8424
VL - 118
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 47
M1 - e2102842118
ER -