Benchmarking mouse contamination removing protocols in patient-derived xenografts genomic profiling

Research output: Contribution to journalArticlepeer-review

Abstract

Patient-derived xenograft (PDX) models are widely used in cancer research. Genomic and transcriptomic profiling of PDXs are inevitably contaminated by sequencing reads originated from mouse cells. Here, we examine the impact of mouse read contamination on RNA sequencing (RNAseq), Whole Exome Sequencing (WES), and Whole Genome Sequencing (WGS) data of 21 PDXs. We also systematically benchmark the performance of 12 computational protocols for removing mouse reads from PDXs. We find that mouse read contamination increases expression of immune and stromal related genes, and inflates the number of somatic mutations. However, detection of gene fusions and copy number alterations is minimally affected by mouse read contamination. Using gold standard datasets, we find that pseudo-alignment protocols often demonstrate better prediction performance and computing efficiency. The best performing tool is a relatively new tool Xengsort. Our results emphasize the importance of removing mouse reads from PDXs and the need to adopt new tools in PDX genomic studies.

Original languageEnglish (US)
Article number113
Journalnpj Precision Oncology
Volume9
Issue number1
DOIs
StatePublished - Dec 2025

ASJC Scopus subject areas

  • Oncology
  • Cancer Research

Fingerprint

Dive into the research topics of 'Benchmarking mouse contamination removing protocols in patient-derived xenografts genomic profiling'. Together they form a unique fingerprint.

Cite this