TY - JOUR
T1 - ProBAMsuite, a bioinformatics framework for genome-based representation and analysis of proteomics data
AU - Wang, Xiaojing
AU - Slebos, Robbert J.C.
AU - Chambers, Matthew C.
AU - Tabb, David L.
AU - Liebler, Daniel C.
AU - Zhang, Bing
N1 - Publisher Copyright:
© 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
PY - 2016/3
Y1 - 2016/3
N2 - To facilitate genome-based representation and analysis of proteomics data, we developed a new bioinformatics framework, proBAMsuite, in which a central component is the protein BAM (proBAM) file format for organizing peptide spectrum matches (PSMs)1 within the context of the genome. proBAMsuite also includes two R packages, pro-BAMr and proBAMtools, for generating and analyzing pro-BAM files, respectively. Applying proBAMsuite to three recently published proteomics datasets, we demonstrated its utility in facilitating efficient genome-based sharing, interpretation, and integration of proteomics data. First, the interpretation of proteomics data is significantly enhanced with the rich genomic annotation information. Second, PSMs can be easily reannotated using user-specified gene annotation schemes and assembled into both protein and gene identifications. Third, using the genome as a common reference, proBAMsuite facilitates seamless proteomics and proteogenomics data integration. Finally, proBAM files can be readily visualized in genome browsers and thus bring proteomics data analysis to a general audience beyond the proteomics community. Results from this study establish proBAMsuite as a useful bioinformatics framework for proteomics and proteogenomics research.
AB - To facilitate genome-based representation and analysis of proteomics data, we developed a new bioinformatics framework, proBAMsuite, in which a central component is the protein BAM (proBAM) file format for organizing peptide spectrum matches (PSMs)1 within the context of the genome. proBAMsuite also includes two R packages, pro-BAMr and proBAMtools, for generating and analyzing pro-BAM files, respectively. Applying proBAMsuite to three recently published proteomics datasets, we demonstrated its utility in facilitating efficient genome-based sharing, interpretation, and integration of proteomics data. First, the interpretation of proteomics data is significantly enhanced with the rich genomic annotation information. Second, PSMs can be easily reannotated using user-specified gene annotation schemes and assembled into both protein and gene identifications. Third, using the genome as a common reference, proBAMsuite facilitates seamless proteomics and proteogenomics data integration. Finally, proBAM files can be readily visualized in genome browsers and thus bring proteomics data analysis to a general audience beyond the proteomics community. Results from this study establish proBAMsuite as a useful bioinformatics framework for proteomics and proteogenomics research.
UR - http://www.scopus.com/inward/record.url?scp=84962503174&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84962503174&partnerID=8YFLogxK
U2 - 10.1074/mcp.M115.052860
DO - 10.1074/mcp.M115.052860
M3 - Article
C2 - 26657539
AN - SCOPUS:84962503174
SN - 1535-9476
VL - 15
SP - 1164
EP - 1175
JO - Molecular and Cellular Proteomics
JF - Molecular and Cellular Proteomics
IS - 3
ER -