TY - JOUR
T1 - Ab-origin
T2 - An enhanced tool to identify the sourcing gene segments in germline for rearranged antibodies
AU - Wang, Xiaojing
AU - Wu, Di
AU - Zheng, Siyuan
AU - Sun, Jing
AU - Tao, Lin
AU - Li, Yixue
AU - Cao, Zhiwei
N1 - Funding Information:
We thank Mr. Yeo WeeKiang for helping us improving the English writing, and the anonymous reviewers for their constructive comments. We also thank Dr. Wu Wei for fruitful and helpful discussion. This work was supported in part by grants from Ministry of Science and Technology China (2004CB720103, 2006AA02Z317), National Natural Science Foundation of China (30500107), Shanghai Municipal Education Commission (2000236018, 2000236016) and Science and technology commission of Shanghai municipality (06PJ14072).
PY - 2008/12/12
Y1 - 2008/12/12
N2 - Background: In the adaptive immune system, variable regions of immunoglobulin (IG) are encoded by random recombination of variable (V), diversity (D), and joining (J) gene segments in the germline. Partitioning the functional antibody sequences to their sourcing germline gene segments is vital not only for understanding antibody maturation but also for promoting the potential engineering of the therapeutic antibodies. To date, several tools have been developed to perform such "trace-back" calculations. Yet, the predicting ability and processing volume of those tools vary significantly for different sets of data. Moreover, none of them give a confidence for immunoglobulin heavy diversity (IGHD) identification. Developing fast, efficient and enhanced tools is always needed with the booming of immunological data. Results: Here, a program named Ab-origin is presented. It is designed by batch query against germline databases based on empirical knowledge, optimized scoring scheme and appropriate parameters. Special efforts have been paid to improve the identification accuracy of the short and volatile region, IGHD. In particular, a threshold score for certain sensitivity and specificity is provided to give the confidence level of the IGHD identification. Conclusion: When evaluated using different sets of both simulated data and experimental data, Ab-origin outperformed all the other five popular tools in terms of prediction accuracy. The features of batch query and confidence indication of IGHD identification would provide extra help to users. The program is freely available at http://mpsq.biosino.org/ ab-origin/supplementary.html.
AB - Background: In the adaptive immune system, variable regions of immunoglobulin (IG) are encoded by random recombination of variable (V), diversity (D), and joining (J) gene segments in the germline. Partitioning the functional antibody sequences to their sourcing germline gene segments is vital not only for understanding antibody maturation but also for promoting the potential engineering of the therapeutic antibodies. To date, several tools have been developed to perform such "trace-back" calculations. Yet, the predicting ability and processing volume of those tools vary significantly for different sets of data. Moreover, none of them give a confidence for immunoglobulin heavy diversity (IGHD) identification. Developing fast, efficient and enhanced tools is always needed with the booming of immunological data. Results: Here, a program named Ab-origin is presented. It is designed by batch query against germline databases based on empirical knowledge, optimized scoring scheme and appropriate parameters. Special efforts have been paid to improve the identification accuracy of the short and volatile region, IGHD. In particular, a threshold score for certain sensitivity and specificity is provided to give the confidence level of the IGHD identification. Conclusion: When evaluated using different sets of both simulated data and experimental data, Ab-origin outperformed all the other five popular tools in terms of prediction accuracy. The features of batch query and confidence indication of IGHD identification would provide extra help to users. The program is freely available at http://mpsq.biosino.org/ ab-origin/supplementary.html.
UR - http://www.scopus.com/inward/record.url?scp=57649176319&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=57649176319&partnerID=8YFLogxK
U2 - 10.1186/1471-2105-9-S12-S20
DO - 10.1186/1471-2105-9-S12-S20
M3 - Article
C2 - 19091020
AN - SCOPUS:57649176319
SN - 1471-2105
VL - 9
JO - BMC bioinformatics
JF - BMC bioinformatics
IS - SUPPL. 12
M1 - S20
ER -