mRNA sequence predictions from homologous protein sequences

G. Valiquette, E. A. Zimmerman, J. L. Roberts

2 Scopus citations


A model has been developed that permits the prediction of mRNA nucleic acid sequence from the sequences of the translated proteins. The model relies on the information obtained from the comparison of protein sequences in related species to reduce the number of possible codons for those amino acids where mutations are observed. The predictions so obtained have been tested by applying the model to proteins whose mRNA sequences are known. The model's predictions have been found to be 100% accurate if three or more different amino acids are known at a given position and if the protein sequences are restricted to relatively closely related species (within the same class). The use of this model may permit a reduction of the mRNA sequence degeneracy and therefore be helpful in the synthesis of cDNA probes or for the prediction of restriction endonuclease sites. Computer programs have been developed to ease the use of the model.

Original languageEnglish (US)
Pages (from-to)445-458
Number of pages14
JournalJournal of Theoretical Biology
Issue number3
Publication statusPublished - Feb 7 1985


ASJC Scopus subject areas

  • Statistics and Probability
  • Modeling and Simulation
  • Biochemistry, Genetics and Molecular Biology(all)
  • Immunology and Microbiology(all)
  • Agricultural and Biological Sciences(all)
  • Applied Mathematics

Cite this