TY - JOUR
T1 - Genomics of compositae crops
T2 - Reference transcriptome assemblies and evidence of hybridization with wild relatives
AU - Hodgins, Kathryn A.
AU - Lai, Zhao
AU - Oliveira, Luiz O.
AU - Still, David W.
AU - Scascitelli, Moira
AU - Barker, Michael S.
AU - Kane, Nolan C.
AU - Dempewolf, Hannes
AU - Kozik, Alex
AU - Kesseli, Richard V.
AU - Burke, John M.
AU - Michelmore, Richard W.
AU - Rieseberg, Loren H.
PY - 2014/1
Y1 - 2014/1
N2 - Although the Compositae harbours only two major food crops, sunflower and lettuce, many other species in this family are utilized by humans and have experienced various levels of domestication. Here, we have used next-generation sequencing technology to develop 15 reference transcriptome assemblies for Compositae crops or their wild relatives. These data allow us to gain insight into the evolutionary and genomic consequences of plant domestication. Specifically, we performed Illumina sequencing of Cichorium endivia, Cichorium intybus, Echinacea angustifolia, Iva annua, Helianthus tuberosus, Dahlia hybrida, Leontodon taraxacoides and Glebionis segetum, as well 454 sequencing of Guizotia scabra, Stevia rebaudiana, Parthenium argentatum and Smallanthus sonchifolius. Illumina reads were assembled using Trinity, and 454 reads were assembled using MIRA and CAP3. We evaluated the coverage of the transcriptomes using BLASTX analysis of a set of ultra-conserved orthologs (UCOs) and recovered most of these genes (88-98%). We found a correlation between contig length and read length for the 454 assemblies, and greater contig lengths for the 454 compared with the Illumina assemblies. This suggests that longer reads can aid in the assembly of more complete transcripts. Finally, we compared the divergence of orthologs at synonymous sites (Ks) between Compositae crops and their wild relatives and found greater divergence when the progenitors were self-incompatible. We also found greater divergence between pairs of taxa that had some evidence of postzygotic isolation. For several more distantly related congeners, such as chicory and endive, we identified a signature of introgression in the distribution of Ks values.
AB - Although the Compositae harbours only two major food crops, sunflower and lettuce, many other species in this family are utilized by humans and have experienced various levels of domestication. Here, we have used next-generation sequencing technology to develop 15 reference transcriptome assemblies for Compositae crops or their wild relatives. These data allow us to gain insight into the evolutionary and genomic consequences of plant domestication. Specifically, we performed Illumina sequencing of Cichorium endivia, Cichorium intybus, Echinacea angustifolia, Iva annua, Helianthus tuberosus, Dahlia hybrida, Leontodon taraxacoides and Glebionis segetum, as well 454 sequencing of Guizotia scabra, Stevia rebaudiana, Parthenium argentatum and Smallanthus sonchifolius. Illumina reads were assembled using Trinity, and 454 reads were assembled using MIRA and CAP3. We evaluated the coverage of the transcriptomes using BLASTX analysis of a set of ultra-conserved orthologs (UCOs) and recovered most of these genes (88-98%). We found a correlation between contig length and read length for the 454 assemblies, and greater contig lengths for the 454 compared with the Illumina assemblies. This suggests that longer reads can aid in the assembly of more complete transcripts. Finally, we compared the divergence of orthologs at synonymous sites (Ks) between Compositae crops and their wild relatives and found greater divergence when the progenitors were self-incompatible. We also found greater divergence between pairs of taxa that had some evidence of postzygotic isolation. For several more distantly related congeners, such as chicory and endive, we identified a signature of introgression in the distribution of Ks values.
KW - Compositae
KW - Crop genomics
KW - Hybridization
KW - Introgression
KW - Transcriptome
UR - http://www.scopus.com/inward/record.url?scp=84890147590&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84890147590&partnerID=8YFLogxK
U2 - 10.1111/1755-0998.12163
DO - 10.1111/1755-0998.12163
M3 - Article
C2 - 24103297
AN - SCOPUS:84890147590
SN - 1755-098X
VL - 14
SP - 166
EP - 177
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
IS - 1
ER -