Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations

Haixiao Hu; Malachy T. Campbell; Trevor H. Yeats; Xuying Zheng; Daniel E. Runcie; Giovanny Covarrubias-Pazaran; Corey Broeckling; Linxing Yao; Melanie Caffe-Treml; Lucı́a Gutiérrez; Kevin P. Smith; James Tanaka; Owen A. Hoekenga; Mark E. Sorrells; Michael A. Gore; Jean Luc Jannink

doi:10.1007/s00122-021-03946-4

Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations

Haixiao Hu, Malachy T. Campbell, Trevor H. Yeats, Xuying Zheng, Daniel E. Runcie, Giovanny Covarrubias-Pazaran, Corey Broeckling, Linxing Yao, Melanie Caffe-Treml, Lucı́a Gutiérrez, Kevin P. Smith, James Tanaka, Owen A. Hoekenga, Mark E. Sorrells, Michael A. Gore, Jean Luc Jannink

Agronomy and Plant Genetics

Research output: Contribution to journal › Article › peer-review

20 Scopus citations

Abstract

Key message: Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Abstract: Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.

Original language	English (US)
Pages (from-to)	4043-4054
Number of pages	12
Journal	Theoretical and Applied Genetics
Volume	134
Issue number	12
DOIs	https://doi.org/10.1007/s00122-021-03946-4
State	Published - Dec 2021

Bibliographical note

Publisher Copyright:
© 2021, This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply.

Access

10.1007/s00122-021-03946-4

OpenUrl availability

Full text

Cite this

Hu, H., Campbell, M. T., Yeats, T. H., Zheng, X., Runcie, D. E., Covarrubias-Pazaran, G., Broeckling, C., Yao, L., Caffe-Treml, M., Gutiérrez, L., Smith, K. P., Tanaka, J., Hoekenga, O. A., Sorrells, M. E., Gore, M. A., & Jannink, J. L. (2021). Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations. Theoretical and Applied Genetics, 134(12), 4043-4054. https://doi.org/10.1007/s00122-021-03946-4

Hu, H, Campbell, MT, Yeats, TH, Zheng, X, Runcie, DE, Covarrubias-Pazaran, G, Broeckling, C, Yao, L, Caffe-Treml, M, Gutiérrez, L, Smith, KP, Tanaka, J, Hoekenga, OA, Sorrells, ME, Gore, MA & Jannink, JL 2021, 'Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations', Theoretical and Applied Genetics, vol. 134, no. 12, pp. 4043-4054. https://doi.org/10.1007/s00122-021-03946-4

@article{d3e3167b2576432e9fca4d49361c1787,

title = "Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations",

abstract = "Key message: Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Abstract: Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.",

author = "Haixiao Hu and Campbell, {Malachy T.} and Yeats, {Trevor H.} and Xuying Zheng and Runcie, {Daniel E.} and Giovanny Covarrubias-Pazaran and Corey Broeckling and Linxing Yao and Melanie Caffe-Treml and Luc{\'ı}a Guti{\'e}rrez and Smith, {Kevin P.} and James Tanaka and Hoekenga, {Owen A.} and Sorrells, {Mark E.} and Gore, {Michael A.} and Jannink, {Jean Luc}",

note = "Publisher Copyright: {\textcopyright} 2021, This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply.",

year = "2021",

month = dec,

doi = "10.1007/s00122-021-03946-4",

language = "English (US)",

volume = "134",

pages = "4043--4054",

journal = "Theoretical and Applied Genetics",

issn = "0040-5752",

publisher = "Springer Verlag",

number = "12",

}

TY - JOUR

T1 - Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations

AU - Hu, Haixiao

AU - Campbell, Malachy T.

AU - Yeats, Trevor H.

AU - Zheng, Xuying

AU - Runcie, Daniel E.

AU - Covarrubias-Pazaran, Giovanny

AU - Broeckling, Corey

AU - Yao, Linxing

AU - Caffe-Treml, Melanie

AU - Gutiérrez, Lucı́a

AU - Smith, Kevin P.

AU - Tanaka, James

AU - Hoekenga, Owen A.

AU - Sorrells, Mark E.

AU - Gore, Michael A.

AU - Jannink, Jean Luc

PY - 2021/12

Y1 - 2021/12

N2 - Key message: Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Abstract: Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.

AB - Key message: Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Abstract: Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.

UR - http://www.scopus.com/inward/record.url?scp=85116966643&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85116966643&partnerID=8YFLogxK

U2 - 10.1007/s00122-021-03946-4

DO - 10.1007/s00122-021-03946-4

M3 - Article

C2 - 34643760

AN - SCOPUS:85116966643

SN - 0040-5752

VL - 134

SP - 4043

EP - 4054

JO - Theoretical and Applied Genetics

JF - Theoretical and Applied Genetics

IS - 12

ER -

Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations

Abstract

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this