A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding

Montesinos-Lopez, O.A.; Martin-Vallejo, J.; Crossa, J.; Gianola, D.; Hernández Suárez, C.M.; Montesinos-Lopez, A.; Juliana, P.; Singh, R.P.

A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding

Genomic selection is revolutionizing plant breeding. However, still lacking are better statistical models for ordinal phenotypes to improve the accuracy of the selection of candidate genotypes. For this reason, in this paper we explore the genomic based prediction performance of two popular machine learning methods: the Multi Layer Perceptron (MLP) and support vector machine (SVM) methods vs. the Bayesian threshold genomic best linear unbiased prediction (TGBLUP) model. We used the percentage of cases correctly classified (PCCC) as a metric to measure the prediction performance, and seven real data sets to evaluate the prediction accuracy, and found that the best predictions (in four out of the seven data sets) in terms of PCCC occurred under the TGLBUP model, while the worst occurred under the SVM method. Also, in general we found no statistical differences between using 1, 2 and 3 layers under the MLP models, which means that many times the conventional neuronal network model with only one layer is enough. However, although even that the TGBLUP model was better, we found that the predictions of MLP and SVM were very competitive with the advantage that the SVM was the most efficient in terms of the computational time required.

Saved in:

Bibliographic Details
Main Authors:	Montesinos-Lopez, O.A., Martin-Vallejo, J., Crossa, J., Gianola, D., Hernández Suárez, C.M., Montesinos-Lopez, A., Juliana, P., Singh, R.P.
Format:	Article biblioteca
Language:	English
Published:	Genetics Society of America 2019
Subjects:	AGRICULTURAL SCIENCES AND BIOTECHNOLOGY, Threshold GBLUP, Deep Learning, Support Vector Machine, Genomic Selection, Genomic Prediction, GenPred, Shared Data Resources, BAYESIAN THEORY, STATISTICAL METHODS, MACHINE LEARNING, ARTIFICIAL SELECTION, PLANT BREEDING, CROP FORECASTING, DATA ANALYSIS,
Online Access:	https://hdl.handle.net/10883/20090
Tags:	Add Tag No Tags, Be the first to tag this record!

id	dig-cimmyt-10883-20090
record_format	koha
spelling	dig-cimmyt-10883-200902024-03-13T15:16:57Z A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding Montesinos-Lopez, O.A. Martin-Vallejo, J. Crossa, J. Gianola, D. Hernández Suárez, C.M. Montesinos-Lopez, A. Juliana, P. Singh, R.P. AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Threshold GBLUP Deep Learning Support Vector Machine Genomic Selection Genomic Prediction GenPred Shared Data Resources BAYESIAN THEORY STATISTICAL METHODS MACHINE LEARNING ARTIFICIAL SELECTION PLANT BREEDING CROP FORECASTING DATA ANALYSIS Genomic selection is revolutionizing plant breeding. However, still lacking are better statistical models for ordinal phenotypes to improve the accuracy of the selection of candidate genotypes. For this reason, in this paper we explore the genomic based prediction performance of two popular machine learning methods: the Multi Layer Perceptron (MLP) and support vector machine (SVM) methods vs. the Bayesian threshold genomic best linear unbiased prediction (TGBLUP) model. We used the percentage of cases correctly classified (PCCC) as a metric to measure the prediction performance, and seven real data sets to evaluate the prediction accuracy, and found that the best predictions (in four out of the seven data sets) in terms of PCCC occurred under the TGLBUP model, while the worst occurred under the SVM method. Also, in general we found no statistical differences between using 1, 2 and 3 layers under the MLP models, which means that many times the conventional neuronal network model with only one layer is enough. However, although even that the TGBLUP model was better, we found that the predictions of MLP and SVM were very competitive with the advantage that the SVM was the most efficient in terms of the computational time required. 601-618 2019-03-20T01:25:13Z 2019-03-20T01:25:13Z 2019 Article Published Version ESSN: 2160-1836 https://hdl.handle.net/10883/20090 10.1534/g3.118.200998 English Open Access PDF Bethesda, MD Genetics Society of America 2 9 G3: Genes, Genomes, Genetics
institution	CIMMYT
collection	DSpace
country	México
countrycode	MX
component	Bibliográfico
access	En linea
databasecode	dig-cimmyt
tag	biblioteca
region	America del Norte
libraryname	CIMMYT Library
language	English
topic	AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Threshold GBLUP Deep Learning Support Vector Machine Genomic Selection Genomic Prediction GenPred Shared Data Resources BAYESIAN THEORY STATISTICAL METHODS MACHINE LEARNING ARTIFICIAL SELECTION PLANT BREEDING CROP FORECASTING DATA ANALYSIS AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Threshold GBLUP Deep Learning Support Vector Machine Genomic Selection Genomic Prediction GenPred Shared Data Resources BAYESIAN THEORY STATISTICAL METHODS MACHINE LEARNING ARTIFICIAL SELECTION PLANT BREEDING CROP FORECASTING DATA ANALYSIS
spellingShingle	AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Threshold GBLUP Deep Learning Support Vector Machine Genomic Selection Genomic Prediction GenPred Shared Data Resources BAYESIAN THEORY STATISTICAL METHODS MACHINE LEARNING ARTIFICIAL SELECTION PLANT BREEDING CROP FORECASTING DATA ANALYSIS AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Threshold GBLUP Deep Learning Support Vector Machine Genomic Selection Genomic Prediction GenPred Shared Data Resources BAYESIAN THEORY STATISTICAL METHODS MACHINE LEARNING ARTIFICIAL SELECTION PLANT BREEDING CROP FORECASTING DATA ANALYSIS Montesinos-Lopez, O.A. Martin-Vallejo, J. Crossa, J. Gianola, D. Hernández Suárez, C.M. Montesinos-Lopez, A. Juliana, P. Singh, R.P. A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding
description	Genomic selection is revolutionizing plant breeding. However, still lacking are better statistical models for ordinal phenotypes to improve the accuracy of the selection of candidate genotypes. For this reason, in this paper we explore the genomic based prediction performance of two popular machine learning methods: the Multi Layer Perceptron (MLP) and support vector machine (SVM) methods vs. the Bayesian threshold genomic best linear unbiased prediction (TGBLUP) model. We used the percentage of cases correctly classified (PCCC) as a metric to measure the prediction performance, and seven real data sets to evaluate the prediction accuracy, and found that the best predictions (in four out of the seven data sets) in terms of PCCC occurred under the TGLBUP model, while the worst occurred under the SVM method. Also, in general we found no statistical differences between using 1, 2 and 3 layers under the MLP models, which means that many times the conventional neuronal network model with only one layer is enough. However, although even that the TGBLUP model was better, we found that the predictions of MLP and SVM were very competitive with the advantage that the SVM was the most efficient in terms of the computational time required.
format	Article
topic_facet	AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Threshold GBLUP Deep Learning Support Vector Machine Genomic Selection Genomic Prediction GenPred Shared Data Resources BAYESIAN THEORY STATISTICAL METHODS MACHINE LEARNING ARTIFICIAL SELECTION PLANT BREEDING CROP FORECASTING DATA ANALYSIS
author	Montesinos-Lopez, O.A. Martin-Vallejo, J. Crossa, J. Gianola, D. Hernández Suárez, C.M. Montesinos-Lopez, A. Juliana, P. Singh, R.P.
author_facet	Montesinos-Lopez, O.A. Martin-Vallejo, J. Crossa, J. Gianola, D. Hernández Suárez, C.M. Montesinos-Lopez, A. Juliana, P. Singh, R.P.
author_sort	Montesinos-Lopez, O.A.
title	A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding
title_short	A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding
title_full	A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding
title_fullStr	A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding
title_full_unstemmed	A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding
title_sort	benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding
publisher	Genetics Society of America
publishDate	2019
url	https://hdl.handle.net/10883/20090
work_keys_str_mv	AT montesinoslopezoa abenchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT martinvallejoj abenchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT crossaj abenchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT gianolad abenchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT hernandezsuarezcm abenchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT montesinoslopeza abenchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT julianap abenchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT singhrp abenchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT montesinoslopezoa benchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT martinvallejoj benchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT crossaj benchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT gianolad benchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT hernandezsuarezcm benchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT montesinoslopeza benchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT julianap benchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding AT singhrp benchmarkingbetweendeeplearningsupportvectormachineandbayesianthresholdbestlinearunbiasedpredictionforpredictingordinaltraitsinplantbreeding
_version_	1794797624980668416

A benchmarking between deep learning, support vector machine and bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding

Similar Items

Resource Map