Genomic prediction of leaf rust resistance to Arabica coffee using machine learning algorithms.

Genomic selection (GS) emphasizes the simultaneous prediction of the genetic effects of thousands of scattered markers over the genome. Several statistical methodologies have been used in GS for the prediction of genetic merit. In general, such methodologies require certain assumptions about the data, such as the normality of the distribution of phenotypic values. To circumvent the non-normality of phenotypic values, the literature suggests the use of Bayesian Generalized Linear Regression (GBLASSO). Another alternative is the models based on machine learning, represented by methodologies such as Artificial Neural Networks (ANN), Decision Trees (DT) and related possible refinements such as Bagging, Random Forest and Boosting. This study aimed to use DT and its refinements for predicting resistance to orange rust in Arabica coffee. Additionally, DT and its refinements were used to identify the importance of markers related to the characteristic of interest. The results were compared with those from GBLASSO and ANN. Data on coffee rust resistance of 245 Arabica coffee plants genotyped for 137 markers were used. The DT refinements presented equal or inferior values of Apparent Error Rate compared to those obtained by DT, GBLASSO, and ANN. Moreover, DT refinements were able to identify important markers for the characteristic of interest. Out of 14 of the most important markers analyzed in each methodology, 9.3 markers on average were in regions of quantitative trait loci (QTLs) related to resistance to disease listed in the literature.

Saved in:
Bibliographic Details
Main Authors: SOUSA, I. C. de, NASCIMENTO, M., SILVA, G. N., NASCIMENTO, A. C. C., CRUZ, C. D., SILVA, F. F. e, ALMEIDA, D. P. de, PESTANA, K. N., AZEVEDO, C. F., ZAMBOLIM, L., CAIXETA, E. T.
Other Authors: Ithalo Coelho de Sousa, Universidade Federal de Viçosa; Moysés Nascimento, Universidade Federal de Viçosa; Gabi Nunes Silva, Universidade Federal de Rondônia; Ana Carolina Campana Nascimento, Universidade Federal de Viçosa; Cosme Damião Cruz, Universidade Federal de Viçosa; Fabyano Fonseca e Silva, Universidade Federal de Viçosa; Dênia Pires de Almeida, Universidade Federal de Viçosa; Kátia Nogueira Pestana, Embrapa Mandioca e Fruticultura; Camila Ferreira Azevedo, Universidade Federal de Viçosa; Laércio Zambolim, Universidade Federal de Viçosa; EVELINE TEIXEIRA CAIXETA MOURA, CNPCa.
Format: Artigo de periódico biblioteca
Language:Ingles
English
Published: 2020-10-15
Subjects:Statistical learning, Hemileia Vastatrix, Plant breeding, Artificial intelligence,
Online Access:http://www.alice.cnptia.embrapa.br/alice/handle/doc/1125524
http://dx.doi.org/10.1590/1678-992X-2020-0021
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-alice-doc-1125524
record_format koha
spelling dig-alice-doc-11255242020-10-16T09:14:23Z Genomic prediction of leaf rust resistance to Arabica coffee using machine learning algorithms. SOUSA, I. C. de NASCIMENTO, M. SILVA, G. N. NASCIMENTO, A. C. C. CRUZ, C. D. SILVA, F. F. e ALMEIDA, D. P. de PESTANA, K. N. AZEVEDO, C. F. ZAMBOLIM, L. CAIXETA, E. T. Ithalo Coelho de Sousa, Universidade Federal de Viçosa; Moysés Nascimento, Universidade Federal de Viçosa; Gabi Nunes Silva, Universidade Federal de Rondônia; Ana Carolina Campana Nascimento, Universidade Federal de Viçosa; Cosme Damião Cruz, Universidade Federal de Viçosa; Fabyano Fonseca e Silva, Universidade Federal de Viçosa; Dênia Pires de Almeida, Universidade Federal de Viçosa; Kátia Nogueira Pestana, Embrapa Mandioca e Fruticultura; Camila Ferreira Azevedo, Universidade Federal de Viçosa; Laércio Zambolim, Universidade Federal de Viçosa; EVELINE TEIXEIRA CAIXETA MOURA, CNPCa. Statistical learning Hemileia Vastatrix Plant breeding Artificial intelligence Genomic selection (GS) emphasizes the simultaneous prediction of the genetic effects of thousands of scattered markers over the genome. Several statistical methodologies have been used in GS for the prediction of genetic merit. In general, such methodologies require certain assumptions about the data, such as the normality of the distribution of phenotypic values. To circumvent the non-normality of phenotypic values, the literature suggests the use of Bayesian Generalized Linear Regression (GBLASSO). Another alternative is the models based on machine learning, represented by methodologies such as Artificial Neural Networks (ANN), Decision Trees (DT) and related possible refinements such as Bagging, Random Forest and Boosting. This study aimed to use DT and its refinements for predicting resistance to orange rust in Arabica coffee. Additionally, DT and its refinements were used to identify the importance of markers related to the characteristic of interest. The results were compared with those from GBLASSO and ANN. Data on coffee rust resistance of 245 Arabica coffee plants genotyped for 137 markers were used. The DT refinements presented equal or inferior values of Apparent Error Rate compared to those obtained by DT, GBLASSO, and ANN. Moreover, DT refinements were able to identify important markers for the characteristic of interest. Out of 14 of the most important markers analyzed in each methodology, 9.3 markers on average were in regions of quantitative trait loci (QTLs) related to resistance to disease listed in the literature. 2020-10-16T09:14:16Z 2020-10-16T09:14:16Z 2020-10-15 2021 Artigo de periódico Scientia Agricola, v. 78, n. 4, e20200021, 2021. http://www.alice.cnptia.embrapa.br/alice/handle/doc/1125524 http://dx.doi.org/10.1590/1678-992X-2020-0021 Ingles en openAccess
institution EMBRAPA
collection DSpace
country Brasil
countrycode BR
component Bibliográfico
access En linea
databasecode dig-alice
tag biblioteca
region America del Sur
libraryname Sistema de bibliotecas de EMBRAPA
language Ingles
English
topic Statistical learning
Hemileia Vastatrix
Plant breeding
Artificial intelligence
Statistical learning
Hemileia Vastatrix
Plant breeding
Artificial intelligence
spellingShingle Statistical learning
Hemileia Vastatrix
Plant breeding
Artificial intelligence
Statistical learning
Hemileia Vastatrix
Plant breeding
Artificial intelligence
SOUSA, I. C. de
NASCIMENTO, M.
SILVA, G. N.
NASCIMENTO, A. C. C.
CRUZ, C. D.
SILVA, F. F. e
ALMEIDA, D. P. de
PESTANA, K. N.
AZEVEDO, C. F.
ZAMBOLIM, L.
CAIXETA, E. T.
Genomic prediction of leaf rust resistance to Arabica coffee using machine learning algorithms.
description Genomic selection (GS) emphasizes the simultaneous prediction of the genetic effects of thousands of scattered markers over the genome. Several statistical methodologies have been used in GS for the prediction of genetic merit. In general, such methodologies require certain assumptions about the data, such as the normality of the distribution of phenotypic values. To circumvent the non-normality of phenotypic values, the literature suggests the use of Bayesian Generalized Linear Regression (GBLASSO). Another alternative is the models based on machine learning, represented by methodologies such as Artificial Neural Networks (ANN), Decision Trees (DT) and related possible refinements such as Bagging, Random Forest and Boosting. This study aimed to use DT and its refinements for predicting resistance to orange rust in Arabica coffee. Additionally, DT and its refinements were used to identify the importance of markers related to the characteristic of interest. The results were compared with those from GBLASSO and ANN. Data on coffee rust resistance of 245 Arabica coffee plants genotyped for 137 markers were used. The DT refinements presented equal or inferior values of Apparent Error Rate compared to those obtained by DT, GBLASSO, and ANN. Moreover, DT refinements were able to identify important markers for the characteristic of interest. Out of 14 of the most important markers analyzed in each methodology, 9.3 markers on average were in regions of quantitative trait loci (QTLs) related to resistance to disease listed in the literature.
author2 Ithalo Coelho de Sousa, Universidade Federal de Viçosa; Moysés Nascimento, Universidade Federal de Viçosa; Gabi Nunes Silva, Universidade Federal de Rondônia; Ana Carolina Campana Nascimento, Universidade Federal de Viçosa; Cosme Damião Cruz, Universidade Federal de Viçosa; Fabyano Fonseca e Silva, Universidade Federal de Viçosa; Dênia Pires de Almeida, Universidade Federal de Viçosa; Kátia Nogueira Pestana, Embrapa Mandioca e Fruticultura; Camila Ferreira Azevedo, Universidade Federal de Viçosa; Laércio Zambolim, Universidade Federal de Viçosa; EVELINE TEIXEIRA CAIXETA MOURA, CNPCa.
author_facet Ithalo Coelho de Sousa, Universidade Federal de Viçosa; Moysés Nascimento, Universidade Federal de Viçosa; Gabi Nunes Silva, Universidade Federal de Rondônia; Ana Carolina Campana Nascimento, Universidade Federal de Viçosa; Cosme Damião Cruz, Universidade Federal de Viçosa; Fabyano Fonseca e Silva, Universidade Federal de Viçosa; Dênia Pires de Almeida, Universidade Federal de Viçosa; Kátia Nogueira Pestana, Embrapa Mandioca e Fruticultura; Camila Ferreira Azevedo, Universidade Federal de Viçosa; Laércio Zambolim, Universidade Federal de Viçosa; EVELINE TEIXEIRA CAIXETA MOURA, CNPCa.
SOUSA, I. C. de
NASCIMENTO, M.
SILVA, G. N.
NASCIMENTO, A. C. C.
CRUZ, C. D.
SILVA, F. F. e
ALMEIDA, D. P. de
PESTANA, K. N.
AZEVEDO, C. F.
ZAMBOLIM, L.
CAIXETA, E. T.
format Artigo de periódico
topic_facet Statistical learning
Hemileia Vastatrix
Plant breeding
Artificial intelligence
author SOUSA, I. C. de
NASCIMENTO, M.
SILVA, G. N.
NASCIMENTO, A. C. C.
CRUZ, C. D.
SILVA, F. F. e
ALMEIDA, D. P. de
PESTANA, K. N.
AZEVEDO, C. F.
ZAMBOLIM, L.
CAIXETA, E. T.
author_sort SOUSA, I. C. de
title Genomic prediction of leaf rust resistance to Arabica coffee using machine learning algorithms.
title_short Genomic prediction of leaf rust resistance to Arabica coffee using machine learning algorithms.
title_full Genomic prediction of leaf rust resistance to Arabica coffee using machine learning algorithms.
title_fullStr Genomic prediction of leaf rust resistance to Arabica coffee using machine learning algorithms.
title_full_unstemmed Genomic prediction of leaf rust resistance to Arabica coffee using machine learning algorithms.
title_sort genomic prediction of leaf rust resistance to arabica coffee using machine learning algorithms.
publishDate 2020-10-15
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/1125524
http://dx.doi.org/10.1590/1678-992X-2020-0021
work_keys_str_mv AT sousaicde genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT nascimentom genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT silvagn genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT nascimentoacc genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT cruzcd genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT silvaffe genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT almeidadpde genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT pestanakn genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT azevedocf genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT zamboliml genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
AT caixetaet genomicpredictionofleafrustresistancetoarabicacoffeeusingmachinelearningalgorithms
_version_ 1756027149957988352