Classification of Cotton Genotypes with Mixed Continuous and Categorical Variables: Application of Machine Learning Models

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: id:"swepub:oai:DiVA.org:ltu-93691" > Classification of C...

1 av 1
Föregående post
Nästa post
Till träfflistan

Classification of Cotton Genotypes with Mixed Continuous and Categorical Variables: Application of Machine Learning Models

Bishnoi, Sudha (författare): Department of Mathematics and Statistics, Chaudhary Charan Singh Haryana Agricultural University, Hisar 125004, Haryana, India

Al-Ansari, Nadhir, 1947- (författare): Luleå tekniska universitet,Geoteknologi

Khan, Mujahid (författare): Agricultural Research Station, Sri Karan Narendra Agriculture University, Jobner 332301, Rajasthan, India

visa fler...

Heddam, Salim (författare): Agronomy Department, Faculty of Science, Hydraulics Division University, 20 Août 1955, Route El Hadaik, BP 26, Skikda 21024, Algeria

Malik, Anurag (författare): Regional Research Station, Punjab Agricultural University, Bathinda 151001, Punjab, India

visa färre...

(creator_code:org_t)

2022-10-21
2022
Engelska.
Ingår i: Sustainability. - : MDPI. - 2071-1050. ; 14:20

Relaterad länk:: https://doi.org/10.3...; visa fler...; https://ltu.diva-por... (primary) (Raw object); https://urn.kb.se/re...; https://doi.org/10.3...; visa färre...

Tidskriftsartikel (refereegranskat)

Abstract Ämnesord

Stäng

Mixed data is a combination of continuous and categorical variables and occurs frequently in fields such as agriculture, remote sensing, biology, medical science, marketing, etc., but only limited work has been done with this type of data. In this study, data on continuous and categorical characters of 452 genotypes of cotton (Gossypium hirsutum) were obtained from an experiment conducted by the Central Institute of Cotton Research (CICR), Sirsa, Haryana (India) during the Kharif season of the year 2018–2019. The machine learning (ML) classifiers/models, namely k-nearest neighbor (KNN), Classification and Regression Tree (CART), C4.5, Naïve Bayes, random forest (RF), bagging, and boosting were considered for cotton genotypes classification. The performance of these ML classifiers was compared to each other along with the linear discriminant analysis (LDA) and logistic regression. The holdout method was used for cross-validation with an 80:20 ratio of training and testing data. The results of the appraisal based on hold-out cross-validation showed that the RF and AdaBoost performed very well, having only two misclassifications with the same accuracy of 97.26% and the error rate of 2.74%. The LDA classifier performed the worst in terms of accuracy, with nine misclassifications. The other performance measures, namely sensitivity, specificity, precision, F1 score, and G-mean, were all together used to find out the best ML classifier among all those considered. Moreover, the RF and AdaBoost algorithms had the highest value of all the performance measures, with 96.97% sensitivity and 97.50% specificity. Thus, these models were found to be the best in classifying the low- and high-yielding cotton genotypes.

Ämnesord

NATURVETENSKAP -- Data- och informationsvetenskap -- Datorteknik (hsv//swe)
NATURAL SCIENCES -- Computer and Information Sciences -- Computer Engineering (hsv//eng)
NATURVETENSKAP -- Kemi -- Analytisk kemi (hsv//swe)
NATURAL SCIENCES -- Chemical Sciences -- Analytical Chemistry (hsv//eng)

Nyckelord

machine learning classifiers
supervised classification
mixed data
heterogeneous data
cotton genotypes
Soil Mechanics
Geoteknik

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Sustainability (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Bishnoi, Sudha; Al-Ansari, Nadhi ...; Khan, Mujahid; Heddam, Salim; Malik, Anurag

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Datorteknik

NATURVETENSKAP: NATURVETENSKAP; och Kemi; och Analytisk kemi

Artiklar i publikationen: Sustainability

Av lärosätet: Luleå tekniska universitet

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se