EconPapers    
Economics at your fingertips  
 

Morphological traits and machine learning for genetic lineage prediction of two reef-building corals

Guinther Mitushasi, Yuko F Kitano, Nicolas Oury, Hélène Magalon, David A Paz-García, Eric Armstrong, Benjamin C C Hume, Barbara Porro, Clémentine Moulin, Emilie Boissin, Guillaume Bourdin, Guillaume Iwankow, Julie Poulain, Sarah Romac, Maggie M Reddy, Tara Pacific Consortium Coordinators, Serge Planes, Denis Allemand, Christian R Voolstra, Didier Forcioli and Sylvain Agostini

PLOS ONE, 2025, vol. 20, issue 6, 1-20

Abstract: Integrating multiple lines of evidence that support molecular taxonomy analysis has proven to be a robust method for species delimitation in scleractinian corals. However, morphology often conflicts with genetic approaches due to high phenotypic plasticity and convergence. Understanding morphological variation among species is crucial to studying coral distribution, life history, ecology, and evolution. Here, we present an application of Random Forest models for coral species identification based on morphological annotation of the corallum and corallites. We show that the integration of molecular and morphological trait analysis can be improved using machine learning. Morphological traits were documented for Porites and Pocillopora coral species that were collected and genotyped through genome-wide, genetical hierarchical clustering, and coalescence analyses for the Tara Pacific Expedition. While Porites only included three tentative species, most Pocillopora species were accounted by included specimens from the western Indian Ocean, tropical Southwestern Pacific, and southeast Polynesia. Two Random Forest models per genus were trained on the morphological annotations using the genetic lineage labels. One model was developed for in-situ image identification and used corallum traits measured from in-situ photographs. Another model for integrative species identification combined corallum and corallite data measured on scanning electron micrographs. Random Forest models outperformed traditional dimension reduction methods like PCA and FAMD followed by k-means and hierarchical clustering by classifying the correct genetic lineage despite morphological clusters overlapping. This machine learning approach is reproducible, cost-effective, and accessible, reducing the need for taxonomic expertise. It can complement molecular and phylogenetic studies and support image identification, highlighting its potential to advance a coral integrative taxonomy workflow.

Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0326095 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 26095&type=printable (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0326095

DOI: 10.1371/journal.pone.0326095

Access Statistics for this article

More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().

 
Page updated 2025-06-21
Handle: RePEc:plo:pone00:0326095