Enhancing the classification of isolated theropod teeth using machine learning: a comparative study

View article
Paleontology and Evolutionary Science

Main article text

 

Introduction

Methods

Data processing and preparation for the modelling phase

Data modeling

Model evaluation metrics

Comparison between log-scaled and non-log-scaled variables

Implementation information

Application of the methodology in the sample of isolated theropod teeth from Guimarota fossil site

Results

Classification at genus level

Classification at higher taxonomic level

Discussion

Classification algorithms

Application of ML classifiers to new findings of isolated theropod teeth

Application of the methodology in the sample of isolated theropod teeth from Guimarota fossil site

Conclusions

Supplemental Information

Data used to train the models, the original dataset presented by (Hendrickx et al., 2023), and the data table with the new isolated theropod teeth measurements from Guimarota fossil site

DOI: 10.7717/peerj.19116/supp-1

Code used to pre-process the data, train the models, visualize the model results, and predict new isolated theropod teeth

DOI: 10.7717/peerj.19116/supp-2

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Carolina S. Marques conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Emmanuel Dufourq performed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Soraia Pereira analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Vanda F. Santos analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Elisabete Malafaia conceived and designed the experiments, performed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The code are available at Github and Zenodo:

- https://github.com/MCarolinaMarques/Enhancing-Fossil-Classification-A-Guide-to-Machine-Learning-Methods-for-Theropod-Teeth- contains all data, code, results and models.

- Carolina S. Marques. (2025). MCarolinaMarques/Enhancing-Fossil-Classification-A-Guide-to-Machine-Learning-Methods-for-Theropod-Teeth: Code and data for the article “Enhancing Fossil Classification: A Guide to Machine Learning Methods for Theropod Teeth” (Code). Zenodo. https://doi.org/10.5281/zenodo.14907721.

The metric results for all the models are available at ShinyApp: https://ml4paleontology.shinyapps.io/teethshinyapp.

The published dataset containing the measurements are available at Zenodo: S. Marques, C., Dufourq, E., Pereira, S., Faria dos Santos, V., & Malafaia, E. (2025). Isolated theropod teeth from Guimarota fossil site presented in “Enhancing the classification of isolated theropod teeth using machine learning: a comparative study” [Data set]. In Enhancing the classification of isolated theropod teeth using machine learning: a comparative study. Zenodo. https://doi.org/10.5281/zenodo.14632904.

Funding

This work was supported by Portuguese government funds through FCT - Fundação para a Ciência e Tecnologia under the doctoral scholarship FCT/CEAUL (UI/BD/154258/2022), the individual contract CEECIND/01770/2018 (https://doi.org/10.54499/CEECIND/01770/2018/CP1534/CT0004), CEAUL’s strategic projects: UID/00006/2025 and UIDB/00006/2020 (https://doi.org/10.54499/UIDB/00006/2020) for the APC, and FCT, I.P./MCTES through national funds (PIDDAC): UID/50019/2025, UIDB/50019/2020 (https://doi.org/10.54499/UIDB/50019/2020) and LA/P/0068/2020 (https://doi.org/10.54499/LA/P/0068/2020). Emmanuel Dufourq was funded by a grant from the Carnegie Corporation of New York (provided through the AIMS Research and Innovation Centre). There was no additional external funding received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more