Outline

JEIM

Artificial Intelligence: Assisted Automated Annotation and Pattern Detection in Large Phylogenetic Trees

Author(s): Ivica Letunic1, Lars Nielsen1, Peer Bork2,3
1Biobyte solutions GmbH, Bothestr 142, 69126 Heidelberg, German
2EMBL, Meyerhofstrasse 1, 69117 Heidelberg, Germany.
3Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany.
Letunic, Ivica ., Nielsen, Lars ., and Bork, Peer . “Artificial Intelligence: Assisted Automated Annotation and Pattern Detection in Large Phylogenetic Trees.” Journal of Computational and Engineering Sciences Issue 1: 11-16, doi:.

Abstract

Phylogenetic trees are central to evolutionary biology, comparative genomics, microbial ecology, and molecular systematics. As sequencing technologies continue to generate high-volume and high-diversity datasets, researchers increasingly rely on interactive visualization platforms to inspect, annotate, and communicate tree-based knowledge. Although current tools such as the Interactive Tree of Life (iTOL) provide sophisticated mechanisms for tree display and metadata integration, the interpretation of large phylogenetic trees remains largely manual, time-consuming, and dependent on expert judgment. In this paper, we propose an artificial intelligence–assisted framework for automated annotation and pattern detection in large phylogenetic trees. The framework combines tree-structural descriptors, branch-level metadata, taxonomy-aware node information, and supervised machine learning models to identify biologically meaningful clades, classify functional patterns, prioritize labels, and recommend visual annotations for interactive display environments. A Random Forest model is used as the primary predictive engine due to its interpretability and robustness, while clustering and anomaly-detection components are incorporated to reveal hidden evolutionary structures. The proposed framework is designed to complement visualization systems such as iTOL by automatically generating annotation suggestions, highlighting candidate clades, and assisting users in exploratory phylogenetic interpretation. Benchmark experiments on microbial, functional, and taxonomic tree datasets indicate that AI-assisted annotation improves consistency, speeds up interpretation, and enhances the discovery of relevant biological patterns. The study establishes a foundation for next-generation intelligent tree visualization platforms that move beyond static rendering toward semi-automated evolutionary knowledge extraction.

Keywords
phylogenetic trees, artificial intelligence, tree annotation, machine learning, comparative genomics, metadata integration, iTOL

Related Articles

Azra Nishat1, Shazadi Sultana2, Arshiya Sultana2
1Department of Obstetrics and Gynecology, Government Nizamia Tibbi College Charminar, Hyderabad, India
2Department of Obstetrics and Gynecology, National Institute of Unani Medicine, PG Institute of Research, Bengaluru, Karnataka, India.
Rinaldo Bellomo1,2, John A. Kellum3, Claudio Ronco4
1Department of Intensive Care, Austin Hospital, Melbourne, Australia
2School of Medicine, The University of Melbourne, Australia
3Center for Critical Care Nephrology, University of Pittsburgh, USA.
4International Renal Research Institute, Vicenza, Italy.
Michael Henderson1, Carol Inatsuka2, Amanda J. Sheets1, Corinne L. Williams3, David J. Benaron2, Gina M. Donato4, Mary C. Gray4, Erik L. Hewlett4, Peggy A. Cotter1,3
1Department of Microbiology and Immunology, School of Medicine, University of North Carolina at Chapel Hill, USA.
2Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, USA.
3Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, USA
4Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia School of Medicine, USA
Eisha Nisar1, Ayesha Samar1, Rong Chen2, Atul Janardhan Butte2
1Faisalabad Medical University, faisalabad, Pakistan.
2Lucile Packard Children’s Hospital, Palo Alto, California, USA.