in

Deep learning and attention mechanisms to identify key genes and their implications for the origin of insect wings


Abstract

Wings are a key trait innovation in the evolutionary history of insects, and contributes to the largest diversity of animals on the planet. However, we still have an incomplete understanding of the functional changes in genes behind this diversification. Insect, Malacostraca and Chelicerata species originated as primitive arthropods during the Cambrian period. Malacostraca as the ancestral taxa of winged insects, are key to understanding this radiation. Here, a deep learning (DL) model for wing genes identification (DeepWG) based on bidirectional long short-term memory (BiLSTM) and attention mechanism (AM) was constructed based on the protein sequences of 119 species. DeepWG demonstrated a strong potential in mining key genes of insect wings, achieving an accuracy rate of 97.3% on the test set. Our research found that the 351 key genes identified by DeepWG and their orthologs exhibit transcriptional similarity in wing and gill tissues, providing molecular evidence consistent with the Ttracheal gill theory of insect wing origin. This study not only proposes a new method for identifying key genes, but also lays the foundation for genetic studies of key evolutionary adaptations in winged insects.

Similar content being viewed by others

Mechanics of Drosophila wing deployment

A hemimetabolous wing development suggests the wing origin from lateral tergum of a wingless ancestor

Application of wings interferential patterns (WIPs) and deep learning (DL) to classify some Culex. spp (Culicidae) of medical or veterinary importance

Funding

This work was supported by Natural Science Foundation of China (32560249), Forest Ecological Big Data Open Project of Key Laboratory of State Forestry and Grassland Administration (SWFU-BIC-2024016) and Scientific Research Foundation of Yunnan Education Department (2024Y612).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to
Youjie Zhao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1. (download ZIP )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, F., Cao, Y., Qian, S. et al. Deep learning and attention mechanisms to identify key genes and their implications for the origin of insect wings.
Sci Rep (2026). https://doi.org/10.1038/s41598-026-49441-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41598-026-49441-y

Keywords

  • Insect evolution
  • Wings origin
  • Gene identification
  • Wing genes
  • Deep learning
  • Attention mechanisms


Source: Ecology - nature.com

Tree community resource economics control soil food web multifunctionality

Experimental analysis of flash flood-induced scour on semi-arid hillslopes

Back to Top