in

A chromosome-level genome assembly of Coryphaenoides armatus


Abstract

Coryphaenoides armatus is a deep-sea species with broad geographic and bathymetric distribution and a highly developed olfactory system, rendering it a potential indicator species for deep-sea mining regions and a model for studying environmental adaptation. Genomic resources for this species are limited, restricting insights into its adaptive evolution. Here, we present a chromosome-level genome assembly of C. armatus, constructed using PacBio HiFi long-read sequencing, Illumina short-read polishing, and Hi-C scaffolding. The final assembly spans 811.1 Mb, achieves a scaffold N50 of 33.3 Mb, and is organized into 24 chromosomes. The complete BUSCO score at the chromosome-level assembly was 90.9%. A total of 24,818 protein-coding genes were annotated in the assembly. This high-quality genome assembly of C. armatus provides a solid foundation for understanding physiological processes, identifying potential indicator species in deep-sea mining regions and exploring adaptive evolution in extreme environments.

Similar content being viewed by others

An improved chromosome-level genome assembly of a deep-sea limpet (Bathyacmaea lactea)

Chromosome-level genome assembly of Decorus tungting, an endemic cyprinid from China

Chromosome-level genome assembly of the northern Pacific seastar Asterias amurensis

Data availability

The raw sequencing data is at NCBI SRA SRP66121047, the chromosome assembly is at Genbank GCA_053525285.148, and the annotation files at Figshare49. In addition, the raw sequence data is also available at NGDC BioProject PRJCA05159850.

Code availability

All data analyzing tools and software used in this study were performed following the instructions and guidelines. There was no custom code applied to analyze the data in our study.

References

  1. Somero, G. N. Biochemical ecology of deep-sea animals. Experientia. 48(6), 537–543 (1992).

    Google Scholar 

  2. O’Hara, T. D. et al. Spatiotemporal faunal connectivity across global sea floors. Nature. (2025).

  3. Bai, S., Shang, K., Zeng, S., Huang, Z. & Han, Z. Genome analysis of Salinimicrobium sp. 3283s, a deep-sea bacterium isolated from the sediments of South China Sea, China. Mar Genomics 76, 101125 (2024).

    Google Scholar 

  4. Luo, J. C., Long, H., Zhang, J., Zhao, Y. & Sun, L. Characterization of a deep sea Bacillus toyonensis isolate: Genomic and pathogenic features. Front Cell Infect Microbiol 11, 629116 (2021).

    Google Scholar 

  5. Cao, X. et al. Genomic analysis of Shewanella eurypsychrophilus YLB-09 reveals backgrounds related to its deep sea environment adaptation. Mar Genomics. 64, 100956 (2022).

    Google Scholar 

  6. Chen, J. et al. Pseudo-chromosome-length genome assembly for a deep-sea eel Ilyophis brunneus sheds light on the deep-sea adaptation. Sci China Life Sci. 66(6), 1379–1391 (2023).

    Google Scholar 

  7. Xu, W. et al. Chromosome-level genome assembly of hadal snailfish reveals mechanisms of deep-sea adaptation in vertebrates. Elife. 12, RP87198 (2023).

    Google Scholar 

  8. Li, W. et al. Genome sequencing of Coryphaenoides yaquinae reveals convergent and lineage-specific molecular evolution in deep-sea adaptation. Mol Ecol Resour. 24(6), e13989 (2024).

    Google Scholar 

  9. Gaither, M. R. et al. Genomics of habitat choice and adaptive evolution in a deep-sea fish. Nat Ecol Evol. 2(4), 680–687 (2018).

    Google Scholar 

  10. IDSSE. Deep-sea fish sample from the Mariana Trench: Coryphaenoides rudis. Genbank https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_048487635.1 (2025).

  11. Wagner, H. J. & Mattheus, U. Pineal organs in deep demersal fish. Cell Tissue Res. 307(1), 115–127 (2002).

    Google Scholar 

  12. Fröhlich, E., Negishi, K. & Wagner, H. J. Patterns of rod proliferation in deep-sea fish retinae. Vision Res. 35(13), 1799–1811 (1995).

    Google Scholar 

  13. Samerotte, A. L., Drazen, J. C., Brand, G. L., Seibel, B. A. & Yancey, P. H. Correlation of trimethylamine oxide and habitat depth within and among species of teleost fish: an analysis of causation. Physiol Biochem Zool. 80(2), 197–208 (2007).

    Google Scholar 

  14. Smith, A., Trudeau, V. L., Williams, L. M., Martinoli, M. G. & Priede, I. G. Melatonin receptors are present in non-optic regions of the brain of a deep-sea fish living in the absence of solar light. J Neuroendocrinol. 8(9), 655–688 (1996).

    Google Scholar 

  15. Wagner, H. J. Volumetric analysis of brain areas indicates a shift in sensory orientation during development in the deep-sea grenadier Coryphaenoides armatus. Mar Biol. 142, 791–797 (2003).

    Google Scholar 

  16. Daniel O. B. J., Jeff A. A., Ana C. & Jennifer M. D. Environmental considerations for impact and preservation reference zones for deep-sea polymetallic nodule mining. Mar Policy. 118 (2020).

  17. Stegeman, J. J., Kloepper-Sams, P. J. & Farrington, J. W. Monooxygenase induction and chlorobiphenyls in the deep-Sea fish Coryphaenoides armatus. Science. 231(4743), 1287–1289 (1986).

    Google Scholar 

  18. Lemaire, B. et al. Molecular adaptation to high pressure in cytochrome P450 1A and aryl hydrocarbon receptor systems of the deep-sea fish Coryphaenoides armatus. Biochim Biophys Acta Proteins Proteom. 1866(1), 155–165 (2018).

    Google Scholar 

  19. Morita, T. High-pressure adaptation of muscle proteins from deep-sea fishes, Coryphaenoides yaquinae and C. armatus. Ann N Y Acad Sci. 1189, 91–94 (2010).

    Google Scholar 

  20. Chen, Y. et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience. 7(1), 1–6 (2018).

    Google Scholar 

  21. Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J Mol Biol. 215(3), 403–410 (1990).

    Google Scholar 

  22. Liu, B. et al. Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects. Quant Biol. 35(s 1-3), 62–67 (2013).

    Google Scholar 

  23. Gertz, E. M., Yu, Y. K., Agarwala, R., Schäffer, A. A. & Altschul, S. F. Composition-based statistics and translated nucleotide searches: Improving the TBLASTN module of BLAST. BMC Biol. 4, 41 (2006).

    Google Scholar 

  24. Kim, D. et al. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol 37, 907–915 (2019).

    Google Scholar 

  25. Kovaka, S. et al. Transcriptome assembly from long read RNA-seq alignments with StringTie2. Genome Biol. 20, 278 (2019).

    Google Scholar 

  26. Cantarel, B. L. et al. MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 18, 188–196 (2008).

    Google Scholar 

  27. Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open-source ab initio eukaryotic gene-finders. Bioinformatics. 20(16), 2878–2879 (2004).

    Google Scholar 

  28. Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 21, 5654–5666 (2003).

    Google Scholar 

  29. Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, 109–114 (2012).

    Google Scholar 

  30. Ashburner, M. et al. Gene Ontology: Tool for the unification of biology. Nat Genet. 25, 25–29 (2000).

    Google Scholar 

  31. Boeckmann, B. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365–370 (2003).

    Google Scholar 

  32. Buchfink, B., Reuter, K. & Drost, H. G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat Methods. 18, 366–368 (2021).

    Google Scholar 

  33. Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 30, 1236–1240 (2014).

    Google Scholar 

  34. Mitchell, A. et al. The InterPro protein families database: the classification resource after 15 years. Nucleic Acids Res. 43, 213–221 (2015).

    Google Scholar 

  35. Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).

    Google Scholar 

  36. Lagesen, K. et al. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35, 3100–3108 (2007).

    Google Scholar 

  37. Nawrocki, E. P., Kolbe, D. L. & Eddy, S. R. Infernal 1.0: inference of RNA alignments. Bioinformatics. 25, 1335–1337 (2009).

    Google Scholar 

  38. Ioanna, K. et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids Res. 49, D192–D200 (2021).

    Google Scholar 

  39. Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).

    Google Scholar 

  40. Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci USA 117(17), 9451–9457 (2020).

    Google Scholar 

  41. Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics 4, 4.10.1–4.10.14 (2009).

    Google Scholar 

  42. Jurka, J. Repbase Update – a database and an electronic journal of repetitive elements. Trends Genet. 16, 418–420 (2000).

    Google Scholar 

  43. Jurka, J. et al. Repbase update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 110, 462–467 (2005).

    Google Scholar 

  44. Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49 (2012).

    Google Scholar 

  45. Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19(9), 1639–1645 (2009).

    Google Scholar 

  46. NCBI BioProject https://identifiers.org/ncbi/bioproject:PRJNA1311777 (2025).

  47. NCBI SRA https://identifiers.org/ncbi/insdc.sra:SRP661210 (2025).

  48. NCBI Genome https://identifiers.org/ncbi/insdc.gca:GCA_053525285 (2025).

  49. Wu, B. Coryphaenoides armatus Genome Annotation. figshare. Dataset. https://doi.org/10.6084/m9.figshare.30606560.v1 (2025).

  50. NGDC GSA https://ngdc.cncb.ac.cn/gsa/browse/CRA034151 (2025).

  51. Zhang, Y. H. et al. Comparative genomics reveal shared genomic changes in syngnathid fishes and signatures of genetic convergence with placental mammals. Natl Sci Rev. 7(6), 964–977 (2020).

    Google Scholar 

Download references

Acknowledgements

We would like to thank all the crew and scientists on board the DY79 cruises to collect and preserve the specimens. The work was supported by the National Key Research and Development Program of China (2023YFC2811501), the National Natural Science Foundation of China (42176120 and 42230409), the Development Fund of South China Sea Institute of Oceanology of the Chinese Academy of Sciences (SCSIO202202), the Guangdong Basic and Applied Basic Research Foundation (2024A1515012304), and the Science and Technology Planning Project of Guangdong Province, China (2023B1212060047).

Author information

Authors and Affiliations

Authors

Contributions

Q.L. and Y.H.Z. conceived the project; B.L. collected the sample; T.D.L. completed the species identification. B.Q.W. and H.Y.Y. did the bioinformatic analyses; Y.H.Z. evaluated the data; B.Q.W. and Y.H.Z. wrote the manuscript. All authors have reviewed and approved the manuscript.

Corresponding authors

Correspondence to
Qiang Lin or Yanhong Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, B., Yu, H., Luo, T. et al. A chromosome-level genome assembly of Coryphaenoides armatus.
Sci Data (2026). https://doi.org/10.1038/s41597-026-06696-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41597-026-06696-4


Source: Ecology - nature.com

Fast-growing alien trees surge as slow native species decline worldwide

Distribution patterns and formation mechanisms of stable landslide dams in the Qinghai-Tibet plateau: a preliminary study

Back to Top