Abstract
Coryphaenoides armatus is a deep-sea species with broad geographic and bathymetric distribution and a highly developed olfactory system, rendering it a potential indicator species for deep-sea mining regions and a model for studying environmental adaptation. Genomic resources for this species are limited, restricting insights into its adaptive evolution. Here, we present a chromosome-level genome assembly of C. armatus, constructed using PacBio HiFi long-read sequencing, Illumina short-read polishing, and Hi-C scaffolding. The final assembly spans 811.1 Mb, achieves a scaffold N50 of 33.3 Mb, and is organized into 24 chromosomes. The complete BUSCO score at the chromosome-level assembly was 90.9%. A total of 24,818 protein-coding genes were annotated in the assembly. This high-quality genome assembly of C. armatus provides a solid foundation for understanding physiological processes, identifying potential indicator species in deep-sea mining regions and exploring adaptive evolution in extreme environments.
Similar content being viewed by others
An improved chromosome-level genome assembly of a deep-sea limpet (Bathyacmaea lactea)
Chromosome-level genome assembly of Decorus tungting, an endemic cyprinid from China
Chromosome-level genome assembly of the northern Pacific seastar Asterias amurensis
Data availability
The raw sequencing data is at NCBI SRA SRP66121047, the chromosome assembly is at Genbank GCA_053525285.148, and the annotation files at Figshare49. In addition, the raw sequence data is also available at NGDC BioProject PRJCA05159850.
Code availability
All data analyzing tools and software used in this study were performed following the instructions and guidelines. There was no custom code applied to analyze the data in our study.
References
Somero, G. N. Biochemical ecology of deep-sea animals. Experientia. 48(6), 537–543 (1992).
O’Hara, T. D. et al. Spatiotemporal faunal connectivity across global sea floors. Nature. (2025).
Bai, S., Shang, K., Zeng, S., Huang, Z. & Han, Z. Genome analysis of Salinimicrobium sp. 3283s, a deep-sea bacterium isolated from the sediments of South China Sea, China. Mar Genomics 76, 101125 (2024).
Luo, J. C., Long, H., Zhang, J., Zhao, Y. & Sun, L. Characterization of a deep sea Bacillus toyonensis isolate: Genomic and pathogenic features. Front Cell Infect Microbiol 11, 629116 (2021).
Cao, X. et al. Genomic analysis of Shewanella eurypsychrophilus YLB-09 reveals backgrounds related to its deep sea environment adaptation. Mar Genomics. 64, 100956 (2022).
Chen, J. et al. Pseudo-chromosome-length genome assembly for a deep-sea eel Ilyophis brunneus sheds light on the deep-sea adaptation. Sci China Life Sci. 66(6), 1379–1391 (2023).
Xu, W. et al. Chromosome-level genome assembly of hadal snailfish reveals mechanisms of deep-sea adaptation in vertebrates. Elife. 12, RP87198 (2023).
Li, W. et al. Genome sequencing of Coryphaenoides yaquinae reveals convergent and lineage-specific molecular evolution in deep-sea adaptation. Mol Ecol Resour. 24(6), e13989 (2024).
Gaither, M. R. et al. Genomics of habitat choice and adaptive evolution in a deep-sea fish. Nat Ecol Evol. 2(4), 680–687 (2018).
IDSSE. Deep-sea fish sample from the Mariana Trench: Coryphaenoides rudis. Genbank https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_048487635.1 (2025).
Wagner, H. J. & Mattheus, U. Pineal organs in deep demersal fish. Cell Tissue Res. 307(1), 115–127 (2002).
Fröhlich, E., Negishi, K. & Wagner, H. J. Patterns of rod proliferation in deep-sea fish retinae. Vision Res. 35(13), 1799–1811 (1995).
Samerotte, A. L., Drazen, J. C., Brand, G. L., Seibel, B. A. & Yancey, P. H. Correlation of trimethylamine oxide and habitat depth within and among species of teleost fish: an analysis of causation. Physiol Biochem Zool. 80(2), 197–208 (2007).
Smith, A., Trudeau, V. L., Williams, L. M., Martinoli, M. G. & Priede, I. G. Melatonin receptors are present in non-optic regions of the brain of a deep-sea fish living in the absence of solar light. J Neuroendocrinol. 8(9), 655–688 (1996).
Wagner, H. J. Volumetric analysis of brain areas indicates a shift in sensory orientation during development in the deep-sea grenadier Coryphaenoides armatus. Mar Biol. 142, 791–797 (2003).
Daniel O. B. J., Jeff A. A., Ana C. & Jennifer M. D. Environmental considerations for impact and preservation reference zones for deep-sea polymetallic nodule mining. Mar Policy. 118 (2020).
Stegeman, J. J., Kloepper-Sams, P. J. & Farrington, J. W. Monooxygenase induction and chlorobiphenyls in the deep-Sea fish Coryphaenoides armatus. Science. 231(4743), 1287–1289 (1986).
Lemaire, B. et al. Molecular adaptation to high pressure in cytochrome P450 1A and aryl hydrocarbon receptor systems of the deep-sea fish Coryphaenoides armatus. Biochim Biophys Acta Proteins Proteom. 1866(1), 155–165 (2018).
Morita, T. High-pressure adaptation of muscle proteins from deep-sea fishes, Coryphaenoides yaquinae and C. armatus. Ann N Y Acad Sci. 1189, 91–94 (2010).
Chen, Y. et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience. 7(1), 1–6 (2018).
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J Mol Biol. 215(3), 403–410 (1990).
Liu, B. et al. Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects. Quant Biol. 35(s 1-3), 62–67 (2013).
Gertz, E. M., Yu, Y. K., Agarwala, R., Schäffer, A. A. & Altschul, S. F. Composition-based statistics and translated nucleotide searches: Improving the TBLASTN module of BLAST. BMC Biol. 4, 41 (2006).
Kim, D. et al. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol 37, 907–915 (2019).
Kovaka, S. et al. Transcriptome assembly from long read RNA-seq alignments with StringTie2. Genome Biol. 20, 278 (2019).
Cantarel, B. L. et al. MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 18, 188–196 (2008).
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open-source ab initio eukaryotic gene-finders. Bioinformatics. 20(16), 2878–2879 (2004).
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 21, 5654–5666 (2003).
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, 109–114 (2012).
Ashburner, M. et al. Gene Ontology: Tool for the unification of biology. Nat Genet. 25, 25–29 (2000).
Boeckmann, B. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365–370 (2003).
Buchfink, B., Reuter, K. & Drost, H. G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat Methods. 18, 366–368 (2021).
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 30, 1236–1240 (2014).
Mitchell, A. et al. The InterPro protein families database: the classification resource after 15 years. Nucleic Acids Res. 43, 213–221 (2015).
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).
Lagesen, K. et al. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35, 3100–3108 (2007).
Nawrocki, E. P., Kolbe, D. L. & Eddy, S. R. Infernal 1.0: inference of RNA alignments. Bioinformatics. 25, 1335–1337 (2009).
Ioanna, K. et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids Res. 49, D192–D200 (2021).
Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci USA 117(17), 9451–9457 (2020).
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics 4, 4.10.1–4.10.14 (2009).
Jurka, J. Repbase Update – a database and an electronic journal of repetitive elements. Trends Genet. 16, 418–420 (2000).
Jurka, J. et al. Repbase update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 110, 462–467 (2005).
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49 (2012).
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19(9), 1639–1645 (2009).
NCBI BioProject https://identifiers.org/ncbi/bioproject:PRJNA1311777 (2025).
NCBI SRA https://identifiers.org/ncbi/insdc.sra:SRP661210 (2025).
NCBI Genome https://identifiers.org/ncbi/insdc.gca:GCA_053525285 (2025).
Wu, B. Coryphaenoides armatus Genome Annotation. figshare. Dataset. https://doi.org/10.6084/m9.figshare.30606560.v1 (2025).
NGDC GSA https://ngdc.cncb.ac.cn/gsa/browse/CRA034151 (2025).
Zhang, Y. H. et al. Comparative genomics reveal shared genomic changes in syngnathid fishes and signatures of genetic convergence with placental mammals. Natl Sci Rev. 7(6), 964–977 (2020).
Acknowledgements
We would like to thank all the crew and scientists on board the DY79 cruises to collect and preserve the specimens. The work was supported by the National Key Research and Development Program of China (2023YFC2811501), the National Natural Science Foundation of China (42176120 and 42230409), the Development Fund of South China Sea Institute of Oceanology of the Chinese Academy of Sciences (SCSIO202202), the Guangdong Basic and Applied Basic Research Foundation (2024A1515012304), and the Science and Technology Planning Project of Guangdong Province, China (2023B1212060047).
Author information
Authors and Affiliations
Contributions
Q.L. and Y.H.Z. conceived the project; B.L. collected the sample; T.D.L. completed the species identification. B.Q.W. and H.Y.Y. did the bioinformatic analyses; Y.H.Z. evaluated the data; B.Q.W. and Y.H.Z. wrote the manuscript. All authors have reviewed and approved the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
Reprints and permissions
About this article
Cite this article
Wu, B., Yu, H., Luo, T. et al. A chromosome-level genome assembly of Coryphaenoides armatus.
Sci Data (2026). https://doi.org/10.1038/s41597-026-06696-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-026-06696-4
Source: Ecology - nature.com
