Abstract
The concept of the ecological niche, defined as the basic habitat requirements for a species, is central to understanding species geographic distributions and predicting their responses to environmental change. However, identifying the essential niche for large regional communities remains a challenge because niche axes can be “hidden” by the complexity of the underlying ecological processes. Here, applying advanced species distribution modelling to nationwide environmental DNA survey data, we identified hidden niche axes of the Japanese coastal fish community and investigated the response diversity to these axes. Our survey detected 1,220 coastal fish species. The hidden niche axes collectively explained most of the variation in fish biodiversity and revealed five biogeographic boundaries for the regional community. These niches of the Japanese fish community may primarily relate to several processes due to ocean currents, such as environmental filters, transport from source areas and dispersal barriers. We also found that the response diversity to niche axes was positively correlated with species richness, although local communities with particularly high response diversity were geographically biased. A better understanding of the niche axes of the regional ecological community should help to mitigate the loss of biodiversity and ecosystem services caused by ongoing environmental change.
Similar content being viewed by others
Niche partitioning of the ubiquitous and ecologically relevant NS5 marine group
Unveiling the transition from niche to dispersal assembly in ecology
Trophic niches of native and nonnative fishes along a river-reservoir continuum
Introduction
The ecological niche is a fundamental concept for explaining the geographic distributions and coexistence of species1,2,3,4,5,6. According to a widely accepted definition3, the niche of a species is represented as a multidimensional space with abiotic and biotic axes in which populations can persist. Numerous field and experimental studies have underscored the critical role of the ecological niche in shaping biodiversity from both ecological and evolutionary perspectives6,7,8,9,10. In most studies, the axes of niche space can be explored by modelling species distributions using abiotic and biotic variables6,11. However, identifying the essential niche axes for a large regional community with diverse species remains challenging, mainly because niche axes can be “hidden” when the underlying ecological processes are too complex to measure (e.g. local species interactions, dispersal limitation, and evolutionary history). The existence of hidden niche axes obscures our understanding of how regional communities will respond to environmental change. Given the current biodiversity crisis due to habitat degradation and global warming12,13, the identification of hidden niche axes is an urgent issue. The aim of this study is to identify the hidden niche axes of the nationwide coastal fish community by combining a large-scale environmental DNA (eDNA) biodiversity survey with a recently developed joint species distribution model14,15,16.
eDNA metabarcoding techniques have improved the efficiency and comprehensiveness of biodiversity surveys, especially for fish, based on analyses of extra-organismal DNA suspended in the surrounding environment17,18,19. Their technical developments provide an opportunity to explore the hidden niche axes of regional communities, as eDNA surveys can cover extensive areas in a relatively short period. Here, we conducted a nationwide eDNA survey to investigate the fish community in the coastal area of the Japanese archipelago. This region is well-known for its high fish diversity, not only because the wide range of climatic zones provides a variety of coastal environments, but also because four major ocean currents result in high productivity in this area20,21. Our eDNA biodiversity survey was conducted at 528 sampling sites along the Japanese coastline from June to August 2017 (Fig. 1 and Supplementary Table S1). Of these sites, 318 were located on the four major islands (Hokkaido, Honshu, Shikoku, and Kyushu) and the remaining 210 were located on 60 smaller, remote islands. The study area extended over 19,000 km of coastline, encompassing a climate range from subtropical to temperate and subarctic zones (24.2°N–45.5°N).
(a) Sampling sites (points) were geographically grouped into eight districts: Hokkaido Islands (HKD, n = 71), East Main Islands along the Pacific Ocean (EMP, n = 39), East Main Islands along the Japan Sea (EMJ, n = 39), West Main Islands along the Pacific Ocean (WMP, n = 95), West Main Islands along the Japan Sea (WMJ, n = 90), West Main Islands along inland sea (WMI, n = 63), Izu-Ogasawara Islands (IOI, n = 51), and Satsuma-Ryukyu Islands (SRI, n = 80). Kuroshio and Tsushima warm currents (red arrows) and Oyashio and Liman cold currents (blue arrows) result in high productivity in the study area. (b) Comparisons of rarefied species richness and (c) nonlinear multivariate analysis (t-SNE) were conducted. Coloured points represent sampling sites from different districts; 50% probability ellipses are shown to clarify the spatial structure of the fish communities in the multivariate analysis.
In this study, we adopted the practical definition of ecological niche axes for regional communities: it contributes to the population persistence of the majority of compositional species. In this definition, ecological niche can be related to various ecological processes, including not only abiotic factors (environmental filter)1,3 but also biotic factors (species interactions, dispersal limitation, evolutionary history, etc.)2,4,5. It is noteworthy that because niche axes were explored by modelling species distributions, it may be difficult to identify niche axes whose contributions to the population persistence fluctuate over time and space6.
Results and discussion
After completing DNA extraction, parallel sequencing, and taxonomic assignment (see Methods and Supplementary Information S1), we detected 1,220 MOTUs (molecular operational taxonomic units; hereafter species) of coastal fishes from our survey, spanning 148 families and 501 genera (Supplementary Tables S2–S4). Local species richness at each site varied from 6 to 118 with a median of 34 species. The well-known latitudinal biodiversity gradient22,23 was observed, with lower richness in the north and higher richness in the south (Fig. 1). Specifically, Hokkaido Islands (HKD) and East Main Islands along the Pacific Ocean (EMP) had the lowest species richness, while West Main Islands along the Pacific Ocean (WMP), Izu-Ogasawara Islands (IOI) and Satsuma-Ryukyu Islands (SRI) had the highest species richness. A nonlinear multivariate analysis (t-SNE) of the entire fish community revealed clear geographically associated clusters (Fig. 1). The subtropical communities in IOI and SRI and the subarctic communities in HKD and EMP formed completely separate clusters, and these clusters were connected by the temperate communities in other regions. The coastal fish communities included many uncommon species that occupied fewer than six sites but accounted for more than half of the total species richness (670 out of 1,220 species).
To infer hidden niche axes that shape the regional coastal fish community, we employed a generalised linear latent variable model (GLLVM)14,15,16. This model estimates species responses not only to environmental covariates (e.g. seawater temperature and salinity) but also to spatially referenced latent variables for explaining species distribution ranges (see Methods). Spatially referenced latent variables, estimated from shared residuals of species co-occurrence patterns24,25, can account for the hidden axes of shared niche space (e.g. unmeasured environments, local species interactions, dispersal limitation and evolutionary history). To avoid the identification of false niche axes due to model overfitting, we fitted the model with 80% of the sampling sites and validated the fitted model with the remaining 20% to compare the predictive performance of 18 candidate models (see Methods). The best-fit model had three latent variables, which collectively explained 98.3% of the variance in species co-occurrence (58.9%, 37.4% and 2.0% for each latent variable; Fig. 2). Measured environmental covariates explained only 1.7% of the variance (1.0% for temperature and 0.7% for salinity).
(a–c) Geological variation in latent variables estimated as hidden niche axes. Colour gradient from blue to red represents the values of scaled latent variables. Areas with similar colours exhibit similar niche values. Points represent sampling sites. For latent variables 1 and 2, we identified five hypothetical boundaries between distinct local communities (B1–B5; red lines). The latent variables explained 58.9%, 37.4%, and 2.0% of the total variance, respectively. Range represents the scale of spatial autocorrelation obtained from Matérn covariance. (d) Niche space with latent variables 1 and 2 as axes. Gray points represent the values for the sampling sites and triangles represent the niche centres of the analysed species. Red dotted lines represent zero-value lines corresponding to above five hypothetical boundaries. Ellipses represent representative species types based on the niche space.
The first and second latent variables, most influential on the regional fish community of the Japanese coast, revealed five hypothetical biogeographic boundaries between distinct local communities (Fig. 2, B1–B5; see Methods for boundary identification). These hypothetical boundaries aligned with those documented in previous studies26,27,28. Boundary B1 is known as the Osumi Line, a dispersal barrier due to the broad and fast-flowing Kuroshio Current, by which many sister species are separately distributed only off the Japanese mainland or the Satsuma-Ryukyu Islands26. Boundaries B2–B3 and B4–B5 correspond to the boundaries of three marine provinces (Sino-Japanese, Oriental, and Kuril Provinces)27,28. The fish fauna of the area south of Boundary B2–B3 is likely explained by the transport of tropical fishes from the northern Philippines and eastern Taiwan by warm currents, while that of the area north of Boundary B4–B5 is explained by the transport of subarctic fishes from the Russian Far East by cold currents. Boundary B2 appeared to arise from the change in coastal depth. The shallow southern area is dominated by warm water from Tsushima Current29 and the deep northern area is dominated by cold water from the abyssal circulation of the Japan Sea30. Interestingly, these boundaries may also explain the distributions of large seagrasses and seaweeds31 and terrestrial plants32,33. The ecological processes underlying these hidden niche axes of coastal fishes may be important for other taxonomic groups with diverse life histories.
Using the first and second latent variables as axes, we reconstructed the niche space in which the species niche centres were located (Fig. 2d). We observed that for the Japanese coastal fish community, the niche centres of many species were in the subtropical (35.3% of all species) and temperate Kuroshio regions (25.4%), whereas fewer species had niche centres in the subarctic region (10.4%). This distribution of niche centres was consistent with the latitudinal variation in species richness (Fig. 1). Thus, our analysis reemphasises the critical role of ecological niches in characterising regional biodiversity6. When the niche space was constructed for each family with more than five species (Supplementary Figure S1), we found that for some families, the niche centres occupied only a portion of the niche space. For example, most species of Acanthuridae, Chaetodontidae, Balistidae and Monacanthidae, which inhabit mainly tropical seas and coral reefs, occupied the niche space from the subtropical to temperate Kuroshio regions. This pattern suggests that these niche axes play an important role in driving regional spatial turnover at the family level.
The third latent variable revealed a niche axis associated with geographically scattered habitats that are important to the regional fish community (Fig. 2c and Supplementary Figure S2). These habitats included enclosed inland seas, such as Ariake Sound, Seto Inland Sea, Ise-Mikawa Bay, and Tokyo Bay, which have lower salinities and colder temperatures than those of adjacent open seas. These enclosed inland seas harbour distinct species that may have originated from the geographic isolation of the Japan Sea about 6 million years ago33. Thus, the niche space of the Japanese coastal fish community appears to be shaped by the interplay of environmental factors as well as dispersal limitation and evolutionary history. This may explain why it is often difficult to capture hidden niche axes with a limited set of measured variables. Of note, we did not find niche axes corresponding to local species interactions (i.e. the Eltonian niche)2. This may be explained by the Eltonian noise hypothesis6, which predicts that the effect of species interactions will generally take place at finer spatial scales and not necessarily across broad biogeographic regions.
Finally, we investigated the response diversity of local fish communities to the latent variables (Fig. 3). Response diversity, defined as the variation in responses to environmental change among species in a community, is a key determinant of the stability of ecosystem functions34,35. Community-level functions are expected to be less variable in communities with high response diversity. As expected, we found a positive relationship between species richness and response diversity (Fig. 3). However, species richness did not fully explain the variation in response diversity. When local fish communities were classified into groups with high and low response diversity by clustering, the communities with high response diversity were geographically biased, particularly along the Pacific coast of western Japan (Fig. 3 and Supplementary Figure S3). This geographic bias may be explained by habitat diversity provided by complex coastal inlets and coral reefs, in addition to fish migration from lower latitudes due to the Kuroshio current. Further studies are needed to understand the mechanisms underlying the increased response diversity of the coastal fish community in specific geographic regions.
(a–c) Relationship between expected species richness and response diversity to each latent variable (i.e. niche axis), calculated from the species-specific coefficients of the generalised linear latent variable model. Local fish communities at sampling sites (points) were classified into groups with high and low response diversity (red and black, respectively) using spectral clustering. Lines and shaded areas represent regression lines and standard deviations. (d–f) Locations of sites with high response diversity (red points) and low response diversity (black points).
Our model showed that field-measured environmental variables (seawater temperature and salinity) exhibited surprisingly low explanatory power for the regional fish community. To investigate the reason, we conducted additional GLLVM analysis with satellite-measured variables as environmental covariates (see Methods and Supplementary Information S2). Satellite-measured variables include annual minimum and maximum sea surface temperatures (SST), mean chlorophyll a concentration, and mean particulate inorganic carbon concentration. Although the predictive performance of the additional GLLVM was considerably less than original GLLVM, the results showed that environmental covariates explained about 80% of the variation in fish biodiversity, while latent variables explained the remaining 20%. In particular, annual SST and chlorophyll a concentration explained most of this variation (about 50% and 20%, respectively). This suggests that long-term environmental factors may be more critical for the regional fish community than environmental factors at the time of sampling. Thus, incorporating appropriate environmental variables as covariates can reduce the variation explained by latent variables, which require careful interpretation.
Additionally, the importance of identifying niche axes based on measured environmental variables must be recognised. Measured environmental niche axes will provide more robust predictions outside of study areas and under future scenarios than hidden niche axes identified by latent variables. As shown in our study, the point of using latent variables is that they allow us to interpret niche axes that are difficult to measure. Our approach, which uses both measured environmental variables and hidden latent variables as covariates, will be a good starting point for identifying ecological niche of regional communities.
Coastal ecosystems are highly productive and diverse ecosystems36. They cover only 11% of oceans but support 90% of the catch from marine fisheries37. Our study demonstrates the application of nationwide eDNA survey data for the identification of hidden niche axes shaping regional fish diversity and for understanding underlying ecological processes in a coastal ecosystem. In particular, ocean currents likely relate to the underlying ecological processes as environmental filters, transport from source area, and dispersal barriers. Global warming may therefore have a serious impact on the Japanese coastal ecosystem because it not only directly increases seawater temperatures but also indirectly alters current water transport through the global climate oscillation38. To predict changes in the regional fish community due to global warming, future efforts must be made to explicitly incorporate these ecological processes into atmosphere-ocean coupled general circulation models39. The United Nations recently declared 2021–2030 the Decade on Ecosystem Restoration, with the aim of preventing, halting and reversing the degradation of ecosystems on every continent and in every ocean40. eDNA surveys based on recently developed metabarcoding techniques will undoubtedly contribute to achieving this ambitious goal.
Methods
Sampling site selection
The Japanese archipelago extends over 3,000 km along the northeastern coast of the Eurasian continent, facing the Sea of Okhotsk, Pacific Ocean, Seto Inland Sea, East China Sea, and Japan Sea. To sample the vast coastal areas of Japan, 528 sampling sites were selected (Fig. 1a), covering a wide range of the climatic zones from subtropical to temperate and subarctic (24.227–45.552˚N, 122.955–145.815˚E). For the four main islands (Hokkaido, Honshu, Shikoku, and Kyushu), 318 sites were located along the concatenated coastlines with an average distance of 60 km between each site. Of the more than 6,000 smaller islands around the main islands, 60 islands were selected for more comprehensive geographic coverage. The 210 sites were located on these 60 islands 1–1,000 km offshore from the main islands with an average distance of 27 km between each site. Exact site locations were determined (1) to evenly divide the lengths of the coastlines, (2) to ensure safety during the water sampling and (3) to safeguard against contamination from exogenous DNA. It is noteworthy that fish eDNA is considered to reflect its local abundance in marine environments41. Therefore, given that the distance between sampling sites is sufficiently large, the transport of eDNA itself by currents will have little impact on our results.
Water sampling and on-site filtration
Seawater sampling was conducted at 528 sites during the summer of 2017, from June 5 to August 18 (Supplementary Table S1). At each sampling site, surface seawater was collected by casting a polypropylene bucket tied to a 15-m rope onto the sea surface. Seawater collection was repeated ten times to minimise sampling bias at each site. Disposable gloves were worn on both hands during water sampling, and the bucket inside and the tip of the rope were thoroughly decontaminated with a foam-style 10% bleach solution to reduce contamination risk. On-site filtration was performed by two researchers to obtain duplicate samples. For a single bucket cast, each researcher filtered 50 ml of seawater twice using a Sterivex filter cartridge (pore size 0.45 μm; Merck Millipore, Billerica, MA) and a 50-ml disposable syringe (Terumo Corp., Tokyo, Japan). The final filtration volume reached 1,000 ml × 2 with ten bucket casts. When the filter was clogged before reaching 1,000 ml filtration, the total volume of water filtered was recorded (200–900 ml from 28 sites; Supplementary Table S1). After on-site filtration, an outlet port of the filter cartridge was sealed with Parafilm (LMS, Tokyo, Japan), and 1.6 ml of RNAlater (Thermo Fisher Scientific, Waltham, MA) was added to the cartridge from an inlet port using a disposable capillary pipette (As One Corp., Osaka, Japan) to prevent eDNA degradation. The inlet port was sealed with either the film or a cap for preservation. A filtration blank (FB) was prepared by filtering 500 ml of purified water in the same manner at the end of each sampling day. The filtered cartridges were transported to the laboratory in a portable cooler and then kept at − 20 °C in the freezer until eDNA extraction.
DNA extraction
A physically separated workspace was prepared for DNA extraction and subsequent column-based steps. The workspace and equipment were sterilised with hypochlorite solution before DNA extraction. To prevent contamination and the loss of environmental DNA, low-retention filtered pipette tips and microtubes were used. DNA extraction was performed according to the previously developed method42 with slight modifications. After removing redundant seawater and RNAlater from the filter cartridge by an aspirator, crude eDNA was extracted from the filter membrane using proteinase K buffer, consisting of 220 µl of PBS, 20 µl of Proteinase K, and 200 µl of Buffer AL, included in a DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany). After sealing proteinase K buffer into the filter cartridge and stirring at 56 °C for 10 min, the extracted DNA was collected and further purified using the kit. The final elution volume of Buffer AE from the column was 200 µl. During this process, an extraction blank (EB) was prepared using Milli-Q Water-enclosed filter cartridge.
Paired-end library Preparation and sequencing
The workspace and equipment in the pre-PCR area were sterilised with hypochlorite solution before library preparation. Low-retention filtered pipette tips and microtubes were used and pre- and post-PCR manipulations were performed in two different dedicated rooms to safeguard against carryover contamination. Two-step PCR (first-round PCR and index PCR) was applied for paired-end next generation sequencer (NGS) library preparation, following previously developed methods43,44. For the first-round PCR (1st PCR), a mixture of two primer pairs was used: MiFish-U-forward (5′–ACA CTC TTT CCC TAC ACG ACG CTC TTC CGA TCT NNN NNN GTC GGT AAA ACT CGT GCC AGC–3′), MiFish-U-reverse (5′–GTG ACT GGA GTT CAG ACG TGT GCT CTT CCG ATC TNN NNN NCA TAG TGG GGT ATC TAA TCC CAG TTT G–3′), MiFish-E-forward-v2 (5′–ACA CTC TTT CCC TAC ACG ACG CTC TTC CGA TCT NNN NNN RGT TGG TAA ATC TCG TGC CAG C–3′) and MiFish-E-reverse-v2 (5′–GTG ACT GGA GTT CAG ACG TGT GCT CTT CCG ATC TNN NNN NGC ATA GTG GGG TAT CTA ATC CTA GTT TG–3′). The 1st PCR was carried out with a 12-µl reaction volume in eight replications using a strip of eight 0.2-ml tubes. A 1st PCR blank (1B) was also prepared during this process. To reduce the experiment costs, a single 12-µl reaction was prepared for each type of blank (FB, EB, and 1B). After the 1st PCR, equal volumes of the PCR products were pooled from the eight replications in a single tube and purified using a GeneRead Size Selection Kit (Qiagen, Hilden, Germany). Subsequently, the purified target products (ca. 300 bp) were quantified using TapeStation 2200 (Agilent Technologies, Tokyo, Japan). After diluting samples to 0.1 ng/µl with Milli-Q water, 1.5 µl of each diluted product was subjected to the index PCR (2nd PCR) in a 12-µl reaction volume to append dual-indexed sequences and flow cell-binding sites for NGS platform. For the three types of blanks (FB, EB, and 1B), the 1st PCR products were purified in the same manner but were not quantified. They were diluted at the average dilution ratio of the positive samples to be used as templates for the 2nd PCR. The following two primers were used for the 2nd PCR to append dual-indexed sequences (eight nucleotides indicated by Xs) and flow cell-binding sites for the MiSeq platform (5′–ends of the sequences before eight Xs): 2nd-PCR-forward (5–AAT GAT ACG GCG ACC ACC GAG ATC TAC ACX XXX XXX XAC ACT CTT TCC CTA CAC GAC GCT CTT CCG ATC T–3′) and 2nd-PCR-reverse (5′–CAA GCA GAA GAC GGC ATA CGA GAT XXX XXX XXG TGA CTG GAG TTC AGA CGT GTG CTC TTC CGA TCT–3′). A 2nd PCR blank (2B) was made during this process.
To monitor contamination during on-site filtration, DNA extraction, and 1st and 2nd PCRs, 321 blanks were made (FB = 191, EB = 52, 1B = 65, 2B = 13) and subjected to the library preparation procedure. They were divided into four sets and the libraries were pooled in equal volumes into four 1.5-ml tubes. The target amplicons (ca. 370 bp) were purified using a 2% E-Gel Size Select agarose gel (Invitrogen, Carlsbad, CA). The concentration of the size-selected libraries was measured using a Qubit dsDNA HS Assay Kit and a Qubit fluorometer (Life Technologies, Carlsbad, CA), diluted to 12.0 pM with HT1 buffer (Illumina), and sequenced on the MiSeq platform (Illumina, San Diego, CA) using a MiSeq v2 Reagent Kit, 300 cycles (Illumina) with a PhiX Control v3 (Illumina, San Diego, CA) spike-in (expected at 5%), following the manufacturer’s protocol. To remove residual contamination from the MiSeq flow path, the flow channel was washed with hypochlorous acid before each operation. All raw DNA sequence data and associated information were deposited in DDBJ/EMBL/GenBank (accession number DRA007474).
Data preprocessing and taxonomic assignment
Data preprocessing and analyses were performed using MiSeq raw reads from the 528 samples. Using USEARCH ver. 10.0.24045, quality-filtered forward and reverse reads were merged, primer sequences were removed, low-quality reads were filtered, and reads were dereplicated and denoised to obtain amplicon sequence variants (ASVs). Finally, fish species (molecular operational taxonomic units; MOTUs) were assigned to ASVs with a sequence identity of > 80% with the reference sequences. A custom reference database (see details in Supplementary Information S1) was used. Four expert ichthyologists (M.M., H.Motomura, T.Sado, and H.S.) revised the taxonomic assignments based on phylogenetic relationships combined with their knowledge of species distributions and dominance. The details of data processing, database assembly, and taxonomic assignment are described in Supplementary Information S1.
To focus on coastal fishes, all fishes principally inhabiting regions outside of the coastal areas (deep-sea fishes, oceanic epipelagic fishes, pure freshwater fishes, and all non-native fishes originating from food materials and carryover contamination) were excluded (Supplementary Table S2). Read counts were subsequently rarefied to the approximate minimum numbers (Supplementary Table S3).
Collecting environmental covariates
To reduce the burden of fieldwork, only seawater temperature and salinity were measured at each site after water sampling and on-site filtration. Additionally, four environmental covariates were collected from publicly available satellite data46: annual minimum and maximum sea surface temperatures in 2017, mean chlorophyll a concentration in June-August 2017 and mean particulate inorganic carbon concentration in June-August 2017. The resolution of all collected satellite data is 4 km × 4 km. The field-measured seawater temperature and salinity were missing at ten sites due to instrument failure. Therefore, data from these sites were removed from the subsequent analysis. Furthermore, the following MOTU data were removed: (1) MOTUs categorised as species complex (i.e. including several species) and (2) MOTUs occurring at fewer than six sites. Consequently, the subsequent analyses included 519 species from 518 sampling sites.
Nonlinear multivariate analysis
To better understand the regional community structure, sampling sites were geographically grouped into eight districts based on commonly used climate classifications and geographic proximity: Hokkaido Island (HKD, n = 71), East Main Island along the Pacific Ocean (EMP, n = 39), East Main Island along the Japan Sea (EMJ, n = 39), West Main Island along the Pacific Ocean (WMP, n = 95), West Main Island along the Japan Sea (WMJ, n = 90), West Main Island along inland sea (WMI, n = 63), Izu-Ogasawara Island (IOI, n = 51) and Satsuma-Ryukyu Island (SRI, n = 80). A nonlinear dimensionality reduction method, t-distributed stochastic neighbor embedding (t-SNE)47, was used to capture the structures of the entire regional fish community based on Jaccard distances between 518 sites.
Generalised linear latent variable model
Joint species distribution models are used to analyse co-occurrence patterns while accounting for missing predictors. In this study, a generalised linear latent variable model (GLLVM)7,8,9 with spatially autocorrelated latent variables (LVs) was used as a joint species distribution model to explore the hidden niche space. The GLLVM decomposes high-dimensional residual correlations using low-dimensional LVs, thus allowing fitting to relatively large datasets. The model structure was a logistic regression with environmental covariates ((:{varvec{x}}_{varvec{i}})) and spatially autocorrelated LVs ((:{varvec{u}}_{varvec{i}})) as explanatory variables and species presence-absence ((:{y}_{ij})) as response variables:
for fish species (:j) = 1, …, 519 at sampling site (:i) = 1, …, 518. (:{varvec{beta:}}_{j}) and (:{varvec{gamma:}}_{j}) are species-specific responses related to the environmental covariates and LVs, respectively. (:{beta:}_{j0}) is the species-specific intercept and (:{alpha:}_{i}) is random site effects. The random site effects can mitigate the bias owing to the variation in total eDNA concentrations among sites9. The spatial autocorrelation was modelled by Gaussian Markov random field48 (Supplementary Figure S4). The details of the model structure are described in Supplementary Information S2.
This study considered three candidate models with different sets of environmental covariates. As environmental covariates, Model (a) included the two field-measured variables, Model (b) included the four satellite variables and Model (c) included both the two field-measured variables and the four satellite variables. Furthermore, models with different numbers of latent variables (from 0 to 5) were also considered for each candidate model. Using five-fold cross validation with negative log-likelihoods as the validation loss, the Model (a) with three spatially autocorrelated LVs was selected as the best-fit model from 18 candidate models. In the main text, the results of this best-fit model were mainly reported for simplicity, whereas the results of the best-fit model of Models (b–c) were reported in Supplementary Information S2. Model (a) exhibited considerably higher model predictive ability for the regional fish community than Models (b–c) (Supplementary Table S5). Nevertheless, these models estimated similar geological variations in latent variables 1 and 2 and reproduced the five hypothetical biogeographic boundaries described in the following paragraph (Supplementary Figure S5).
As several previous studies49,50 did, our study defined biogeographic boundaries based on the ordination space (Fig. 2). Given that GLLVM directly relates ordination axes (i.e. latent variables) to species occurrence probabilities, community compositions are expected to change largely at the boundary of the zero-value line. Thus, zero-value lines were used to identify hypothetical biogeographic boundaries. The first and second LVs revealed five hypothetical boundaries between distinct local communities (Fig. 2 and Supplementary Figure S2). The third LV showed broad zero-value areas (not zero-value lines), suggesting no boundaries between communities (Supplementary Figure S2). Species niche centres were defined as the position with the highest occurrence probability in the niche space. For this, species occurrence records in geographic space were mapped onto the niche space and then the occurrence probability on the niche space was calculated using a kernel density estimation method.
Calculation of response diversity
To calculate the response diversity of a local community to each LV, 100 communities were randomly generated with estimated occurrence probabilities ((:varvec{p})). The interquartile range of the response to each LV ((:varvec{gamma:})) was calculated for the species comprising each generated community and the interquartile ranges over all 100 generated communities were averaged as the expected response diversity. The interquartile range was used instead of the other dispersion proxies (e.g. variance) to reduce the influence of outliers due to estimation error.
Spectral clustering and linear regression were used to investigate the relationship between species richness and response diversity. Local communities were classified into high and low expected response diversity using spectral clustering (the number of clusters was set to two). Then, linear regression was applied to each cluster:
To ensure homogeneity of variances, both species richness and response diversity were log-transformed. Because spectral clustering can produce several different results due to the randomness in the optimisation process, clustering was repeated 100 times and finally the clustering result with the highest log-likelihood was selected for linear regressions. All codes are available at https://doi.org/10.5281/zenodo.17823192.
Data availability
Data are available on Supplementary Table S1-S4 and will be deposited in ANEMONE DB (https://db.anemone.bio/). All raw DNA sequence data and associated information are deposited in DDBJ/EMBL/GenBank (accession number DRA007474). The custom reference database is available on https://doi.org/10.5281/zenodo.17823192.
References
Grinnell, J. The niche-relationships of the California thrasher. Auk 34, 427–433 (1917).
Google Scholar
Elton, C. S. Animal Ecology (Sidgwick and Jackson, 1927).
Hutchinson, G. E. Concluding remarks. Cold Spring Harbor Symp. 22, 415–427 (1957).
Google Scholar
Holt, R. D. Bringing the Hutchinsonian niche into the 21st century: Ecological and evolutionary perspectives. Proc. Natl. Acad. Sci. USA106, 19659–19665 (2009).
Colwell, R. K. & Rangel, T. F. Hutchinson’s duality: the once and future niche. Proc. Natl. Acad. Sci. USA. 106, 19651–19658 (2009).
Google Scholar
Peterson, A. T. et al. Ecological Niches and Geographic Distribution (Princeton University Press, 2011).
Harpole, W. S. & Tilman, D. Grassland species loss resulting from reduced niche dimension. Nature 446, 791–793 (2007).
Google Scholar
Harpole, W. S. et al. Addition of multiple limiting resources reduces grassland diversity. Nature 537, 93–96 (2016).
Google Scholar
Wiens, J. J. & Graham, C. H. Niche conservatism: integrating evolution, ecology, and conservation biology. Annu. Rev. Ecol. Evol. Sys. 36, 519–539 (2005).
Google Scholar
Larcombe, M. J., Jordan, G. J., Bryant, D. & Higgins, S. I. The dimensionality of niche space allows bounded and unbounded processes to jointly influence diversification. Nat. Comm. 9, 4258 (2018).
Google Scholar
Sillero, N. et al. Want to model a species niche? A step-by-step guideline on correlative ecological niche modelling. Ecol. Model. 456, 109671 (2021).
Google Scholar
Butchart, S. H. M. et al. Global biodiversity: indicators of recent declines. Science 328, 1164–1168 (2010).
Google Scholar
Pereira, H. M. et al. Scenarios for global biodiversity in the 21st century. Science 330, 1496–1501 (2010).
Google Scholar
Thorson, J. T. et al. Spatial factor analysis: a new tool for estimating joint species distributions and correlations in species range. Methods Ecol. Evol. 6, 627–637 (2015).
Google Scholar
Ovaskainen, O., Roy, D. B., Fox, R. & Anderson, B. J. Uncovering hidden Spatial structure in species communities with Spatially explicit joint species distribution models. Methods Ecol. Evol. 7, 428–436 (2016).
Google Scholar
Niku, J., Hui, F. K. C., Taskinen, S., Warton, D. I. & gllvm Fast analysis of multivariate abundance data with generalized linear latent variable models in R. Methods Ecol. Evol. 10, 2173–2182 (2019).
Google Scholar
Cristescu, M. E. & Hebert, P. D. N. Uses and misuses of environmental DNA in biodiversity science and conservation. Ann. Rev. Ecol. Evol. Sys. 49, 209–230 (2018).
Google Scholar
Miya, M., Gotoh, R. O. & Sado, T. MiFish metabarcoding: a high–throughput approach for simultaneous detection of multiple fish species from environmental DNA and other samples. Fisheries Sci. 86, 939–970 (2020).
Google Scholar
Miya, M. Environmental DNA Metabarcoding: A novel method for biodiversity monitoring of marine fish communities. Ann. Rev. Mar. Sci. 14, 161–185 (2022).
Google Scholar
Fujikura, K., Kindsay, D., Kitazato, H., Nishida, S. & Shirayama, Y. Marine biodiversity in Japanese waters. PloS One. 5, e11836 (2010).
Google Scholar
Nakabo, T. Fish of Japan with Pictorial Keys to the Species (Tokai University, 2013).
Tittensor, D. P. et al. Global patterns and predictors of marine biodiversity across taxa. Nature 466, 1098–1101 (2010).
Google Scholar
Stuart-Smith, R. D. et al. Integrating abundance and functional traits reveals new global hotspots of fish diversity. Nature 501, 539–542 (2013).
Google Scholar
Warton, D. I. et al. So many variables: joint modeling in community ecology. Trends Ecol. Evol. 30, 766–779 (2015).
Google Scholar
Poggiato, G. et al. On the interpretations of joint modeling in community ecology. Trends Ecol. Evol. 36, 391–401 (2021).
Google Scholar
Motomura, H. & Mtsunuma, M. Fish diversity along the Kuroshio current. In Fish Diversity of Japan: Evolution, Zoogeography, and Conservation (eds Kai, Y. et al.). 63–78. (Springer, 2022).
Google Scholar
Briggs, J. C. & Bowen, B. W. A realignment of marine biogeographic provinces with particular reference to fish distributions. J. Biogeogr. 39, 12–30 (2012).
Google Scholar
Toonen, R. J., Bowen, B. W., Iacchei, M. & Briggs, J. C. Marine biogeography. Ency Evol. Biol. 1, 166–178 (2016).
Kai, Y. & Motomura, H. Origins and present distribution of fishes in Japan. In Fish Diversity of Japan: Evolution, Zoogeography, and Conservation (eds Kai, Y. et al.). 19–31. (Springer, 2022).
Google Scholar
Senjyu, T. et al. Deep flow field in the Japan/East sea as deduced from direct current measurements. Deep-sea Res. II. 52, 1726–1741 (2005).
Google Scholar
Japan Fisheries Research and Education Agency. Guidebook for Estimating CO2 Storage in Seagrass and Seaweed Beds (In Japanese). (2023). https://www.fra.go.jp/home/kenkyushokai/press/pr2023/20231101_kaisou.html
Zhai, S. N., Comes, H. P., Nakamura, K., Yan, H. F. & Qiu, Y. X. Late pleistocene lineage divergence among populations of Neolitsea sericea (Lauraceae) across a deep sea-barrier in the Ryukyu Islands. J. Biogeogr. 39, 1347–1360 (2015).
Google Scholar
Matsuura, K. (ed) Fishes in the Kuroshio Current (In Japanese) (Tokai University, 2012).
Elmqvist, T. et al. Response diversity, ecosystem change, and resilience. Front. Ecol. Environ. 1, 488–494 (2003).
Google Scholar
Mori, A. S., Furukawa, T. & Sasaki, T. Response diversity determines the resilience of ecosystems to environmental change. Biol. Rev. 88, 349–364 (2013).
Google Scholar
Martínez, M. L. et al. The Coasts of our world: Ecological, economic and social importance. Ecol. Econ. 63, 254–272 (2007).
Google Scholar
Barbier, E. B. Marine ecosystem services. Curr. Biol. 27, R507–R510 (2017).
Google Scholar
Yamazaki, A., Watanabe, T., Tsunogai, U., Iwase, F. & Yamano, H. A 150-year variation of the Kuroshio transport inferred from coral nitrogen isotope signature. Paleoceanography 31, 838–846 (2016).
Google Scholar
Luo, J. J., Masson, S., Roeckner, E., Madec, G. & Yamagata, T. Reducing climatology bias in an ocean-atmosphere OGCM with improved coupling physics. J. Clim. 18, 2344–2360 (2005).
Google Scholar
United Nations Environment Agency. United Nations Decade on Ecosystem Restoration (2021–2030). (2019). https://undocs.org/A/RES/73/284
Murakami, H. et al. Dispersion and degradation of environmental DNA from caged fish in a marine environment. Fish. Sci. 85, 327–337 (2019).
Google Scholar
Miya, M. et al. Use of a filter cartridge for filtration of water samples and extraction of environmental DNA. J. Visual Exper. 117, 5474 (2016).
Miya, M. et al. MiFish, a set of universal PCR primers for metabarcoding environmental DNA from fishes: detection of more than 230 subtropical marine species. R Soc. Open. Sci. 2, 150088 (2015).
Google Scholar
Minamoto, T. et al. An illustrated manual for environmental DNA research: water sampling guidelines and experimental protocols. Environ. DNA. 3, 8–13 (2021).
Google Scholar
Edgar, R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–2461 (2010).
Google Scholar
NASA Ocean Biology Processing Group. MODIS-Aqua Level-3 Mapped Ocean Color Data, version R2022.0, NASA Ocean Biology Distributed Active Archive Center (2018). Accessed 10 July 2025.
van der Maaten, L. J. P. & Hinton, G. E. Visualizing high-dimensional data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Lindgren, F., Rue, H. & Lindström, J. An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach. J. R Statist Soc. B. 73, 423–498 (2011).
Google Scholar
Kreft, H. & Jetz, W. A framework for delineating biogeographical regions based on species distributions. J. Biogeogr. 37, 2029–2053 (2010).
Google Scholar
Pang, S. E. H., Silk, J. W. F., Zurell, D. & Webb, E. L. The clustering of spatially associated species unravels patterns in tropical tree species distributions. Ecosphere 14, e4589 (2023).
Google Scholar
Acknowledgements
We sincerely thank T.S. Jo, M.K. Sakata, M.U. Kawata, R. Osawa, K. Maegawa, Q. Wu, T. Inoue, J. Kawai, R. Shimoyama, K. Kido, R. Asada, S. Harasaki, S. Chiba, M. Shiomi, M. Ogata, T. Komai, and Y. Ikoma for help with field sampling.
Funding
This research was supported financially by the JSPS CREST program (JPMJCR13A2) and JSPS KAKENHI Grants (19H05641, 20H03311, 21H03651, 21K15171, 25H01190).
Author information
Authors and Affiliations
Contributions
Y.O. and M.K. conceived and designed the research; M.K., M.M., H.A., A.K., R.M., T.Minamoto, and S.S. planned the project on which the research was based; H.Yamanaka managed logistics for field sampling; M.M., H.D., R.M., T.T., S.Yamamoto, M.A., T.F., R.O.G., M.H., M.I., T.I., T.Kajita, T.Kanbe, T.Kenta, Y.K., T.Matsuura, T.Mizumoto, H.Motomura, H.Murakami, K.N., S.O., T.Sado, H.S., K.S., T.Sunobe, H.T., K.Takayama, K.Tanaka, and S.Yoon performed field sampling; M.M., T.F., T.Sado, S.Yokoyama, and H.Yamakawa performed laboratory experiments; M.M., H.Motomura, T.Sado, and H.S. refined taxonomic assignments; Y.O., K.F., and R.O.G analyzed the data; and Y.O., M.M., H.Yamakawa, and M.K. wrote the first draft and completed the final manuscript with contributions from all co-authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Supplementary Material 1
Supplementary Material 2
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Reprints and permissions
About this article
Cite this article
Osada, Y., Miya, M., Araki, H. et al. Large-scale environmental DNA survey reveals niche axes of a regional coastal fish community.
Sci Rep 16, 3276 (2026). https://doi.org/10.1038/s41598-025-31307-4
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-31307-4
Source: Ecology - nature.com
