More stories

  • in

    Oceanic vertical migrators in a warming world

    Seibel, B. A. & Birk, M. A. Nat. Clim. Change https://doi.org/10.1038/s41558-022-01491-6 (2022).Article 

    Google Scholar 
    Urban, M. C. et al. Science 353, aad8466 (2016).Article 

    Google Scholar 
    Pörtner, H. O. & Knust, R. Science 315, 95–97 (2007).Article 

    Google Scholar 
    Verberk, W. C. E. P., Bilton, D. T., Calosi, P. & Spicer, J. I. Ecology 92, 1565–1572 (2011).Article 

    Google Scholar 
    Rubalcaba, J. G., Verberk, W. C., Hendriks, A. J., Saris, B. & Woods, H. A. Proc. Natl Acad. Sci. USA 117, 31963–31968 (2020).CAS 
    Article 

    Google Scholar 
    Deutsch, C. et al. Proc. Natl Acad. Sci. USA 119, e2201345119 (2022).CAS 
    Article 

    Google Scholar 
    Stramma, L. et al. Nat. Clim. Change 2, 33–37 (2012).CAS 
    Article 

    Google Scholar 
    Vergés, A. et al. Proc. R. Soc. B 281, 20140846 (2014).Article 

    Google Scholar  More

  • in

    Protecting the Amazon forest and reducing global warming via agricultural intensification

    Study regions and recent trends in land use changeOur analysis focuses on four biomes (referred to as regions in the rest of the text), accounting for nearly all soybean area in Brazil: the Pampa, the Atlantic Forest, the Cerrado and the Amazon (Supplementary Section 1). Soybean production is negligible in the Pantanal and the Caatinga, so these two regions were excluded from our analysis. We focused on soybean-based systems in Brazil, either those that include one crop per year (single soybean) or those including a second-crop maize. In the latter system, soybean is sown in September–October, and maize is sown right after the soybean harvest in late January–February. Single soybean is common in the Pampa, where the drier climate does not allow double cropping. In contrast, higher precipitation allows double cropping in the Amazon, the Cerrado and most of the Atlantic Forest (Supplementary Section 2).Recent trends in yield, area and production for soybean and second-crop maize were derived from official statistics for the 2007–2019 period16. We fitted linear models to derive the annual rate of yield improvement and harvested area for soybean and second-crop maize, separately for each region (Fig. 1 and Extended Data Fig. 1). Land use change arising from soybean expansion was estimated using data from the MapBiomas project (v.5.0)10 (Supplementary Table 1). Our estimation of land use change accounted for the time lag between land conversion and the beginning of soybean production, which can include transitional stages such as the cultivation of upland rice or short-term pasture-based livestock systems42. To account for this, we looked at the new land brought into soybean production during the 2008–2019 period, and we analysed how much of this land was under a different land use type (forest, savannah, grassland, pasture or other crops) in 2000 (Extended Data Fig. 2).Estimation of yield potential and yield gapsWe used results on yield potential for Brazil that we generated through the Global Yield Gap Atlas project43 using well-validated process-based crop models and the best available sources of weather, soil and management data. Briefly, we selected 32 sites to portray the distribution of the soybean harvested area within the country, following protocols that ensure representativeness and a reasonable coverage of the national crop area44. The 32 sites collectively accounted for half of the soybean harvested area in Brazil. These sites were located within agro-climatic zones accounting for 86% of the national soybean production and accounted for 72–92% of the soybean area in each region. Following protocols that gave preference to measured data at a high level of spatial and temporal resolution45, we collected databases on weather, soil, management and crop yields for soybean for each site, and also for second-crop maize at those sites where double-cropping is practised (Supplementary Tables 2 and 3 and Supplementary Section 3).Yield potential was simulated for widespread cultivars in each region using the CROPGRO soybean model embedded in DSSAT v.4.546 and the Hybrid-Maize model47. Both models simulate crop growth and development on a daily time step. Growth rates are determined by simulating both CO2 assimilation and respiration, with partitioning coefficients to different organs dependent on developmental stage. The model phenological coefficients were calibrated to portray the crop cycle of the most dominant cultivars in each region in Brazil. We used generic default coefficients for growth-related model internal parameters such as photosynthesis, respiration, leaf area expansion, light interception, biomass partitioning and grain filling. In all cases, simulations of yield potential assumed the absence of insect pests, weeds and diseases and no nutrient limitations. In simulating yield potential, both models account for solar radiation, photoperiod, temperature, and the timing and amount of rainfall as well as soil properties influencing crop water balance.We first evaluated the CROPGRO and Hybrid-Maize models on the ability to reproduce measured phenology and yields across 40 well-managed experiments located across the four regions. The models showed satisfactory performance at reproducing the measured values (Extended Data Fig. 3). We then simulated soybean yield potential for the dominant agricultural soils at each site (usually two or three), as determined from the soil maps generated by the Radambrasil project48. The simulations were based on long-term (1999–2018) measured daily weather data retrieved from the Brazilian Institute of Meteorology49. Soybean yield potential was simulated for each year of the time series. We also simulated yield potential for second-crop maize for those sites where double-cropping is practised. To do so, we used sowing dates and cultivar maturities that maximize the overall productivity of the soybean–maize system; these sowing dates and cultivar maturities are within the current ranges in each region21,28. To estimate the average yield potential for each site, we weighted the simulated values for each soil type by soil area fraction at each site. In all cases, the simulations assumed no limitations to crop growth due to nutrient deficiencies or incidence of biotic stresses such as weeds, insect pests and pathogens. The results were upscaled from site to region and then to country following van Bussel et al.44. Briefly, the average yield potential for each region was estimated by averaging the simulated yields across the sites located within each region, weighing sites according to their share of the soybean area within each region. A similar approach was followed to upscale yield potential from region to the national level. Details on crop modelling, data sources and upscaling are provided in Supplementary Section 3.The average farmer yield was calculated separately for soybean and second-crop maize on the basis of the average yield reported over the 2012–2017 period for the municipalities that overlap with each site, weighing municipalities on the basis of their share of the soybean or maize area within each site16. Including more years before 2012 would have led to a biased estimate of average actual yield due to the technological yield trend in Brazil. Average farmer yields were estimated at the region and country levels following the same upscaling approach as for yield potential. Finally, the exploitable yield gap was calculated as the difference between attainable yield and average farmer yield. The attainable yield was calculated as 80% of the simulated yield potential, which is considered a reasonable yield for farmers with adequate access to inputs, markets and technical information (Supplementary Section 2).Assessing scenarios of intensification and land use changeWe explored three scenarios with different soybean and maize yields and areas by 2035 and assessed their outcomes in terms of production, land use change and GWP (Supplementary Table 4). A 15-year future timespan is long enough to facilitate the implementation of long-term policies, investments and technologies devoted to closing the exploitable yield gap and to implement land-use policies, but it is short enough to minimize long-term effects from climate change on crop yields and cropping systems. In the BAU scenario, historical (2007–2019) trends of soybean and second-crop maize area and yield (Extended Data Fig. 1) remain unchanged in all regions between the baseline year (2019) and the final year (2035). Likewise, soybean area expands following the same pattern of land use change observed during 2008–2019 (Extended Data Fig. 2).To explore the available opportunity for increasing production on the existing production area, we considered an NCE scenario in which there is no physical expansion of cropland while full closure of the exploitable yield gap occurs in the regions where the current yield gaps are small (the Pampa and the Atlantic Forest), and 50% closure of the exploitable yield gap takes place in regions where the current yield gaps are large (the Amazon and the Cerrado) (Supplementary Table 4). These rates are comparable to historical yield gains in the Pampa and the Atlantic Forest. A scenario of full yield closure in the Amazon and the Cerrado would have been unrealistic, as it would have required rates of yield improvement that are three to four times higher than historical rates, much higher than those in the Pampa and the Atlantic Forest, and well beyond those reported for main soybean-producing countries. In the case of second-crop maize, we assumed full closure of the exploitable yield gap by 2035 because historical rates of yield improvement are adequate to reach that yield level. Regarding second-crop maize area, we projected the proportion of double-cropping to increase from the current 47% (Amazon), 39% (Cerrado) and 31% (Atlantic Forest) to 100%, 70% and 50%, respectively, as determined on the basis of the degree of water limitation in each region (Supplementary Section 4).Finally, we explored a third scenario of intensification plus target area expansion (INT), in which identical yield gain rates and the adoption of double-cropping equivalent to those in the NCE scenario were assumed, but with physical expansion of the soybean–maize system allowed in low-C ecosystems (that is, pastures and grasslands). In this scenario, soybean expansion is limited to 5% of existing pastures and grasslands in the Pampa, the Atlantic Forest and the Cerrado (total of 5.7 Mha) as a result of a parallel intensification in the pasture-based livestock sector that frees up land for soybean production. The latter would require an increase of current stocking rates, not only for freeing up 5% of the area for soybean cultivation but also to meet the projected 7% beef production increase during the study period (2020–2035)17. Hence, an overall 12% increase in stocking rates would be required within our 15-year timeframe, which is a reasonable target as reported in previous studies and based on current trends in stocking rates16,29,32,33.Another assumption is that the yield potential of pasture and grasslands converted for soybean production is similar to that in existing soybean areas in each region. Cropland expansion into grassland and pastures was allowed in all regions except for the Amazon to prevent ‘leaking’ effects and the impact of road development on land clearing50,51. Similarly, the conversion of area cultivated with food crops for soybean production was not allowed to avoid the negative impact of indirect land use change52.Estimation of GWP and gross incomeWe estimated GHG emissions, including carbon dioxide (CO2), methane (CH4) and nitrous oxides (N2O), associated with land conversion (GHGLUC) and crop production (GHGPROD) for the baseline year (2019) and for the three scenarios by year 2035 (BAU, NCE and INT). GHGLUC includes emissions associated with changes in C stocks from aboveground and belowground biomass when land is converted for soybean production (GHGBIO), as well as GHG emissions derived from changes in soil organic C (GHGSOC). For each land use type, annual GHGBIO was estimated on the basis of the difference between C stocks of the land use type that was converted for production (Supplementary Table 5) and, depending on the scenario and region, the average C stocks of the new cropping system53,54,55:$${mathrm{GHG}}_{{mathrm{BIO}}} = {sum} {left( {{mathrm{TDM}}_i-{mathrm{TDM}}_{{mathrm{crop}}}} right) times A_i}$$
    (1)
    where i is the land cover type, TDM is the total dry matter (tC ha−1) in land cover type i and in cropland (crop), and Ai is the annual area converted from land use type i for soybean cultivation (Supplementary Table 4). C stocks for single soybean and soybean–second-crop maize systems were assumed at 2 and 5 tC ha−1, respectively53,54,55. Changes in SOC stocks were estimated following the Intergovernmental Panel on Climate Change 2019 guidelines54, available country-specific emission factors56 and the SOC values estimated for each region57,58:$${mathrm{GHG}}_{{mathrm{SOC}}} = {sum} {left( {{mathrm{SOC}}_{{mathrm{REF}},i} times F_{{mathrm{LU}}}} right) times A_i}$$
    (2)
    where SOCREF is the SOC stock for mineral soils in the upper 30 cm for the reference condition (tC ha−1)57 in land cover type i (Supplementary Table 5), and FLU is the stock change factor for SOC land-use systems for a particular land use (Supplementary Table 4). Because no-till is the predominant soil management strategy in Brazil59, we used FLU = 0.96 for natural vegetation converted to no-till annual crop production, and FLU = 1.16 for pasture and grassland converted to no-till annual crop production56. Because we wanted to assess the full impact of the three scenarios (BAU, NCE and INT) on GWP, we assigned all GHGBIO and GHGSOC derived from land conversion to the first year after land conversion and expressed them as CO2 equivalents by multiplying changes in C stocks by 3.67.Annual GHG emissions derived from soybean and second-crop maize production (GHGPROD) were calculated for each scenario and included those derived from manufacturing, packaging and transportation of agricultural inputs, fossil fuel use for field operations, soil N2O emissions derived from the application of nitrogen (N) fertilizer, and domestic grain transportation. For the baseline year (2019), annual GHG emissions from N, phosphorous (P) and potassium (K) fertilizers and other inputs (lime, pesticides and fuel) were calculated on the basis of current average input rates for soybean and second-crop maize in each region as derived from the crop management data collected for each region (Supplementary Table 6 and Supplementary Section 3.4). To calculate GHG emissions associated with manufacturing, packaging and transportation of N, P and K fertilizers and lime, we used specific updated emissions factors for South America60, selecting those fertilizer sources that are most commonly used for soybean and second-crop maize production: urea (N), monoammonium phosphate (P) and potassium chloride (K). Our calculations also included the extra lime application that is needed to correct soil acidity in converted areas. Emission factors associated with seed production, pesticides and diesel were derived from ref. 61. Soil N2O emissions derived from N fertilizer application were calculated assuming an N2O emission factor of 1% of the applied N fertilizer on the basis of the country-specific emission factor62. Emissions derived from domestic grain transportation for each region were estimated using the GHGs per ton of grain as reported by previous studies for each region63. We assumed that inputs other than nutrient fertilizer will not change relative to the baseline in the BAU scenario. In the INT scenario, applied inputs were calculated on the basis of those reported for current high-yield fields where the yield gap is small. We estimated fertilizer nutrient rates for the three scenarios following a nutrient-balance approach that depends on the projected yield for each scenario (Supplementary Table 6 and Supplementary Section 3.4).GHGPROD in the baseline year (2019) and for the three scenarios in 2035 (BAU, NCE and INT) was estimated for each region by multiplying the emissions per unit of area by the annual soybean harvested area, summing them to estimate GHG emissions at the national level. Overall 100-year GWP was estimated as the sum of GHGLUC and GHGPROD, both expressed as CO2e to account for the higher warming potential of CH4 and N2O, which are 25 and 298 times the intensity of CO2 on a per mass basis, respectively. The gross income was estimated for each scenario by multiplying the annual crop production by the average price for soybean and maize grain during the past ten years (US$453 and US$184 per t for soybean and maize, respectively1). Finally, to combine the environmental and economic impacts into one metric, we calculated the GWP intensity as the ratio between GWP and gross income.Reporting summaryFurther information on research design is available in the Nature Research Reporting Summary linked to this article. More

  • in

    Inter-annual variability patterns of reef cryptobiota in the central Red Sea across a shelf gradient

    Knowlton, N. et al. in Life in the World’s Oceans 65–78 (Wiley-Blackwell, 2010).Fisher, R. et al. Species richness on coral reefs and the pursuit of convergent global estimates. Curr. Biol. 25, 500–505. https://doi.org/10.1016/j.cub.2014.12.022 (2015).CAS 
    Article 
    PubMed 

    Google Scholar 
    Brandl, S. J., Goatley, C. H. R., Bellwood, D. R. & Tornabene, L. The hidden half: Ecology and evolution of cryptobenthic fishes on coral reefs. Biol. Rev. 93, 1846–1873. https://doi.org/10.1111/brv.12423 (2018).Article 
    PubMed 

    Google Scholar 
    Appeltans, W. et al. The magnitude of global marine species diversity. Curr. Biol. 22, 2189–2202. https://doi.org/10.1016/j.cub.2012.09.036 (2012).CAS 
    Article 
    PubMed 

    Google Scholar 
    Carvalho, S. et al. Beyond the visual: Using metabarcoding to characterize the hidden reef cryptobiome. Proc. R. Soc. B Biol. Sci. https://doi.org/10.1098/rspb.2018.2697 (2019).Article 

    Google Scholar 
    Kramer, M. J., Bellwood, O., Fulton, C. J. & Bellwood, D. R. Refining the invertivore: Diversity and specialisation in fish predation on coral reef crustaceans. Mar. Biol. 162, 1779–1786. https://doi.org/10.1007/s00227-015-2710-0 (2015).CAS 
    Article 

    Google Scholar 
    Brandl, S. J. et al. Demographic dynamics of the smallest marine vertebrates fuel coral reef ecosystem functioning. Science 364, 1189–1192. https://doi.org/10.1126/science.aav3384 (2019).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Kramer, M. J., Bellwood, D. R. & Bellwood, O. Cryptofauna of the epilithic algal matrix on an inshore coral reef, Great Barrier Reef. Coral Reefs 31, 1007–1015. https://doi.org/10.1007/s00338-012-0924-x (2012).ADS 
    Article 

    Google Scholar 
    Rocha, L. A. et al. Specimen collection: An essential tool. Science 344, 814–815. https://doi.org/10.1126/science.344.6186.814 (2014).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Berumen, M. L. et al. The status of coral reef ecology research in the Red Sea. Coral Reefs 32, 737–748. https://doi.org/10.1007/s00338-013-1055-8 (2013).ADS 
    Article 

    Google Scholar 
    Paknia, O., Sh, H. R. & Koch, A. Lack of well-maintained natural history collections and taxonomists in megadiverse developing countries hampers global biodiversity exploration. Org. Divers. Evol. 15, 619–629. https://doi.org/10.1007/s13127-015-0202-1 (2015).Article 

    Google Scholar 
    Knowlton, N. & Leray, M. Censusing marine life in the twentyfirst Century. Genome 58, 238–238 (2015).
    Google Scholar 
    Yu, D. W. et al. Biodiversity soup: Metabarcoding of arthropods for rapid biodiversity assessment and biomonitoring. Methods Ecol. Evol. 3, 613–623. https://doi.org/10.1111/j.2041-210X.2012.00198.x (2012).Article 

    Google Scholar 
    Ransome, E. et al. The importance of standardization for biodiversity comparisons: A case study using autonomous reef monitoring structures (ARMS) and metabarcoding to measure cryptic diversity on Mo’orea coral reefs, French Polynesia. PLoS ONE https://doi.org/10.1371/journal.pone.0175066 (2017).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Coker, D. J., DiBattista, J. D., Sinclair-Taylor, T. H. & Berumen, M. L. Spatial patterns of cryptobenthic coral-reef fishes in the Red Sea. Coral Reefs 37, 193–199. https://doi.org/10.1007/s00338-017-1647-9 (2018).ADS 
    Article 

    Google Scholar 
    Pearman, J. K. et al. Cross-shelf investigation of coral reef cryptic benthic organisms reveals diversity patterns of the hidden majority. Sci. Rep. 8, 8090. https://doi.org/10.1038/s41598-018-26332-5 (2018).ADS 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Pearman, J. K. et al. Disentangling the complex microbial community of coral reefs using standardized Autonomous Reef Monitoring Structures (ARMS). Mol. Ecol. 28, 3496–3507. https://doi.org/10.1111/mec.15167 (2019).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Selkoe, K. A. et al. The DNA of coral reef biodiversity: Predicting and protecting genetic diversity of reef assemblages. Proc. R. Soc. B-Biol. Sci. https://doi.org/10.1098/rspb.2016.0354 (2016).Article 

    Google Scholar 
    DiBattista, J. D. et al. Digging for DNA at depth: Rapid universal metabarcoding surveys (RUMS) as a tool to detect coral reef biodiversity across a depth gradient. PeerJ https://doi.org/10.7717/peerj.6379 (2019).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    DiBattista, J. D. et al. Assessing the utility of eDNA as a tool to survey reef-fish communities in the Red Sea. Coral Reefs 36, 1245–1252. https://doi.org/10.1007/s00338-017-1618-1 (2017).ADS 
    Article 

    Google Scholar 
    Nester, G. M. et al. Development and evaluation of fish eDNA metabarcoding assays facilitate the detection of cryptic seahorse taxa (family: Syngnathidae). Environ. DNA 2, 614–626 (2020).Article 

    Google Scholar 
    West, K. M. et al. eDNA metabarcoding survey reveals fine-scale coral reef community variation across a remote, tropical island ecosystem. Mol. Ecol. 29, 1069–1086. https://doi.org/10.1111/mec.15382 (2020).CAS 
    Article 
    PubMed 

    Google Scholar 
    DiBattista, J. D. et al. Environmental DNA can act as a biodiversity barometer of anthropogenic pressures in coastal ecosystems. Sci. Rep. https://doi.org/10.1038/s41598-020-64858-9 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Worm, B. et al. Impacts of biodiversity loss on ocean ecosystem services. Science 314, 787–790. https://doi.org/10.1126/science.1132294 (2006).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Spalding, M. et al. Mapping the global value and distribution of coral reef tourism. Mar. Policy 82, 104–113. https://doi.org/10.1016/j.marpol.2017.05.014 (2017).Article 

    Google Scholar 
    Thomsen, P. F. & Willerslev, E. Environmental DNA – An emerging tool in conservation for monitoring past and present biodiversity. Biol. Cons. 183, 4–18. https://doi.org/10.1016/j.biocon.2014.11.019 (2015).Article 

    Google Scholar 
    Hughes, T. P. et al. Spatial and temporal patterns of mass bleaching of corals in the Anthropocene. Science 359, 80–83. https://doi.org/10.1126/science.aan8048 (2018).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Monroe, A. A. et al. In situ observations of coral bleaching in the central Saudi Arabian Red Sea during the 2015/2016 global coral bleaching event. PLoS ONE https://doi.org/10.1371/journal.pone.0195814 (2018).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Roth, F. et al. Coral reef degradation affects the potential for reef recovery after disturbance. Mar. Environ. Res. 142, 48–58. https://doi.org/10.1016/j.marenvres.2018.09.022 (2018).CAS 
    Article 
    PubMed 

    Google Scholar 
    Foster, T. & Gilmour, J. P. Seeing red: Coral larvae are attracted to healthy-looking reefs. Mar. Ecol. Prog. Ser. 559, 65–71. https://doi.org/10.3354/meps11902 (2016).ADS 
    CAS 
    Article 

    Google Scholar 
    Karcher, D. B. et al. Nitrogen eutrophication particularly promotes turf algae in coral reefs of the central Red Sea. PeerJ https://doi.org/10.7717/peerj.8737 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Pancrazi, I., Ahmed, H., Cerrano, C. & Montefalcone, M. Synergic effect of global thermal anomalies and local dredging activities on coral reefs of the Maldives. Marine Pollut. Bull. https://doi.org/10.1016/j.marpolbul.2020.111585 (2020).Article 

    Google Scholar 
    Vercelloni, J. et al. Forecasting intensifying disturbance effects on coral reefs. Glob. Change Biol. 26, 2785–2797. https://doi.org/10.1111/gcb.15059 (2020).ADS 
    Article 

    Google Scholar 
    González-Barrios, F. J., Cabral-Tena, R. A. & Alvarez-Filip, L. Recovery disparity between coral cover and the physical functionality of reefs with impaired coral assemblages. Glob. Change Biol. 27, 640–651. https://doi.org/10.1111/gcb.15431 (2020).ADS 
    Article 

    Google Scholar 
    Rice, M. M., Ezzat, L. & Burkepile, D. E. Corallivory in the anthropocene: Interactive effects of anthropogenic stressors and corallivory on coral reefs. Front. Marine Sci. https://doi.org/10.3389/fmars.2018.00525 (2019).Article 

    Google Scholar 
    Lin, Y.-J. et al. Long-term ecological changes in fishes and macro-invertebrates in the world’s warmest coral reefs. Sci. Total Environ. 750, 142254. https://doi.org/10.1016/j.scitotenv.2020.142254 (2021).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Loreau, M. & de Mazancourt, C. Biodiversity and ecosystem stability: A synthesis of underlying mechanisms. Ecol. Lett. 16, 106–115. https://doi.org/10.1111/ele.12073 (2013).Article 
    PubMed 

    Google Scholar 
    Cardinale, B. J. et al. Biodiversity loss and its impact on humanity. Nature 486, 59–67. https://doi.org/10.1038/nature11148 (2012).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Handley, L. L. How will the “molecular revolution’ contribute to biological recording?. Biol. J. Lin. Soc. 115, 750–766. https://doi.org/10.1111/bij.12516 (2015).Article 

    Google Scholar 
    Ducklow, H. W., Doney, S. C. & Steinberg, D. K. Contributions of long-term research and time-series observations to marine ecology and biogeochemistry. Ann. Rev. Mar. Sci. 1, 279–302. https://doi.org/10.1146/annurev.marine.010908.163801 (2009).Article 
    PubMed 

    Google Scholar 
    Hughes, B. B. et al. Long-term studies contribute disproportionately to ecology and policy. Bioscience 67, 271–281. https://doi.org/10.1093/biosci/biw185 (2017).Article 

    Google Scholar 
    Kraft, N. J. B. et al. Community assembly, coexistence and the environmental filtering metaphor. Funct. Ecol. 29, 592–599. https://doi.org/10.1111/1365-2435.12345 (2015).Article 

    Google Scholar 
    Leibold, M. A. et al. The metacommunity concept: A framework for multi-scale community ecology. Ecol. Lett. 7, 601–613. https://doi.org/10.1111/j.1461-0248.2004.00608.x (2004).Article 

    Google Scholar 
    Vellend, M. The Theory of Ecological Communities (MPB-57). (Princeton University Press, 2016).Condon, R. H. et al. Recurrent jellyfish blooms are a consequence of global oscillations. Proc. Natl. Acad. Sci. U.S.A. 110, 1000–1005. https://doi.org/10.1073/pnas.1210920110 (2013).ADS 
    Article 
    PubMed 

    Google Scholar 
    Boero, F., Kraberg, A. C., Krause, G. & Wiltshire, K. H. Time is an affliction: Why ecology cannot be as predictive as physics and why it needs time series. J. Sea Res. 101, 12–18. https://doi.org/10.1016/j.seares.2014.07.008 (2015).ADS 
    Article 

    Google Scholar 
    Pearman, J. K., Anlauf, H., Irigoien, X. & Carvalho, S. Please mind the gap – Visual census and cryptic biodiversity assessment at central Red Sea coral reefs. Mar. Environ. Res. 118, 20–30. https://doi.org/10.1016/j.marenvres.2016.04.011 (2016).CAS 
    Article 
    PubMed 

    Google Scholar 
    David, R. et al. Lessons from photo analyses of autonomous reef monitoring structures as tools to detect (bio-)geographical, spatial, and environmental effects. Mar. Pollut. Bull. 141, 420–429. https://doi.org/10.1016/j.marpolbul.2019.02.066 (2019).CAS 
    Article 
    PubMed 

    Google Scholar 
    Pennesi, C. & Danovaro, R. Assessing marine environmental status through microphytobenthos assemblages colonizing the autonomous reef monitoring structures (ARMS) and their potential in coastal marine restoration. Mar. Pollut. Bull. 125, 56–65. https://doi.org/10.1016/j.marpolbul.2017.08.001 (2017).CAS 
    Article 
    PubMed 

    Google Scholar 
    Chang, J. J. M., Ip, Y. C. A., Bauman, A. G. & Huang, D. MinION-in-ARMS: Nanopore sequencing to expedite barcoding of specimen-rich macrofaunal samples from Autonomous Reef Monitoring Structures. Front. Marine Sci. https://doi.org/10.3389/fmars.2020.00448 (2020).Article 

    Google Scholar 
    Hazeri, G. et al. Latitudinal species diversity and density of cryptic crustacean (Brachyura and Anomura) in micro-habitat Autonomous Reef Monitoring Structures across Kepulauan Seribu, Indonesia. Biodivers. J. Biol. Divers. 20 (2019).Al-Rshaidat, M. M. D. et al. Deep COI sequencing of standardized benthic samples unveils overlooked diversity of Jordanian coral reefs in the northern Red Sea. Genome 59, 724–737. https://doi.org/10.1139/gen-2015-0208 (2016).CAS 
    Article 
    PubMed 

    Google Scholar 
    Pearman, J. K. et al. Pan-regional marine benthic cryptobiome biodiversity patterns revealed by metabarcoding Autonomous Reef Monitoring Structures. Mol. Ecol. https://doi.org/10.1111/mec.15692 (2020).Article 
    PubMed 

    Google Scholar 
    Leray, M. & Knowlton, N. DNA barcoding and metabarcoding of standardized samples reveal patterns of marine benthic diversity. Proc. Natl. Acad. Sci. U.S.A. 112, 2076–2081. https://doi.org/10.1073/pnas.1424997112 (2015).ADS 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Obst, M. et al. A marine biodiversity observation network for genetic monitoring of hard-bottom communities (ARMS-MBON). Front. Marine Sci. https://doi.org/10.3389/fmars.2020.572680 (2020).Article 

    Google Scholar 
    Hughes, T. P. et al. Ecological memory modifies the cumulative impact of recurrent climate extremes. Nat. Clim. Chang. 9, 40–43. https://doi.org/10.1038/s41558-018-0351-2 (2019).ADS 
    Article 

    Google Scholar 
    Hughes, T. P., Kerry, J. T. & Simpson, T. Large-scale bleaching of corals on the Great Barrier Reef. Ecology 99, 501–501. https://doi.org/10.1002/ecy.2092 (2018).CAS 
    Article 
    PubMed 

    Google Scholar 
    Furby, K. A., Bouwmeester, J. & Berumen, M. L. Susceptibility of central Red Sea corals during a major bleaching event. Coral Reefs 32, 505–513. https://doi.org/10.1007/s00338-012-0998-5 (2013).ADS 
    Article 

    Google Scholar 
    Froehlich, C. Y. M., Klanten, O. S., Hing, M. L., Dowton, M. & Wong, M. Y. L. Uneven declines between corals and cryptobenthic fish symbionts from multiple disturbances. Sci. Rep. https://doi.org/10.1038/s41598-021-95778-x (2021).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Bellwood, D. R. et al. Coral recovery may not herald the return of fishes on damaged coral reefs. Oecologia 170, 567–573. https://doi.org/10.1007/s00442-012-2306-z (2012).ADS 
    Article 
    PubMed 

    Google Scholar 
    Archana, A. & Baker, D. M. Multifunctionality of an urbanized coastal marine ecosystem. Front. Marine Sci. https://doi.org/10.3389/fmars.2020.557145 (2020).Article 

    Google Scholar 
    Servis, J. A., Reid, B. N., Timmers, M. A., Stergioula, V. & Naro-Maciel, E. Characterizing coral reef biodiversity: Genetic species delimitation in brachyuran crabs of Palmyra Atoll Central Pacific. Mitochondrial DNA Part A 31, 178–189. https://doi.org/10.1080/24701394.2020.1769087 (2020).CAS 
    Article 

    Google Scholar 
    Chaves-Fonnegra, A. et al. Bleaching events regulate shifts from corals to excavating sponges in algae-dominated reefs. Glob. Change Biol. 24, 773–785. https://doi.org/10.1111/gcb.13962 (2018).ADS 
    Article 

    Google Scholar 
    Perry, C. T. & Morgan, K. M. Post-bleaching coral community change on southern Maldivian reefs: Is there potential for rapid recovery?. Coral Reefs 36, 1189–1194. https://doi.org/10.1007/s00338-017-1610-9 (2017).ADS 
    Article 

    Google Scholar 
    DeCarlo, T. M. The past century of coral bleaching in the Saudi Arabian central Red Sea. PeerJ https://doi.org/10.7717/peerj.10200 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Cortés, J. et al. in Coral Reefs of the Eastern Tropical Pacific: Persistence and Loss in a Dynamic Environment (eds Peter W. Glynn, Derek P. Manzello, & Ian C. Enochs) 203–250 (Springer Netherlands, 2017).Enochs, I. C. & Manzello, D. P. Species richness of motile cryptofauna across a gradient of reef framework erosion. Coral Reefs 31, 653–661. https://doi.org/10.1007/s00338-012-0886-z (2012).ADS 
    Article 

    Google Scholar 
    Timmers, M. A. et al. Biodiversity of coral reef cryptobiota shuffles but does not decline under the combined stressors of ocean warming and acidification. Proc. Natl. Acad. Sci. 118, e2103275118. https://doi.org/10.1073/pnas.2103275118 (2021).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Khalil, M. T., Bouwmeester, J. & Berumen, M. L. Spatial variation in coral reef fish and benthic communities in the central Saudi Arabian Red Sea. PeerJ https://doi.org/10.7717/peerj.3410 (2017).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Roik, A. et al. Year-long monitoring of physico-chemical and biological variables provide a comparative baseline of coral reef functioning in the central Red Sea. PLoS ONE https://doi.org/10.1371/journal.pone.0163939 (2016).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Largier, J. L. Considerations in estimating larval dispersal distances from oceanographic data. Ecol. Appl. 13, S71–S89 (2003).Article 

    Google Scholar 
    Volkov, I., Banavar, J. R., Hubbell, S. P. & Maritan, A. Patterns of relative species abundance in rainforests and coral reefs. Nature 450, 45–49. https://doi.org/10.1038/nature06197 (2007).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Alsaffar, Z., Cúrdia, J., Borja, A., Irigoien, X. & Carvalho, S. Consistent variability in beta-diversity patterns contrasts with changes in alpha-diversity along an onshore to offshore environmental gradient: The case of Red Sea soft-bottom macrobenthos. Mar. Biodivers. 49, 247–262. https://doi.org/10.1007/s12526-017-0791-3 (2017).Article 

    Google Scholar 
    Alsaffar, Z. et al. The role of seagrass vegetation and local environmental conditions in shaping benthic bacterial and macroinvertebrate communities in a tropical coastal lagoon. Sci. Rep. https://doi.org/10.1038/s41598-020-70318-1 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Rocha, L. A. et al. Mesophotic coral ecosystems are threatened and ecologically distinct from shallow water reefs. Science 361, 281–284. https://doi.org/10.1126/science.aaq1614 (2018).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Soininen, J., Lennon, J. J. & Hillebrand, H. A multivariate analysis of beta diversity across organisms and environments. Ecology 88, 2830–2838. https://doi.org/10.1890/06-1730.1 (2007).Article 
    PubMed 

    Google Scholar 
    Chust, G. et al. Dispersal similarly shapes both population genetics and community patterns in the marine realm. Sci. Rep. https://doi.org/10.1038/srep28730 (2016).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Gianuca, A. T., Declerck, S. A. J., Lemmens, P. & De Meester, L. Effects of dispersal and environmental heterogeneity on the replacement and nestedness components of beta-diversity. Ecology 98, 525–533. https://doi.org/10.1002/ecy.1666 (2017).Article 
    PubMed 

    Google Scholar 
    Enochs, I. C., Toth, L. T., Brandtneris, V. W., Afflerbach, J. C. & Manzello, D. P. Environmental determinants of motile cryptofauna on an eastern Pacific coral reef. Mar. Ecol. Prog. Ser. 438, 105-U127. https://doi.org/10.3354/meps09259 (2011).ADS 
    Article 

    Google Scholar 
    Hughes, T. P. et al. Coral reefs in the anthropocene. Nature 546, 82–90. https://doi.org/10.1038/nature22901 (2017).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Fabricius, K. E. Effects of terrestrial runoff on the ecology of corals and coral reefs: Review and synthesis. Mar. Pollut. Bull. 50, 125–146. https://doi.org/10.1016/j.marpolbul.2004.11.028 (2005).CAS 
    Article 
    PubMed 

    Google Scholar 
    Chaidez, V., Dreano, D., Agusti, S., Duarte, C. M. & Hoteit, I. Decadal trends in Red Sea maximum surface temperature. Sci. Rep. https://doi.org/10.1038/s41598-018-25731-y (2018).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Hubbell, S. P. in Monographs in Population Biology. The unified neutral theory of biodiversity and biogeography Vol. 32 Monographs in Population Biology i-xiv, 1–375 (2001).Dornelas, M., Connolly, S. R. & Hughes, T. P. Coral reef diversity refutes the neutral theory of biodiversity. Nature 440, 80–82. https://doi.org/10.1038/nature04534 (2006).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Baselga, A. Partitioning the turnover and nestedness components of beta diversity. Glob. Ecol. Biogeogr. 19, 134–143. https://doi.org/10.1111/j.1466-8238.2009.00490.x (2010).Article 

    Google Scholar 
    Legendre, P. Interpreting the replacement and richness difference components of beta diversity. Glob. Ecol. Biogeogr. 23, 1324–1334. https://doi.org/10.1111/geb.12207 (2014).Article 

    Google Scholar 
    Hollander, M. & Wolfe, D. A. Nonparametric statistical methods. Ergonomics 18, 701–702 (1975).
    Google Scholar 
    Kohler, K. E. & Gill, S. M. Coral point count with excel extensions (CPCe): A visual basic program for the determination of coral and substrate coverage using random point count methodology. Comput. Geosci. 32, 1259–1269. https://doi.org/10.1016/j.cageo.2005.11.009 (2006).ADS 
    Article 

    Google Scholar 
    Geller, J., Meyer, C., Parker, M. & Hawk, H. Redesign of PCR primers for mitochondrial cytochrome c oxidase subunit I for marine invertebrates and application in all-taxa biotic surveys. Mol. Ecol. Resour. 13, 851–861. https://doi.org/10.1111/1755-0998.12138 (2013).CAS 
    Article 
    PubMed 

    Google Scholar 
    Hao, X., Jiang, R. & Chen, T. Clustering 16S rRNA for OTU prediction: A method of unsupervised Bayesian clustering. Bioinformatics 27, 611–618. https://doi.org/10.1093/bioinformatics/btq725 (2011).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Callahan, B. J. et al. DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods 13, 581. https://doi.org/10.1038/nmeth.3869 (2016).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2020).Ranwez, V., Harispe, S., Delsuc, F. & Douzery, E. J. P. MACSE: Multiple alignment of coding SEquences accounting for frameshifts and stop codons. PLoS ONE https://doi.org/10.1371/journal.pone.0022594 (2011).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Machida, R. J., Leray, M., Ho, S. L. & Knowlton, N. Data Descriptor: Metazoan mitochondrial gene sequence reference datasets for taxonomic assignment of environmental samples. Sci. Data https://doi.org/10.1038/sdata.2017.27 (2017).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Wang, Q., Garrity, G. M., Tiedje, J. M. & Cole, J. R. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 73, 5261–5267. https://doi.org/10.1128/aem.00062-07 (2007).ADS 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Generate High-Resolution Venn and Euler Plots v. 1.6.20 (2018).Ginestet, C. ggplot2: Elegant graphics for data analysis. J. R. Stat. Soc. Ser. Stat. Soc. 174, 245–245. https://doi.org/10.1111/j.1467-985X.2010.00676_9.x (2011).Article 

    Google Scholar 
    McMurdie, P. J. & Holmes, S. phyloseq: An R package for reproducible interactive analysis and graphics of microbiome census data. PLoS ONE https://doi.org/10.1371/journal.pone.0061217 (2013).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Anderson, M. J. A new method for non-parametric multivariate analysis of variance. Austral Ecol. 26, 32–46. https://doi.org/10.1111/j.1442-9993.2001.01070.pp.x (2001).Article 

    Google Scholar 
    Hervé, M. Testing and plotting procedures for biostatistics v. 0.9-79. Retrieved from https://cran.r-project.org/web/packages/RVAideMemoire/index.html (2021).De Caceres, M. & Legendre, P. Associations between species and groups of sites: Indices and statistical inference. Ecology 90, 3566–3574. https://doi.org/10.1890/08-1823.1 (2009).Article 
    PubMed 

    Google Scholar 
    Legendre, P. & Anderson, M. J. Distance-based redundancy analysis: Testing multispecies responses in multifactorial ecological experiments. Ecol. Monogr. 69, 1–24. https://doi.org/10.1890/0012-9615(1999)069[0001:dbratm]2.0.co;2 (1999).Article 

    Google Scholar 
    Roberts, D. Ordination and multivariate analysis for ecology v. 2.0-1. Retrieved from http://ecology.msu.montana.edu/labdsv/R (2019).Dray, S., Bauman, D., Blanchet, G., Borcard, D., Clappe, S., Guenard, G. & Wagner, H. Adespatial: Multivariate multiscale spatial analysis v. 0.3-13. Retrieved from https://cran.r-project.org/package=adespatial (2021). More

  • in

    Spatial structure of city population growth

    Overview of U.S. domestic migration flowsThe most recent ACS county-to-county flow dataset26 reports that about 45.6 million people migrated to the U.S. per year during the period 2015–2019, which corresponds to 14.2% of the U.S. population27. Approximately 43.5 million annual moves corresponded to domestic migration (moves within the U.S.28), while 2.1 million accounted for inflows of individuals from other countries (viz. international immigration).With respect to domestic migration, 25.7 million people per year migrated within the same county, thus showing that the highest share of domestic flows (59%) is intra-county. Annually, about 10.4 people moved between different counties within the same state, thus intra-state flows account for 24% of the domestic migration (Supplementary Fig. 1), mainly driven by the search for more affordable housing, better jobs, and for family reasons such as change in marital status29. Long distance moves, captured by inter-state flows, represent the remaining 17% of domestic flows, which comprises about 7.5 million moves per year. Here, we will refer to these domestic migration flows as inflows or outflows, and netflows (inflows-outflows).The United States Office of Management and Budget (OMB) classifies counties as metropolitan, micropolitan, or neither30. A metropolitan statistical area contains a core urban area of at least 50,000 population. A metro area represents a functional delineation of an urban area with a network of strong socioeconomic ties, and provision of infrastructure services31,32,33. A micropolitan statistical area contains an urban core of at lest 10,000 but less than 50,000 inhabitants. There are over 380 metropolitan statistical areas in the U.S., each composed of one or more counties, accounting for about 86% of the total U.S. population and comprising approximately 28% of the land area of the country. For this reason, our analysis focuses on the growth dynamics of MSA counties. Supplementary Fig. 2 shows the 3141 counties (administrative subdivisions of the states) in the U.S., comprising about 321 million inhabitants in the starting year of the ACS 5-Year survey period (2015–2019) of our analysis26.Population growth has two components, namely natural growth and migration. Natural growth accounts for births minus deaths, and migration comprises domestic and international migration. With recent trends showing that births and natural increase have declined in the U.S. and in recent years contribute less to overall city population growth34,35, migration patterns become more relevant to the study of city population growth. Because the ACS flow files contain international inflows only, the relative importance of migrations on population growth is here addressed by x = ∣Inflows−Outflows∣/∣Births−Deaths∣ (Supplementary Figs. 3, 4), which is the ratio between domestic netflows and natural growth. The statistical distribution of this quantity computed for all U.S. counties is well fitted by a lognormal distribution, and shows that x≥1 for 76.5% of counties. For most counties, domestic migration dominates population growth, and understanding the spatial structure of domestic netflows (and their distribution within a city) is crucial to the comprehension of the mechanisms behind the heterogeneity of city population growth.At this spatial granularity, we observe a strong heterogeneity among the U.S. counties (Supplementary Fig. 2) for the period 2015 − 2019, along with examples of specific MSAs. In particular, the relative dispersion of counties relative growth due to netflows is higher than one for about 85% of the metro areas, indicating a large heterogeneity within the same city and pointing towards the spatial structure of domestic migration. The observed difference in the netflows stresses the relevance of our approach: counties belonging to the same city may have specific growth rates due to population flow patterns, thus indicating preferential flow destinations and pinpointing the direction in which the city has expanded.Heterogeneity of inter- and intra-city flowsInter-city flows represent the major component of the total flows (~55%), while intra-city flows represent ~25%. Flows between metro and micro areas, and between metro and non-statistical areas are the smallest components, with ~13% and ~7%, respectively. Given that about 80% of the domestic migration are composed of intra- and inter-city flows, we will focus our attention on describing the structure of intra- and inter-city flows, but in the Supplementary Information we offer a brief analysis of flows between metro and micro areas, and between metro and non-statistical areas.Inter-city flows are not uniform across the U.S. cities. The most intense annual netflows ( >2000 people per year), accounting for approximately 17% of the entire inter-city U.S. netflows, are mainly from New York and Chicago to California and Florida (Fig. 2), and from Los Angeles to neighboring cities. Notably, netflows among the Midwestern cities are mostly negative and below the threshold we set. These flows are mainly responsible for increasing or decreasing the population of a given city. Intra-city flow patterns, illustrated with the 7 most populous U.S. cities with more than 5 counties, are also non-uniform.Fig. 2: Heterogeneity of inter- and intra-city netflows.The map (A) suggests that the domestic redistribution of people between different U.S. metro areas are non-uniform: the black arrows, indicating the direction of the most intense inter-city netflows (higher than 2000 people per year), reveal migration trends from northern and eastern cities to western and southern regions. Cities (composed of one or more counties) are colored according to the relative growth (viz. population growth adjusted by population) of the whole MSA during the 2015–2019 period, and the black intensity and the thickness of the arrows are proportional to the netflows. Alaska and Hawaii are not shown. Panels (B–H), which are close-up of New York (B), Chicago (C), Dallas (D), Houston (E), Washington D.C. (F), Philadelphia (G), Atlanta (H), suggest that the most intense intra-city netflows are oriented radially outwards: people are moving from core to external counties. Here, counties are colored according to their relative growth in the 2015–2019 period and the width of the arrows is proportional to the netflows between origin and destination counties.Full size imageOur analysis reveals that city centers (defined as the core county with the highest population density) are more likely to have negative netflows, indicating that people are leaving the central regions of cities. The arrows in Fig. 2 indicate the direction of the most intense netflows, supporting this finding and highlighting that there is a trend of people moving from internal to external regions, contributing to population growth and spatial expansion of U.S. cities. In fact, we found no correlation between relative population growth (viz. population growth by county size) and distance from the core county (Supplementary Fig. 5A) for the 46 cities with more than 5 counties, with relative growth about 0.03 ± 0.05. On the other hand, we found that relative natural growth (Supplementary Fig. 5B) is negatively correlated with the distance to core county, thus natural growth is less relevant as a component of growth in the outer regions of cities. Consequently, our results show that not only the contribution of each component of growth changes with distance to core county, but also that the internal redistribution of people is an important mechanism of growth, mainly in the external counties.We also examined variability in inter- and intra-city flows within the 50 states (Supplementary Fig. 6). Total flows within a state increase, as expected, with the state population. Two special cases are, however, of interest: (1) two states (Vermont and Rhode Island) with small populations have only one MSA, in which case within-state inter-city flows are zero; and (2) nearly 40%, or 149, of MSAs have only one county, in which case intra-city flows could not be estimated. For all other cases, we observe on average an equal split between inter- and intra-city flows, but with considerable variability among the states, with a mean about 0.5 and standard deviation about 0.2. A generalization of the intra- and inter-city migratory patterns for all 46 cities with more than 5 counties shows that the percentage of migrants from intra- and inter-city flows are of the same order of magnitude (Fig. 3).Fig. 3: Roles of intra- and inter-city flows in driving the heterogeneous population growth of cities.We define the core county as the one with the highest population density, and we plot the percentage of inflows due to intra- (A) and inter-city flows (B) of each county within a city as a function of its distance to the core county. The percentage of outflows due to intra- and inter-city flows are shown in (C) and (D), respectively. The positive correlation of the relative growth with distance due to intra-city flows in (E), along with the lack of correlation due to inter-city flows in (F), indicates that intra-city flows are mainly responsible for increasing the population in the external regions of cities. The sizes of red circles and blue squares are proportional to the city population. The range of distances is split into equally spaced bins. The number of counties n within each bin, from left to right, is 46, 1, 4, 7, 7, 17, 21, 31, 36, 38, 34, 31, 31, 30, 20, 20, 21, 14, 17, 9, 9, 6, 4, 2, 5, 5, 2, 1. The black dots and the error bars indicate the mean and the 90% interval, respectively, of the counties within the corresponding bin. We also show the Pearson correlation coefficient R and the p-value associated with the two-sided test of the null hypothesis of non-correlation.Full size imageApart from the core county, flows from the same city correspond to about 50% of the inflow of people in the counties, presenting a slightly positive correlation with their distance from the city center (Fig. 3A). The low percentage for the core county indicates that it is not the major destination of flows from the same city. The percentage of inflows from other cities is higher in the core county and decays as we move towards the suburbs (Fig. 3B). The moderate negative correlation of this percentage with the distance reveals that inflows from other cities are more likely to concentrate in the core regions of a city.The percentage of outflows directed from the core county to other counties within the same city has a slightly negative correlation with the distance of the origin county to the city center, so it is more likely to find intra-city flows with outflows from internal regions (Fig. 3C). The core county is an exception again, suggesting that it is less likely that someone leaving the core county will move to another county within the same city. The slightly negative correlation of the percentage of outflows directed to other cities suggests that there is a trend of people leaving the core county and the central regions to move to other cities (Fig. 3D). The high percentage of inflows (Fig. 3B) and outflows (Fig. 3D) in the central region due to inter-city flows implies that the central regions of cities are more dynamic and diverse and that people tend to move to counties with similar levels of urbanization. The same pattern is observed for flows between metro and micro areas, and for metro and non-statistical areas, allowing us to conclude that people moving from rural areas are more likely to move to the external regions of a city (Supplementary Fig. 7).The positive correlation of the relative growth with the distance due to intra-city flows (Fig. 3E) shows that the resulting intra-city redistribution of people, given by the difference between inflows and outflows, is such that there is a trend from core county to the external counties (viz. suburbs). When compared to the relative growth due to inter-city flows (Fig. 3F), which do not show any trend and that have negative values for the most distant counties, it becomes clear that intra-city flows play a major role in the population increase observed in outer regions of cities. Interestingly, large circle and square dots in Fig. 3E and F suggest that the loss of people due to inter-city netflows is more intense than the gain of people due to intra-city netflows in some external counties of the largest metro areas, thus explaining the population decline in some outer regions of New York and Chicago (as shown in Fig. 2B and C).The population growth due to intra-city flows is also depicted in Fig. 4. The concentration of flows below the diagonal captures the heterogeneity and the preferential destination of intra-city netflows. We observe that people are more likely to move to lower population density counties when moving from one place to another within the same city, as exemplified by 7 cities in panel A. Panel B summarizes this analysis for the 46 cities with more than 5 counties by showing the fraction ({{{{{{{mathcal{F}}}}}}}}) of intra-city netflows to lower density counties. We note that more than 93% of the cities have ({{{{{{{mathcal{F}}}}}}}} > 0.5) and that there is a positive correlation of ({{{{{{{mathcal{F}}}}}}}}) with the city population, and C shows the rank of cities according to the fraction of intra-city netflows to lower density counties.Fig. 4: People are moving to counties with lower population density.A The population density of the origin (ρo) and destination (ρd) counties of intra-city netflows for New York, Chicago, Dallas, Houston, Washington D.C., Philadelphia, Atlanta, reveal that the majority of the flows occur from high to low-density counties. The size of the symbols are proportional to the intensity of the netflow, and the black line corresponds to y = x. B The fraction of netflows to lower density counties ({{{{{{{mathcal{F}}}}}}}}) has a positive correlation with city population when we consider the 46 MSAs with more than 5 counties, suggesting that intra-city netflows to lower density counties are more frequent as the city size increases. We also show the Pearson correlation coefficient R and the p-value associated with the two-sided test of the null hypothesis of non-correlation. C The ranking of the cities according to ({{{{{{{mathcal{F}}}}}}}}).Full size imagePopulation density does not seem to play a major role in driving flows between counties of different cities. The fraction of inter-city netflows to lower density counties is about 57% when we consider all the 384 MSAs. The heterogeneity in the inter-city netflow pattern can be assessed by analyzing ({{{{{{{mathcal{F}}}}}}}}) versus the population of the destination city (Fig. 5A, B) and ({{{{{{{mathcal{F}}}}}}}}) versus the population of the origin city (Fig. 5C, D). The negative correlation of ({{{{{{{mathcal{F}}}}}}}}) with the population of the destination city in panel A indicates that inflows are more likely to come from lower density counties as the destination city size increases. The positive correlation of ({{{{{{{mathcal{F}}}}}}}}) with the population of the origin city in panel C reveals that outflows tend to be directed to lower density counties as the origin city size increases. The trends observed in panels A and C reveal that inter-city flows are more likely between counties with different population densities rather than between counties with similar population densities. Panels B and D show the rank order of cities according to a function of the destination city size and the origin city size, respectively.Fig. 5: Inter-city flow patterns depend on the population size of the origin and destination cities.Each point corresponds to a particular city. A Fraction ({{{{{{{mathcal{F}}}}}}}}) of netflows going to lower density counties versus the population of the destination city. Inflows to counties of large cities (with population greater than 106, dashed line) usually comes from counties with lower population densities. B Rank of cities according to the share of inflows from lower density counties. C Fraction ({{{{{{{mathcal{F}}}}}}}}) versus the population of the origin city. Outflows from counties of large cities usually go to cities with lower density counties. D The rank of cities according to the share of inter-city netflows to lower density counties is presented. The dots are colored according to the city population density (darker red means higher density). We also show the Pearson correlation coefficient R and the p-value associated with the two-sided test of the null hypothesis of non-correlation.Full size imageWe would expect that there might be preferential locations within a given city to which people move due to various factors such as lower costs of housing and employment opportunities. However, it seems that house prices have little to no effect on intra-city netflows (Supplementary Fig. 8). While the fraction of intra-city netflows to counties with less expensive houses is about 0.8 for cities like New York, Chicago and Washington, this fraction is about 0.2 for cities like Dallas, Houston and Philadelphia. The lack of a clear national pattern highlights the specificity of each city and the heterogeneity of the regional housing market in the U.S.36,37. On the other hand, the fraction of intra-city netflows to counties with lower unemployment rates is higher than 0.5 for about 2/3 of the cities (Supplementary Fig. 9), thus showing that people are more likely to move to counties with lower unemployment rates.Statistical structure of inter-city flowsIntra-city flows capture the internal redistribution of population, without altering the total city population. In this context, we focus on inter-city flows to investigate whether or not extreme flows play an important role in shaping the growth of counties as observed at the city level5. For cities, Verbavatz and Barthelemy5 introduce a stochastic equation to describe population growth, composed of two terms. The first term accounts for out-of-system growth, which includes natural growth and international migration, and the second term accounts for the growth due to domestic netflows. They find that total netflows adjusted by population size can be well approximated by a Lévy distribution, and this heavy-tailed distribution indicates that rare and extreme inter-city flows (viz. migratory shocks) dominate city population growth.Here, we find that, for counties, the distribution of total netflows adjusted by population size, which is represented by ζi and captures the intensity of inter-city migratory flows (see the section “Methods” for details), can be approximated by a Gaussian distribution (Fig. 6). The lack of a heavy tail in the empirical distribution of ζi suggests the absence of extreme flows at the county level, thus indicating that the growth of counties can be described by smoother migratory process than cities. Given that cities do experience migratory shocks5, our findings indicate that cities redistribute inflows among its different counties, leading to a spill-over effect that dampens flow shocks at the county level.Fig. 6: Extreme shocks are dissipated at the county level.The distribution of ζi, which is the sum of the netflows of a county i adjusted by its population, suggests that migratory events are exponentially bounded at the county level since ζi is well described by a Gaussian distribution. The distribution of ζi is computed here for all the counties with at least 50.000 inhabitants. We also show the result of the two-sided KS test under the null hypothesis that ζi follows a Gaussian distribution.Full size imageHeterogeneity of international inflowsThe highest share of international inflows is concentrated in large cities. About 40% of the international inflows are destined to the top 10 (~2.6%) largest metro areas of the U.S. New York is the first with 8.5% of international inflows, followed by Los Angeles and Miami with 5.4% and 5.0%, respectively. Indeed, international inflows Yk scale superlinearly with the population Sk of the metro area k (Fig. 7A), thus larger cities have more immigrants per capita than smaller cities.Fig. 7: International inflow scales superlinearly with city size.Panel (A) shows the number of international immigrants as a function of the city size S for the 384 U.S. metro areas. The performance of the model Y = Y0Sθ, in which θ = 1.19 (95% CI [1.13, 1.24]) and Y0 = 4.10−4 is a normalization constant, is assessed by the coefficient of determination R2. Note that the spread of empirical data around the model narrows as the size of the city increases. Panel (B) shows the rank of the metro areas and the residues, which captures the deviation from the null model thus highligthing cities receiving more/less than expected international inflows. Names of the cities are followed by two-letter state abbreviations.Full size imageInterestingly, this gain with scale is also observed in socioeconomic city metrics as crime, GDP, innovation and wealth creation due to the manifestation of nonlinear agglomeration phenomena38,39,40. Using Y = Y0Sθ as a null model, we can compute deviations from the average behavior by means of residuals given by (log ({Y}_{k}/{Y}_{0}{S}_{k}^{theta }))38. The rank of the residues (Fig. 7B) shows that college towns are among the top metro areas receiving more international inflows than expected, while large cities as Los Angeles, New York, Atlanta, and Chicago are among the metro areas receiving less international inflows than expected.The spatial distribution of international inflows within cities is shown in Supplementary Fig. 10. The highest share of inflows is concentrated at core counties, and the percentage of inflows decreases dramatically with the distance from the core county. This result suggests that inflow of international migrants is an important component of population growth, particularly at the core regions of large cities.Robustness of our findingsPatterns of population redistribution change from time to time in the U.S., and are affected by several factors. For instance, in the 1960s non-metropolitan counties lost about 3 million people due to outflows to metropolitan counties, while the reverse trend was observed in the 1970s when non-metropolitan counties experienced net inflows of about 2.6 million people41. Wardwell and Brown in41 indicate that three factors might be among the main reasons of such change, namely economic decentralization, preference for rural living, and modernization of rural life. The temporal influence of factors as socioeconomic conditions, transportation infrastructure, natural amenities, and land use and development on population growth in rural and suburban areas is explored in42. Changes in rural migration patterns are also studied in43, where age-specific rural migration patterns from 1950 to 1995 are analyzed. In44, the authors explore redistribution trends across U.S. counties from 1980 to 1995 split into three five year periods (1980–1985, 1985–1990, 1990–1995), and45 analyzes changes in age-specific nationwide migration patterns from 1950 to 2010.The spatial structure of migration patterns may indeed change from time to time; our results correspond to the current intra- and inter-city redistribution trends, based on the most recent ACS migration flow files. We present a thorough empirical and statistical analysis of domestic migration flows among U.S. cities ans counties. Our study also introduces a framework that can be used for analyzing and comparing internal redistribution of people across different time periods. Indeed, we extended our analysis for two other time periods, 2005–2009 and 2010–2014. With respect to the spatial distribution of intra- and inter-city flows, similar trends are observed in both periods (Supplementary Figs. 12, 13), namely inter-city flows are responsible for the highest share of inflows to core counties, and intra-city flows are responsible for the highest share of inflows to external counties. We also explored the role of population density in driving netflows between counties within the same metro area in 2005–2009 and 2010–2014. The results (Supplementary Figs. 14 and 15) indicate that 95.7% of cities were dominated by intra-city moves to lower density counties in 2005–2009, and this percentage dropped to 76.1% in 2010–2014. Our findings indicate that the trends we report here are taking place since 2005 but with different intensities.The robustness of our findings is checked with additional migration data from the Internal Revenue Service (IRS), which reports the year-to-year address changes on individual tax returns filled with the IRS46. The results obtained with the analysis of IRS datasets from periods 2015–2016, 2016–2017, 2017–2018, 2018–2019 (Supplementary Figs. 16, 17, 18, 19), reveal similar trends to those we found using ACS data. Particularly, we observe that, for all periods considered, the correlation between intra-city netflow/S and distance to core county is stronger than we found with ACS data, thus highlighting the role of intra-city flows in driving population to external regions of cities. The main difference between both datasets is in the percentage of intra- and inter-city inflows and outflows: while ACS data indicates that both flows have approximately the same contribution to the total flows, the IRS data indicates that, besides the core county, intra-city flows are responsible for about 80% of inflows and outflows of metro areas. More

  • in

    A real-time rural domestic garbage detection algorithm with an improved YOLOv5s network model

    Attention combination mechanismDue to the difficulty in extracting features from target areas in images, the high computational effort of the model and the low accuracy of detection are addressed. As shown in Fig. 3, we introduce a lightweight feedforward convolutional attention module CBAM after the backbone network Focus module of the YOLOv5s network model. The SE-Net (Squeeze and Excitation Networks) channel attention module is posted at the end of the backbone network. We propose an attention combination mechanism based on the YOLOv5s network model and name the improved network model YOLOv5s-CS. Where the CBAM module has a channel number of 128, a convolutional kernel size of 3 and a step size of 2, the SELayer has a channel number of 1024 and a step size of 4.Figure 3YOLOv5 backbone network structure before and after improvement.Full size imageConvolutional block attention module networkIn 2018, Woo et al.25 proposed the lightweight feedforward convolutional attention module CBAM. The CBAM module focuses on feature information from both channels and space dimensions and combines feature information to some extent to obtain more comprehensive reliable attentional information26. CBAM consists of two submodules, the channel attention module (CAM) and spatial attention module (SAM), and its overall module structure is shown in Fig. 4a.Figure 4Principle of CBAM.Full size imageThe working principle of the CAM is shown in Fig. 4b. First, the feature map F is input at the input entrance. Second, the global maximum pooling operation and the global average pooling operation are applied to the width and height of the feature map respectively to obtain two feature maps of the same size. Third, two feature maps of the same size are input to the shared parameter network MLP at the same time. Finally, the new feature map output from the shared parameter network is subjected to a summation operation and a sigmoid activation function to obtain the channel attention features ({M}_{c}).The channel attention module CAM is calculated as shown in Formula (1):$${text{M}}_{rm{c}}({text{F}}){=sigma}({text{MLP (AvgPool (F))}}+ {text MLP (MaxPool (F)))}{=sigma}({rm{W}}_{1}({text{W}}_{0}({text{F}}_{{{rm{avg}}}^{rm{c}}}))+{rm{W}}_{0}({rm{W}}_{1}({rm{F}}_{{{rm{max}}}^{rm{c}}})))$$
    (1)
    where σ represents the sigmoid function, MLP represents the shared parameter network, ({text{W}}_{0}) and ({text{W}}_{1}) represent the shared weights, ({text{F}}_{text{avg}}^{text{c}}) is the result of feature map F after global average pooling, and ({text{F}}_{text{max}}^{text{c}}) is the result of feature map F after global maximum pooling.The working principle of SAM is shown in Fig. 4c. The feature map F’ is regarded as the input of the SAM. F’ is obtained by multiplying the input of SAM with the output of CAM. First, the global maximum pooling operation and the global average pooling operation are applied to the channels of the feature map to obtain two feature maps of the same size. Second, two feature maps that have completed the pooling operation are stitched at the channels and the feature channels are dimensioned down using the convolution operation to obtain a new feature map. Finally, spatial attention features ({text{M}}_{text{s}}) are generated using the sigmoid activation function.The spatial attention module (SAM) is calculated, as shown in Formula (2):$${text{M}}_{text{s}}left({text{F}}right) {=sigma}left({text{f}}^{7 times 7}left(left[{text{AvgPool}}left({text{F}}right)text{;MaxPool}left({text{F}}right)right]right)right) {=sigma}left({text{f}}^{7 times 7}left(left[{text{F}}_{text{avg}}^{text{s}} ; {text{F}}_{text{max}}^{text{s}}right]right)right)$$
    (2)
    where σ is the sigmoid function, ({text{f}}^{7 times 7}) denotes the convolution operation with a filter size of 7 × 7, ({text{F}}_{text{avg}}^{text{s}}) is the result of the feature map after global average pooling, and ({text{F}}_{text{max}}^{text{s}}) is the result of the feature map after global maximum pooling.Squeeze and excitation networkIn 2018, Hu et al.27 proposed a single-path attention network structure SE-Net. SE-Net uses the idea of an attention mechanism to analyze the relationship feature maps by modeling and adaptively learning to obtain the importance of each feature map28 and then assigns different weights to the original feature map for updating according to the importance. In this way, SE-Net pays more attention to the features that are useful for the target task while suppressing useless feature information and allocates computational resources rationally to different channels to train the model to achieve better results.The SE-Net attention module is mainly composed of two parts: the squeeze operation and excitation operation. The structure of the SE-Net module is shown in Fig. 5.Figure 5The SE-Net module structure.Full size imageThe squeeze operation uses global average pooling to encode all spatial features on the channel as local features. Second, each feature map is compressed into a real number that has global information on the feature maps. Finally, the squeeze results of each feature map are combined into a vector as the weights of each group of feature maps. It is calculated as shown in Eq. (3):$${text{Z}}_{text{c}}={text{F}}_{text{sq}}left({text{u}}_{text{c}}right)=frac{1}{text{H} times {text{W}}}sum_{text{i=1}}^{text{H}}sum_{text{j=1}}^{text{W}}{{text{u}}}_{text{c}}left(text{i,j}right) , , , $$
    (3)
    where H is the height of the feature map, W is the feature map width, u is the result after convolution, z is the global attention information of the corresponding feature map, and the subscript c indicates the number of channels.After completing the squeeze operation to obtain the channel information, the feature vector is subjected to the excitation operation. First, it passes through two fully connected layers. Second, it uses the sigmoid function. Finally, the output weights are assigned to the original features. It is calculated as follows:$$text{s} = {text{F}}_{text{ex}}left(text{z,W}right){=sigma}left({text{g}}left(text{z,W}right)right){=sigma}left({text{W}}_{2}{delta}left({text{W}}_{1}{text{z}}right)right)$$
    (4)
    $$widetilde{{text{x}}_{rm{c}}}={text{F}}_{rm{scale}}left({text{u}}_{rm{c}}, {text{s}}_{rm{c}}right)={text{s}}_{rm{c}}{{text{u}}}_{rm{c}}$$
    (5)
    where σ is the ReLU activation function, δ represents the sigmoid activation function, and ({text{W}}_{1}) and ({text{W}}_{2}) represent two different fully connected layers. The vector s represents the set of feature mapping weights obtained through the fully connected layer and the activation function. (widetilde{{x}_{c}}) is the feature mapping of the x feature channel, ({text{s}}_{text{c}}) is a weight, and ({text{u}}_{text{c}}) is a two-dimensional matrix.Target detection layerThe garbage in rural areas is a smaller target and has fewer pixel characteristics, such as capsule, button butteries. Therefore, we insert a small target detection layer to improve the head network structure based on the original YOLOv5s network model for detecting objects with small targets to optimize the problem of missed detection in the original network model. The YOLOv5s network structure with the addition of the small target detection layer is shown in Fig. 6 and named YOLOv5s-STD.Figure 6The YOLOv5s-STD network structure.Full size imageIn the seventeenth layer of the neck network, operations such as upsampling are performed on the feature maps so that the feature maps continue to expand. Meanwhile, in the twentieth layer, the feature maps obtained from the neck network are fused with the feature maps extracted from the backbone network. We insert a detection layer capable of predicting small targets in the thirty-first layer. To improve the detection accuracy, we use a total of four detection layers for the output feature maps, which are capable of detecting smaller target objects. In addition to the three initial anchor values based on the original model, an additional set of anchor values is added as a way to detect smaller targets. The anchor values of the improved YOLOv5s network model are set to [5, 6, 8, 14, 15, 11], [10, 13, 16, 30, 33, 23], [30, 61, 62, 45, 59, 119] and [116, 90, 156, 198, 373, 326].Bounding box regression loss functionThe loss function is an important indicator of the generalization ability of a model. In 2016, Yu et al.29 proposed a new joint intersection loss function IoU for bounding box prediction. IoU stands for intersection over union, which is a frequently used metric in target detection. It is used not only to determine the positive and negative samples, but also to determine the similarity between the predicted bounding box and the ground truth bounding box. It can be described as shown in the Eq. (6):$$text{IoU} = frac{left|text{A} capleft.{text{B}}right|right.}{left|{text{A}} cupleft.{text{B}}right|right.}$$
    (6)
    where the value domain of IoU ranges from [0,1]. A and B are the areas of arbitrary regions. Additionally, when IoU is used as a loss function, it has to scale invariance, as shown in Eq. (7):$$text{IoU_Loss} = 1-frac{left|text{A} cap left.{text{B}}right|right.}{left|{text{A}} cup left.{text{B}}right|right.}$$
    (7)
    However, when the prediction bounding box and the ground truth bounding box do not intersect, namely IoU = 0, the distance between the arbitrary region area of A and B cannot be calculated. The loss function at this point is not derivable and cannot be used to optimize the two disjoint bounding boxes. Alternatively, when there are different intersection positions, where the overlapping parts are the same but in different overlapping directions, the IoU loss function cannot be predicted.To address these issues, the idea of GIoU (Generalized Intersection over Union)30, in which a minimum rectangular Box C of A and B is added, was proposed in 2019 by Rezatofighi et al. Suppose the prediction bounding box is B, the ground truth bounding box is A, the area where A and B intersect is D, and the area containing two bounding boxes is C, as shown in Fig. 7.Figure 7GIoU evaluation chart.Full size imageThen, the GIoU calculation, as shown in Formula (8), is:$$text{GIoU}= text{IoU}-frac{text{|C}-left({text{A}} cup {text{B}}right)text{|}}{text{|C|}}$$
    (8)
    The GIoU_Loss is calculated as (9):$$text{GIoU_Loss=1}-{text{IoU}}-frac{text{|C}-left({text{A}} cup {text{B}}right)text{|}}{text{|C|}}$$
    (9)
    The original YOLOv5 algorithm uses GIoU_Loss as the loss function. Comparing Eqs. (6) and (8), it can be seen that GIoU is a new penalty term (frac{text{|C}-left({text{A}} cup {text{B}}right)text{|}}{text{|C|}}) that is added to IoU and is clearly represented by Fig. 7.Although the GIoU loss function solves the problem that the gradient of the IoU loss function cannot be updated in time and the prediction bounding box, the direction of the ground truth bounding box is not consistent when predicting, but there are still disadvantages, as shown in Fig. 8.Figure 8Comparsion of loss values.Full size imageFigure 8 shows three different position relationships formed when the predicted bounding box and the ground truth bounding box overlap exactly. Among them, the ratio of the length to width of the green grounding truth bounding box is 1:2, and the red predicted bounding box has the same aspect ratio as the ground truth bounding box, but the size is only one-half of the green ground truth bounding box. When the prediction bounding box and the ground truth bounding box completely overlap, the GIoU degenerates to the IoU, and the GIoU value and IoU value for the three different position cases are 0.45 at this time. The GIoU loss function does not directly reflect the distance between the prediction bounding box and the ground truth bounding box. Therefore, we introduce the CIoU (Complete Intersection over Union)31 loss function to replace the original GIoU loss function in the YOLOv5 algorithm and continue to optimize the prediction bounding box.Therefore, the CIoU is calculated as (10):$$text{GIoU_Loss}=1-text{IoU}-frac{text{|C}-left({text{A}} cup {text{B}}right)text{|}}{text{|C|}}$$
    (10)
    where b and ({text{b}}^{text{gt}}) denote the centroids of the prediction bounding box and the ground truth bounding box, respectively, ({rho}) is the Euclidean distance between the two centroids, and c is the diagonal length of the minimum closed area formed by the prediction bounding box and the ground truth bounding box.(alpha) is the parameter used to balance the scale, and v is the scale consistency used to measure the aspect ratio between the prediction bounding box and the ground truth bounding box, as shown in Eqs. (11) and (12).$$alpha =frac{text{v}}{left(1-text{IoU}right)+{text{v}}^{{prime}}}$$
    (11)
    $$text{v} = frac{4}{{pi}^{2}}{left({text{arctan}}frac{{omega}^{text{gt}}}{{text{h}}^{text{gt}}}- text{arctan}frac{{omega}^{text{p}}}{{text{h}}^{text{p}}}right)}^{2}$$
    (12)
    Therefore, the expression of CIoU_Loss can be obtained according to Eqs. (10), (11) and (12).$$text{CIoU_Loss} =1-text{CIoU}=1-text{IoU}+frac{{rho}^{2}left(text{b,}{text{b}}^{text{gt}}right)}{{text{c}}^{2}}{+ alpha v }$$
    (13)
    Optimization algorithmAfter optimizing the loss function of the network model, the next step is to optimize the hyperparameters of the network model. The function of the optimizer is to adjust the hyperparameters to the most appropriate values while making the loss function converge as much as possible32. In the target detection algorithm, the optimizer is mainly used to calculate the gradient of the loss function and to iteratively update the parameters.The optimizer used in YOLOv5 is stochastic gradient descent (SGD). Since a large number of problems in deep learning satisfy the strict saddle function, all the local optimal solutions obtained are almost as ideal. Therefore, SGD algorithm is not trapped in the saddle point and has strong generality. However, the slow convergence speed and the number of iterations of SGD algorithm are still problems that need to be improved. Adam algorithm has both the first-order momentum in the SGD algorithm and combines the second-order momentum in AdaGrad algorithm and AdaDelta algorithm, Adaptive&Momentum. Adam formula can be described as follows:$${m}_{t}={beta }_{1}{m}_{t-1}+left(1-{beta }_{1}right){g}_{t}$$
    (14)
    $${v}_{t}={beta }_{2}{v}_{t-1}+left(1-{beta }_{2}right){g}_{t}^{2}$$
    (15)
    $${widehat{m}}_{t}=frac{{m}_{t}}{1-{beta }_{1}^{t}}$$
    (16)
    $${widehat{v}}_{t}=frac{{v}_{t}}{1-{beta }_{2}^{t}}$$
    (17)
    where ({beta }_{1}) and ({beta }_{2}) parameters are hyperparameters and g is the current gradient value of the error function, ({m}_{t}) is the gradient of the first-order momentum and ({v}_{t}) is the gradient of the second-order momentum.Adam is an adaptive one-step random objective function optimization algorithm based on a low-order moment. It can replace the traditional first-order optimization algorithm for the stochastic gradient descent process. It is able to update the weights of the neural network adaptively based on the data trained during the iterative process. The Adam optimizer occupies fewer memory resources during the training process and is suitable for solving the problems of sparse gradients and large fluctuations in loss values33. Therefore, we use the Adam optimization algorithm instead of the SGD optimization algorithm to train the network model based on the YOLOv5s network model. The calculation is shown in Table 3.Table 3 Computing method of the Adam optimizer.Full size tablewhere ({alpha}) is a factor controlling the learning rate of the network, ({beta}^{{prime}}) is the exponential decay rate of the first-order moment estimate, ({beta}^{{primeprime}}) is the exponential decay rate of the second-order moment estimate, and ({varepsilon}) is a constant that tends to zero infinitely as the denominator. More

  • in

    Applying genomic approaches to delineate conservation strategies using the freshwater mussel Margaritifera margaritifera in the Iberian Peninsula as a model

    Funk, W. C., McKay, J. K., Hohenlohe, P. A. & Allendorf, F. W. Harnessing genomics for delineating conservation units. Trends Ecol. Evol. 27, 489–496. https://doi.org/10.1016/j.tree.2012.05.012 (2012).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Hohenlohe, P. A., Funk, W. C. & Rajora, O. P. Population genomics for wildlife conservation and management. Mol. Ecol. 30, 62–82. https://doi.org/10.1111/mec.15720 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Helyar, S. J. et al. Application of SNPs for population genetics of nonmodel organisms: New opportunities and challenges. Mol. Ecol. Resour. 11, 123–136. https://doi.org/10.1111/j.1755-0998.2010.02943.x (2011).ADS 
    Article 
    PubMed 

    Google Scholar 
    Allendorf, F. W. Genetics and the conservation of natural populations: Allozymes to genomes. Mol. Ecol. 26, 420–430. https://doi.org/10.1111/mec.13948 (2017).CAS 
    Article 
    PubMed 

    Google Scholar 
    Zimmerman, S. J., Aldridge, C. L. & Oyler-McCance, S. J. An empirical comparison of population genetic analyses using microsatellite and SNP data for a species of conservation concern. BMC Genomics 21, 38. https://doi.org/10.1186/s12864-020-06783-9 (2020).CAS 
    Article 

    Google Scholar 
    Lemopoulos, A. et al. Comparing RADseq and microsatellites for estimating genetic diversity and relatedness—implications for brown trout conservation. Ecol. Evol. 9, 2106–2120. https://doi.org/10.1002/ece3.4905 (2019).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Kleinman-Ruiz, D. et al. Novel efficient genome-wide SNP panels for the conservation of the highly endangered Iberian lynx. BMC Genomics 18, 556. https://doi.org/10.1186/s12864-017-3946-5 (2017).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Geist, J. Strategies for the conservation of endangered freshwater pearl mussels (Margaritifera margaritifera L.): A synthesis of conservation genetics and ecology. Hydrobiologia 644, 69–88. https://doi.org/10.1007/s10750-010-0190-2 (2010).Lopes-Lima, M. et al. Conservation status of freshwater mussels in Europe: State of the art and future challenges. Biol. Rev. 92, 572–607. https://doi.org/10.1111/brv.12244 (2017).Article 
    PubMed 

    Google Scholar 
    Outeiro, A., Ondina, P., Fernández, C., Amaro, R. & Miguel, E. S. Population density and age structure of the freshwater Pearl mussel, Margaritifera margaritifera, in two Iberian rivers. Freshw. Biol. 53, 485–496. https://doi.org/10.1111/j.1365-2427.2007.01913.x (2008).CAS 
    Article 

    Google Scholar 
    Clements, E. A., Thomas, R. & Adams, C. E. An investigation of salmonid host utilisation by the endangered freshwater pearl mussel (Margaritifera margaritifera) in north-west Scotland. Aquat. Conserv.: Mar. Freshw. Ecosyst. 28, 764–768. https://doi.org/10.1002/aqc.2900 (2018).Taeubert, J-E. & Geist, J. The relationship between the Freshwater Pearl Mussel (Margaritifera margaritifera) and its hosts. Biol. Bull. 44, 67–73. https://doi.org/10.1134/S1062359017010149 (2017).Sousa, R. et al. Conservation status of the freshwater pearl mussel Margaritifera margaritifera in Portugal. Limnologica 50, 4–10. https://doi.org/10.1016/j.limno.2014.07.004 (2015).Article 

    Google Scholar 
    Almodóvar, A., Nicola, G. G., Ayllón, D. & Elvira, B. Global warming threatens the persistence of Mediterranean brown trout. Glob. Change Biol. 18, 1549–1560. https://doi.org/10.1111/j.1365-2486.2011.02608.x (2012).ADS 
    Article 

    Google Scholar 
    Nicola, G. G., Elvira, B., Johnson, B., Ayllón, D. & Almodóvar, A. Local and global climatic drivers of Atlantic salmon decline in southern Europe. Fish. Res. 198, 78–85. https://doi.org/10.1016/j.fishres.2017.10.012 (2018).Article 

    Google Scholar 
    da Silva, J. P. et al. Predicting climatic threats to an endangered freshwater mussel in Europe: The need to account for fish hosts. Freshw. Biol. 00, 1–15. https://doi.org/10.1111/fwb.13885 (2022).Article 

    Google Scholar 
    Strayer, D. L., Geist, J., Haag, W. R., Jackson, J. K. & Newbold, J. D. Essay: Making the most of recent advances in freshwater mussel propagation and restoration. Conserv. Sci. Pract. 43, e53. https://doi.org/10.1111/csp2.53 (2009).Article 

    Google Scholar 
    Geist, J., Bayerl, H., Stoeckle, B. C. & Kuehn, R. Securing genetic integrity in freshwater pearl mussel propagation and captive breeding. Sci. Rep. 11, 16019. https://doi.org/10.1038/s41598-021-95614-2 (2021).ADS 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Gomes dos Santos, A. et al. The Crown Pearl: a draft genome assembly of the European freshwater pearl mussel Margaritifera margaritifera (Linnaeus, 1758). DNA Res. 28, dsab002. https://doi.org/10.1093/dnares/dsab002 (2021).Bouza, C. et al. Threatened freshwater pearl mussel Margaritifera margaritifera L. in NW Spain: low and very structured genetic variation in southern peripheral assessed using microsatellite markers. Conserv. Genet. 8: 937–948. https://doi.org/10.1007/s10592-006-9248-0 (2007).Stoeckle, B. C. et al. Strong genetic differentiation and low genetic diversity of the freshwater pearl mussel (Margaritifera margaritifera L.) in the southwestern European distribution range. Conserv. Genet. 18, 147–157. https://doi.org/10.1007/s10592-016-0889-3 (2017).Geist, J., Söderberg, H., Karllberg, A. & Kuehn, R. Drainage-independent genetic structure and high genetic diversity of endangered freshwater pearl mussels (Margaritifera margaritifera) in northern Europe. Conserv. Genet. 11, 1339–1350. https://doi.org/10.1007/s10592-009-9963-4 (2010).Article 

    Google Scholar 
    implications for conservation and management. Geist, J. & Kuehn, R. Genetic diversity and differentiation of central European freshwater pearl mussel (Margaritifera margaritifera L.) populations. Mol. Ecol. 14, 239–425. https://doi.org/10.1111/j.1365-294X.2004.02420.x (2005).CAS 
    Article 

    Google Scholar 
    Farrington, S. J., King, R. W., Baker, J. A. & Gibbons, J. G. Population genetics of freshwater pearl mussel (Margaritifera margaritifera) in central Massachusetts and implications for conservation. Aquat. Conserv.: Mar. Freshw. Ecosyst. 30, 1945–1958. https://doi.org/10.1002/aqc.3439 (2020).Zanatta, D. T. et al. High genetic diversity and low differentiation in North American Margaritifera margaritifera (Bivalvia: Unionida: Margaritiferidae). Biol. J. Linn. Soc. Lond., 123, 850–863. https://doi.org/10.1093/biolinnean/bly010. (2018)Garrison, N. L., Johnson, P. D. & Whelan, N. V. Conservation genomics reveals low genetic diversity and multiple parentage in the threatened freshwater mussel Margaritifera hembeli. Conser. Genet. 22, 217–231. https://doi.org/10.1007/s10592-020-01329-8 (2021).Article 

    Google Scholar 
    Roe, K. & Kim, K. S. Genome-wide SNPs redefine species-boundaries and conservation units in the freshwater mussel genus Cyprogenia of North America. Sci. Rep. 11, 10752. https://doi.org/10.1038/s41598-021-90325-0 (2021).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Wingett, S. W. & Andrews, S. FastQ Screen: A tool for multi-genome mapping and quality control. F1000Res 7, 1338. https://doi.org/10.12688/f1000research.15931.2 (2018).Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048. https://doi.org/10.1093/bioinformatics/btw354 (2016).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Rochette, N. C., Rivera-Colón, A. G. & Catchen, J. M. Stacks 2: Analytical methods for paired-end sequencing improve RADseq-based population genomics. Mol. Ecol. 28, 4737–4754. https://doi.org/10.1111/mec.15253 (2019).CAS 
    Article 
    PubMed 

    Google Scholar 
    Paris, R. J., Stevens, J. R. & Catchen, J. M. Lost in parameter space: A road map for STACKS. Methods Ecol. Evol. 8, 1360–1373. https://doi.org/10.1111/2041-210X.12775 (2017).Article 

    Google Scholar 
    Limin, F., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: Accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152. https://doi.org/10.1093/bioinformatics/bts565 (2012).CAS 
    Article 

    Google Scholar 
    Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:1303.3997v2. https://doi.org/10.48550/arXiv.1303.3997 (2013).Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079. https://doi.org/10.1093/bioinformatics/btp352 (2009).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing. arXiv:1207.3907v2. https://doi.org/10.48550/arXiv.1207.3907 (2012)Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158. https://doi.org/10.1093/bioinformatics/btr330 (2011).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Excoffier, L. & Lischer, H. E. L. Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under Linux and Windows. Mol. Ecol. Resour. 10, 564–567. https://doi.org/10.1111/j.1755-0998.2010.02847.x (2010).Weir, B. S. & Cockerham, C. Estimating F-statistics for the analysis of population structure. Evol. 38, 1358–1370. https://doi.org/10.2307/2408641 (1984).CAS 
    Article 

    Google Scholar 
    Frichot, E. & François, O. LEA: An R package for landscape and ecological association studies. Methods Ecol. Evol. 6, 925–929. https://doi.org/10.1111/2041-210X.12382 (2015).Article 

    Google Scholar 
    Alexander, D. H, Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664. https://doi.org/10.1101/gr.094052.109 (2009).Jakobsson, M. & Rosenberg, N. A. CLUMPP: A cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806. https://doi.org/10.1093/bioinformatics/btm233 (2007).CAS 
    Article 
    PubMed 

    Google Scholar 
    Frankham, R. et al. A practical guide for genetic management of fragmented animal and plant populations. Oxford University Press, New York. 174. https://doi.org/10.1093/oso/9780198783411.001.0001 (2019).Wacker, S., Larson, B., Jakobsen, P. & Karlssona, S. Multiple paternity promotes genetic diversity in captive breeding of a freshwater mussel. Glob. Ecol. Cons. 17, e00564. https://doi.org/10.1016/j.gecco.2019.e00564 (2019).Article 

    Google Scholar 
    Cao, R. et al. Genetic structure and diversity of Australian freshwater crocodiles (Crocodylus johnstoni) from the Kimberley Western Australia. Conserv. Genet. 21, 421–429. https://doi.org/10.1007/s10592-020-01259-5 (2020).Article 

    Google Scholar 
    Escalante, M. A. et al. Genotyping-by-sequencing reveals the effects of riverscape, climate and interspecific introgression on the genetic diversity and local adaptation of the endangered Mexican Golden trout (Oncorhynchus chrysogaster). Conserv. Genet. 21, 907–926. https://doi.org/10.1371/journal.pone.0141775 (2020).CAS 
    Article 

    Google Scholar 
    Bauer, G. Reproductive strategy of the freshwater pearl mussel Margaritifera margaritifera. J. Anim. Ecol. 56, 691–704. https://doi.org/10.2307/5077 (1987).Article 

    Google Scholar 
    Machordom, A., Araujo, R., Erpenbeck, D. & Ramos, M. A. Phylogeography and conservation genetics of endangered European Margaritiferidae (Bibalvia: Unionoidae). Biol. J. Linn. Soc. 78, 235–252. https://doi.org/10.1046/j.1095-8312.2003.00158.x (2003).Article 

    Google Scholar 
    Viveen, W., Schoorl, J. M., Veldkamp, A., van Balen, R. T. & Vidal-Romani, J. R. Fluvial terraces of the northwest Iberian lower Miño River. J. Maps 9, 513–522. https://doi.org/10.1080/17445647.2013.821096 (2013).Article 

    Google Scholar 
    Pérez-Granados, C., López-Iborra, G. & Seoane, J. A multi-scale analysis of habitat selection in peripheral populations of the endangered Dupont’s Lark Chersophilus duponti. Bird Conserv. Intern. 27, 398–413. https://doi.org/10.1017/S0959270916000356 (2017).Article 

    Google Scholar 
    Sanz Ball-Llosera, N., Garcìa-Marìn, J. & Pla, C. Managing fish populations under mosaic relationships. The case of brown trout (Salmo trutta) in peripheral Mediterranean populations. Conserv. Genet. 3, 385–400. https://doi.org/10.1023/A:1020527420654 (2002).Vila, M. et al. Phylogeography and Conservation Genetics of the Ibero-Balearic Three-Spined Stickleback (Gasterosteus aculeatus). PLoS One 12, e0170685. https://doi.org/10.1371/journal.pone.0170685 (2017)Hamed, Y. et al. Climate impacto n Surface and groundwater in North Africa: A global synthesis of findings and recommendations. Euro-Mediterr. J. Environ. Integr. 3, 25. https://doi.org/10.1007/s41207-018-0067-8 (2018).Article 

    Google Scholar 
    Krijgsman, W. et al. The Gibraltar Corridor: Watergate of the Messinian Salinity Crisis. Mar. Geol. 403, 238–246. https://doi.org/10.1016/j.margeo.2018.06.008 (2018).ADS 
    Article 

    Google Scholar 
    Zanatta, D. T. & Wilson, C. C. Testing congruency of geographic and genetic population structure for a freshwater mussel Bivalvia: Unionoida) and its host fish. Biol. J. Linn. Soc. 102, 669–685. https://doi.org/10.1111/j.1095-8312.2010.01596.x (2011).Article 

    Google Scholar 
    Österling, E. M., Ferm, J. & Piccolo, J.J. Parasitic freshwater pearl mussel larvae (Margaritifera margaritifera L.) reduce the drift-feeding rate of juvenile brown trout (Salmo trutta L.). Environ. Biol. Fish. 97, 543–549. https://doi.org/10.1007/s10641-014-0251-x (2014).Geist, J. et al. Genetic structure of Irish freshwater pearl mussels (Margaritifera margaritifera and Margaritifera durrovensis): Validity of subspecies, roles of host fish, and conservation implications. Aquat. Conserv. Mar. Freshw. Ecosyst. 28, 923–933. https://doi.org/10.1002/aqc.2913 (2018)Wacker, S., Larsen, B. M., Karlsson, S. & Hindar, K. Host specificity drives genetic structure in a freshwater mussel. Sci. Rep. 9, 10409 (2019).Machordom, A., Suárez, J., Almodóvar, A. & Bautista, J. Mitochondrial haplotype variation and phylogeography of Iberian brown trout populations. Mol. Ecol. 9, 1325–1338. https://doi.org/10.1046/j.1365-294x.2000.01015.x (2000).CAS 
    Article 

    Google Scholar 
    Suárez, J., Bautista, J. M., Almodóvar, A. & Machordom, A. Evolution of the mitocondrial control region in Paleartic brown trout (Salmo trutta) populations: The biogeographical role of the Iberian Peninsula. Heredity 87, 198–206. https://doi.org/10.1046/j.1365-2540.2001.00905.x (2001).Article 
    PubMed 

    Google Scholar 
    Velasco, J. C. et al. Descubiertos algunos ejemplares de Margaritifera margaritifera (L.) (Bivalvia, Unionoida) en el alto Duero (Soria, España). Iberus 32(2), 97–104 (2014).Geist, J. & Kuehn, R. Host-parasite interactions in oligotrophic stream ecosystems: the roles of life history strategy and ecological niche. Mol. Ecol. 17, 997–1008. https://doi.org/10.1111/j.1365-294X.2007.03636.x. (2008)Ledoux, J.-B., et al. Gradients of genetic diversity and differentiation across the distribution range of a Mediterranean coral: Patterns, processes and conservation implications. Divers. Distrib. 27, 2104–2123 https://doi.org/10.1111/ddi.13382 (2021).Hervella F, & Caballero P. Inventario piscícola dos ríos galegos. Consellería de Medio Ambiente. Xunta de Galicia. Santiago de Compostela (1999).Saura, M., Caballero, P. & Morán, P. Are there Atlantic salmon in the River Tambre?. J. Fish Biol. 72, 1223–1229. https://doi.org/10.1111/j.1095-8649.2007.01782.x (2008).Article 

    Google Scholar 
    Hoban, S. et al. Genetic diversity targets and indicators in the CBD post-2020 global biodiversity framework must be improved. Biol. Conserv. 248, 108654. https://doi.org/10.1016/j.biocon.2020.108654 (2020).Article 

    Google Scholar 
    Rilov, G. et al. Adaptive marine conservation planning in the face of climate change: What can we learn from physiological, ecological and genetic studies?. Glob. Ecol. Conserv. 17, e00566. https://doi.org/10.1016/j.gecco.2019.e00566 (2019).Article 

    Google Scholar 
    Muniz, F. L. et al. Delimitation of evolutionary units in Cuvier’s dwarf caiman, Paleosuchus palpebrosus (Cuvier, 1807): Insights from conservation of a broadly distributed species. Conserv. Genet. 19, 599–610. https://doi.org/10.1007/s10592-017-1035-6 (2018).Article 

    Google Scholar 
    Gum, B., Lange, M. & Geist, J. A critical reflection on the success of rearing and culturing of juvenile freshwater mussels with a focus on the endangered freshwater pearl mussel (Margaritifera margaritifera L.). Aquat. Conserv. 21, 743–751. https://doi.org/10.1002/aqc.1222 (2011).Thomas, G. R., Taylor, J. & García de Leaniz, C. Captive breeding of the endangered freshwater Pearl mussel Margaritifera margaritifera. Endanger. Species Res. 12, 1–9. https://doi.org/10.3354/esr00286 (2010).Wilson, C. D. et al. The importance of population genetic information in formulating ex situ conservation strategies for the freshwater pearl mussel (Margaritifera margaritifera L.) in Northern Ireland. Anim. Conserv. 15, 595–602. https://doi.org/10.1111/j.1469-1795.2012.00553.x (2012).Pires, D., Reis, J., Benites, L. & Rodrigues, P. Minimizing dams impacts on biodiversity by way of translocations: the case of freshwater mussels. Impact Assess. Proj. Apprais. 39, 110–117. https://doi.org/10.1080/14615517.2020.1836710. (2021) More

  • in

    Unique H2-utilizing lithotrophy in serpentinite-hosted systems

    Serpentinite-hosted systems are rare and extreme habitats in which a hydrothermal process, serpentinization, alters ultramafic mantle rocks and yields hyperalkaline fluid rich in molecular hydrogen (H2) and reduced one-carbon compounds [1,2,3,4,5,6,7,8]. These fluids are often electron acceptor depleted—oxygen, nitrate, sulfate, etc. are absent (i.e., anoxic) and even the least favorable exogenous acceptor, carbon dioxide (CO2), is limiting due to the high alkalinity. Though previous studies explore the diversity of organisms in serpentinite-hosted systems, we have little insight into how indigenous H2-utilizing microorganisms combat the unique metabolic challenges in situ. One recent study shows strategies that methane-generating archaea employ to oxidize H2 in situ [9], but how other microorganisms (i.e., H2-utilizing anaerobic bacteria) overcome the electron acceptor limitation is poorly understood. Further, given that life is theorized to have emerged as H2-utilizing lithotrophs in early Earth serpentinite-hosted systems [10,11,12,13,14,15,16,17,18,19,20,21,22,23,24], modern lithotrophs inhabiting such ecosystems may represent valuable extant windows into the metabolism of primordial organisms. In this study, we pair metagenomics and thermodynamics to characterize uncultured putative anaerobic H2 utilizers inhabiting alkaline H2-rich serpentinite-hosted systems (Hakuba Happo hot springs in Hakuba, Japan, and The Cedars springs in California, USA; pH ~10.9 and ~11.9, respectively [25,26,27]) and elucidate novel, potentially ancient, lithotrophic strategies.Thermodynamics and geochemistryThe two primary strategies for utilizing H2 under anoxic conditions without favorable exogenous electron acceptors are methanogenesis and homoacetogenesis. As bacteria were detected in both Hakuba and The Cedars, yet archaea were absent in Hakuba, we focused our analyzes on metabolic strategies supporting bacterial H2 utilization (i.e., homoacetogenesis). To evaluate whether homoacetogenesis is viable in situ, we examined the in situ geochemical environment and the thermodynamics of H2/formate utilization and homoacetogenesis. The spring waters of both Hakuba and The Cedars contained H2 (e.g., 201–664 μM in Hakuba [27]). Formate, another compound thought to be abiotically generated through serpentinization, was also detected in Hakuba (8 μM in drilling well #3 [28]) and The Cedars (6.9 µM in GPS1). Acetate has also been detected in situ (4 μM in Hakuba [28] and 69.3 µM in The Cedars GPS1), suggesting these ecosystems may host novel H2- and/or formate-utilizing homoacetogens. Thermodynamic calculations using newly measured and published geochemical data (Tables S1 and S2) confirmed that H2 and formate are reductants in situ (i.e., H2 = 2H+ + 2e−/Formate− = H+ + CO2 + 2e−): the Gibbs free energy yields (∆G) for oxidation (coupled with physiological electron carriers NADP+, NAD+, and ferredoxin) are less than −4.78 kJ per mol H2 and −24.92 kJ per mol formate in Hakuba, and −10.73 and −22.03 in The Cedars respectively (H2 concentration was not available for The Cedars so the highest concentration observed in Hakuba [664 µM] was used; see Supplementary Results). However, serpentinite-hosted systems impose a unique challenge to homoacetogenesis—a key substrate, CO2, is at extremely low concentrations due to the high alkalinity. We estimate that the aqueous CO2 concentration is below 0.0006 nM in Hakuba (pH 10.7 and  +24.92 or +22.02 kJ per mol formate). Thus, catabolic reduction of CO2 to acetate is thermodynamically challenging in situ and may only run if investing ATP (e.g., Calvin–Benson–Bassham cycle [−6 ATP; ∆G of −361.68 kJ per mol acetate in Hakuba] or reductive tricarboxylic acid [−1 ATP; −61.68 kJ per mol]). Under CO2 limitation, autotrophs are known to accelerate CO2 uptake through HCO3− dehydration to CO2 (carbonic anhydrase) or carbonate mineral dissolution, but both only modify kinetics and are not effective in changing the maximum CO2 concentration (determined by equilibrium with carbonate species). In addition, in Hakuba, the CO32− concentration is too low (84.7 nM CO32−) to cause carbonate mineral precipitation (e.g., [CO32−] must exceed 38.5 µM given Ks of 5 ×  10−9 for CaCO3 and [Ca2+] of 0.13 mM).Based on thermodynamic calculations, the energy obtainable from H2/CO2-driven homoacetogenesis is too small to support life in many serpentinite-hosted systems, yet acetate is detected in some of these ecosystems (Fig. S2 and Table S2; note that we cannot exclude the possibility that acetate may be produced abiotically by water–rock reactions [30]). Thus, CO2-independent electron-disposing metabolism may have been necessary for extremophilic organisms to gain energy from H2 in the hyperalkaline fluids of hydrothermal systems. Here, we explore the metabolic capacities of organisms living in serpentinite-hosted systems to gain insight into potential metabolic strategies for utilizing H2 under the extreme conditions in situ.Diverse putative H2- and formate-utilizing organismsThrough metagenomic exploration of the two serpentinite-hosted systems (Table S3), we discover a plethora of phylogenetically novel organisms encoding genes for H2 and formate metabolism (19 bins with 73.2–94.8% completeness and 0.0–8.1% contamination [86.1% and 3.8% on average respectively]; available under NCBI BioProject PRJNA453100) despite challenges in acquisition of genomic DNA (15.7 and 18.9 ng of DNA from 233 and 720 L of filtered Hakuba Happo spring water, respectively; RNA was below the detection limit). We find metagenome-assembled genomes (MAGs) affiliated with lineages of Firmicutes (e.g., Syntrophomonadaceae and uncultured family SRB2), Actinobacteria, and candidate division NPL-UPA2 [31] (Fig. S3). We also recovered MAGs for a novel lineage, herein referred to as “Ca. Lithacetigenota”, that inhabits both Hakuba and The Cedars and, to our knowledge, no other ecosystems (Figs. 1, 2a, S3, and S4). The average amino acid identity (AAI) between Ca. Lithacetigenota and neighboring phyla (Coprothermobacterota, Dictyoglomi, Thermodesulfobiota [GTDB-defined phylum], Thermotogae, and Caldiserica) was comparable to the average interphylum AAI among the neighboring phyla (45.33 ± 0.86% vs 45.17 ± 0.99%), suggesting that Ca. Lithacetigenota represents a novel phylum-level lineage (Fig. 2a, b). These genomes encode enzymes for oxidizing H2 and formate (i.e., hydrogenases and formate dehydrogenases [32,33,34,35,36,37,38,39]; see Supplementary Results), suggesting that organisms in situ can employ H2 and formate as electron donors.Fig. 1: Ribosomal protein tree including high-quality MAGs from 74 GTDB-defined phylum-level lineages.Representative genomes (highest quality based on a score defined as completeness – 5*contamination, both estimated by CheckM) were chosen for bacterial classes that contain at least one genome the meet the following criteria: (i) cultured organisms with ≥90% completeness, ≤5% contamination (as estimated by CheckM), and ≤ 20 contigs; (ii) uncultured organisms with ≥85% completeness, ≤3% contamination, and ≤20 contigs; and (iii) Ca. Patescibacteria with ≥60% completeness and ≤1 contig. Universally conserved ribosomal proteins were collected from each genome, aligned with MAFFT v7.394, trimmed with BMGE v1.12 (-m BLOSUM30 -g 0.67 -b 3), and concatenated. A maximum likelihood tree was calculated using IQ-TREE v2.1.3 with the UDM0064LCLR model (-m Poisson+UDM0064LCLR), ultrafast bootstrap approximation, and SH-like approximate likelihood ratio test (-B 1000 -alrt 1000; bootstrap values are recalculated with BOOSTER using the -tbe option). Branches with ≥90% ultrafast bootstrap support and ≥80% SH-alrt support are indicated with black circles. Archaeal and eukaryotic genomes were used as an outgroup. The inter-domain branch was shortened with a break to 1/10 of the calculated length for illustrative purposes. Phylogenetic groups corresponding to “Gracilicutes” and “Terrabacteria” are indicated yellow and blue respectively. Ca. Lithacetigenota are highlighted (magenta). See Supplementary Fig. S4 for full tree.Full size imageFig. 2: “Ca. Lithacetigenota” phylogeny, lithotrophic acetate generation pathways, and comparative genomics with neighboring phyla.a A maximum likelihood tree was calculated for a concatenated alignment of universally conserved ribosomal protein sequences from representative genomes of individual phyla (aligned with MAFFT v7.394 [default parameters] and trimmed with BMGE v1.12 (−m BLOSUM30 −g 0.67 −b 3) using IQ-TREE v2.1.3 with the UDM0064LCLR model (-m Poisson+UDM0064LCLR), ultrafast bootstrap approximation, and SH-like approximate likelihood ratio test (-B 1000 -alrt 1000; bootstrap values are recalculated with BOOSTER using the -tbe option). Branches with ≥90% ultrafast bootstrap support and ≥80% SH-alrt support are indicated with black circles. Phylum names are shown for NCBI taxonomy (italicized) or GTDB classification (*). b The average inter-phylum AAI (as calculated by CompareM) was calculated using GTDB species representatives. c Putative metabolic pathways potentially adapted to the CO2-limited hyperalkaline conditions encoded by “Ca. Lithacetigenota” members and others: formate- and glycine-reducing acetate generation. Arrow colors indicate oxidative (pink), reductive (blue), ATP-yielding (orange), and ATP-consuming (green) steps. d Venn diagram of COGs/NOGs (as predicted by eggnog-mapper) fully conserved across all members of each phylum (genomes included in GTDB release 95 with completeness ≥85% and contamination ≤5%). COGs/NOGs related to lithotrophy and alkaliphily are highlighted. * “COG” abbreviated.Full size image“Ca. Lithacetigenota” has unique site-adapted metabolismInspection of the serpentinite-hosted environment-exclusive phylum “Ca. Lithacetigenota” reveals specialization to H2-driven lithotrophy potentially suitable for the low-CO2 in situ conditions (Fig. 2c). We discover that The Cedars-inhabiting population (e.g., MAG BS5B28, 94.8% completeness and 2.9% contamination) harbors genes for H2 oxidation ([NiFe] hydrogenase Hox), a nearly complete Wood-Ljungdahl pathway, and an oxidoreductase often associated with acetogenesis—NADH:ferredoxin oxidoreductase Rnf [40, 41] (Tables S4 and S5). One critical enzyme, the formate dehydrogenase, is missing from all three “Ca. Lithacetigenota” MAGs from The Cedars (and unbinned contigs), indicating that these bacteria can neither perform H2/CO2-driven nor formate-oxidizing acetogenesis (Fig. 2c). However, even without the formate dehydrogenase, the genes present can form a coherent pathway that uses formate rather than CO2 as a starting point for the “methyl branch” of the Wood–Ljungdahl pathway (i.e., formate serves as an electron acceptor; Fig. 2c). This is a simple yet potentially effective strategy for performing homoacetogenesis while circumventing the unfavorable reduction of CO2 to formate. Coupling H2 oxidation with this formate-reducing pathway is thermodynamically viable as it halves the usage of CO2 (3H2 + Formate− + CO2 = Acetate− + 2H2O; ∆G of −29.62 kJ per mol acetate) and, as a pathway, is simply an intersection between the conventional H2/CO2-driven and formate-disproportionating acetogenesis (Fig. 2c and S5). Although use of formate as an electron acceptor for formate-oxidizing acetogenesis is quite common, no previous homoacetogens have been observed to couple H2 oxidation with acetogenesis from formate, likely because CO2 has a much higher availability than formate in most ecosystems.The Hakuba-inhabiting “Ca. Lithacetigenota” (HKB210 and HKB111) also encodes Hox for H2 oxidation but lacks genes for homoacetogenesis (no homologs closely related to The Cedars population genes were detected even in unbinned metagenomic contigs). We suspect that this population forgoes the above H2/formate-driven homoacetogenesis because the estimated energy yield of the net reaction in situ (∆G of −19.94 kJ per mol acetate) is extremely close to the thermodynamic threshold of microbial catabolism (slightly above −20 kJ per mol) and, depending on the actual threshold for “Ca. Lithacetigenota” and/or even slight changes in the surrounding conditions (e.g., ∆G increases by 1 kJ per mol if H2 decreases by 20 µM decreases in Hakuba), the metabolism may be unable to recover energy. Through searching the physicochemical environment for alternative exogenous electron acceptors and MAGs for electron-disposing pathways, we detected a low concentration of glycine in situ (5.4 ± 1.6 nM; Table S6) and found genes specific to catabolic glycine reduction (see next paragraph). We suspect that some portion of this glycine is likely geochemically generated in situ, given that (a) glycine is often detected as the most abundant amino acid produced by both natural and laboratory-based serpentinization (e.g., H2 + Formate = Formaldehyde ⇒ Formaldehyde + NH3 = Glycine) [10, 16, 42,43,44,45,46,47] and (b) no other amino acid was consistently detectable (if glycine was cell-derived, other amino acids ought to also be consistently detected).For utilization of the putatively abiotic glycine, the Hakuba “Ca. Lithacetigenota” encodes glycine reductases (Grd; Fig. 3 and S6; Tables S4 and S5)—a unidirectional selenoprotein for catabolic glycine reduction [48, 49]. Based on the genes available, this population likely specializes in coupling H2 oxidation and glycine reduction (H2 + Glycine− → Acetate− + NH3; Fig. 2c). Firstly, the genomes encode NADP-linked thioredoxin reductases (NADPH + Thioredoxinox → NADP+ + Thioredoxinred) that can bridge electron transfer from H2 oxidation (H2 + NADP → NADPH + H+) to glycine reduction (Glycine− + Thioredoxinred → Acetyl-Pi + NH3 + Thioredoxinox). Secondly, though glycine reduction is typically coupled with amino acid oxidation (i.e., Stickland reaction in Firmicutes and Synergistetes [48, 50]), similar metabolic couplings have been reported for some organisms (i.e., formate-oxidizing glycine reduction [via Grd] [51] and H2-oxidizing trimethylglycine reduction [via Grd-related betaine reductase] [52]). Thirdly, Grd is a rare catabolic enzyme, so far found in organisms that specialize in amino acid (or peptide) catabolism, many of which are reported to use glycine for the Stickland reaction (e.g., Peptoclostridium of Firmicutes and Aminobacterium of Synergistetes [53]). Lastly, the population lacks any discernable fermentative (propionate [methylmalonyl-CoA pathway], butyrate [reverse beta oxidation], lactate [lactate dehydrogenase], and alanine [alanine dehydrogenase]) and respiratory (aerobic [terminal oxidases], nitrate [nitrate reductase, nitrite reductase, nitric oxide reductase, nitrous oxide reductase], sulfate [dissimilatory sulfate reductase and sulfite reductase], other sulfurous compounds [molybdopterin-binding protein family sulfurous compound reductases], and metals [outer membrane cytochrome OmcB]) electron disposal pathways and oxidative organotrophy (Tables S4 and S5). Although the BS5B28 genome encodes a bifunctional alcohol/aldehyde dehydrogenase and aldehyde:ferredoxin oxidoreductase, no complete sugar or amino acid degradation pathways could be identified, suggesting that these genes have a physiological role unrelated to ethanol fermentation. Further, though formate and glycine transporters were absent in the genomes, a survey of transporters (annotated in UniProtKB 2021_03 [54]) revealed that no alkaliphiles (organisms with optimum pH ≥ 9.5 in the DSMZ BacDive database [55]) encoded known formate transporters (focA; TIGR04060) or amino acid permeases (PF00324) (ABC transporters were not considered as substrate specificity for these complexes cannot be annotated reliably), indicating that alkaliphiles likely employ unknown transport proteins. Reflecting the lack of other catabolic pathways, the Hakuba “Ca. Lithacetigenota” MAGs display extensive genome streamlining, comparable to that of Aurantimicrobium [56, 57], “Ca. Pelagibacter” [58], and Rhodoluna [59] in aquatic systems, as also reported for other organisms inhabiting serpentinite-hosted systems [60, 61] (Fig. S7). Thermodynamic calculations show that H2-oxidizing glycine reduction is favorable in situ (∆G°’ of −70.37 kJ per mol glycine [∆G of −85.84 in Hakuba]; Fig. S1). Further, based on the pathway identified, this putative metabolism is >10 times more efficient in recovering energy from H2 (1 mol ATP per mol H2) than acetogenesis utilizing H2/CO2 (0.075 mol ATP per mol H2 based on the pathway Acetobacterium woodii utilizes) or H2/formate (0.075 mol ATP per mol H2, assuming no energy recovery associated with the formate dehydrogenase). We also detect glycine reductases in The Cedars “Ca. Lithacetigenota”, indicating that it may also perform this metabolism (∆G of −76.87 in The Cedars, assuming 201 µM H2).Fig. 3: Evolution and distribution of glycine reductases.a Phylogeny of serpentinite-hosted microbiome glycine reductase subunit GrdBE homologs (Hakuba Happo hot spring*, The Cedars springs†, and other serpentinite-hosted system metagenomes#) and a brief scheme for evolutionary history of Grd. Grd-related COG1978 homologs were collected from the representative species genomes in GTDB, filtered using a GrdB motif conserved across members of phyla known to perform glycine-reducing Stickland reaction (see Methods and Supplementary Fig. S6) and clustered with 75% amino acid sequence similarity using CD-HIT (-c 0.75). GrdB-related sarcosine reductase subunits were excluded by identification of a GrdF motif conserved across sequences that form a distinct cluster around the biochemically characterized Peptoclostridium acidaminophilum GrdF. GrdE neighboring GrdB were collected. D-proline reductase subunits PrdBA (homologous to GrdB and GrdE respectively) was used as an outgroup. GrdB+PrdB and GrdE+PrdA were aligned (MAFFT v7.394) and trimmed (BMGE v1.12 -m BLOSUM30 -g 0.05) separately, then concatenated. A maximum likelihood tree was calculated using IQ-TREE v2.1.3 (-m LG+C20+G+F) and 1000 ultrafast bootstrap replicates (bootstrap values are recalculated with BOOSTER). Branches with ≥95% ultrafast bootstrap support are indicated with pink circles. Serpentinite-hosted system-derived sequences are shown in blue and taxa that may have gained GrdB through horizontal transfer are shown in green. Though the GrdB motif did not match, the closest (and only) detectable archaeal homolog (COG1978) identified in Ca. Bathyarchaeota is included. An axis break is used for the branch connecting GrdBE (and the Ca. Bathyarchaeota homolog) and outlier PrdBA for readability (10% of actual length). See Supplementary Fig. S6 for complete tree and full branch length between GrdBE and PrdBA. In the brief scheme of Grd evolution (top left), the cladogram topology is based on Fig. S4. Vertical transfer (red lines in cladogram) and horizontal transfer (black arrows) inferred from tree structures are shown. Phyla that may have acquired Grd vertically (red) and horizontally (gray) are indicated. GTDB phyla belonging to Firmicutes were grouped together. * GTDB-defined phylum-level lineage nomenclature. b Number of glycine reductase-encoding GTDB-defined species representatives (GTDB r95) associated with different environments. Only genomes with both GrdB and GrdE were included.Full size imageGiven the phylogenetic and metabolic uniqueness of these populations, we report provisional taxonomic assignment to “Ca. Lithacetigenota” phyl. nov., “Ca. Lithacetigena glycinireducens” gen. nov., sp. nov. (HKB111 and HKB210), and “Ca. Psychracetigena formicireducens” gen. nov., sp. nov. (BS525, BS5B28, and GPS1B18) (see Supplementary Results). Based on a concatenated ribosomal protein tree, this serpentinite-hosted ecosystem-associated candidate phylum is closely related to the deepest-branching group of bacterial phyla in “Terrabacteria”, one of the two major of lineages Bacteria (Fig. 1). Comparative genomics shows that “Ca. Lithacetigenota” shares 623 core functions (based on Bacteria-level COGs/NOGs predicted by eggnog-mapper shared by the two highest quality Hakuba and The Cedars MAGs HKB210 and BS5B28; Fig. 2d). When compared with the core functions of two closest related phyla (Caldiserica and Coprothermobacterota), 176 functions were unique to “Ca. Lithacetigenota”, including those for NiFe hydrogenases (and their maturation proteins), selenocysteine utilization (essential for Grd), and sodium:proton antiporter for alkaliphily. With Coprothermobacterota, 232 functions were shared, including Grd, thioredoxin oxidoreductase (essential for electron transfer to Grd), and additional proteins for NiFe hydrogenases and selenocysteine utilization, pointing toward importance of H2 metabolism and glycine reduction for these closely related phyla. More importantly, among bacterial phyla in the deep-branching group, “Ca. Lithacetigenota” represents the first lineage inhabiting hyperalkaliphilic serpentinite-hosted ecosystems, suggesting that these organisms may be valuable extant windows into potential physiologies of primordial organisms who are thought to have lived under hyperalkaline conditions (albeit with 4 billion years of evolution in between; see discussion regarding Grd below).Widespread glycine reduction in serpentinite-hosted systemsUncultured members of Chloroflexi (Chloroflexota) class Dehalococcoidia inhabiting The Cedars and Firmicutes (Firmicutes_D) class SRB2 in Hakuba and The Cedars also possess glycine reductases (Table S5). In addition, these populations encode hydrogenases and formate dehydrogenases, suggesting that they may also link H2 and formate metabolism to glycine reduction. Closely related glycine reductases were also detected in other studied serpentinite-hosted systems (47–94% amino acid similarity in Tablelands, Voltri Massif, and Coast Range Ophiolite) [1, 2, 7]. Phylogenetic analysis of the glycine-binding “protein B” subunits GrdB and GrdE reveals close evolutionary relationships between glycine reductases from distant/remote sites (Fig. 3a and S6). Note that Tablelands spring glycine reductase sequences were not included in the analysis as they were only detected in the unassembled metagenomic reads (4460690.3; 69.7–82.2% similarity to Hakuba SRB2). Overall, “Ca. Lithacetigenota”, Dehalococcoidia, and SRB2 glycine reductases are all detected in at least two out of the seven metagenomically investigated systems despite the diverse environmental conditions (e.g., temperature). Thus, we propose glycine as an overlooked thermodynamically and energetically favorable electron acceptor for H2 oxidation in serpentinite-hosted systems. We suspect that glycine reduction may be a valuable catabolic strategy as the pathway requires few genes/proteins (a hydrogenase, Grd, acetate kinase, and thioredoxin oxidoreductase) and conveniently provides acetate, ammonia, and ATP as basic forms of carbon, nitrogen, and energy.Phylogenetic analysis of glycine reductases (Fig. 3a and S6) shows that the novel homologs recovered from serpentinite-hosted systems represent deep-branching lineages distantly related from those detectable in published genomes (GTDB r95 species representatives). Further comparison of the topology with a ribosomal protein-based genome tree (Fig. 2a) indicates that the two deep-branching serpentinite-hosted system-affiliated lineages (Ca. Lithacetigenota and novel Chloroflexi family) and Firmicutes vertically inherited glycine reductases. Thus, catabolic glycine reduction can be traced back to the concestor of these three lineages, suggesting the metabolism at least dates back to the ancestor of “Terrabacteria”. We further identified an archaeal GRD homolog (in Miscellaneous Crenarchaeota Group [MCG] or Ca. Bathyarchaeota member BA-1; Fig. 3a and S6), but whether this gene functions as a glycine reductase (GrdB motif not fully conserved) and, further, truly belong to this clade (source is metagenome-assembled genome) remains to be verified. Reconstruction of the ancestral Grd sequence and estimation of its pH preference (via AcalPred) showed that the ancestral enzyme likely had good efficiency under alkaline conditions (pH  > 9; probability of 0.9973 and 0.9858 for GrdB and GrdE, respectively). Thus, the currently available data suggest that Grd (and catabolic glycine reduction) is an ancient bacterial catabolic innovation in an alkaline habitat, dating back to one of the deepest nodes in the bacterial tree.While we detect glycine reductases in many serpentinite-hosted systems, examination of genomes derived from other natural ecosystems shows that only 107 species (species representatives in GTDB r95; 0.35% of all GTDB species) inhabiting such habitats encode GrdBE (Fig. 3b and S6). This is a level comparable to rare artificial contaminant-degrading enzymes (e.g., tetrachloroethane dehalogenase pceA—65 species [encoding KEGG KO K21647 based on AnnoTree with GTDB r95 and default settings [62]]; dibenzofuran dioxygenase—258 species [K14599 and K14600]). Most glycine reductase homologs are found in species affiliated with host-associated (mostly human body and rumen) or artificial habitats (360 species), the majority of which belong to the phylum Firmicutes. We suspect that glycine reduction has low utility in most natural ecosystems (e.g., no excess glycine via abiotic generation and no severe nutrient/electron acceptor limitation) and has been repurposed by some anaerobes for the fermentative Stickland reaction in organic-rich ecosystems (e.g., host-associated ecosystems) where excess amino acids are available but access to favorable electron acceptors is limited (Fig. 3b) (notably, glycine is the dominant amino acid in collagen [ >30%], the most abundant protein in vertebrate bodies).Other characteristics of putative indigenous homoacetogensIn contrast with members of “Ca. Lithacetigenota”, several other putative homoacetogenic populations encode the complete Wood–Ljundgahl pathway (Tables S4 and S5), indicating that other forms of acetogenesis may also be viable in situ. One putative homoacetogen in The Cedars, NPL-UPA2, lacks hydrogenases but encodes formate dehydrogenases. Although the NPL-UPA2 population cannot perform H2/formate-driven acetogenesis, it may couple formate oxidation with formate-reducing acetogenesis—another thermodynamically viable metabolism (∆G of –50.90 kJ per mol acetate in The Cedars; Fig. S5). The pathway uses CO2 as a substrate but has lower CO2 consumption compared to H2/CO2 homoacetogenesis and can produce intracellular CO2 from formate. A recent study also points out that methanogens inhabiting serpentinite-hosted environments oxidize formate presumably to generate intracellular CO2 [9]. In Hakuba, an Actinobacteria population affiliated with the uncultured class UBA1414 (MAG HKB206) encodes hydrogenases and a complete Wood–Ljungdahl pathway (Table S5) and, thus, may be capable of H2/formate or the above formate-disproportionating acetogenesis (Fig. 2c and S5). Indeed, the UBA1414 population was enriched in Hakuba-derived cultures aiming to enrich acetogens using the H2 generated by the metallic iron–water reaction [63] (Fig. S8). Many populations encoding a complete Wood–Ljungdahl pathway possess monomeric CO dehydrogenases (CooS unassociated with CODH/ACS subunits; NPL-UPA2, Actinobacteria, Syntrophomonadaceae [Hakuba and The Cedars], and Dehalococcoidia [The Cedars]; Table S4). Although CO is below the detection limit in Hakuba (personal communication with permission from Dr. Konomi Suda), another study shows that CO metabolism takes place in an actively serpentinizing system with no detectable CO [64]. Given that CO is a known product of serpentinization [7, 64], it may be an important substrate for thermodynamically favorable acetogenesis in situ. However, further investigation is necessary to verify this (e.g., need to measure CO at multiple time points).Another interesting adaptation observed for all putative homoacetogens detected in Hakuba and The Cedars was possession of an unusual CODH/ACS complex. Although Bacteria and Archaea are known to encode structurally distinct forms of CODH/ACS (designated as Acs and Cdh respectively for this study), all studied Hakuba/The Cedars putative homoacetogens encode genes for a hybrid CODH/ACS that integrate archaeal subunits for the CO dehydrogenase (AcsA replaced with CdhAB) and acetyl-CoA synthase (AcsB replaced with CdhC) and bacterial subunits for the corrinoid protein and methyltransferase components (AcsCDE) (Table S4). The Firmicutes lineages also additionally encode the conventional bacterial AcsABCDE. Given that all of the identified putative homoacetogens encode this peculiar hybrid complex, we suspect that such CODH/ACS’s may have features adapted to the high-pH low-CO2 conditions (e.g., high affinity for CO2 and/or CO). In agreement, a similar hybrid CODH/ACS has also been found in the recently isolated “Ca. Desulforudis audaxviator” inhabiting an alkaline (pH 9.3) deep subsurface environment with a low CO2 concentration (below detection limit [65, 66]) [67].Implications for primordial biologyThe last universal common ancestor (LUCA) is hypothesized to have evolved within alkaline hydrothermal mineral deposits at the interface of serpentinization-derived fluid and ambient water (e.g., Hadean weakly acidic seawater) [22,23,24]. Although such interfaces no longer exist (i.e., ancient Earth lacked O2 but most water bodies contain O2 on modern Earth), modern anoxic terrestrial and oceanic ecosystems harboring active serpentinization [1,2,3,4,5,6,7,8] may hold hints for how primordial organisms utilized H2 under hyperalkaline CO2-depleted conditions (e.g., post-LUCA H2-utilizing organisms that ventured away from the interface towards the alkaline fluids). Our findings suggest that unconventional modes of lithotrophy that take advantage of geogenic reduced carbon compounds (e.g., formate and glycine) as exogenous electron acceptors may have been viable approaches to circumventing thermodynamic issues and obtaining energy from H2 oxidation in situ. The strategies we discover are largely exclusive to the bacterial domain (archaeal CO2 reduction does not involve formate as an intermediate and, to our knowledge, glycine reduction is limited to Bacteria) and originated deep in the bacterial tree, suggesting they may have been relevant in the divergence towards the bacterial and archaeal domains. Notably, the estimated alkaliphily of the ancestral Grd also points towards the relevance of this metabolism in ancient alkaline habitats. More

  • in

    Nitrogen and carbon stable isotope analysis sheds light on trophic competition between two syntopic land iguana species from Galápagos

    Luiselli, L., Akani, G. & Capizzi, D. Food resource partitioning of a community of snakes in a swamp rainforest of south-eastern Nigeria. J. Zool. 246(2), 125–133. https://doi.org/10.1111/j.1469-7998.1998.tb00141.x (1998).Article 

    Google Scholar 
    Rouag, R., Djilali, H., Gueraiche, H. & Luiselli, L. Resource partitioning patterns between two sympatric lizard species from Algeria. J. Arid Environ. 69, 158–168. https://doi.org/10.1016/j.jaridenv.2006.08.008 (2007).ADS 
    Article 

    Google Scholar 
    Bergeron, R. & Blouin-Demers, G. Niche partitioning between two sympatric lizards in the Chiricahua Mountains of Arizona. Copeia 108(3), 570–577. https://doi.org/10.1643/CH-19-268 (2020).Article 

    Google Scholar 
    Lucek, K., Butlin, R. K. & Patsiou, T. Secondary contact zones of closely-related Erebia butterflies overlap with narrow phenotypic and parasitic clines. J. Evol. Biol. 33(9), 1152–1163. https://doi.org/10.1111/jeb.13669 (2020).CAS 
    Article 
    PubMed 

    Google Scholar 
    Freeman, B. G. Competitive interaction upon secondary contact drive elevational divergence in tropical birds. Am. Nat. 186(4), 470–479. https://doi.org/10.5061/dryad.6qg3g (2015).Article 
    PubMed 

    Google Scholar 
    Schoener, T. W. Resource partitioning in ecological communities. Science 185(4145), 27–39 (1974).ADS 
    CAS 
    Article 

    Google Scholar 
    Rivas, L. R. A Reinterpretation of the concepts “sympatric” and “allopatric” with proposal of the additional terms “syntopic” and “allotopic”. Syst. Zool. 13(1), 42 (1964).Article 

    Google Scholar 
    Macarthur, R. & Levins, R. The limiting similarity, convergence, and divergence of coexisting species. Am. Nat. 101(921), 377–385 (1967).Article 

    Google Scholar 
    Dayan, T. & Simberloff, D. Ecological and community-wide character displacement: The next generation. Ecol. Lett. 8(8), 875–894. https://doi.org/10.1111/j.1461-0248.2005.00791.x (2005).Article 

    Google Scholar 
    Holomuzki, J. R., Feminella, J. W. & Power, M. E. Biotic interactions in freshwater benthic habitats. J. N. Am. Benthol. Soc. 29(1), 220–244. https://doi.org/10.1899/08-044.1 (2010).Article 

    Google Scholar 
    Ferretti, F. et al. Competition between wild herbivores: Reintroduced red deer and Apennine chamois. Behav. Ecol. 26(2), 550–559. https://doi.org/10.1093/beheco/aru226 (2015).Article 

    Google Scholar 
    Takada, H., Yano, R., Katsumata, A., Takatsuki, S. & Minami, M. Diet compositions of two sympatric ungulates, the Japanese serow (Capricornis crispus) and the sika deer (Cervus nippon), in a montane forest and an alpine grassland of Mt. Asama central, Japan. Mamm. Biol. 101, 681–694. https://doi.org/10.1007/s42991-021-00122-5 (2021).Article 

    Google Scholar 
    Hubbel, S. P. The Unified Neutral Theory of Biodiversity and Biogeography (Princeton University Press, 2001) (ISBN 9780691021287).
    Google Scholar 
    Bell, G. Neutral macroecology. Science 293, 2413–2418. https://doi.org/10.1126/science.293.5539.2413 (2001).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Rosindell, J., Hubbel, S. P. & Etienne, R. S. The unified neutral theory of biodiversity and biogeography at age ten. Trends Ecol. Evol. 26(7), 340–348. https://doi.org/10.1016/j.tree.2011.03.024 (2011).Article 
    PubMed 

    Google Scholar 
    Cowie, R. H. & Holland, B. S. Dispersal is fundamental to biogeography and the evolution of biodiversity on oceanic islands. J. Biogeogr. 33, 193–198. https://doi.org/10.1111/j.1365-2699.2005.01383.x (2006).Article 

    Google Scholar 
    Amarasekare, P. & Nisbet, R. M. Spatial heterogeneity, source-sink dynamics, and the local coexistence of competing species. Am. Nat. 158(6), 572–584. https://doi.org/10.1086/323586 (2001).CAS 
    Article 
    PubMed 

    Google Scholar 
    Kumar, K., Gentile, G. & Grant, T. D. Conolophus subcristatus. The IUCN Red List of Threatened Species 2020, e.T5240A3014082 (2020). https://doi.org/10.2305/IUCN.UK.2020-2.RLTS.T5240A3014082.enGentile, G. Conolophus marthae. The IUCN Red List of Threatened Species 2012, e. T174472A1414375 (2012). https://doi.org/10.2305/IUCN.UK.2012-1.RLTS.T174472A1414375.enGentile, G., Marquez, C., Snell, H. L., Tapia, W. & Izurieta, A. Conservation of a New Flagship Species: The Galápagos Pink Land Iguana (Conolophus marthae Gentile and Snell, 2009). In Problematic Wildlife: A Cross-Disciplinary Approach (ed. Angelici, F. M.) 315–336 (Springer International Publishing, 2016). https://doi.org/10.1007/978-3-319-22246-2_15.Chapter 

    Google Scholar 
    Gentile, G. & Snell, H. L. Conolophus marthae sp. Nov. (Squamata, iguanidae), a new species of land iguana from the Galápagos Archipelago. Zootaxa 2201, 1–10 (2009).Article 

    Google Scholar 
    Colosimo, G. et al. Chemical signatures of femoral pore secretions in two syntopic but reproductively isolated species of Galápagos land iguanas (Conolophus marthae and C. subcristatus). Sci. Rep. 10(1), 14314. https://doi.org/10.1038/s41598-020-71176-7 (2020).ADS 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Jackson, M. Galápagos: A Natural History, Revised and Expanded (University of Calgary Press, 1994).
    Google Scholar 
    Traveset, A. et al. Galápagos land iguana (Conolophus subcristatus) as a seed disperser. Integr. Zool. 11(3), 207–213. https://doi.org/10.1111/1749-4877.12187 (2016).Article 
    PubMed 

    Google Scholar 
    Di Giambattista, L. et al. Molecular data exclude current hybridization between iguanas Conolophus marthae and C. subcristatus on Wolf volcano (Galápagos islands). Conserv. Genet. 19(6), 1461–1469. https://doi.org/10.1007/s10592-018-1114-3 (2018).Article 

    Google Scholar 
    MacLeod, A. et al. Hybridization masks speciation in the evolutionary history of the Galápagos marine iguana. Proc. R. Soc. B 282, 1–9. https://doi.org/10.1098/rspb.2015.0425 (2015).Article 

    Google Scholar 
    Gause, G. F. The Struggle for Existence (Williams and Wilkins Company, 1934).Book 

    Google Scholar 
    Hardin, G. The competitive exclusion principle. Science 131(3409), 1292–1297 (1960).ADS 
    CAS 
    Article 

    Google Scholar 
    Ashrafi, S., Beck, A., Rutishauser, M., Arlettaz, R. & Bontadina, F. Trophic niche partitioning of cryptic species of long-eared bats in Switzerland: Implications for conservation. Eur. J. Wildl. Res. 57, 843–849. https://doi.org/10.1007/s10344-011-0496-z (2011).Article 

    Google Scholar 
    Bleyhl, B. et al. Assessing niche overlap between domestic and threatened wild sheep to identify conservation priority areas. Divers. Distrib. 25(1), 129–141. https://doi.org/10.1111/ddi.12839 (2019).Article 

    Google Scholar 
    Newsome, S. D., del Rio, C. M., Bearhop, S. & Phillips, D. L. A niche for isotopic ecology. Front. Ecol. Environ. 5(8), 429–436. https://doi.org/10.1890/060150.1 (2007).Article 

    Google Scholar 
    Riera, P., Stal, L. J. & Nieuwenhuize, J. δ13C versus δ15N of co-occurring mollusks within a community dominated by Crassostrea gigas and Crepidula ornicate (Oossterschelde, The Netherlands). Mar. Ecol. Prog. Ser. 240, 291–295 (2002).ADS 
    Article 

    Google Scholar 
    Page, B., McKenzie, J. & Goldsworthy, S. D. Dietary resources partitioning among sympatric New Zealand and Australian fur seals. Mar. Ecol. Prog. Ser. 293, 283–302 (2005).ADS 
    Article 

    Google Scholar 
    DeNiro, M. J. & Epstein, S. Influence of diet on the distribution of carbon isotopes in animals. Geochim. Cosmochim. Acta 42(5), 495–506 (1978).ADS 
    CAS 
    Article 

    Google Scholar 
    DeNiro, M. J. & Epstein, S. Influence of diet on the distribution of nitrogen isotopes in animals. Geochim. Cosmochim. Acta 45(3), 341–351 (1981).ADS 
    CAS 
    Article 

    Google Scholar 
    Post, D. M. Using stable isotopes to estimate trophic position: Models, methods, and assumptions. Ecology 83(3), 703–718. https://doi.org/10.1890/0012-9658(2002)083[0703:USITET]2.0.CO;2 (2002).Article 

    Google Scholar 
    Crawford, K., McDonald, R. A. & Bearhop, S. Applications of stable isotope techniques to the ecology of mammals. Mammal. Rev. 38(1), 87–107. https://doi.org/10.1111/j.1365-2907.2008.00120.x (2008).Article 

    Google Scholar 
    Trueman, M. & d’Ozouville, N. Characterizing the Galápagos terrestrial climate in the face of global climate change. Gala Res. 67, 26–37 (2010).
    Google Scholar 
    Paltán, H. A. et al. Climate and sea surface trends in the Galápagos Islands. Sci. Rep. 11(1), 1–13. https://doi.org/10.1038/s41598-021-93870-w (2021).CAS 
    Article 

    Google Scholar 
    Rivas-Torres, G. F., Benítez, F. L., Rueda, D., Sevilla, C. & Mena, C. F. A methodology for mapping native and invasive vegetation coverage in archipelagos: An example from the Galápagos islands. Prog. Phys. Geogr. 42(1), 83–111. https://doi.org/10.1177/0309133317752278 (2018).Article 

    Google Scholar 
    Gentile, G., Ciambotta, M. & Tapia, W. Illegal wildlife trade in Galápagos: Molecular tools help taxonomic identification and guide rapid repatriation of confiscated iguanas. Conserv. Genet. Resour. 5, 867–872. https://doi.org/10.1007/s12686-013-9915-7 (2013).Article 

    Google Scholar 
    Stephens, R. B., Ouimette, A. P., Hobbie, E. A. & Rowe, R. J. Re-evaluating trophic discrimination factors (Δδ13C and Δδ15N) for diet reconstruction. Ecol. Mono 92, e1525. https://doi.org/10.1002/ecm.1525 (2022).CAS 
    Article 

    Google Scholar 
    Hobson, K. A. & Clark, R. G. Assessing avian diets using stable isotopes I: Turnover of 13C in tissues. The Condor 94(1), 181–188. https://doi.org/10.2307/1368807 (1992).Article 

    Google Scholar 
    Li, C.-H., Roth, J. D. & Detwiler, J. T. Isotopic turnover rates and diet-tissue discrimination depend on feeding habits of freshwater snails. PLoS ONE 13(7), e0199713. https://doi.org/10.1371/journal.pone.0199713 (2018).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Steinitz, R., Lemm, J., Pasachnik, S. & Kurle, C. Diet-tissue stable isotope (δ13C and δ15N) discrimination factors for multiple tissues from terrestrial reptiles. Rapid Commun. Mass Spectrom. 30(1), 9–21. https://doi.org/10.1002/rcm.7410 (2016).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    Ethier, D. M., Kyle, C. J., Kyser, T. K. & Nocera, J. J. Variability in the growth patterns of the cornified claw sheath among vertebrates: Implications for using biogeochemistry to study animal movement. Can. J. Zool. 88(11), 1043–1051. https://doi.org/10.1139/Z10-073 (2010).Article 

    Google Scholar 
    Aresco, M. J. & James, F. C. Ecological relationships of turtles in northern Florida lakes: A study of omnivory and the structure of a lake food web. Florida Fish and Wildlife Conservation Commission (2005). https://www.semanticscholar.org/paper/ECOLOGICAL-RELATIONSHIPS-OF-TURTLES-IN-NORTHERN-A-A-Aresco-James/f6d59265eb6494aa19cfde7d2d80bb165e6432acLourenço, P. M., Granadeiro, J. P., Guilherme, J. L. & Catry, T. Turnover rates of stable isotopes in avian blood and toenails: Implications for dietary and migration studies. J. Exp. Mar. Biol. Ecol. 472, 89–96. https://doi.org/10.1016/j.jembe.2015.07.006 (2015).CAS 
    Article 

    Google Scholar 
    Jackson, A. L., Inger, R., Parnell, A. C. & Bearhop, S. Comparing isotopic niche widths among and within communities: SIBER—Stable isotope Bayesian ellipses in r. J. Animal Ecol. 80(3), 595–602. https://doi.org/10.1111/j.1365-2656.2011.01806.x (2011).Article 

    Google Scholar 
    Wikelski, M. & Romero, L. M. Body size, performance and fitness in Galápagos marine iguanas. Integr Comp Biol 43(3), 376–386. https://doi.org/10.1093/icb/43.3.376 (2003).Article 
    PubMed 

    Google Scholar 
    Iverson, J., Smith, G. & Pieper, L. Factors Affecting Long-Term Growth of the Allen Cays Rock Iguana in the Bahamas. In Iguanas: Biology and Conservation (eds Alberts, A. et al.) 176–192 (University of California Press, 2004). https://doi.org/10.1525/9780520930117-018.Chapter 

    Google Scholar 
    Smith, G. R. & Iverson, J. B. Effects of tourism on body size, growth, condition, and demography in the Allen Cay Iguana. Herpetol. Conserv. Biol. 11, 214–221 (2016).
    Google Scholar 
    Wikelski, M., Carrillo, V. & Trillmich, F. Energy limits to body size in a grazing reptile, the Galápagos Marine Iguana. Ecology 78(7), 2204–2217. https://doi.org/10.2307/2265956 (1997).Article 

    Google Scholar 
    Bulakhova, N. A. et al. Inter-observer and intra-observer differences in measuring body length: A test in the common lizard, Zootoca vivipara. Amphibia-Reptilia 32(4), 477–484. https://doi.org/10.1163/156853811X601636 (2011).Article 

    Google Scholar 
    R Development Core Team. R: A language and environment for statistical computing (2021). https://cran.r-project.orgGoslee, S. C. & Urban, D. L. The ecodist package for dissimilarity-based analysis of ecological data. J. Stat. Softw. 22(7), 1–19. https://doi.org/10.18637/jss.v022.i07 (2007).Article 

    Google Scholar 
    Randin, C. F., Jaccard, H., Vittoz, P., Yoccoz, N. G. & Guisan, A. Land use improves spatial predictions of mountain plant abundance but not presence–absence. J. Veg. Sci. 20, 996–1008. https://doi.org/10.1111/j.1654-1103.2009.01098.x (2009).Article 

    Google Scholar 
    Broennimann, O., Di Cola, V. & Guisan, A. ecospat: Spatial Ecology Miscellaneous Methods. R package version 3.2.1 (2022) https://CRAN.R-project.org/package=ecospatBorcard, D., Legendre, P. & Drapeau, P. Partialling out the spatial component of ecological variation. Ecology 73(3), 1045–1055. https://doi.org/10.2307/1940179 (1992).Article 

    Google Scholar 
    Wood, S. N. Generalized Additive Models: An Introduction with R 2nd edn. (Chapman and Hall/CRC, 2017). https://doi.org/10.1201/9781315370279.Book 
    MATH 

    Google Scholar 
    Van Marken Lichtenbelt, W. D. Optimal foraging of a herbivorous lizard, the green iguana in a seasonal environment. Oecologia 95, 246–256. https://doi.org/10.1007/BF00323497 (1993).ADS 
    Article 
    PubMed 

    Google Scholar 
    Pasachnik, S. A. & Martin-Velez, V. An evaluation of the diet of Cyclura iguanas in the Dominican Republic. Herpetol. Bull. 140, 6–12 (2017).
    Google Scholar 
    Cerling, T. E. et al. Global vegetation change through the Miocene/Pliocene boundary. Nature 389(6647), 153–158. https://doi.org/10.1038/38229 (1997).ADS 
    CAS 
    Article 

    Google Scholar 
    O’Leary, M. H. Carbon isotopes in photosynthesis. Bioscience 38(5), 328–336. https://doi.org/10.2307/1310735 (1988).Article 

    Google Scholar 
    Snell, H. L. & Tracy, C. R. Behavioral and morphological adaptations by Galapagos land iguanas (Conolophus subcristatus) to water and energy requirements of eggs and neonates. Am. Zool. 25(4), 1009–1018. https://doi.org/10.1093/icb/25.4.1009 (1985).Article 

    Google Scholar 
    Christian, K., Tracy, C. R. & Porter, W. P. Diet, digestion, and food preferences of Galápagos land iguanas. Herpetologica 40(2), 205–212 (1984).
    Google Scholar 
    Mallona, I., Egea-Cortines, M. & Weiss, J. Conserved and divergent rhythms of crassulacean acid metabolism-related and core clock gene expression in the cactus Opuntia ficus-indica. Plant Physiol. 156, 1978–1989. https://doi.org/10.1104/pp.111.179275 (2011).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    San Sebastián, O., Navarro, J., Llorente, G. A. & Richter-Boix, Á. Trophic strategies of a non-native and a native amphibian species in shared ponds. PLoS ONE 10(6), 1–17. https://doi.org/10.1371/journal.pone.0130549 (2015).CAS 
    Article 

    Google Scholar 
    Perga, M. E. & Grey, J. Laboratory measures of isotope discrimination factors: Comments on Caut, Angulo & Courchamp (2008, 2009). J. Appl. Ecol. 47(4), 942–947. https://doi.org/10.1111/j.1365-2664.2009.01730.x (2010).CAS 
    Article 

    Google Scholar 
    Freeman, B. Sexual niche partitioning in two species of new Guinean Pachycephala whistlers. J. Field Ornithol. 85(1), 23–30. https://doi.org/10.1111/jofo.12046 (2014).Article 

    Google Scholar 
    Werner, D. I. Social Organization and Ecology of Land Iguanas, Conolophus subcristatus, on Isla Fernandina, Galápagos. In Iguanas of the World: Their Behavior, Ecology, and Conservation (eds Burghardt, G. M. & Rand, A. S.) 342–365 (Noyes Publications, 1982).
    Google Scholar 
    Doi, H., Akamatsu, F. & González, A. L. Starvation effects on nitrogen and carbon stable isotopes of animals: An insight from meta-analysis of fasting experiments. R. Soc. Open Sci. 4(8), 170633. https://doi.org/10.1098/rsos.170633 (2017).ADS 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    Persaud, A., Dillon, P., Molot, L. & Hargan, K. Relationships between body size and trophic position of consumers in temperate freshwater lakes. Aquat. Sci. 74(1), 203–212. https://doi.org/10.1007/s00027-011-0212-9 (2012).Article 

    Google Scholar 
    Keppeler, F. W. et al. Body size, trophic position, and the coupling of different energy pathways across a saltmarsh landscape. Limnol. Oceanogr. Lett. 6(6), 360–368. https://doi.org/10.1002/lol2.10212 (2021).Article 

    Google Scholar 
    Hanson, J. O. et al. Feeding across the food web: The interaction between diet, movement and body size in estuarine crocodiles (Crocodylus porosus). Austral. Ecol. 40(3), 275–286. https://doi.org/10.1111/aec.12212 (2015).Article 

    Google Scholar 
    Gustavino, B., Terrinoni, S., Paglierani, C. & Gentile, G. Conolophus marthae vs. Conolophus subcristatus: Does the skin pigmentation pattern exert a protective role against DNA damaging effect induced by UV light exposure? Analysis of blood smears through the micronucleus test. Paper presented at the Galápagos Land and Marine Iguanas Workshop, IUCN SSC Iguana Specialist Group Meeting, Puerto Ayora, 28–29 October 2014.Di Giacomo, C. et al. 25–Hydroxivitamin D plasma levels in natural populations of pigmented and partially pigmented land iguanas from Galápagos (Conolophus spp.). Hind 2022, 1–9. https://doi.org/10.1155/2022/7741397 (2022).CAS 
    Article 

    Google Scholar 
    Percie du Sert, N. et al. Reporting animal research: Explanation and elaboration for the ARRIVE guidelines 2.0. PLoS Biol. 18(7), e3000411. https://doi.org/10.1371/journal.pbio.3000411 (2020).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar  More