More stories

  • in

    Abundant and cosmopolitan lineage of cyanopodoviruses lacking a DNA polymerase gene

    Suttle CA. Marine viruses-major players in the global ecosystem. Nat Rev Microbiol. 2007;5:801–12.CAS 
    PubMed 

    Google Scholar 
    Fuhrman JA. Marine viruses and their biogeochemical and ecological effects. Nature 1999;399:541–8.CAS 
    PubMed 

    Google Scholar 
    Rohwer F, Thurber RV. Viruses manipulate the marine environment. Nature 2009;459:207–12.CAS 
    PubMed 

    Google Scholar 
    Breitbart M, Bonnain C, Malki K, Sawaya NA. Phage puppet masters of the marine microbial realm. Nat Microbiol. 2018;3:754–66.CAS 
    PubMed 

    Google Scholar 
    Zimmerman AE, Howard-Varona C, Needham DM, John SG, Worden AZ, Sullivan MB, et al. Metabolic and biogeochemical consequences of viral infection in aquatic ecosystems. Nat Rev Microbiol. 2020;18:21–34.CAS 
    PubMed 

    Google Scholar 
    Rosenwasser S, Ziv C, Creveld SGV, Vardi A. Virocell metabolism: metabolic innovations during host-virus interactions in the ocean. Trends Microbiol. 2016;24:821–32.CAS 
    PubMed 

    Google Scholar 
    Fuchsman CA, Carlson MCG, Garcia Prieto D, Hays MD, Rocap G. Cyanophage host-derived genes reflect contrasting selective pressures with depth in the oxic and anoxic water column of the Eastern Tropical North Pacific. Environ Microbiol. 2021;23:2782–2800.CAS 
    PubMed 

    Google Scholar 
    Roux S, Brum JR, Dutilh BE, Sunagawa S, Duhaime MB, Loy A, et al. Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses. Nature 2016;537:689–93.CAS 
    PubMed 

    Google Scholar 
    Gregory AC, Zayed AA, Conceição-Neto N, Temperton B, Bolduc B, Alberti A, et al. Marine DNA viral macro-and microdiversity from pole to pole. Cell 2019;177:1109–23.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Brum JR, Ignacio-Espinoza JC, Roux S, Doulcier G, Acinas SG, Alberti A, et al. Patterns and ecological drivers of ocean viral communities. Science 2015;348:1261498.PubMed 

    Google Scholar 
    Dion MB, Oechslin F, Moineau S. Phage diversity, genomics and phylogeny. Nat Rev Microbiol. 2020;18:125–38.CAS 
    PubMed 

    Google Scholar 
    Sullivan MB, Waterbury JB, Chisholm SW. Cyanophages infecting the oceanic cyanobacterium Prochlorococcus. Nature 2003;424:1047–51.CAS 
    PubMed 

    Google Scholar 
    Mann NH. Phages of the marine cyanobacterial picophytoplankton. FEMS Microbiol Rev. 2003;27:17–34.CAS 
    PubMed 

    Google Scholar 
    Ni T, Zeng Q. Diel infection of cyanobacteria by cyanophages. Front Mar Sci. 2016;2:123.
    Google Scholar 
    Flombaum P, Gallegos JL, Gordillo RA, Rincon J, Zabala LL, Jiao N, et al. Present and future global distributions of the marine Cyanobacteria Prochlorococcus and Synechococcus. Proc Natl Acad Sci USA 2013;110:9824–9.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Biller SJ, Berube PM, Lindell D, Chisholm SW. Prochlorococcus: the structure and function of collective diversity. Nat Rev Microbiol 2015;13:13–27.CAS 
    PubMed 

    Google Scholar 
    Proctor LM, Fuhrman JA. Viral mortality of marine-bacteria and cyanobacteria. Nature 1990;343:60–62.
    Google Scholar 
    Carlson MCG, Ribalet F, Maidanik I, Durham BP, Hulata Y, Ferron S, et al. Viruses affect picocyanobacterial abundance and biogeography in the North Pacific Ocean. Nat Microbiol 2022;7:570–80.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Matteson AR, Loar SN, Pickmere S, DeBruyn JM, Ellwood MJ, Boyd PW, et al. Production of viruses during a spring phytoplankton bloom in the South Pacific Ocean near of New Zealand. FEMS Microbiol Ecol 2012;79:709–19.CAS 
    PubMed 

    Google Scholar 
    Ribalet F, Swalwell J, Clayton S, Jimenez V, Sudek S, Lin Y, et al. Light-driven synchrony of Prochlorococcus growth and mortality in the subtropical Pacific gyre. Proc Natl Acad Sci USA. 2015;112:8008–12.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Demory D, Liu R, Chen Y, Zhao F, Coenen AR, Zeng Q, et al. Linking light-dependent life history traits with population dynamics for Prochlorococcus and cyanophage. mSystems 2020;5:e00586–19.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Avrani S, Wurtzel O, Sharon I, Sorek R, Lindell D. Genomic island variability facilitates Prochlorococcus-virus coexistence. Nature 2011;474:604–8.CAS 
    PubMed 

    Google Scholar 
    Marston MF, Pierciey FJ Jr, Shepard A, Gearin G, Qi J, Yandava C, et al. Rapid diversification of coevolving marine Synechococcus and a virus. Proc Natl Acad Sci USA 2012;109:4544–9.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Xiao X, Guo W, Li X, Wang C, Chen X, Lin X, et al. Viral lysis alters the optical properties and biological availability of dissolved organic matter derived from Prochlorococcus picocyanobacteria. Appl Environ Microbiol. 2021;87:e02271–20.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Xiao X, Zeng Q, Zhang R, Jiao N. Prochlorococcus viruses—From biodiversity to biogeochemical cycles. Sci China Earth Sci. 2018;61:1728–36.
    Google Scholar 
    Jover LF, Effler TC, Buchan A, Wilhelm SW, Weitz JS. The elemental composition of virus particles: implications for marine biogeochemical cycles. Nat Rev Microbiol. 2014;12:519–28.CAS 
    PubMed 

    Google Scholar 
    Puxty RJ, Millard AD, Evans DJ, Scanlan DJ. Viruses inhibit CO2 fixation in the most abundant phototrophs on earth. Curr Biol 2016;26:1585–9.CAS 
    PubMed 

    Google Scholar 
    Weitz JS, Stock CA, Wilhelm SW, Bourouiba L, Coleman ML, Buchan A, et al. A multitrophic model to quantify the effects of marine viruses on microbial food webs and ecosystem processes. ISME J. 2015;9:1352–64.PubMed 
    PubMed Central 

    Google Scholar 
    Sullivan MB, Coleman ML, Weigele P, Rohwer F, Chisholm SW. Three Prochlorococcus cyanophage genomes: signature features and ecological interpretations. PLoS Biol. 2005;3:e144.PubMed 
    PubMed Central 

    Google Scholar 
    Sullivan MB, Krastins B, Hughes JL, Kelly L, Chase M, Sarracino D, et al. The genome and structural proteome of an ocean siphovirus: a new window into the cyanobacterial ‘mobilome’. Environ Microbiol. 2009;11:2935–51.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Sullivan MB, Huang KH, Ignacio-Espinoza JC, Berlin AM, Kelly L, Weigele PR, et al. Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments. Environ Microbiol. 2010;12:3035–56.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Sabehi G, Shaulov L, Silver DH, Yanai I, Harel A, Lindell D. A novel lineage of myoviruses infecting cyanobacteria is widespread in the oceans. Proc Natl Acad Sci USA 2012;109:2037–42.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Huang S, Wang K, Jiao N, Chen F. Genome sequences of siphoviruses infecting marine Synechococcus unveil a diverse cyanophage group and extensive phage-host genetic exchanges. Environ Microbiol. 2012;14:540–58.CAS 
    PubMed 

    Google Scholar 
    Labrie SJ, Frois-Moniz K, Osburne MS, Kelly L, Roggensack SE, Sullivan MB, et al. Genomes of marine cyanopodoviruses reveal multiple origins of diversity. Environ Microbiol. 2013;15:1356–76.CAS 
    PubMed 

    Google Scholar 
    Dekel-Bird NP, Avrani S, Sabehi G, Pekarsky I, Marston MF, Kirzner S, et al. Diversity and evolutionary relationships of T7-like podoviruses infecting marine cyanobacteria. Environ Microbiol. 2013;15:1476–91.CAS 
    PubMed 

    Google Scholar 
    Huang S, Wilhelm SW, Jiao N, Chen F. Ubiquitous cyanobacterial podoviruses in the global oceans unveiled through viral DNA polymerase gene sequences. ISME J. 2010;4:1243–51.PubMed 

    Google Scholar 
    Baran N, Goldin S, Maidanik I, Lindell D. Quantification of diverse virus populations in the environment using the polony method. Nat Microbiol. 2018;3:62–72.CAS 
    PubMed 

    Google Scholar 
    Chow C-ET, Suttle CA. Biogeography of viruses in the sea. Annu Rev Virol. 2015;2:41–66.CAS 
    PubMed 

    Google Scholar 
    Chen F, Lu JR. Genomic sequence and evolution of marine cyanophage P60: a new insight on lytic and lysogenic phages. Appl Environ Microbiol. 2002;68:2589–94.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Huang S, Zhang S, Jiao N, Chen F. Comparative genomic and phylogenomic analyses reveal a conserved core genome shared by estuarine and oceanic cyanopodoviruses. PLoS One. 2015;10:e0142962.PubMed 
    PubMed Central 

    Google Scholar 
    Pope WH, Weigele PR, Chang J, Pedulla ML, Ford ME, Houtz JM, et al. Genome sequence, structural proteins, and capsid organization of the cyanophage Syn5: A “horned’ bacteriophage of marine Synechococcus. J Mol Biol. 2007;368:966–81.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Huang S, Sun Y, Zhang S, Long L. Temporal transcriptomes of a marine cyanopodovirus and its Synechococcus host during infection. Microbiologyopen 2021;10:e1150.CAS 
    PubMed 

    Google Scholar 
    Wang K, Chen F. Prevalence of highly host-specific cyanophages in the estuarine environment. Environ Microbiol. 2008;10:300–12.CAS 
    PubMed 

    Google Scholar 
    Chen F, Wang K, Huang S, Cai H, Zhao M, Jiao N, et al. Diverse and dynamic populations of cyanobacterial podoviruses in the Chesapeake Bay unveiled through DNA polymerase gene sequences. Environ Microbiol. 2009;11:2884–92.PubMed 

    Google Scholar 
    Goldin S, Hulata Y, Baran N, Lindell D. Quantification of T4-like and T7-like cyanophages using the polony method show they are significant members of the virioplankton in the North Pacific Subtropical Gyre. Front Microbiol. 2020;11:1210.PubMed 
    PubMed Central 

    Google Scholar 
    Nasko DJ, Chopyk J, Sakowski EG, Ferrell BD, Polson SW, Wommack KE. Family A DNA polymerase phylogeny uncovers diversity and replication gene organization in the virioplankton. Front Microbiol. 2018;9:3053.PubMed 
    PubMed Central 

    Google Scholar 
    Dekel-Bird NP, Sabehi G, Mosevitzky B, Lindell D. Host-dependent differences in abundance, composition and host range of cyanophages from the Red Sea. Environ Microbiol. 2015;17:1286–99.CAS 
    PubMed 

    Google Scholar 
    Hanson CA, Marston MF, Martiny JBH. Biogeographic variation in host range phenotypes and taxonomic composition of marine cyanophage isolates. Front Microbiol. 2016;7:983.PubMed 
    PubMed Central 

    Google Scholar 
    Rocap G, Larimer FW, Lamerdin J, Malfatti S, Chain P, Ahlgren NA, et al. Genome divergence in two Prochlorococcus ecotypes reflects oceanic niche differentiation. Nature 2003;424:1042–7.CAS 
    PubMed 

    Google Scholar 
    Chen B, Wang L, Song S, Huang B, Sun J, Liu H. Comparisons of picophytoplankton abundance, size, and fluorescence between summer and winter in northern South China Sea. Cont Shelf Res. 2011;31:1527–40.
    Google Scholar 
    Lindell D, Jaffe JD, Coleman ML, Futschik ME, Axmann IM, Rector T, et al. Genome-wide expression dynamics of a marine virus and host reveal features of co-evolution. Nature 2007;449:83–86.CAS 
    PubMed 

    Google Scholar 
    Zhao Y, Qin F, Zhang R, Giovannoni SJ, Zhang Z, Sun J, et al. Pelagiphages in the Podoviridae family integrate into host genomes. Environ Microbiol. 2019;21:1989–2001.CAS 
    PubMed 

    Google Scholar 
    Leptihn S, Gottschalk J, Kuhn A. T7 ejectosome assembly: A story unfolds. Bacteriophage 2016;6:e1128513.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Thompson LR, Zeng Q, Kelly L, Huang KH, Singer AU, Stubbe J, et al. Phage auxiliary metabolic genes and the redirection of cyanobacterial host carbon metabolism. Proc Natl Acad Sci USA 2011;108:E757–64.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Zeng Q, Chisholm SW. Marine viruses exploit their host’s two-component regulatory system in response to resource limitation. Curr Biol 2012;22:124–8.CAS 
    PubMed 

    Google Scholar 
    Zeng Q, Bonocora RP, Shub DA. A free-standing homing endonuclease targets an intron insertion site in the psbA gene of cyanophages. Curr Biol. 2009;19:218–22.CAS 
    PubMed 

    Google Scholar 
    Lindell D, Jaffe JD, Johnson ZI, Church GM, Chisholm SW. Photosynthesis genes in marine viruses yield proteins during host infection. Nature 2005;438:86–89.CAS 
    PubMed 

    Google Scholar 
    Breitbart M, Thompson LR, Suttle CA, Sullivan MB. Exploring the vast diversity of marine viruses. Oceanography. 2007;20:135–9.
    Google Scholar 
    Kazlauskas D, Venclovas C. Computational analysis of DNA replicases in double-stranded DNA viruses: relationship with the genome size. Nucleic Acids Res. 2011;39:8291–305.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Liu X, Zhang Q, Murata K, Baker ML, Sullivan MB, Fu C, et al. Structural changes in a marine podovirus associated with release of its genome into Prochlorococcus. Nat Struct Mol Biol. 2010;17:830–6.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Dai W, Fu C, Raytcheva D, Flanagan J, Khant HA, Liu XG, et al. Visualizing virus assembly intermediates inside marine cyanobacteria. Nature 2013;502:707–10.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Liu R, Liu Y, Chen Y, Zhan Y, Zeng Q. Cyanobacterial viruses exhibit diurnal rhythms during infection. Proc Natl Acad Sci USA 2019;116:14077–82.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Maidanik I, Kirzner S, Pekarski I, Arsenieff L, Tahan R, Carlson MCG, et al. Cyanophages from a less virulent clade dominate over their sister clade in global oceans. ISME J. 2022;16:2169–80.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Shitrit D, Hackl T, Laurenceau R, Raho N, Carlson MCG, Sabehi G, et al. Genetic engineering of marine cyanophages reveals integration but not lysogeny in T7-like cyanophages. ISME J. 2022;16:488–99.CAS 
    PubMed 

    Google Scholar 
    Liang Y, Wang L, Wang Z, Zhao J, Yang Q, Wang M, et al. Metagenomic analysis of the diversity of DNA viruses in the surface and deep sea of the South China Sea. Front Microbiol. 2019;10:1951.PubMed 
    PubMed Central 

    Google Scholar 
    Pedrós-Alió C, Potvin M, Lovejoy C. Diversity of planktonic microorganisms in the Arctic Ocean. Prog Oceanogr. 2015;139:233–43.
    Google Scholar 
    Luo E, Eppley JM, Romano AE, Mende DR, DeLong EF. Double-stranded DNA virioplankton dynamics and reproductive strategies in the oligotrophic open ocean water column. ISME J. 2020;14:1304–15.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Steidinger BS, Crowther TW, Liang J, Van Nuland ME, Werner GDA, Reich PB, et al. Climatic controls of decomposition drive the global biogeography of forest-tree symbioses. Nature 2019;569:404–8.CAS 
    PubMed 

    Google Scholar 
    Xie X, Wu T, Zhu M, Jiang G, Xu Y, Wang X, et al. Comparison of random forest and multiple linear regression models for estimation of soil extracellular enzyme activities in agricultural reclaimed coastal saline land. Ecol Indic. 2021;120:106925.CAS 

    Google Scholar 
    Lee SJ, Richardson CC. Choreography of bacteriophage T7 DNA replication. Curr Opin Chem Biol. 2011;15:580–6.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Kulczyk AW, Richardson CC. The replication system of bacteriophage T7. Enzymes. 2016;39:89–136.CAS 
    PubMed 

    Google Scholar 
    Benkovic SJ, Valentine AM, Salinas F. Replisome-mediated DNA replication. Annu Rev Biochem. 2001;70:181–208.CAS 
    PubMed 

    Google Scholar 
    Johnson A, O’Donnell M. Cellular DNA replicases: components and dynamics at the replication fork. Annu Rev Biochem. 2005;74:283–315.CAS 
    PubMed 

    Google Scholar 
    Seco EM, Zinder JC, Manhart CM, Lo Piano A, McHenry CS, Ayora S. Bacteriophage SPP1 DNA replication strategies promote viral and disable host replication in vitro. Nucleic Acids Res. 2013;41:1711–21.CAS 
    PubMed 

    Google Scholar 
    Mruwat N, Carlson MCG, Goldin S, Ribalet F, Kirzner S, Hulata Y, et al. A single-cell polony method reveals low levels of infected Prochlorococcus in oligotrophic waters despite high cyanophage abundances. ISME J. 2021;15:41–54.CAS 
    PubMed 

    Google Scholar 
    Moore LR, Rocap G, Chisholm SW. Physiology and molecular phylogeny of coexisting Prochlorococcus ecotypes. Nature 1998;393:464–7.CAS 
    PubMed 

    Google Scholar 
    Puxty RJ, Millard AD, Evans DJ, Scanlan DJ. Shedding new light on viral photosynthesis. Photosynth Res. 2015;126:71–97.CAS 
    PubMed 

    Google Scholar 
    Edwards KF, Steward GF, Schvarcz CR. Making sense of virus size and the tradeoffs shaping viral fitness. Ecol Lett. 2021;24:363–73.PubMed 

    Google Scholar 
    Moore LR, Coe A, Zinser ER, Saito MA, Sullivan MB, Lindell D, et al. Culturing the marine cyanobacterium Prochlorococcus. Limnol Oceanogr Methods. 2007;5:353–62.CAS 

    Google Scholar 
    Hyman P, Abedon ST. Bacteriophage host range and bacterial resistance. Adv Appl Microbiol. 2010;70:217–48.CAS 
    PubMed 

    Google Scholar 
    Fridman S, Flores-Uribe J, Larom S, Alalouf O, Liran O, Yacoby I, et al. A myovirus encoding both photosystem I and II proteins enhances cyclic electron flow in infected Prochlorococcus cells. Nat Microbiol. 2017;2:1350–7.CAS 
    PubMed 

    Google Scholar 
    Fang X, Liu Y, Zhao Y, Chen Y, Liu R, Qin QL, et al. Transcriptomic responses of the marine cyanobacterium Prochlorococcus to viral lysis products. Environ Microbiol. 2019;21:2015–28.CAS 
    PubMed 

    Google Scholar 
    John SG, Mendez CB, Deng L, Poulos B, Kauffman AK, Kern S, et al. A simple and efficient method for concentration of ocean viruses by chemical flocculation. Environ Microbiol Rep. 2011;3:195–202.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 2014;30:2114–20.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics 2011;27:863–4.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:1–10.
    Google Scholar 
    Peng Y, Leung HC, Yiu SM, Chin FY. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 2012;28:1420–8.CAS 
    PubMed 

    Google Scholar 
    Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 2014;30:2068–9.CAS 
    PubMed 

    Google Scholar 
    Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. IQ-TREE 2: new models and efficient methods for phylogenetic Inference in the genomic era. Mol Biol Evol. 2020;37:2461–2461.PubMed 
    PubMed Central 

    Google Scholar 
    Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS. UFBoot2: improving the ultrafast bootstrap approximation. Mol Biol Evol. 2018;35:518–22.CAS 
    PubMed 

    Google Scholar 
    Martinez-Hernandez F, Fornas O, Lluesma Gomez M, Bolduc B, de la Cruz Pena MJ, Martinez JM, et al. Single-virus genomics reveals hidden cosmopolitan and abundant viruses. Nat Commun. 2017;8:15892.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Zhang Z, Qin F, Chen F, Chu X, Luo H, Zhang R, et al. Culturing novel and abundant pelagiphages in the ocean. Environ Microbiol 2021;23:1145–61.CAS 
    PubMed 

    Google Scholar 
    Buchholz HH, Michelsen ML, Bolanos LM, Browne E, Allen MJ, Temperton B. Efficient dilution-to-extinction isolation of novel virus-host model systems for fastidious heterotrophic bacteria. ISME J. 2021;15:1585–98.CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Qin F, Du S, Zhang Z, Ying H, Wu Y, Zhao G, et al. Newly identified HMO-2011-type phages reveal genomic diversity and biogeographic distributions of this marine viral group. ISME J. 2022;16:1363–75.CAS 
    PubMed 
    PubMed Central 

    Google Scholar  More

  • in

    The African Development Corridors Database: a new tool to assess the impacts of infrastructure investments

    The African Development Corridors database is publicly available. The visualisation of the database that can be explored interactively here: https://dcp-unep-wcmc.opendata.arcgis.com/. The data is deposited in the Dryad Digital Repository referenced as Thorn, J. P.R., Mwangi, B.; Juffe Bignoli, D., The African Development Corridors Database, Dryad, Dataset, https://doi.org/10.5061/dryad.9kd51c5hw (2022)43. The final data were compiled into an online Master database spreadsheet, using the project code data as the merging attribute of the spatial and tabular database (AfricanDevelopmentCorridorsDatabase2022.csv). The African Development Corridor Database is presented as a GeoPackage file (.gpkg) and ESRI file Geodatabase (.gdb) composed by line and point feature datasets with the 22 associated attributes for all mapped corridors, a table with corridors that could not be mapped (also with the attributes), and a table with all sources consulted for each project code.We created a data standard to ensure a systematic and standardised data collection (Supplementary Table 2). Each data record in the database represents a project within a development corridor. To group all projects within the same development corridor we used a unique identifier composed by three letters that identified the corridor plus a number unique for each project or record. For example, the Lamu port project in Kenya within the Lamu Port South Sudan Ethiopia Transport Corridor (LAPSSET) was represented as LAP000. In this corridor we identified 20 projects, from LAP0001 which is the Lamu Port to LAP0020 which is the Isiolo-Lokichar-Lodwar-Nadapal Highway in Kenya. In addition to the unique identifier for each project, the data standard includes data attributes that provide detailed information about each project. Table 1 describes the attributes included in the database. Supplementary Table 3 summarises the 79 corridors included in the database.Table 1 List of the attributes included in the African Development Corridors Database.Full size tableInfrastructure types and status of development corridors in AfricaThe data consists of a total of 79 corridors consisting of 184 projects (Fig. 2). Of the 12 infrastructure types, the most predominant form of infrastructure in Africa’s development corridors is roads (n = 64, 34.8%), followed by wet ports (n = 38, 20.7%), passenger and freight railways (n = 33, 17.9%), and airports (n = 14, 7.6%). Fewer resort cities, electricity transmission lines, dry ports, industrial parks, and water pipelines comprise development corridors (all ≤ n = 3, 1.6%) (Fig. 3). We acknowledge our study might not include many infrastructure developments that are components projects of larger programmes but are not yet labelled as corridors. A total of 107 (58.7%) projects are operational, 35 (19%) are in progress, 25 (13.6%) are planned, 25 (13.9%) are being upgraded, and 2(1%) are on hold.Fig. 2Map showing the distribution of all the development corridors included in the African Development Corridors Database and their infrastructure type.Full size imageFig. 3Subset of highest frequencies of key attributes captured in the database.Full size imageSpatial distributionThe linear distance of development corridors in Africa is 122,294 km – which approximates to three times the Earth’s circumference, with an average of 1703.84 ± 213.19 km (mean, SE), ranging from 4–11,141 km. In terms of number of projects per country, Kenya has the most projects (n = 34, 18.5%), followed by Tanzania (n = 18, 9.8%), South Africa and Democratic Republic of the Congo (n = 17, 9.2% ea.), Ethiopia (n = 15, 8.2%), Mozambique and Zambia (n = 14, 7.6%), Angola, Uganda, Guinea and Cameroon (n = 12, 6.5%), Namibia (n = 11, 6.0%), Republic of Congo (n = 10, 5.4%), Burundi and Chad (n = 9, 4.9%), Malawi, Senegal, and Zimbabwe (n = 8, 4.4%), and Burkina Faso and Ghana (n = 7, 3.8%). Due to differences in the frequency and quality that countries publish data on infrastructure and development corridor investments, coverage may be lower for some regions, or some periods (i.e., recent, or further in the past).Investments in development corridorsAdjusting for inflation, the total investment of development corridors that is captured in the database ranges between USD 547.29–658.62 billion. The average cost of a corridor ranges between USD 3.46 ± 1.92 billion and USD 4.17 ± 2.04 billion. This is a notable sum, considering the average GDP in sub-Saharan Africa is USD 1.48 billion44. The most expensive development corridor project is the first of the nine Trans African Highway projects at USD 78.20 billion (when adjusted for inflation) – comprising transcontinental roads across Africa. We were able to capture the budget (or at least a proportion of the budget) for 84.7% of all projects.Temporal evolution of growth of development corridorsInvestments started in the 1800s and have increased exponentially (Fig. 4). Over a fifty-year period, the greatest number of investments took place between 1950 and 2000. Spikes in investments occurred particularly around 1900, which was when there was a wave of new imperialism across the continent, followed in the 1960s when many countries across sub-Saharan Africa gained independence. The third spike in investment was in the last decade, particularly since 2013, when we have seen rapid growth in foreign direct investment in Africa under initiatives such as the Belt and Road Initiative. According to the Ernst and Young Africa Attractiveness Survey (2019)45, the largest foreign direct investment (in terms of capital) between 2014–2018 came from China (USD 72,235 million), France (USD 34,172 million), USA (USD 30,885 million), the United Arab Emirates (USD 25,278 million) and the United Kingdom (USD 17,768 million).Fig. 4(a) Temporal evolution of investment in development corridors in Africa. (b) Annual investments per annum in development corridors in Africa (USD maximum, before adjusting for inflation).Full size imageDonors that are funding development corridorsAcross Africa, regional development banks invested the most in development corridors (30.8%), with the African Development Bank funding the majority (24.3%) of all projects. Outside of Africa, the regional development banks that invested in the most projects are the Export-Import Bank of China (n = 13, 3.8%), the European Investment Bank (n = 10, 2.8%) and the Arab Bank for Economic Development in Africa (n = 4, 1.2% ea.). National governments funded about 29.8% of all projects. The Government of Kenya funded the most projects (n = 26; 7.5%), followed by the Governments of Tanzania (n = 7, 2.0%) and South Africa (n = 4, 1.2%). Multilateral banks funded 10.9% of projects – mostly from the World Bank (n = 33, 9.54%) and a few from the International Finance Corporation (n = 4, 1.6%). The international development community funded only 6.1% – of which the OPEC Fund for International Development funded four projects. Private companies continue to invest in a small percentage of development corridors (3.5%), but this is higher than national banks that invest in 3.2%. Regional Economic Community bodies have invested in 15 (4.8%) of all 184 projects. The average number of donors per corridor ranged from one to 12.Weighting of investments by donor typeIn terms capital funded per donor type, Regional Development Banks invested the most (totalling USD 30.72 billion), followed by national governments (USD 20.45 billion). The figure then drops substantially to international development agencies (USD6.17 billion) and multilateral banks (USD 3.76 billion). These results are limited by the fact that we were only able to capture the amount funded delineated by donor type for 22.58% (or USD 70.24 billion) of the minimum of all investments (USD 311.14 billion) before adjusting for inflation.Commodities transportedA total of 147 commodities were captured. The top twenty commodities traded were rice (n = 52, 28.7% of all projects), sugar (27.0%), fish and petroleum (24.3% ea.), passengers (21.6%), textiles (21.1%), maize (19.5%), coffee (18.9%), cement and timber (17.8% ea.) followed by cotton, crude petroleum, vehicle spare parts, beverages, copper, fruit, fertilisers, gold, pharmaceutical products, and tobacco.Beneficiaries and net supplier or receiverApproximately 213 different beneficiaries were identified – predominantly local communities (n = 134 of projects, 72.8%), followed by national and local governments (63.0%), traders (51.1%), agricultural farmers and livestock producers (27.7%), ports (27.2%), industries (25.5%), truck drivers (22.3%), tourists (17.4%), entrepreneurs (12.0%), and logistics companies (11.4%). Almost all (89.1%) of corridors are net receivers and suppliers of commodities, while only 13 (7.1%) are net suppliers and seven are net receivers (3.8%). More

  • in

    Essential oils of plants and their combinations as an alternative adulticides against Anopheles gambiae (Diptera: Culicidae) populations

    WHO. Global plan for insecticide management. (World Health Organisation, Geneva, Switzerland 130, 2012).WHO. Paludisme: situation mondiale. vol. 2507. World Health Organisation, Geneva, Switzerland, (2020).WHO. Procédures pour tester la résistance aux insecticides chez les moustiques vecteurs du paludisme Seconde édition. (World Health Organisation, Geneva, Switzerland, 2017).WHO. Guidelines for Malaria Vector Control. (World Health Organisation, Geneva, Switzerland, 2019).Churcher, T. S., Lissenden, N., Griffin, J. T., Worrall, E. & Ranson, H. The impact of pyrethroid resistance on the efficacy and effectiveness of bednets for malaria control in Africa. Elife 5, 16090 (2016).
    Google Scholar 
    Hemingway, J. et al. Averting a malaria disaster: Will insecticide resistance derail malaria control?. Lancet 387, 1785–1788 (2016).PubMed 
    PubMed Central 

    Google Scholar 
    Dabiré, K. R. et al. Trends in insecticide resistance in natural populations of malaria vectors in Burkina Faso, West Africa: 10 Years surveys K. INTECH 32, 479–502 (2012).
    Google Scholar 
    WHO. WHO Global Malaria Programme: Global Plan for insecticide resistance management. (World Health Organisation, Geneva, Switzerland, 2012).Toe, K. H. et al. Do bednets including piperonyl butoxide offer additional protection against populations of Anopheles gambiae s.l. that are highly resistant to pyrethroids? An experimental hut evaluation in Burkina Faso. Med. Vet. Entomol. 32, 407–416 (2018).CAS 
    PubMed 

    Google Scholar 
    Hien, A. S. et al. Evidence supporting deployment of next generation insecticide treated nets in Burkina Faso: Bioassays with either chlorfenapyr or piperonyl butoxide increase mortality of pyrethroid-resistant Anopheles gambiae. Malar. J. 20, 1–12 (2021).
    Google Scholar 
    Zoubiri, S. & Baaliouamer, A. Potentiality of plants as source of insecticide principles. J. Saudi Chem. Soc. 18, 925–938 (2014).
    Google Scholar 
    Tripathi, A. K., Upadhyay, S., Bhuiyan, M. & Bhattacharya, P. R. A review on prospects of essential oils as biopesticide in insect-pest management. J. Pharmacogn. Phytother. 1, 52–63 (2009).CAS 

    Google Scholar 
    Isman, M. B. Plant essential oils for pest and disease management. Crop Prot. 19, 603–608 (2000).ADS 
    CAS 

    Google Scholar 
    Mossa, A. T. H. Green pesticides: Essential oils as biopesticides in insect-pest management. J. Environ. Sci. Technol. 9, 354–378 (2016).CAS 

    Google Scholar 
    Lucia, A. et al. Larvicidal effect of Eucalyptus grandis essential oil and turpentine and their major components on Aedes aegypti larvae. J. Am. Mosq. Control Assoc. 23, 299–303 (2007).CAS 
    PubMed 

    Google Scholar 
    Singh, R., Koul, O. & Rup, P. J. Toxicity of some essential oil constituents and their binary mixtures against Chilo partellus (Lepidoptera: Pyralidae). Int. J. Tropical Insect Sci. 29, 93–101 (2009).CAS 

    Google Scholar 
    Sarma, R., Adhikari, K., Mahanta, S. & Khanikor, B. Combinations of plant essential oil based terpene compounds as larvicidal and adulticidal agent against Aedes aegypti (Diptera: Culicidae). Sci. Rep. 9, 1–13 (2019).ADS 

    Google Scholar 
    Mansour, S. A., Foda, M. S. & Aly, A. R. Mosquitocidal activity of two Bacillus bacterial endotoxins combined with plant oils and conventional insecticides. Ind. Crops Prod. 35, 44–52 (2012).CAS 

    Google Scholar 
    Yaméogo, F., Wendgida, D. W., Sombié, A., Sanon, A. & Badolo, A. Insecticidal activity of essential oils from six aromatic plants against Aedes aegypti, dengue vector from two localities of Ouagadougou Burkina Faso. Arthropod. Plant. Interact. 15, 627–634 (2021).
    Google Scholar 
    Wangrawa, D. W. et al. Essential oils and their binary combinations have synergistic and antagonistic insecticidal properties against Anopheles gambiae s l. (Diptera: Culicidae). Biocatal. Agric. Biotechnol. 42, 102347 (2022).CAS 

    Google Scholar 
    Drabo, S. F., Olivier, G., Bassolé, I. H. N., Nébié, R. C. & Laurence, M. Susceptibility of MED-Q1 and MED-Q3 biotypes of Bemisia tabaci (Hemiptera: Aleyrodidae) populations to essential and seed oils. J. Econ. Entomol. 110, 1031–1038 (2017).
    Google Scholar 
    N’Guessan, R., Corbel, V., Akogbéto, M. & Rowland, M. Treated nets and indoor residual reduced efficacy of insecticide-pyrethroid resistance area benin. Emerg. Infect. Dis. 13, 199–206 (2007).PubMed 
    PubMed Central 

    Google Scholar 
    WHO. Standard operating procedure for testing insecticide susceptibility of adult mosquitoes in WHO tube tests. (World Health Organisation, Geneva, Switzerland 2022).Abbott, W. S. A method of computing the effectiveness of an insecticide. J. Econ. Entomol. 18, 265–267 (1925).CAS 

    Google Scholar 
    Schelz, Z., Molnar, J. & Hohmann, J. Antimicrobial and antiplasmid activities of essential oils. Fitoterapia 77, 279–285 (2006).CAS 
    PubMed 

    Google Scholar 
    Bassolé, I. H. N. & Juliani, H. R. Essential oils in combination and their antimicrobial properties. Molecules 17, 3989–4006 (2012).PubMed 
    PubMed Central 

    Google Scholar 
    WHO. Test Procedures for Insecticide Resistance Monitoring in Malaria Vector Mosquitoes Second edition. (World Health Organisation, Geneva, Switzerland 2016).Tchoumbougnang, F. et al. Activité larvicide sur Anopheles gambiae giles et composition chimique des huiles essentielles extraites de quatre plantes cultivées au Cameroun. Biotechnol. Agron. Soc. Environ. 13, 77–84 (2009).CAS 

    Google Scholar 
    Ranson, H. & Lissenden, N. Insecticide resistance in African Anopheles mosquitoes: A worsening situation that needs urgent action to maintain malaria control. Trends Parasitol. 32, 187–196 (2016).CAS 
    PubMed 

    Google Scholar 
    Wangrawa, D. et al. Insecticidal activity of local plants essential oils against laboratory and field strains of Anopheles gambiae s. L. (Diptera: Culicidae) from Burkina Faso. J. Econ. Entomol. 111, 2844–2853 (2018).CAS 
    PubMed 

    Google Scholar 
    Gbolade, A. A. & Lockwood, G. B. Toxicity of Ocimum sanctum L. essential oil to Aedes aegypti larvae and its chemical composition. J. Essent. Oil Bearing Plants 11, 148–153 (2008).CAS 

    Google Scholar 
    Vani, R. S., Cheng, S. F. & Chuah, C. H. Comparative study of volatile compounds from genus Ocimum. Am. J. Appl. Sci. 6, 523–528 (2009).CAS 

    Google Scholar 
    Bassolé, et al. Ovicidal and larvicidal activity against Aedes aegypti and Anopheles gambiae complex mosquitoes of essential oils extracted from three spontaneous plants of Burkina Faso. Parasitologia 45, 23–26 (2003).
    Google Scholar 
    Peerzada, N. Chemical composition of the essential oil of Hyptis Suaveolens. Molecules 2, 165–168 (1997).CAS 

    Google Scholar 
    Ilboudo, Z. et al. Biological activity and persistence of four essential oils towards the main pest of stored cowpeas, Callosobruchus maculatus (F.) (Coleoptera: Bruchidae). J. Stored Prod. Res. 46, 124–128 (2010).CAS 

    Google Scholar 
    Zulfikar, A. & Sitepu, F. Y. The effect of lemongrass (Cymbopogon nardus) extract as insecticide against Aedes aegypti. Int. J. Mosq. Res. 6, 101–103 (2019).
    Google Scholar 
    Ojewumi, E. M., Oladipupo, A. A. & Ojewumi, O. E. Oil extract from local leaves an alternative to synthetic mosquito repellants. Pharmacophore 9, 1–6 (2018).
    Google Scholar 
    Gnankiné, O. & Bassolé, I. H. N. Essential oils as an alternative to pyrethroids resistance against Anopheles species complex giles (Diptera: Culicidae). Molecules 22, 1321 (2017).PubMed Central 

    Google Scholar 
    Bossou, A. D. et al. Chemical composition and insecticidal activity of plant essential oils from Benin against Anopheles gambiae (Giles). Parasit. Vectors 6, 337 (2013).PubMed 
    PubMed Central 

    Google Scholar 
    Balboné, et al. Essential oils from five local plants: An alternative larvicide for Anopheles gambiae s. I. (Diptera: Culicidae) and Aedes aegypti (Diptera: Culicidae) control in Western Burkina Faso. Front. Trop. Dis. 3, 853405 (2022).
    Google Scholar 
    Bekele, J. & Hassanali, A. Blend effects in the toxicity of the essential oil constituents of Ocimum kilimandscharicum and Ocimum kenyense (Labiateae) on two post-harvest insect pests. Phytochemistry 57, 385–391 (2001).CAS 
    PubMed 

    Google Scholar 
    Pavela, R. Acute and synergistic effects of some monoterpenoid essential oil compounds on the house fly (Musca domestica). J. Essent. Oil Bearing Plants 11, 451–459 (2008).CAS 

    Google Scholar 
    Tanprasit, P. Biological control of dengue fever mosquitoes (Aedes aegypti Linn.) using leaf extracts of Chan (Hyptis suaveolens (L) poit.) and hedge flower Lantana camara Linn.). (2005).Park, H. M. et al. Larvicidal activity of myrtaceae essential oils and their components against Aedes aegypti, acute toxicity on Daphnia magna, and aqueous residue. J. Med. Entomol. 48, 405–410 (2011).CAS 
    PubMed 

    Google Scholar 
    Burt, S. Essential oils: Their antibacterial properties and potential applications in foods—A review. Int. J. Food Microbiol. 94, 223–253 (2004).CAS 
    PubMed 

    Google Scholar 
    Abbassy, M. A., Abdelgaleil, S. A. M. & Rabie, R. Y. A. Insecticidal and synergistic effects of Majorana hortensis essential oil and some of its major constituents. Entomol. Exp. Appl. 131, 225–232 (2009).CAS 

    Google Scholar 
    Chiasson, H., Bélanger, A., Bostanian, N., Vincent, C. & Poliquin, A. Acaricidal properties of Artemisia absinthium and Tanacetum vulgare (Asteraceae) essential oils obtained by three methods of extraction. J. Econ. Entomol. 94, 167–171 (2001).CAS 
    PubMed 

    Google Scholar 
    Luz, T. R. S. A., deMesquita, L. S. S., Amaral, F. M. M. & Coutinho, D. F. Essential oils and their chemical constituents against Aedes aegypti L. (Diptera: Culicidae) larvae. Acta Trop. 212, 105705 (2020).CAS 
    PubMed 

    Google Scholar 
    Deletre, E., Mallent, M., Menut, C., Chandre, F. & Martin, T. Behavioral response of Bemisia tabaci (Hemiptera: Aleyrodidae) to 20 plant extracts. J. Econ. Entomol. 108, 1890–1901 (2015).
    Google Scholar 
    Berenbaum, M. A. Y. & Neal, J. J. Synergism between myristicin and xanthotoxin, a naturally cooccurring plant toxicant. J. Chem. Ecol. 11, 1349–1358 (1985).CAS 
    PubMed 

    Google Scholar 
    Intirach, J. et al. Chemical constituents and combined larvicidal effects of selected essential oils against Anopheles cracens (Diptera: Culicidae). Psyche (London) https://doi.org/10.1155/2012/591616 (2012).
    Google Scholar 
    Pavela, R. Acute, synergistic and antagonistic effects of some aromatic compounds on the Spodoptera littoralis Boisd. (Lep., Noctuidae) larvae. Ind. Crops Prod. 60, 247–258 (2014).CAS 

    Google Scholar 
    Muturi, E. J., Ramirez, J. L., Doll, K. M. & Bowman, M. J. Combined toxicity of three essential oils against Aedes aegypti (Diptera: Culicidae) larvae. J. Med. Entomol. 54, 1684–1691 (2017).CAS 
    PubMed 

    Google Scholar  More

  • in

    Immune-mediated competition benefits protective microbes over pathogens in a novel host species

    Alizon S, de Roode JC, Michalakis Y (2013) Multiple infections and the evolution of virulence. Ecol Lett 16(4):556–67PubMed 

    Google Scholar 
    Bian G, Zhou G, Lu P, Xi Z (2013) Replacing a native Wolbachia with a novel strain results in an increase in endosymbiont load and resistance to dengue virus in a mosquito vector. PLoS Negl Trop Dis 7(6):e2250PubMed 
    PubMed Central 

    Google Scholar 
    Bjørnstad ON, Harvill ET (2005) Evolution and emergence of Bordetella in humans. Trends Microbiol 13(8):355–9PubMed 

    Google Scholar 
    Bosch TC (2013) Cnidarian-microbe interactions and the origin of innate immunity in metazoans. Annu Rev Microbiol 67:499–518CAS 
    PubMed 

    Google Scholar 
    Bull JJ, Turelli M (2013) Wolbachia versus dengue: Evolutionary forecasts. Evol Med Public Health 2013(1):197–207PubMed 
    PubMed Central 

    Google Scholar 
    Cabreiro F, Gems D (2013) Worms need microbes too: microbiota, health and aging in Caenorhabditis elegans. EMBO Mol Med 5(9):1300–10CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Chen F, Krasity BC, Peyer SM, Koehler S, Ruby EG, Zhang X et al. (2017) Bactericidal permeability-increasing proteins shape host-microbe interactions. mBio 8:e00040–17CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Chrostek E, Pelz-Stelinski K, Hurst GDD, Hughes GL (2017) Horizontal Transmission of Intracellular Insect Symbionts via Plants. Front Microbiol 8:2237PubMed 
    PubMed Central 

    Google Scholar 
    Chrostek E, Teixeira L (2015) Mutualism breakdown by amplification of Wolbachia genes. PLoS Biol 13(2):e1002065PubMed 
    PubMed Central 

    Google Scholar 
    Cisani G, Varaldo PE, Grazi G, Soro O (1982) High-level potentiation of lysostaphin anti-staphylococcal activity by lysozyme. Antimicrob Agents Chemother 21(4):531–5CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Clark LC, Hodgkin J (2014) Commensals, probiotics and pathogens in the Caenorhabditis elegans model. Cell Microbiol 16(1):27–38CAS 
    PubMed 

    Google Scholar 
    Coolon JD, Jones KL, Todd TC, Carr BC, Herman MA (2009) Caenorhabditis elegans genomic response to soil bacteria predicts environment-specific genetic effects on life history traits. PLOS Genet 5:e1000503PubMed 
    PubMed Central 

    Google Scholar 
    Dierking K, Yang W, Schulenburg H (2016) Antimicrobial effectors in the nematode Caenorhabditis elegans: an outgroup to the Arthropoda. Philos Trans R Soc Lond B Biol Sci 371:1695
    Google Scholar 
    Dong Y, Manfredini F, Dimopoulos G (2009) Implication of the mosquito midgut microbiota in the defense against malaria parasites. PLoS Pathog 5(5):e1000423PubMed 
    PubMed Central 

    Google Scholar 
    Drew GC, King KC (2022) More or less? The effect of symbiont density in protective mutualisms. Am Nat 199(4):443–54PubMed 

    Google Scholar 
    Ford SA, Kao D, Williams D, King KC (2016) Microbe-mediated host defence drives the evolution of reduced pathogen virulence. Nat Commun 7:13430CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Ford SA, King KC (2016) Harnessing the Power of Defensive Microbes: Evolutionary Implications in Nature and Disease Control. PLoS Pathog 12(4):e1005465PubMed 
    PubMed Central 

    Google Scholar 
    Ford SA, King KC (2021) In Vivo Microbial Coevolution Favors Host Protection and Plastic Downregulation of Immunity. Mol Biol Evol 38(4):1330–1338CAS 
    PubMed 

    Google Scholar 
    Frank SA (1996) Models of parasite virulence. Q Rev Biol 71(1):37–78CAS 
    PubMed 

    Google Scholar 
    Félix MA, Braendle C (2010) The natural history of Caenorhabditis elegans. Curr Biol 20(22):R965–9PubMed 

    Google Scholar 
    Garsin DA, Sifri CD, Mylonakis E, Qin X, Singh KV, Murray BE et al. (2001) A simple model host for identifying Gram-positive virulence factors. Proc Natl Acad Sci USA 98(19):10892–7CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Gerardo NM, Parker BJ (2014) Mechanisms of symbiont-conferred protection against natural enemies: an ecological and evolutionary framework. Curr Opin Insect Sci 4:8–14PubMed 

    Google Scholar 
    Gravato-Nobre MJ, Hodgkin J (2005) Caenorhabditis elegans as a model for innate immunity to pathogens. Cell Microbiol 7(6):741–51CAS 
    PubMed 

    Google Scholar 
    Habets MG, Rozen DE, Brockhurst MA (2012) Variation in Streptococcus pneumoniae susceptibility to human antimicrobial peptides may mediate intraspecific competition. Proc Biol Sci 279(1743):3803–11CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Heath BD, Butcher RD, Whitfield WG, Hubbard SF (1999) Horizontal transfer of Wolbachia between phylogenetically distant insect species by a naturally occurring mechanism. Curr Biol 9(6):313–6CAS 
    PubMed 

    Google Scholar 
    Heikkilä MP, Saris PE (2003) Inhibition of Staphylococcus aureus by the commensal bacteria of human milk. J Appl Microbiol 95(3):471–8PubMed 

    Google Scholar 
    Hoffmann AA, Ross PA, Rašić G (2015) Wolbachia strains for disease control: ecological and evolutionary considerations. Evol Appl 8(8):751–68PubMed 
    PubMed Central 

    Google Scholar 
    Hope IA (1999) C. elegans: a practical approach. Oxford University Press, Oxford
    Google Scholar 
    Huigens ME, de Almeida RP, Boons PA, Luck RF, Stouthamer R (2004) Natural interspecific and intraspecific horizontal transfer of parthenogenesis-inducing Wolbachia in Trichogramma wasps. Proc Biol Sci 271(1538):509–15CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Jaenike J, Polak M, Fiskin A, Helou M, Minhas M (2007) Interspecific transmission of endosymbiotic Spiroplasma by mites. Biol Lett 3(1):23–5CAS 
    PubMed 

    Google Scholar 
    Kaltenpoth M, Engl T (2014) Defensive microbial symbionts in Hymenoptera. Funct Ecol 28(2):315–27
    Google Scholar 
    King KC (2019) Quick guide: defensive symbionts. Curr Biol 29:R78–R80CAS 
    PubMed 

    Google Scholar 
    King KC, Brockhurst MA, Vasieva O, Paterson S, Betts A, Ford SA et al. (2016) Rapid evolution of microbe-mediated protection against pathogens in a worm host. ISME J 10(8):1915–24CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Kong C, Tan MW, Nathan S (2014) Orthosiphon stamineus protects Caenorhabditis elegans against Staphylococcus aureus infection through immunomodulation. Biol Open 3(7):644–55PubMed 
    PubMed Central 

    Google Scholar 
    Kopylova E, Noé L, Touzet H (2012) SortMeRNA: Fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics 14(24):3211–17
    Google Scholar 
    Koziel J, Potempa J (2013) Protease-armed bacteria in the skin. Cell Tissue Res 351:325–37CAS 
    PubMed 

    Google Scholar 
    Lysenko ES, Ratner AJ, Nelson AL, Weiser JN (2005) The role of innate immune responses in the outcome of interspecies competition for colonization of mucosal surfaces. PLoS Pathog 1(1):e1PubMed 
    PubMed Central 

    Google Scholar 
    Magalhaes T, Bergren NA, Bennett SL, Borland EM, Hartman DA, Lymperopoulos K et al. (2019) Induction of RNA interference to block Zika virus replication and transmission in the mosquito Aedes aegypti. Insect Biochem Mol Biol 111:103169CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Margolis E, Yates A, Levin BR (2010) The ecology of nasal colonization of Streptococcus pneumoniae, Haemophilus influenzae and Staphylococcus aureus: the role of competition and interactions with host’s immune response. BMC Microbiol 10:59PubMed 
    PubMed Central 

    Google Scholar 
    Marra A, Hanson MA, Kondo S, Erkosar B, Lemaitre B (2021) Drosophila Antimicrobial Peptides and Lysozymes Regulate Gut Microbiota Composition and Abundance. mBio 12(4):e0082421CAS 
    PubMed 

    Google Scholar 
    Martinez J, Cogni R, Cao C, Smith S, Illingworth CJ, Jiggins FM (2016) Addicted? Reduced host resistance in populations with defensive symbionts. Proc Biol Sci 283:1833
    Google Scholar 
    Martín-Platero AM, Valdivia E, Ruíz-Rodríguez M, Soler JJ, Martín-Vivaldi M, Maqueda M et al. (2006) Characterization of antimicrobial substances produced by Enterococcus faecalis MRR 10-3, isolated from the uropygial gland of the hoopoe (Upupa epops). Appl Environ Microbiol 72(6):4245–9PubMed 
    PubMed Central 

    Google Scholar 
    Mason KL, Stepien TA, Blum JE, Holt JF, Labbe NH, Rush JS et al. (2011) From commensal to pathogen: translocation of Enterococcus faecalis from the midgut to the hemocoel of Manduca sexta. MBio 2(3):e00065–11PubMed 
    PubMed Central 

    Google Scholar 
    Matthews AC, Mikonranta L, Raymond B (2019) Shifts along the parasite-mutualist continuum are opposed by fundamental trade-offs. Proc Biol Sci 286(1900):20190236CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    May G, Nelson P (2014) Defensive mutualisms: do microbial interactions within hosts drive the evolution of defensive traits? Funct Ecol 28(2):356–63
    Google Scholar 
    Mejía LC, Herre EA, Sparks JP, Winter K, García MN, Van Bael SA et al. (2014) Pervasive effects of a dominant foliar endophytic fungus on host genetic and phenotypic expression in a tropical tree. Front Microbiol 5:479PubMed 
    PubMed Central 

    Google Scholar 
    Mergaert P (2018) Role of antimicrobial peptides in controlling symbiotic bacterial populations. Nat prod Rep. 35(4):336–56CAS 
    PubMed 

    Google Scholar 
    Metcalf CJE, Koskella B (2019) Protective microbiomes can limit the evolution of host pathogen defense. Evol Lett 3:534–43PubMed 
    PubMed Central 

    Google Scholar 
    Montalvo-Katz S, Huang H, Appel MD, Berg M, Shapira M (2013) Association with soil bacteria enhances p38-dependent infection resistance in Caenorhabditis elegans. Infect Immun 81(2):514–20CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Moreira LA, Iturbe-Ormaetxe I, Jeffery JA, Lu G, Pyke AT, Hedges LM et al. (2009) A Wolbachia symbiont in Aedes aegypti limits infection with dengue, Chikungunya, and Plasmodium. Cell 139(7):1268–78PubMed 

    Google Scholar 
    O’Neill SL, Ryan PA, Turley AP, Wilson G, Retzki K, Iturbe-Ormaetxe I et al. (2018) Scaled deployment of Wolbachia to protect the community from Aedes transmitted arboviruses. Gates Open Res 2:36PubMed 

    Google Scholar 
    Oliver KM, Campos J, Moran NA, Hunter MS (2008) Population dynamics of defensive symbionts in aphids. Proc Biol Sci 275(1632):293–9PubMed 

    Google Scholar 
    Oliver KM, Smith AH, Russell JA (2014) Defensive symbiosis in the real world ‘96 advancing ecological studies of heritable, protective bacteria in aphids and beyond. Funct Ecol 28(2):341–55
    Google Scholar 
    Pan X, Pike A, Joshi D, Bian G, McFadden MJ, Lu P et al. (2018) The bacterium Wolbachia exploits host innate immunity to establish a symbiotic relationship with the dengue vector mosquito Aedes aegypti. ISME J 12(1):277–88CAS 
    PubMed 

    Google Scholar 
    Parker BJ, Barribeau SM, Laughton AM, de Roode JC, Gerardo NM (2011) Non-immunological defense in an evolutionary framework. Trends Ecol Evol 26(5):242–8PubMed 

    Google Scholar 
    Pastar I, O’Neill K, Padula L, Head CR, Burgess JL, Chen V et al. (2020) Staphylococcus epidermidis Boosts Innate Immune Response by Activation of Gamma Delta T Cells and Induction of Perforin-2 in Human Skin. Front Immunol 11:550946CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Pees B, Kloock A, Nakad R, Barbosa C, Dierking K (2017) Enhanced behavioral immune defenses in a C. elegans C-type lectin-like domain gene mutant. Dev Comp Immunol 74:237–42CAS 
    PubMed 

    Google Scholar 
    Peleg AY, Tampakakis E, Fuchs BB, Eliopoulos GM, Moellering RC, Mylonakis E (2008) Prokaryote-eukaryote interactions identified by using Caenorhabditis elegans. Proc Natl Acad Sci USA 105(38):14585–90CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Petersen C, Dirksen P, Schulenburg H (2015) Why we need more ecology for genetic models such as C. elegans. Trends Genet 31(3):120–7CAS 
    PubMed 

    Google Scholar 
    Pimentel H, Bray NL, Puente S, Melsted P, Pachter L (2017) Differential analysis of RNA-seq incorporating quantification uncertainty. Nat Methods 14(7):687–90CAS 
    PubMed 

    Google Scholar 
    Portal-Celhay C, Blaser MJ (2012) Competition and resilience between founder and introduced bacteria in the Caenorhabditis elegans gut. Infect Immun 80(3):1288–99CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Raberg L, de Roode JC, Bell AS, Stamou P, Gray D, Read AF (2006) The role of immune-mediated apparent competition in genetically diverse malaria infections. Am Nat 168(1):41–53PubMed 

    Google Scholar 
    Rafaluk-Mohr C, Ashby B, Dahan DA, King KC (2018) Mutual fitness benefits arise during coevolution in a nematode-defensive microbe model. Evol Lett 2(3):246–56PubMed 
    PubMed Central 

    Google Scholar 
    Ragland SA, Criss AK (2017) From bacterial killing to immune modulation: Recent insights into the functions of lysozyme. PLoS Pathog 13(9):e1006512PubMed 
    PubMed Central 

    Google Scholar 
    Rancès E, Ye YH, Woolfit M, McGraw EA, O’Neill SL (2012) The relative importance of innate immune priming in Wolbachia-mediated dengue interference. PLoS Pathog 8(2):e1002548PubMed 
    PubMed Central 

    Google Scholar 
    Raudvere U, Kolberg L, Kuzmin I, Arak T, Adler P, Peterson H et al. (2019) g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res 47(W1):W191–W198CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Raymann K, Shaffer Z, Moran NA (2017) Antibiotic exposure perturbs the gut microbiota and elevates mortality in honeybees. PLoS Biol 15(3):e2001861PubMed 
    PubMed Central 

    Google Scholar 
    Rossouw W, Korsten L (2017) Cultivable microbiome of fresh white button mushrooms. Lett Appl Microbiol 64(2):164–70CAS 
    PubMed 

    Google Scholar 
    Russell JA, Moran NA (2005) Horizontal transfer of bacterial symbionts: heritability and fitness effects in a novel aphid host. Appl Environ Microbiol 71(12):7987–94CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Ryu H, Kim SH, Lee HY, Bai JY, Nam YD, Bae JW et al. (2008) Innate immune homeostasis by the homeobox gene Caudal and commensal-gut mutualism in Drosophila. Science 319:777–82CAS 
    PubMed 

    Google Scholar 
    Sellegounder D, Liu Y, Wibisono P, Chen CH, Leap D, Sun J (2019) Neuronal GPCR NPR-8 regulates C. elegans defense against pathogen infection. Sci Adv 5(11):eaaw4717CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Sifri CD, Begun J, Ausubel FM, Calderwood SB (2003) Caenorhabditis elegans as a model host for Staphylococcus aureus pathogenesis. Infect Immun 71(4):2208–17CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Singh UB, Malviya D, Wasiullah, Singh S, Pradhan JK, Singh BP et al. (2016) Bio-protective microbial agents from rhizosphere eco-systems trigger plant defense responses provide protection against sheath blight disease in rice (Oryza sativa L.). Microbiol Res 192:300–12CAS 
    PubMed 

    Google Scholar 
    Trevelline BK, Fontaine SS, Hartup BK, Kohl KD (2019) Conservation biology needs a microbial renaissance: a call for the consideration of host-associated microbiota in wildlife management practices. Proc Biol Sci 286(1895):20182448PubMed 
    PubMed Central 

    Google Scholar 
    Ulrich Y, Schmid-Hempel P (2012) Host modulation of parasite competition in multiple infections. Proc Biol Sci 279(1740):2982–9PubMed 
    PubMed Central 

    Google Scholar 
    Vaishnava S, Yamamoto M, Severson KM, Ruhn KA, Yu X, Koren O et al. (2011) The antibacterial lectin RegIIIgamma promotes the spatial segregation of microbiota and host in the intestine. Science 334(653):255–8CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Varahan S, Iyer VS, Moore WT, Hancock LE (2013) Eep confers lysozyme resistance to enterococcus faecalis via the activation of the extracytoplasmic function sigma factor SigV. J Bacteriol 195(14):3125–34CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Visvikis O, Ihuegbu N, Labed SA, Luhachack LG, Alves AF, Wollenberg AC et al. (2014) Innate host defense requires TFEB-mediated transcription of cytoprotective and antimicrobial genes. Immunity 40(6):896–909CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Vorburger C, Ganesanandamoorthy P, Kwiatkowski M (2013) Comparing constitutive and induced costs of symbiont-conferred resistance to parasitoids in aphids. Ecol Evol 3(3):706–13PubMed 
    PubMed Central 

    Google Scholar 
    Wang S, Dos-Santos ALA, Huang W, Liu KC, Oshaghi MA, Wei G et al. (2017) Driving mosquito refractoriness to Plasmodium falciparum with engineered symbiotic bacteria. Science 357(6358):1399–1402CAS 
    PubMed 

    Google Scholar 
    Wilke AB, Marrelli MT (2015) Paratransgenesis: a promising new strategy for mosquito vector control. Parasit Vectors 8:342PubMed 
    PubMed Central 

    Google Scholar 
    Wong D, Bazopoulou D, Pujol N, Tavernarakis J, Ewbank J (2007) Genome-wide investigation reveals pathogen-specific and shared signatures in the response of Caenorhabditis elegans to infection. Genome Biol 8:R194PubMed 
    PubMed Central 

    Google Scholar  More

  • in

    Addressing the dichotomy of fishing and climate in fishery management with the FishClim model

    DataSea Surface temperature (1850–2019)Sea Surface Temperature (SST, °C) from 1850 to 2019 originated from the COBE SST2 1° × 1° gridded dataset74, https://psl.noaa.gov/data/gridded/data.cobe2.html. SST data were interpolated on a 0.25° latitude × 0.25° longitude grid on a monthly scale from 1850 to 2019.BathymetryBathymetry (m) came from GEBCO Bathymetric Compilation Group 2019 (The GEBCO_2019 Grid—a continuous terrain model of the global oceans and land). Data are provided by the British Oceanographic Data Centre, National Oceanography Centre, NERC, UK. doi:10/c33m. (https://www.bodc.ac.uk/data/published_data_library/catalogue/10.5285/836f016a-33be-6ddc-e053-6c86abc0788e/). These data were interpolated on a 0.25° latitude × 0.25° longitude grid.Biological dataDaily mass concentration of chlorophyll-a in seawater (mg/m3) originated from the Glob Colour project (http://www.globcolour.info/). The product merges together all the daily data from satellites (MODIS, SeaWIFS, VIIRS) available from September 1997 to December 2019, on a 4 km resolution spatial grid. These data were interpolated on a daily scale on a 0.25° latitude × 0.25° longitude grid. These data were only used to map the average maximum standardised SSB (mdSSB) around the North Sea (Fig. 1a). When long-term changes in mdSSB were examined, we used modelled chlorophyll data (see section “Climate projections” below).Cod recrutment at age 1, Spawning Stock Biomass (SSB) and fishing effort F for 1963–2019 originated from ICES35.We used a plankton index of larval cod survival, which was an update of the index proposed by Beaugrand and colleagues33. Based on data from the Continuous Plankton Recorder (CPR)75, the index is based on the simultaneous consideration of six key biological parameters important for the diet and growth of cod larvae and juveniles in the North Sea:76,77 (i) Total calanoid copepod biomass as a quantitative indicator of food for larval cod, (ii) mean size of calanoid copepods as a qualitative indicator of food, (iii-iv) the abundance of the two dominant congeneric species Calanus finmarchicus and C. helgolandicus, (v) the genus Pseudocalanus and (vi) the taxonomic group euphausiids. A standardised Principal Component Analysis (PCA) is performed on the six plankton indicators for each month from March to September for the period 1958–2017 (table 60 years × 7 months-6 indicators). The plankton index is simply the first principal component of the PCA33.Climate projectionsClimate projections for SST and mass concentration of chlorophyll in seawater (kg m−3) originated from the Coupled Model Intercomparison Project Phase 6 (CMIP6)5 and were provided by the Earth System Grid Federation (ESGF). We used the projections known as Shared Socioeconomic Pathways (SSP) 245 and 585 corresponding respectively to a medium and a high radiative forcing by 2100 (2.5 W m−2 and 8.5 W m−2)78. The daily simulations of four different models (i.e. CNRM-ESM2-1, GFDL-ESM4, IPSL-CM6A-LR, and UKESM1-0-LL) covering the time period 1850–2014 (historical simulation) and 2015–2100 (future projections for the two SSPs scenarios) were used. All the data were interpolated on a 0.25° by 0.25° regular grid. Key references (i.e. DOI and dataset version) are provided in Supplementary Text 1. Long-term changes in modelled SSB were based on these data (including modelled daily chlorophyll data).The FishClim modelLet Kt be the maximum standardised Spawning Stock Biomass (mdSSB hereafter) that can be reached by a fish stock at time t for a given environmental regime φt. Xt+1, standardised SSB (dSSB hereafter) at time t+1 was calculated from dSSB at time t as follows:$${X}_{t+1}={X}_{t}+r{X}_{t}left(1-frac{{X}_{t}}{{K}_{t}}right)-alpha {X}_{t}$$
    (1)
    α is the fishing intensity that varies between 0 (i.e. no fishing) and 1 (i.e. 100% of SSB fished in a year). It is important to note that α (see Eq. (10)) should not be mistaken with ICES fishing effort F79 (calculated from SSB). The second term of Eq. (1) is the intrinsic growth rate of the fish stock that is a function of both Kt and the population growth rate r (r was fixed to 0.5 in most analyses, but see Fig. 3d however where r varied from 0.25 to 0.75). The population growth rate r is highly influenced by the life history traits of a species80 but also by environmental variability54,55,81. Here, the population growth rate was assumed to be constant in space and time and the influence of environmental variability occurred exclusively through its effects on Kt. We made this choice to not multiply the sources of complexity and errors (i.e. population growth rate is very difficult to assess and varies with age80). The third term reflects the part of dSSB that is lost by fishing. Note that natural mortality is not explicitly integrated in Eq. (1) because this process is difficult to assess with confidence at the scale of our study. Here, we assumed that the second term of Eq. (1) implicitly considered this process; when K increases, it is likely that natural mortality diminishes, especially at age 134. We tested this assumption below. Most of the time when fishing occurs, Xt {y}_{{{{rm{opt}}}}}$$
    (3)
    Here yopt= 5.4 °C and t1 and t2 were fixed to 5.7 °C and 4 °C, respectively, so that the thermal niche was close to that assessed by Beaugrand and colleagues31 (Supplementary Fig. 2). This Supplementary Figure compares the thermal response curve we chose in the present study with the data analysed in Beaugrand and colleagues31. The figure shows that the response curve (red curve) is close to the histogram showing the number of geographical cells with a cod occurrence as a function of temperature varying between −2 °C (frozen seawater) and 20 °C.Because t1  > t2, the niche was slightly negative asymmetrical (Supplementary Fig. 1). U1(y) was the first component of mdSSB along the thermal gradient y. c was the maximum value of mdSSB; c was fixed to 1 so that mdSSB varied between 0 and 184,85. y was the value of SST. Slight variations in the different parameters of the niche did not alter either the spatial patterns in the distribution of mdSSB nor the correlations with recruitment.To model the bathymetric niche of cod, we used a trapezoidal function. Changes in mdSSB, U2, along bathymetry, were assessed using four points (θ1, θ2, θ3, θ4):$$begin{array}{cc}{{U}}_2({{z}})=0 & {{{{{{{rm{When}}}}}}; z}}le {{{{rm{theta }}}}}_{1}end{array}$$
    (4)
    $$begin{array}{cc}{{U}}_2({{z}})=frac{z-{theta }_{1}}{{theta }_{2}-{theta }_{1}}c & {{{{{rm{When}}}}}},{{{{rm{theta }}}}}_{1} < {{z}}le {{{{rm{theta }}}}}_{2}end{array}$$ (5) $$begin{array}{cc}{{U}}_2({{z}})={{c}} & {{{rm{When}}}},{{{{rm{theta }}}}}_{2} < {{z}} < {{{{rm{theta }}}}}_{3}end{array}$$ (6) $${{U}}_2begin{array}{cc}(z)=frac{{theta }_{4}-z}{{theta }_{4}-{theta }_{3}}c & {{{rm{When}}}},{{{{rm{theta }}}}}_{3}le {{z}} < {{{{rm{theta }}}}}_{4}end{array}$$ (7) $$begin{array}{cc}{{{rm{U}}}}_2({{z}})=0 & {{{rm{When}}}}; {{{rm{z}}}}ge {{{theta }}}_{4}end{array}$$ (8) With θ2 ≥ θ1, θ3 ≥ θ2 and θ4≥ θ3 and y the bathymetry; θ1 = 0, θ2 = 10−4, θ3 = 200 and θ4 = 600 m (Supplementary Fig. 1). These parameters were retrieved from the litterature86,87. Here also c, the maximum abundance reached by the target species was fixed to 1 and U2 varied between 0 and 1. Trapezoidal niches have been used frequently to model the spatial distribution of fish and marine mammals88,89.The trophic niche was modelled by a rectangular function on a daily basis. To the best of our knowledge, no information on the trophic niche is available. We modelled the trophic niche by fixing U3 to 1 when chlorophyll-a concentration was higher than 0.05 mg m−3 during a minimum period of 15 days and 0 otherwise (Supplementary Fig. 1). This minimum of chlorophyll was implemented as a proxy for suitable food, which has been shown to be important in the North Atlantic for cod recruitment and distribution6,33.There exists two ways to combine the different ecological dimensions of a niche: (i) use an additive or (ii) a multiplicative model82,90. We used a multiplicative model because when one dimension is associated to a nil abundance, the resulting abundance combining all dimensions is also nil in contrast to an additive model; therefore only one unsuitable environmental value may explain a nil abundance. All dimensions were associated to abundance values that varied between 0 and 190.Therefore, maximum dSSB, K, for a given environmental regime E was given by multiplying the three niches (thermal, bathymetric and trophic):$$K=mathop{prod }limits_{i=1}^{p}{U}_{i}$$ (9) where p = 3, the three dimensions of the niche.AnalysesMapping of maximum standardised SSBmdSSB is close to the “dynamic B0” approach; B0 is the SSB in the absence of fishing (generally expressed in tonnes)51 whereas mdSSB is the SSB in the absence of fishing standardised between 0 and 1 and assessed from the knowledge of the niche of the species. We first assessed mdSSB in the North-east Atlantic (around UK) at a spatial resolution of 0.25° latitude × 0.25° longitude on a daily basis from 1850 to 2019. For this analysis, FishClim was run on monthly COBE SST (1850–2019), mean bathymetry and a climatology of daily mass concentration of chlorophyll-a in seawater from the Glob Colour project (see Data section). We then calculated an annual average based on the main seasonal productive period around UK, i.e. from March to October90. Finally, we averaged all years to examine spatial patterns in mean mdSSB (Fig. 1a).Temporal changes in maximum standardised SSBWe assessed average long-term changes in mdSSB in the North Sea (51°N–62°N and 3°W–9.5°E); the annual average was calculated from March to October because this is a period of high production90 . We compared long-term changes in mdSSB with cod recruitment at age 1, a plankton index of larval cod survival based on the period March to October33, and ICES-based SSB35 for 1963-2019 (Fig. 1b–d).Correlation analyses with modelled maximum standardised SSBPearson correlations between long-term changes in mdSSB (average for the North Sea, 51°N–62°N and 3°W–9.5°E) and cod recruitment at age 1 in decimal logarithm35, a plankton index of larval cod survival in the North Sea33, and observed ICES SSB in decimal logarithm35 for the period 1963–2019 were calculated (Fig. 1b–d). The same analysis was performed between assessed fishing intensity α from our FishClim model and fishing effort F35 in the North Sea (Fig. 1e). The probability of significance of the coefficients of correlation was adjusted to correct for temporal autocorrelation91.Assessment of fishing intensity from ICES spawning stock biomassUsing North Sea ICES SSB, we applied Eq. (1) to assess fishing intensity α:$$alpha =1+rleft(1-frac{{X}_{t}}{{K}_{t}}right)-frac{{X}_{t+1}}{{X}_{t}}$$ (10) With Xt+1 and Xt the ICES dSSB (in decimal logarithm). Standardisation of ICES SSB, necessary for this analysis, was complicated because many different kinds of standardisation were achievable so long as X remained strictly above 0 (i.e. full cod extirpation, not observed so far35) and strictly below min(K) (i.e. all black curves always below all points of the blue curve were possible, Supplementary Fig. 3). Indeed, ICES SSB includes exploitation and environmental fluctuations whereas K (i.e. mdSSB) integrates only environmental forcing; the difference is mainly caused by the negative influence of fishing. We chose the black curve (ICES SSB) that maximised the correlation between α (fishing intensity in the FishClim model) and F (ICES fishing effort)35.Reconstruction of long-term changes in ICES spawning stock biomassThe estimation of α allowed us to reconstruct long-term changes in cod (ICES) dSSB and to examine the respective influence of fishing and CIEC by means of Eq. (1) (“Methods”) using four hypothetical scenarios (Fig. 1f). First, we fixed fishing intensity and considered exclusively environmental variations through its influence on dSSB. (i–ii) We assessed long-term changes in dSSB from long-term variation in observed mdSSB (called Kt in Eq. (1)) with a constant level of exploitation fixed to (i) minimum (upper blue curve, i.e. the lowest fishing intensity observed in 1963–2019) or (ii) maximum (lower blue curve, i.e. the highest fishing intensity observed in 1963–2019).Second, we fixed the environmental influence on dSSB and considered variations in fishing intensity. We estimated long-term changes in dSSB from long-term variation in estimated α with a constant mdSSB fixed to (iii) minimum (lower red curve, i.e. the lowest mdSSB observed in 1963–2019) or (iv) maximum (upper red curve, i.e. the highest mdSSB observed in 1963–2019). It was possible to compare long-term changes in reconstructed (ICES) dSSB (thick black curve in Fig. 1f) with these four hypothetical scenarios (Fig. 1f); note that these comparisons were not affected by the choice we made earlier on the standardisation of (ICES) SSB.Quantification of the respective influence of fishing and climate/environment on spawning stock biomassUsing the previous curves, we examined the respective influence of fishing and CIEC on reconstructed (ICES) dSSB (Fig. 2). First, the influence of fishing was investigated by estimating the residuals between reconstructed (ICES) dSSB based on long-term changes in mdSSB (i.e. Kt in Eq. (1)) and α (thick black curves in Fig. 1f) and modelled dSSB based on fluctuating fishing intensity α and invariant K (average of the two red curves in Fig. 1f). This calculation led to the red curve in Fig. 2b. Next, we performed the opposite procedure to examine the influence of CIEC on dSSB (i.e. invariant fishing intensity α based on the two blue curves in Fig. 1f). This calculation led to the blue curve in Fig. 2b.A cluster analysis, based on a matrix years × three time series with (i) long-term changes in reconstructed standardised (ICES) SSBs, (ii) fishing and (iii) CIEC, was performed to identify key periods (vertical dashed lines in Fig. 2). We standardised each variable between 0 and 1 and used an Euclidean distance to assess the year (1963–2019) × year (1963–2019) square matrix so that each variable contributed equally to each association coefficient. We used an agglomerative hierarchical clustering technique using average linkage, which was a good compromise between the two extreme single and complete clustering techniques92. In this paper, we were only interested in the timing between the different time periods (i.e. the groups of years) revealed by the cluster analysis (Fig. 2).We also calculated an index of fishing influence (ε, expressed in percentage) by means of two indicators γ and δ, which were slightly different to the ones we used above. The first one, γ, was modelled dSSB with fluctuating fishing intensity and a constant mdSSB based on the best suitable environment observed during 1963–2019 (only the upper red curve in Fig. 1f; fishing influence). The second one, δ, was modelled dSSB based on fluctuating environment and fishing intensity (black curve in Fig. 1f) on modelled dSSB based on a fluctuating environment but a constant fishing intensity fixed to the lowest value of the time series (only the upper blue curve in Fig. 1f; environmental influence). The index of fishing influence (ε, expressed in percentage) was calculated as follows:$$varepsilon =frac{100gamma }{gamma +delta }$$ (11) For each period of 1963–2019 identified by the cluster analysis, we quantified the influence of fishing (and therefore the environment) using a Jackknife procedure93,94. The resampling procedure recalculated ε by removing each time 1 year of the time period, which allowed us to provide a range of values (i.e. minimum and maximum) in addition to the average value (bar{varepsilon }) calculated for each interval, including the whole period (Fig. 2c).Long-term changes in modelled spawning stock biomass (1850–2019, 2020–2100 and 2020-2300)We modelled mdSSB (Kt in Eq. (1)) using outputs from four Earth System models (ESMs) based on two scenarios of SST/Chlorophyll changes (i.e. SSP245 and SSP585) for the period 1850–2100 (and for one scenario and one ESM until 2300; Fig. 3).For the period 1850–2019, we used daily SST/Chlorophyll changes from the four ESMs to estimate potential changes in mdSSB (thin dashed black curves in Fig. 3a). An average of mdSSB was also calculated (thick green curve in Fig. 3a).For the period 2020–2100, we showed all potential changes in mdSSB based on the four ESMs and both scenarios SSP245 (thin dashed blue curves in Fig. 3a) and SSP585 (thin dashed red curves). An average of mdSSB was also calculated for scenarios SSP245 (thick continuous blue curve) and SSP585 (thick continuous red curve). In addition, we assessed dSSB based on a constant standardised catch fixed to the average of 2008–2019, the last period identified by the cluster analysis (G5, i.e. (alpha X) = 0.03 in Eq. (1)), and the average values of all ESMs for SSP245 (thick dashed blue curve in Fig. 3a) and SSP585 (thick dashed red curve). This analysis was performed to show how a constant catch might alter long-term changes in mdSSB. When Xt (Eq. (1)) reached 0.1, the stock was considered as fully extirpated.Understanding how fishing and climate/environment interact now and in the futureWe modelled dSSB as a function of fishing intensity α and CIEC to show how fishing and the environment interact (Fig. 3b, c). We calculated dSSB for fishing intensity between α = 0 and α = 0.5 every step Ɵ = 0.001 and for mdSSB between K = 0 and K = 1 every step Ɵ = 0.001 to represent values of dSSB as a function of fishing and CIEC. We then superimposed reconstructed ICES dSSB (1963–2019) on the diagram for three periods: 1963–1985 (high SSB), 1986–1999 (pronounced reduction in SSB), and 2000–2019 (low SSB). Maximum standardised SSB for 2020–2100 (or 2300 exclusively for Scenario SSP 585 of IPSL ESM) assessed from four ESMs and scenarios SSP245 and SSP585 were also superimposed. Fishing intensity is unpredictable for 2020–2100 and so we arbitrarily fixed it constant between 0.08 and 0.17 in increments of 0.1 for display purposes, starting by ESMs based on scenario SSP 245 followed by scenario SSP 585 (Fig. 3b). When Xt (Eq. (1)) reached 0.1, the stock was considered as fully extirpated.We calculated an index of sensitivity of dSSB as a function of fishing intensity and CIEC. To do so, we first calculated sensitivity of dSSB to fishing intensity α. Index ζi was calculated at point i from dSSB X and fishing intensity α at i−1 and i+1 (see also Eq. (1)):$$begin{array}{cc}{zeta }_{i}=frac{left|{X}_{i+1}-{X}_{i-1}right|}{left|{alpha }_{i+1}-{alpha }_{i-1}right|} & {{{rm{with}}}},{{{rm{min }}}}(alpha )+{{uptheta }}le ile {{{rm{max }}}}(alpha )-{{uptheta }}end{array}$$ (12) With min(α) = 0, max(α) = 0.5 and Ɵ = 0.001.Similarly, we calculated sensitivity of dSSB to K. Index ηj was calculated at point j from dSSB X and mdSSB K at j−1 and j+1 (see also Eq. (1)):$$begin{array}{cc}{eta }_{j}=frac{left|{X}_{j+1}-{X}_{j-1}right|}{left|{K}_{j+1}-{K}_{j-1}right|} & {{{rm{with}}}},{{{rm{min }}}}left(Kright)+{{{rm{theta }}}}le {{j}}le {{{rm{max }}}}({{{rm{K}}}})-{{uptheta }}end{array}$$ (13) With min(K) = 0, max(K) = 1 and Ɵ = 0.001.Then, we summed the two indices to assess the joint sensitivity of dSSB to fishing intensity Z and mdSSB H:$${{{{bf{I}}}}}_{{{i}},{{j}}}={{{bf{Z}}}}({{{{rm{zeta }}}}}_{{{i}}})+{{{bf{H}}}}({eta }_{{{j}}})$$ (14) Matrix I was subsequently standardised between 0 and 1:$${{{{boldsymbol{I}}}}}^{{{{boldsymbol{* }}}}}=frac{{{{boldsymbol{I}}}}-min ({{{boldsymbol{I}}}})}{max left({{{boldsymbol{I}}}}right)-min ({{{boldsymbol{I}}}})}$$ (15) With I* the matrix of sensitivity of dSSB to fishing intensity and mdSSB standardised between 0 and 1 (Fig. 3c).Number of years needed for recovery after stock collapseWe investigated how the number of years needed for a stock to recover after stock collapse (i.e. dSSB=0.05 in Eq. (1); i.e. 10% of mdSSB) varied as a function of mdSSB (between 0 and 1 by increment of 0.001); this was only influenced by the environmental regime φt and population growth rate r. For this analysis, we fixed a target dSSB of 0.4 (vertical dashed green vertical line in Fig. 3d) and three different values of r: 0.25, 0.5 and 0.75. We simulated a hypothetical moratorium with a fishing intensity α = 0 in Eq. (1).Here, stock collapse was defined as dSSB ≤ 0.1 × mdSSB, i.e. when the dSSB reached less than 10% of the unfished biomass mdSSB. This threshold corresponds to values usually defined in the literature; e.g. Pinsky and colleagues95 defined a collapse when landings are below 10% the average of the five highest landings recorded for more than 2 years, Worm and colleagues69 defined stock collapse when the biomass becomes lower than 10% of the unfished biomass, Andersen96 when it is lower than 20% and Thorpe and De Oliveira67 when it is lower than 10–20%.Potential consequences of fisheries management and climate-induced environmental changesWe examined how fishing and CIEC may affect cod stocks and their exploitation around UK with a focus in the North Sea (Figs. 4, 5). For these analyses, we averaged long-term changes in modelled dSSB corresponding to each scenario (all thin dashed blue and thin red curves in Fig. 3a for SSP245 and 585, respectively). In these analyses, the stock was considered fully extirpated when Xt (Eq. (1)) reached 0.1.Year of cod extirpation for 2020–2100 We estimated year of cod extirpation from 2020 to 2100 in each geographical cell based on (i) a constant fishing intensity (α = 0.04) in time and space, and (ii) an adjusted fishing intensity using the concept of Mean Sustainable Yield (MSY). The choice of α = 0.04 did not alter our conclusions; a lower or a higher value delayed or speed cod extirpation in a predictable way, respectively. In fisheries, MSY is defined as the maximum catch (abundance or biomass) that can be removed from a population over an indefinite period with dX/dt = 0, with X for dSSB and t for time. Despite some criticisms about MSY66, the concept remains a key paradigm in fisheries management35,63. We used this concept to show that controlling fishing intensity delayed cod extirpation. From Eq. (1), we calculated fishing intensity, called αMSYt, so that X remained above XMSYt at all time t:$${alpha }_{{{{{rm{MSY}}}}t}}=rleft(1-frac{{X}_{{{{{rm{MSY}}}}t}}}{{K}_{t}}right)$$ (16) In this analysis, we fixed XMSY t = Kt/2. We assessed ({alpha }_{{{{{rm{MSY}}}}t}}) from Eq. (16) and then estimated dSSB from ({alpha }_{{{{{rm{MSY}}}}t}}) and Kt (based on averaged SSP245 and SSP585) by means of Eq. (1). Although results were displayed at the scale of the north-east Atlantic (around UK), we calculated the difference in year of cod extirpation between scenarios of warming (SSP245 and SSP585) and between scenarios of cod management (constant versus adjusted—MSY— fishing intensity). Differences were presented by means of histograms (Fig. 4). From each histogram, we calculated the median of the differences in year of cod extirpation E97. Pooled standardised catch by 2100 (2020–2100) In term of fishing exploitation, we assessed pooled standardised catch (i.e. pooled dSSB) in 2100 (2020–2100), again for two scenarios of CIEC (SSP245 and 585) and two scenarios of cod management (constant versus adjusted—MSY—fishing intensity; Fig. 5). We then calculated the percentage of reduction in pooled standardised catch caused by fishing or the intensity of warming. Finally, we assessed the median of the percentage of reduction in pooled standardised catch for the North Sea area (51°N–62°N and 3°W–9.5°E). The goal of this analysis was to demonstrate that controlling fishing intensity optimises cod exploitation. Statistics and reproducibilityAll statistical analyses can be reproduced from the equations provided in the text, the cited references or the data available in Supplementary Data.Reporting summaryFurther information on research design is available in the Nature Research Reporting Summary linked to this article. More

  • in

    Genotyping-by-sequencing reveals range expansion of Adonis vernalis (Ranunculaceae) from Southeastern Europe into the zonal Euro-Siberian steppe

    Wesche, K. et al. The Palaearctic steppe biome: A new synthesis. Biodivers. Conserv. 25, 2197–2231 (2016).
    Google Scholar 
    Hurka, H. et al. The Eurasian steppe belt: Status quo, origin and evolutionary history. Turczaninowia 22, 5–71 (2019).
    Google Scholar 
    Willner, W. et al. Formalized classification of semi-dry grasslands in central and eastern Europe. Preslia 91, 25–49 (2019).
    Google Scholar 
    Willis, K. J. & McElwain, J. C. The Evolution Of Plants (Oxford University Press, Oxford, 2002).Suc, J.-P. et al. Reconstruction of Mediterranean flora, vegetation and climate for the last 23 million years based on an extensive pollen dataset. Ecol. Mediterr. 44, 53–85 (2018).
    Google Scholar 
    Strani, F. Impact of Early and Middle Pleistocene major climatic events on the palaeoecology of Southern European ungulates. Hist. Biol. 33, 2260–2275 (2021).
    Google Scholar 
    Chytrý, M. et al. A modern analogue of the Pleistocene steppe-tundra ecosystem in southern Siberia. Boreas 48, 36–56 (2019).
    Google Scholar 
    Manafzadeh, S., Staedler, Y. M. & Conti, E. Visions of the past and dreams of the future in the orient: The Irano–Turanian region from classical botany to evolutionary studies. Biol. Rev. 92, 1365–1388 (2017).PubMed 

    Google Scholar 
    Seregin, A. P., Anačkov, G. & Friesen, N. Molecular and morphological revision of the Allium saxatile group (Amaryllidaceae): Geographical isolation as the driving force of underestimated speciation. Bot. J. Linn. Soc. 178, 67–101 (2015).
    Google Scholar 
    Friesen, N. et al. Dated phylogenies and historical biogeography of Dontostemon and Clausia (Brassicaceae) mirror the palaeogeographical history of the Eurasian steppe. J. Biogeogr. 43, 738–749 (2016).
    Google Scholar 
    Seidl, A. et al. Phylogeny and biogeography of the Pleistocene Holarctic steppe and semi-desert goosefoot plant. Krascheninnikovia ceratoides. Flora 262, 151504 (2020).
    Google Scholar 
    Seidl, A. et al. The phylogeographic history of Krascheninnikovia reflects the development of dry steppes and semi-deserts in Eurasia. Sci. Rep. 11, 6645 (2021).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Žerdoner Čalasan, A., Seregin, A. P., Hurka, H., Hofford, N. P. & Neuffer, B. The Eurasian steppe belt in time and space: Phylogeny and historical biogeography of the false flax (Camelina Crantz, Camelineae, Brassicaceae). Flora 260, 151477 (2019).
    Google Scholar 
    Žerdoner Čalasan, A. et al. Pleistocene dynamics of the Eurasian steppe as a driving force of evolution: Phylogenetic history of the genus Capsella (Brassicaceae). Ecol. Evol. 11, 12697–12713 (2021).PubMed 
    PubMed Central 

    Google Scholar 
    Žerdoner Čalasan, A., German, D. A., Hurka, H. & Neuffer, B. A story from the Miocene: Clock-dated phylogeny of Sisymbrium L. (Sisymbrieae, Brassicaceae). Ecol. Evol. 11, 2573–2595 (2021).PubMed 
    PubMed Central 

    Google Scholar 
    Zaveská, E. et al. Multiple auto- and allopolyploidisations marked the Pleistocene history of the widespread Eurasian steppe plant Astragalus onobrychis (Fabaceae). Mol. Phylogenet. Evol. 139, 106572 (2019).PubMed 

    Google Scholar 
    Franzke, A. et al. Molecular signals for Late Tertiary/Early Quaternary range splits of an Eurasian steppe plant: Clausia aprica (Brassicaceae). Mol. Ecol. 13, 2789–2795 (2004).CAS 
    PubMed 

    Google Scholar 
    Friesen, N. et al. Evolutionary history of the Eurasian steppe plant Schivereckia podolica (Brassicaceae) and its close relatives. Flora 268, 151602 (2020).
    Google Scholar 
    Walter, H. & Straka, H. Arealkunde (Floristisch-Historische Geobotanik), second edition (Ulmer, 1970).Zimmermann, W. 50c. Familie Ranunculáceae in Gustav Hegi, Illustrierte Flora Von Mitteleuropa, Band III, Teil 3, second edition (eds. Rechinger, K. H. & Damboldt, J.) 53–341 (Parey, 1965–1974).Meusel, H., Jäger, E. & Weinert, E. Vergleichende Chorologie Der Zentraleuropäischen Flora, Band I (Gustav Fischer, 1965).Hoffmann, M. H. Ecogeographical differentiation patterns in Adonis sect. Consiligo (Ranunculaceae). Plant Syst. Evol. 211, 43–56 (1998).
    Google Scholar 
    Willner, W. et al. A higher-level classification of the Pannonian and western Pontic steppe grasslands (Central and Eastern Europe). Appl. Veg. Sci. 20, 143–158 (2017).PubMed 

    Google Scholar 
    Lange, D. Conservation and Sustainable Use of Adonis vernalis, a Medicinal Plant in International Trade (Landwirtschaftsverlag, Münster, 2000).Denisow, B., Wrzesień, M. & Cwener, A. Pollination and floral biology of Adonis vernalis L. (Ranunculaceae)—A case study of threatened species. Acta Soc. Bot. Pol. 83, 29–37 (2014).
    Google Scholar 
    Mitrenina, E. Y. et al. Karyotype and genome size in Adonis vernalis and Adonis volgensis. Turczaninowia 25, 5–15 (2022).
    Google Scholar 
    Zhai, W. et al. Chloroplast genomic data provide new and robust insights into the phylogeny and evolution of the Ranunculaceae. Mol. Phylogenet. Evol. 135, 12–21 (2019).CAS 
    PubMed 

    Google Scholar 
    Bobrov, E. G. Genus 540. ADONIS L. In Flora Of The U.S.S.R., Volume VII, Ranales And Rhoeadales (ed. Komarov, V. L.) pp. 403–411 (Academy of Sciences of the U.S.S.R., 1970).Pisareva, V. V. et al. Changes in the landscape and climate of Eastern Europe in the Early Pleistocene. Stratigr. Geol. Correl. 27, 475–497 (2019).ADS 

    Google Scholar 
    Vislobokova, I., Tesakov, A. Early And Middle Pleistocene Of Northern Eurasia. In Encyclopedia Of Quaternary Science, Vol 4, 2nd edn (ed. Elias, S. A.) pp. 605–614 (Elsevier, Amsterdam, 2013).Hirsch, H. et al. High genetic diversity declines towards the geographic range periphery of Adonis vernalis, a Eurasian dry grassland plant. Plant Biol. 17, 1233–1241 (2015).CAS 
    PubMed 

    Google Scholar 
    Kirschner, P. et al. Long-term isolation of European steppe outposts boosts the biome’s conservation value. Nat. Commun. 11, 1968 (2020).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Willner, W. et al. Long-term continuity of steppe grasslands in eastern Central Europe: Evidence from species distribution patterns and chloroplast haplotypes. J. Biogeogr. 48, 3104–3117 (2021).
    Google Scholar 
    Molnár, Z., Biró, M., Bartha, S. & Fekete, G. Past trends, present state and future prospects of Hungarian forest-steppes. In Eurasian Steppes. Ecological Problems And Livelihoods In A Changing World (eds Werger, M. J. A. & van Staalduinen, M. A.) pp. 209–252 (Springer, Dordrecht, 2012).
    Google Scholar 
    Liedtke, H. Die Nordischen Vereisungen in Mitteleuropa, 2nd edn. (Zentralausschuß für deutsche Landeskunde, Trier, 1981).Sizikova, A. O. & Zykina, V. S. The dynamics of the Late Pleistocene loess formation, Lozhok section, Ob loess Plateau, SW Siberia. Quat. Int. 365, 4–14 (2015).
    Google Scholar 
    Zykina, V. S. & Zykin, V. S. Pleistocene warming stages in Southern West Siberia: Soils, environment, and climate evolution. Quat. Int. 106–107, 233–243 (2003).
    Google Scholar 
    Shumilovskikh, L., Sannikov, P., Efimik, E., Shestakov, I. & Mingalev, V. V. Long-term ecology and conservation of the Kungur forest-steppe (pre-Urals, Russia): Case study Spasskaya Gora. Biodivers. Conserv. 30, 4061–4087 (2021).
    Google Scholar 
    Markova, A. & Puzachenko, A. Vertebrate records/Late Pleistocene of Northern Asia. In Encyclopedia of Quaternary Science Vol. 4 (ed. Elias, S. A.) 3158–3175 (Elsevier, Amsterdam, 2007).Gómez, C. & Espadaler, X. An update of the world survey of myrmecochorous dispersal distances. Ecography 36, 1193–1201 (2013).
    Google Scholar 
    Albert, A. et al. Seed dispersal by ungulates as an ecological filter: A trait-based meta-analysis. Oikos 124, 1109–1120 (2015).
    Google Scholar 
    Albert, A., Mårell, A., Picard, M. & Baltzinger, C. Using basic plant traits to predict ungulate seed dispersal potential. Ecography 38, 440–449 (2015).
    Google Scholar 
    Popescu, S.-M. et al. Late Quaternary vegetation and climate of SE Europe–NW Asia according to pollen records in three offshore cores from the Black and Marmara seas. Paleobiodivers. Paleoenviron. 101, 197–212 (2021).
    Google Scholar 
    Markova, A. K., Simakova, A. N. & Puzachenko, A. Y. Ecosystems of Eastern Europe at the time of maximum cooling of the Valdai glaciation (24–18 kyr BP) inferred from data on plant communities and mammal assemblages. Quat. Int. 201, 53–59 (2009).
    Google Scholar 
    Kajtoch, Ł et al. Phylogeographic patterns of steppe species in Eastern Central Europe: A review and the implications for conservation. Biodivers. Conserv. 25, 2309–2339 (2016).
    Google Scholar 
    Kropf, M., Bardy, K., Höhn, M. & Plenk, K. Phylogeographical structure and genetic diversity of Adonis vernalis L. (Ranunculaceae) across and beyond the Pannonian region. Flora 262, 151497 (2020).
    Google Scholar 
    Plenk, K., Bardy, K., Höhn, M., Thiv, M. & Kropf, M. No obvious genetic erosion, but evident relict status at the westernmost range edge of the Pontic-Pannonian steppe plant Linum flavum L. (Linaceae) in Central Europe. Ecol. Evol. 7, 6527–6539 (2017).PubMed 
    PubMed Central 

    Google Scholar 
    Magyari, E. K. et al. Late Pleniglacial vegetation in eastern-central Europe: Are there modern analogues in Siberia? Quat. Sci. Rev. 95, 60–79 (2014).ADS 

    Google Scholar 
    Doležel, J., Greilhuber, J. & Suda, J. Estimation of nuclear DNA content in plants using flow cytometry. Nat. Protoc. 2, 2233–2244 (2007).PubMed 

    Google Scholar 
    Shaw, J., Lickey, E. B., Schilling, E. E. & Small, R. L. Comparison of whole chloroplast genome sequences to choose noncoding regions for phylogenetic studies in angiosperms: The tortoise and the hare III. Am. J. Bot. 94, 275–288 (2007).CAS 
    PubMed 

    Google Scholar 
    Shaw, J. et al. The tortoise and the hare II: Relative utility of 21 noncoding chloroplast DNA sequences for phylogenetic analysis. Am. J. Bot. 92, 142–166 (2005).CAS 
    PubMed 

    Google Scholar 
    Heckenhauer, J., Barfuss, M. H. J. & Samuel, R. Universal multiplexable matK primers for DNA barcoding of angiosperms. Appl. Plant Sci. 4, 1500137 (2016).
    Google Scholar 
    White, T. J., Bruns, T., Lee, S. & Taylor, J. Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. In PCR Protocols: A Guide to Methods and Applications (eds Innis, M. A. et al.) pp. 315–322 (Academic Press, Cambridge, 1990).
    Google Scholar 
    Simmons, M. P. & Ochoterena, H. Gaps as characters in sequence-based phylogenetic analyses. Syst. Biol. 49, 369–381 (2000).CAS 
    PubMed 

    Google Scholar 
    Müller, K. SeqState: Primer design and sequence statistics for phylogenetic DNA datasets. Appl. Bioinformat. 4, 65–69 (2005).
    Google Scholar 
    Swofford, D. L. PAUP*. Software. https://paup.phylosolutions.com/.Kozlov, A., Darriba, D., Flouri, T., Morel, B. & A.,. Stamatakis RAxML-NG: A fast, scalable, and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics 35, 4453–4455 (2019).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Bouckaert, R. et al. BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 15, e1006650 (2019).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Heled, J. & Drummond, A. J. Calibrated tree priors for relaxed phylogenetics and divergence time estimation. Syst. Biol. 61, 138–149 (2012).PubMed 

    Google Scholar 
    Drummond, A. J., Ho, S. Y. W., Phillips, M. J. & Rambaut, A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 4, e88 (2006).PubMed 
    PubMed Central 

    Google Scholar 
    Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67, 901–904 (2018).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Rambaut, A. FigTree. Software. http://tree.bio.ed.ac.uk/software/figtree/.Wendler, N. et al. Unlocking the secondary gene-pool of barley with next-generation sequencing. Plant Biotechnol. J. 12, 1122–1131 (2014).ADS 
    CAS 
    PubMed 

    Google Scholar 
    Eaton, D. A. R. & Overcast, I. ipyrad: Interactive assembly and analysis of RADseq datasets. Bioinformatics 36, 2592–2594 (2020).CAS 
    PubMed 

    Google Scholar 
    Weiß, C. L., Pais, M., Cano, L. M., Kamoun, S. & Burbano, H. A. nQuire: A statistical framework for ploidy estimation using next generation sequencing. BMC Bioinform. 19, 122 (2018).
    Google Scholar 
    Corrêa dos Santos, R. A., Goldman, G. H. & Riaño-Pachón, D. M. ploidyNGS: Visually exploring ploidy with next generation sequencing data. Bioinformatics 33, 2575–2576 (2017).
    Google Scholar 
    Frichot, E. & François, O. LEA: An R package for landscape and ecological association studies. Methods Ecol. Evol. 6, 925–929 (2015).
    Google Scholar 
    R Core Team. R. A language and environment for statistical computing. Software. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.Gruber, B., Unmack, P. J., Berry, O. F. & Georges, A. dartR: An R package to facilitate analysis of SNP data generated from reduced representation genome sequencing. Mol. Ecol. Resour. 18, 691–699 (2018).PubMed 

    Google Scholar 
    Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Minh, B. Q. et al. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).CAS 
    PubMed 

    Google Scholar 
    Minh, B. Q., Nguyen, M. A. T. & von Haeseler, A. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30, 1188–1195 (2013).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Huson, D. H. & Bryant, D. Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 23, 254–267 (2005).PubMed 

    Google Scholar 
    Kalinowski, S. T. HP-RARE 1.0: A computer program for performing rarefaction on measures of allelic richness. Mol. Ecol. Notes 5, 187–189 (2005).CAS 

    Google Scholar  More

  • in

    An isolated population reveals greater genetic structuring of the Australian dingo

    Alvares, F. et al. Old Wolrd Canis spp. with taxonomic ambiguity: Workshop conclusions and recommendations Vairao, Portugal, 28th–30th May 2019. Canid News (Online Edition) (2019).Jackson, S. M. et al. Taxonomy of the dingo: It’s an ancient dog. Aust. Zool. 41, 347–357 (2021).
    Google Scholar 
    Stephens, D., Wilton, A. N., Fleming, P. J. S. & Berry, O. Death by sex in an Australian icon: A continent-wide survey reveals extensive hybridization between dingoes and domestic dogs. Mol. Ecol. 24, 5643–5656 (2015).CAS 
    PubMed 

    Google Scholar 
    Cairns, K. M., Shannon, L. M., Koler-Matznick, J., Ballard, J. W. O. & Boyko, A. R. Elucidating biogeographical patterns in Australian native canids using genome wide SNPs. PLoS ONE 13, e0198754 (2018).PubMed 
    PubMed Central 

    Google Scholar 
    Fleming, P. J. S., Ballard, G. & Cutter, N. There is no Dingo dilemma: legislation facilitates culling, containment and conservation of Dingoes in New South Wales. Aust. Zool. 41, 408–416 (2021).
    Google Scholar 
    Corbett, L. K. The Dingo in Australia and Asia. Second edn, (JB Books Australia, 2001).Newsome, T. M. et al. Making a new dog?. Bioscience 67, 374–381 (2017).
    Google Scholar 
    Wang, G.-D. et al. Out of southern East Asia: the natural history of domestic dogs across the world. Cell Res. 26, 21–33 (2016).PubMed 

    Google Scholar 
    Smith, B. The Dingo Debate: Origins, Behaviour and Conservation. (CSIRO Publishing, 2015).Jackson, S. M. et al. The dogma of dingoes-taxonomic status of the dingo: A reply to Smith et al. Zootaxa 4564, 198–212 (2019).
    Google Scholar 
    Zhang, S. J. et al. Genomic regions under selection in the feralization of the dingoes. Nat. Commun. 11, 671 (2020).ADS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Balme, J. & O’Connor, S. Dingoes and Aboriginal social organization in Holocene Australia. J. Archaeol. Sci. Rep. 7, 775–781 (2016).
    Google Scholar 
    Cairns, K. M. What is a dingo – origins, hybridisation and identity. Aust. Zool. 41(3), 322–337 (2021).
    Google Scholar 
    Allen, B. L. & West, P. Influence of dingoes on sheep distribution in Australia. Aust. Vet. J. 91, 261–267 (2013).CAS 
    PubMed 

    Google Scholar 
    Fleming, P. J. S. in Carnivores of Australia: Past, Present and Future (eds A.S. Glen & C.R. Dickman) Ch. 6, 105–149 (CSIRO Publishing, 2014).Stephens, D. The molecular ecology of Australian wild dogs: hybridisation, gene flow and genetic structure at multiple geographic scales, The University of Western Australia, (2011).Cairns, K. M., Nesbitt, B. J., Laffan, S. W., Letnic, M. & Crowther, M. S. Geographic hot spots of dingo genetic ancestry in southeastern Australia despite hybridisation with domestic dogs. Conserv. Genet. 21, 77–90 (2020).CAS 

    Google Scholar 
    Wilton, A. N., Steward, D. J. & Zafiris, K. Microsatellite variation in the Australian dingo. J. Hered. 90, 108–111 (1999).CAS 
    PubMed 

    Google Scholar 
    Allendorf, F. W., Leary, R. F., Spruell, P. & Wenburg, J. K. The problems with hybrids: Setting conservation guidelines. Trends Ecol. Evol. 16, 613–622 (2001).
    Google Scholar 
    Atkinson, J. An account of the state of agriculture & grazing in New South Wales. (J. Cross, 1826).Massy, C. The Australian Merino: The Story of a Nation (Revised and updated). xxii,1262 (Random House Australia, 2007).Cairns, K. M., Brown, S. K., Sacks, B. N. & Ballard, J. W. O. Conservation implications for dingoes from the maternal and paternal genome: Multiple populations, dog introgression, and demography. Ecol. Evol. 7, 9787–9807 (2017).PubMed 
    PubMed Central 

    Google Scholar 
    Kopelman, N. M., Mayzel, J., Jakobsson, M., Rosenberg, N. A. & Mayrose, I. Clumpak: A program for identifying clustering modes and packaging population structure inferences across K. Mol. Ecol. Resour. 15, 1179–1191 (2015).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Driscoll, C., Yamaguchi, N., O’Brien, S. J. & Macdonald, D. W. A suite of genetic markers useful in assessing wildcat (Felis silvestris ssp.)-domestic cat (Felis silvestris catus) admixture. J. Hered. 102(1), S87–S90 (2011).PubMed 
    PubMed Central 

    Google Scholar 
    Bohling, J. H. & Waits, L. P. Factors influencing red wolf–coyote hybridization in eastern North Carolina USA. Biol. Conserv. 184, 108–116 (2015).
    Google Scholar 
    Fleming, P., Corbett, L., Harden, R. & Thomson, P. in Managing the Impacts of Dingoes and Other Wild Dogs. (Bureau of Rural Sciences, Canberra, 2001).Van Veldhuisen, R. Pipe dreams: A history of water supply in the Wimmera-Mallee (Wimmera Mallee Water, 2001).Newsome, A. The distribution of red kangaroos, Megaleia rufa (Desmarest), about sources of persistent food and water in central Australia. Aust. J. Zool. 13, 289–300 (1965).
    Google Scholar 
    James, C. D., Landsberg, J. & Morton, S. R. Provision of watering points in the Australian arid zone: A review of effects on biota. J. Arid Environ. 41, 87–121 (1999).ADS 

    Google Scholar 
    Robinson, J. A., Brown, C., Kim, B. Y., Lohmueller, K. E. & Wayne, R. K. Purging of strongly deleterious mutations explains long-term persistence and absence of inbreeding depression in island foxes. Curr. Biol. 28, 3487-3494.e3484 (2018).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Benazzo, A. et al. Survival and divergence in a small group: The extraordinary genomic history of the endangered Apennine brown bear stragglers. PNAS 114, E9589–E9597 (2017).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Mattucci, F. et al. Genomic approaches to identify hybrids and estimate admixture times in European wildcat populations. Sci. Rep. 9, 11612 (2019).ADS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Thomson, P. C., Rose, K. & Kok, N. E. The behavioural ecology of dingoes in north-western Australia. VI. Temporary extra-terrestrial movements and dispersal. Wildl. Res. 19, 585–595 (1992).
    Google Scholar 
    Newsome, T. M., Ballard, G.-A., Dickman, C. R., Fleming, P. J. S. & van de Ven, R. Home range, activity and sociality of a top predator, the dingo: A test of the Resource Dispersion Hypothesis. Ecography 36, 914–925 (2013).
    Google Scholar 
    Giglio, R. M., Rocke, T. E., Osorio, J. E. & Latch, E. K. Characterizing patterns of genomic variation in the threatened Utah prairie dog: Implications for conservation and management. Evol. Appl. 14, 1036–1051 (2021).PubMed 

    Google Scholar 
    Conroy, G. C. et al. Conservation concerns associated with low genetic diversity for K’gari–Fraser Island dingoes. Sci. Rep. 11, 9503 (2021).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Frankham, R. Do island populations have less genetic variation than mainland populations?. Heredity 78, 311–327 (1997).PubMed 

    Google Scholar 
    Funk, W. C. et al. Adaptive divergence despite strong genetic drift: genomic analysis of the evolutionary mechanisms causing genetic differentiation in the island fox (Urocyon littoralis). Mol. Ecol. 25, 2176–2194 (2016).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Behrendorff, L. Best-practice dingo management: six lessons from K’gari (Fraser Island). Aust. Zool. 41, 521–533 (2021).
    Google Scholar 
    van Eeden, L. M., Smith, B. P., Crowther, M. S., Dickman, C. R. & Newsome, T. M. ‘The dingo menace’: An historic survey on graziers’ management of an Australian carnivore. Pac. Conserv. Biol. 25, 245–256 (2019).
    Google Scholar 
    Whiting, S. D., Long, J. L., Hadden, K. M., Lauder, A. D. K. & Koch, A. U. Insights into size, seasonality and biology of a nesting population of the Olive Ridley turtle in northern Australia. Wildl. Res. 34, 200–210 (2007).
    Google Scholar 
    Banks, S. C., Hoyle, S. D., Horsup, A., Sunnucks, P. & Taylor, A. C. Demographic monitoring of an entire species (the northern hairy-nosed wombat, Lasiorhinus krefftii) by genetic analysis of non-invasively collected material. Anim. Conserv. 6, 101–107 (2003).
    Google Scholar 
    Parker, H. G. et al. Genomic analyses reveal the influence of geographic origin, migration, and hybridization on modern dog breed development. Cell Rep. 19, 697–708 (2017).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Thackway, R. & Cresswell, I. An Interim Biogeographic Regionalisation for Australia: A Framework for Setting Priorities in the National Reserves System Cooperative Program. Version 4, (Australian Nature Conservation Agency, Reserve Systems Unit, 1995).Bureau of Meteorology & CSIRO. (Bureau of Meteorology, CSIRO and Farmlink, http://www.bom.gov.au/climate/climate-guides/guides/01-Mallee-VIC-Climate-Guide.pdf, 2019).Rowan, J. N. & Downes, R. G. in Soil Conservation Authority of Victoria (ed Brookes, A.C.) 1–55 (Govt. Printer, Melbourne, 1963).Longmire, J. L., Maltbie, M. & Baker, R. J. Use of “lysis buffer” in DNA isolation and its implications for museum collections. Occas. Pap. Mus. Tex. Tech. Univ. 163, 1–3 (1997).
    Google Scholar 
    Tatler, J., Prowse, T. A. A., Roshier, D. A., Cairns, K. M. & Cassey, P. Phenotypic variation and promiscuity in a wild population of pure dingoes (Canis dingo). J. Zool. Syst. Evol. Res. 59, 311–322 (2020).
    Google Scholar 
    Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Francis, R. M. pophelper: An R package and web app to analyse and visualize population structure. Mol. Ecol. Resour. 17, 27–32 (2017).CAS 
    PubMed 

    Google Scholar 
    Wang, J. The computer program structure for assigning individuals to populations: Easy to use but easier to misuse. Mol. Ecol. Resour. 17, 981–990 (2017).CAS 
    PubMed 

    Google Scholar 
    Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software structure: A simulation study. Mol. Ecol. 14, 2611–2620 (2005).CAS 
    PubMed 

    Google Scholar 
    Earl, D. A. & von Holdt, B. M. Structure harvester: A website and program for visualizing structure output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 (2012).
    Google Scholar 
    Verity, R. & Nichols, R. A. Estimating the number of subpopulations (K) in structured populations. Genetics 203, 1827–1839 (2016).PubMed 
    PubMed Central 

    Google Scholar 
    Jombart, T. adegenet: A R package for the multivariate analysis of genetic markers. Bioinformatics 24, 1403–1405 (2008).CAS 
    PubMed 

    Google Scholar 
    Jombart, T., Devillard, S. & Balloux, F. Discriminant analysis of principal components: A new method for the analysis of genetically structured populations. BMC Genet. 11, 94 (2010).PubMed 
    PubMed Central 

    Google Scholar 
    Keenan, K. et al. diveRsity: An R package for the estimation and exploration of population genetics parameters and their associated errors. Methods Ecol. Evol. 4, 782–788 (2013).
    Google Scholar 
    Peakall, R. & Smouse, P. E. genalex 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol. Ecol. Notes 6, 288–295 (2006).
    Google Scholar 
    Peakall, R. & Smouse, P. E. GenAlEx 6.5: Genetic analysis in Excel. Population genetic software for teaching and research—an update. Bioinformatics 28, 2537–2539 (2012).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Lynch, M. & Ritland, K. Estimation of pairwise relatedness with molecular markers. Genetics 152, 1753–1766 (1999).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    Weir, B. S. & Cockerham, C. C. Estimating F-statistics for the analysis of population structure. Evolution 38, 1358–1370 (1984).CAS 
    PubMed 

    Google Scholar 
    Jost, L. GST and its relatives do not measure differentiation. Mol. Ecol. 17, 4015–4026 (2008).PubMed 

    Google Scholar 
    Do, C. et al. NeEstimator v2: Re-implementation of software for the estimation of contemporary effective population size (Ne) from genetic data. Mol. Ecol. Resour. 14, 209–214 (2014).CAS 
    PubMed 

    Google Scholar 
    Shirk, A. J. & Cushman, S. A. sGD: Software for estimating spatially explicit indices of genetic diversity. Mol. Ecol. Resour. 11, 922–934 (2011).CAS 
    PubMed 

    Google Scholar 
    Schnute, J., Boers, N., Haigh, R. & Couture-Beil, A. Introduction to PBSmapping. (2016). More

  • in

    Sentinel2GlobalLULC: A Sentinel-2 RGB image tile dataset for global land use/cover mapping with deep learning

    To build Sentinel2GlobalLULC, we followed two main steps. First, we established a spatio-temporal consensus between 15 global LULC products for 29 LULC classes. Then, we extracted the maximum number of Sentinel-2 RGB images representing each class. Each image is a tile that has 224 × 224 pixels at 10 × 10 m spatial resolution and was built as a cloud-free composite from all the Sentinel-2 images acquired between June 2015 and October 2020. Both tasks were implemented using GEE, an efficient programming, processing and visualisation platform that allowed us to have free manipulation and access to all used LULC products and Sentinel-2 imagery, simultaneously.Finding spatio-temporal agreement across 15 global LULC productsTo establish the spatio-temporal consensus between different LULC products for each one of the 29 LULC classes, we followed four steps: (1)Identification of the LULC products to be used in the consensus, (2)Standardization and harmonization of the LULC legend that was subsequently used to annotate the image tiles, (3)Spatio-temporal aggregation across LULC products, and (4)Spatial reprojection and tile selection based on optimized spatial purity thresholds.Global LULC products selectionThe adopted purity measure for spatio-temporal agreement across the 15 global LULC products we selected from GEE (Table 2) aims to find areas of high consensus to maximize the annotation quality. Spatial and temporal consensus across such rich diversity of LULC products, in terms of spatial resolution, time coverage, satellite source, LULC classes and accuracy, was used as a source of robustness for our subsequent LULC annotation. Products outside GEE were not used due to computing limitations.Table 2 Main characteristics of the 15 global Land-Use and Land-Cover (LULC) products available in Google Earth Engine (GEE) that were combined to find consensus in the global distribution of 29 main LULC classes.Full size tableStandardization and Harmonization of LULC legendsLand cover (LC) data describes the main type of natural ecosystem that occupies an area; either by vegetation types such as shrublands, grasslands and forests, or by other biophysical classes such as permanent snow, bare land and water bodies. Land use (LU) includes the way in which humans modify or exploit an area, such as urban areas or agricultural fields.To build our 29 LULC classes nomenclature, we established a standardization and harmonization approach based on expert knowledge. During this process, we took into account both the needs of different practitioners in the global and regional LULC mapping field and the thematic resolution of the global LULC legends available in GEE. Our nomenclature consists of 23 LC and 6 LU distinct classes identified through specific consensus rules across 15 LULC products (see Table 4). A six-level (L0 to L5) hierarchical structure was adopted in the creation of these 29 LULC classes (Fig. 2). To facilitate the inter-operability of our 29 legends at the finest level L5 across all LULC products and with the widely used FAO’s hierarchical Land Cover Classification System (LCCS)1, we have established an LULC classification system where the 29 classes can be mapped directly to FAO’s LCCS as explained in the table of Supplementary File 1. The LC part in our dataset contains 20 terrestrial ecosystems and 3 aquatic ecosystems. The terrestrial systems are: Barren lands, Grasslands, Permanent snow, Moss and Lichen lands, Close shrublands, Open shrublands, in addition to 12 Forests classes that differed in their tree cover, phenology, and leaf type. The aquatic classes are: Marine water bodies, Continental water bodies, and Wetlands; furthermore, wetlands were divided into 3 classes: Marshlands, Mangroves and Swamps. The LU part is composed of urban areas and 5 coarse cropland types that differed in their irrigation regime and leaf type. In Table 3, you can find the semantic definition of each one of the 29 classes in Sentinel2GlobalLULC. We provided a table in Supplementary File 2, for a more detailed definition of each LULC class.Fig. 2Tree representation of the six-level (L0 to L5) hierarchical structure of the Land-Use and Land-Cover (LULC) classes contained in the Sentinel2GlobalLULC dataset. Outter circular leafs represent the final or most detailed 29 LULC classes (C1 to C29) of level L5. The followed path to define each class is represented through inner ellipses that contain the names of intermediate classes at different levels between the division of the Earth’s surface (square) into LU and LC (level L0) and the final class circle (level L5). All LULC classes belong to three levels at least, except the 12 forest classes that belong to L5 only.Full size imageTable 3 Semantic signification of each one of the 29 Land Use and Land Cover (LULC) classes contained in the Sentinel2GlobalLULC dataset according to the six-level (L0 to L5) hierarchical structure.Full size tableCombining products across time and spaceFor each one of the 29 LULC classes, we combined in space and time the global LULC information among the 15 GEE LULC products. This way, each image was annotated with a LULC class only if all combined products agreed in its corresponding tile (i.e., 100% of agreement in space and time). For each product and LULC type, we first set one or more criteria to create a global mask at the native resolution of the product in which each pixel was classified as 1 or 0 depending on whether it met the criteria for belonging to that LULC type or not, respectively (see first stage in Table 4). For certain LULC classes, some products did not provide any relevant information, so they were not used. For example (Table 4), in Grasslands (C3), Open Shrublands (C4) and Close Shrublands (C5), we combined 14 products, while in UrbanBlUpArea (C29) and Permanent Snow (C23) we only combined 10 and 7 products, respectively.Table 4 First stage of the rule set criteria used to find consensus across the 15 Land-Use and Land-Cover (LULC) products available in Google Earth Engine (GEE) for each of the 29 LULC classes contained in the Sentinel2GlobalLULC dataset.Full size tableThen (see second stage in Table 5), for each LULC type, we calculated the average of all the masks obtained from each product to create a final global probability map from all products with values ranging between 0 and 1. Value 1 meant that all products agreed to assign that pixel to a particular LULC class, while 0 meant that none of the products assigned it to that particular class (Fig. 3). These 0-to-1 values are interpreted as the spatio-temporal purity level of each pixel to belong to a particular LULC class and are provided as metadata with each image.Table 5 Second stage of the rule set criteria used to find consensus across the 15 Land-Use and Land-Cover (LULC) products available in Google Earth Engine (GEE) for each of the 29 LULC classes contained in the Sentinel2GlobalLULC dataset.Full size tableFig. 3Example of the process of building the final global probability map for one of the 29 Land-Use and Land-Cover (LULC) classes (e.g. C1: “Barren”) by means of spatio-temporal agreement of the 15 LULC products available in Google Earth Engine (GEE). The final map is normalized to values between 0 (white, i.e., areas with no presence of C1 in any product) and 1 (black spots, i.e., areas containing or compatible with the presence of C1 in all 15 products), whereas the shades of grey corresponds to the values in between (i.e., areas that did not contain or were not compatible with the presence of C1 in some of the products). This process is divided into two stages: the first stage (the blue part, see details in Table 4) and the second stage (the yellow part, see details in Table 5). LULC products available for several years are represented with superposed rectangles, while single year products are represented with single rectangles. GMP: global probability map, NA: Not Available.Full size imageAs an example of the first stage (see details in Table 4), to specify if a given pixel belongs to Dense Evergreen Needleleaf Forest, we evaluated its tree cover level using “ ≤ “ and “ ≥ “, while for bands containing the leaf type information, we used the equal operator “ = “. For the spatio-temporal combination of multiple criteria we have used the following operators: “AND”,“OR” and “ADD”. For example, we combined the tree cover percentage criteria with the leaf type criteria using “AND” to select forest pixels that met both conditions. To combine many years instances of the same product, we used “ADD”, except for product P13, where we used “AND” to identify permanent water areas only. Whenever we used the “ADD” operator, we normalized pixel values afterwards to bring it back to a probability interval between 0 and 1 using the division by the total number of combined years or criteria.In the second stage (see details in Table 5), we combined for each LULC class the 15 global probability maps previously derived from each product to create a final global probability map (Fig. 3). This combination was carried out using various operators such as “ADD”, “MULTIPLY” and “OR”, depending on the LULC type. When “ADD” was used, the final pixel values were normalized by dividing the final addition value of each pixel by the total number of added products. The “MULTIPLY” operator was mostly used at the end, to remove urban areas from non-urban LULC classes, or to remove water from non-water LULCs. The multiplication operator was also adopted to make sure that a certain criteria was respected in the final probability map. For instance, for the swamp class, we multiplied all pixels in the final stage by a water mask where saline water areas have a value of 0 in order to eliminate mangrove from swamp pixels and vice versa. Finally, we used “OR” operator between different water related products to take advantage of the fact that they complement each other in terms of spatial-temporal coverage and accuracy.In GEE, when two products are aggregated using “ADD”, “MULTIPLY” or any other operator, the output is aggregated at the spatial resolution of the product at the left of the operator. Hence, to maintain the finest spatial resolution in the final probability map, we multiplied everything by product P15 and placed it at the left of the final “MULTIPLY” operation (See Table 5). Hence, all the 29 final probability maps were generated at the P15 spatial resolution of 30 m/pixel (except the urban class C29 which maintained the 30 m/pixel resolution of product P14).Re-projection and Selection of purity thresholdSince our objective was finding pure Sentinel-2 image tiles of 224 × 224 10-m pixels representing each LULC class, we reprojected the 30 m/pixel probability maps to 2240 m/pixel using the spatial mean reducer in GEE. That is, each pixel value at 2240 m resolution was computed using the mean over all the 30m-pixel values contained within it. Hence, the resulting pixel values at 2240 m resolution represent the purity level that each Sentinel-2 image tile of 224 × 224 10-m pixels has. We illustrated the reprojection and selection processes in Fig. 4.Fig. 4Example of the workflow to obtain a Sentinel-2 image tile of 2240 × 2240 m for one of the 29 Land-Use and Land-Cover (LULC) classes (e.g. C1: “Barren”). The process starts with the reprojected final global probability map obtained from stage two (Table 5) and ends with its exportation to the repository of a Sentinel-2 image tile of 224 × 224 pixels. The white rectangle is the only one having a probability value of 1 (Recall that the purity threshold used for Barren was 1, i.e., 100%). The black pixels has a null probability value, while the probability values between 0 and 1 are represented in gray scale levels.Full size imageFor each one of the reprojected maps, we defined a pixel value threshold to decide whether a given 2240 × 2240 m tile was representative of each LULC class or not. Since training DL image classification models needs a large number of high quality (both in terms of image quality and annotation quality) image tiles to reach a good accuracy, when the spatial purity of 100% (full agreement across products in all the pixels of the 224 × 224 tile) resulted in a small number of agreement tiles for a particular class, the purity threshold was decreased for that class until the number of tiles was larger than 1000 or further decreased in less abundant classes to a minimum of 75% of purity. The found purity value is always provided as metadata for each image in the dataset, so the user can always restrict its analysis to those image tiles and classes at any desired purity level. Decreasing the purity threshold down to 75% for the less abundant classes (e.g swamp, mangrove, etc.) was a trade-off between maintaining a good data annotation quality and providing a sufficient number of tiles in each class. In Table 6, we present the number of agreement tiles found at different purity thresholds ranging from 75% to 100% for each LULC class. This spatial purity was not further decreased since machine learning image classification models are known to be robust when the target class is spatially dominant in each training image (it occupies more than 60% of the pixels in the scene)42. On the other hand, when the number of pure tiles for a LULC class was too large to be downloaded (i.e., greater than 14000), we applied a selection algorithm as described in the Supplementary File 3, to download a maximum of 14000 spatially representative images. For this, the world was divided into a one-degree squared cell grid. If a cell contained less than 50 image tiles, we selected them all. If it contained more than 50, we applied that automatic maximum geographic distance algorithm that selected images as far from each other as possible in a number proportional to the number of existing images in that cell. The map in Fig. 6 shows the global distribution of the selected 194877 image tiles contained in Sentinel2GlobalLULC and distributed in 29 LULC classes.Table 6 Summary of the varying number of found and eventually selected Sentinel-2 image tiles of 224 × 224 pixels depending on the different consensus level reached across the 15 Land-Use and Land-Cover (LULC) products available in Google Earth Engine (GEE) for each of the 29 LULC classes contained in the Sentinel2GlobalLULC dataset.Full size tableData extractionSentinel2GlobalLULC provides the user with two types of data: Sentinel-2 RGB images (jpeg and geotif versions) and CSV files with associated metadata. In the following subsections, we describe the process for associating metadata, including the Global Human Modification (GHM) index.Global human modification index extractionAs an additional metadata related to the level of human influence in each image, we calculated for each tile in GEE, the spatial mean of the global human modification index for terrestrial lands43, where 0 means no human modification and 1 means complete transformation. Since the original GHM product was mapped at 1 × 1 km resolution, we reprojected it to 2240 × 2240 m using the same reprojection procedure explained in (Re-projection and Selection of purity threshold).CSV files generationOnce the tiles were selected, for each LULC class we listed the image tiles in descendent order of purity. Metadata included: geographical coordinates of each tile centroid, tile purity value, name and ID of the LULC class, and average GHM index for that tile. Then, we used the geographical coordinates of each tile to identify its exact administrative address geolocation. To implement this reverse geo-referencing operation, we used a free request-unlimited python module called reverse_geocoder. This way, we assigned a country code, two levels of administrative departments, and the locality to each tile.For LULC classes that had more than 14000 pure tiles, we have released the coordinates before and after the distance-based selection in case the user wants to download more tiles or use our consensus coordinates for other purposes.Sentinel-2 RGB images exportationAfter extracting all these pieces of information and grouping them into CSV files, we went back to the geographic center coordinates of each tile and used them to extract the corresponding 224 × 224 Sentinel-2 RGB tiles using GEE. Each exported image was identical to the 2240 × 2240 m area covered by its Sentinel-2 tile.We chose “Sentinel-2 MSI (Multi-Spectral Instrument) product” since it is free and publicly available in GEE at the fine resolution of 10 × 10 m. We chose “Level-1C” (i.e., top-of-atmosphere reflectance) since it provides the longest data availability of Sentinel-2 images without any modification of the data. To build RGB images, we extracted the three bands B4, B3 and B2 that correspond to Red, Green and Blue channels, respectively. More bands available in Sentinel-2 or even in Sentinel-1 images can be incorporated in the future to our dataset. However, computational limitations (i.e., the size of the dataset would be impractical) did not allowed us to handle it as a first goal. In addition, the spatial resolution of the images would be heterogeneous across bands.To minimize the inherent noise due to atmospheric conditions (e.g. clouds, aerosols, smoke, etc.) that could affect the satellite RGB images, every image was built as a temporal aggregation of all images gathered by Sentinel-2 satellites between June 2015 and October 2020. During this aggregation, only the highest quality images in the corresponding image collection were considered, as we firstly discarded all image instances where the cloud probability exceeded 20% according to the metadata provided in their corresponding Sentinel-2 collection. Then, we calculated the 25th-percentile value between all remaining images for each reflectance band (R, G, and B), and built the final image with the obtained 25-percentile values in each pixel for its RGB bands. The 25th-percentile choice was adopted giving its suitability in atmospheric noise reduction44,45,46,47,48.Usually, Sentinel-2 MSI product includes true colour images in JPEG2000 format, except for the “Level-1C” collection used here. The three original bands (B4, B3, and B2) required a saturation mapping of their reflectance values into 0–255 RGB digital values. Thus, we mapped the saturation reflectance of 3558 into 255 to obtain true RGB channels with digital values between 0 and 255. The choice of these mapping numbers was taken from the Sentinel-2 true colour image recommendations section of Sentinel user guidelines. Finally, after exporting the selected tiles for each LULC class as “.tif” images, we converted them into “.jpeg” format using a lossless conversion algorithm.Technical implementationTo implement all our methodology steps, we first created a javascript in GEE for each LULC class. Each script is a multi-task javascript where we implemented a switch command to control which task we want to execute (between the spatio-temporal aggregation task, the spatial reprojection and tiles selection task, or the data exportation task). In each one of these scripts, we selected from GEE LULC datasets repository the 15 LULC products used to build the consensus of that LULC class. Each script was responsible of elaborating the spatio-temporal combination of the selected products and generating the final consensus map for that LULC class as described in the subsection “Combining products across time and space”. Then, it exports the final global probability map as an asset into GEE server storage to make its reprojection faster. In the same script, once the consensus map exportation was done, we imported it from the GEE assets storage and reprojected it to 2240 × 2240 m resolution; then, we exported the new reprojected map into GEE assets storage again to make its analysis and processing faster. Afterwards, we imported the reprojected map into the same script and applied different processing tasks. During this processing phase, many purity threshold values were evaluated. Then, we elaborated in this same script the pure tiles identification and their center coordinates exportation into a CSV file. A distinct GEE script was developed to import, reproject and export the global GHM map. The resulted GHM map was saved as an asset too, then imported and used in each one of the 29 LULC multi-task scripts.A python script was developed separately to read the exported CSV files for each LULC class and apply the reverse geo-referencing on their pure tiles coordinates then add the found geolocalization data (country code, locality…etc) to the original CSV files as new columns. Then, another python script was implemented to read the new resulted CSV files with all their added columns (reverse geo-referencing data, GHM data) and use the center coordinates of each pure tile in that class to export first its corresponding Sentinel-2 satellite geotiff image within GEE through the python API. Finally, after downloading all the selected geotiff images from our Google drive, we created another python script to convert these geotiff images into JPEG format. More