More stories

  • in

    Correction to: Large-scale protein level comparison of Deltaproteobacteria reveals cohesive metabolic groups

    Author notesNina DombrowskiPresent address: Royal Netherlands Institute for Sea Research, Department of Marine Microbiology and Biogeochemistry, AB Den Burg, The NetherlandsKiley W. SeitzPresent address: EMBL Heidelberg, Meyerhofstraße 1, Heidelberg, GermanyThese authors contributed equally: Marguerite V. Langwig, Valerie De Anda.AffiliationsDepartment of Marine Science, University of Texas at Austin, Marine Science Institute, Port Aransas, TX, USAMarguerite V. Langwig, Valerie De Anda, Nina Dombrowski, Kiley W. Seitz, Ian M. Rambo & Brett J. BakerDepartment of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, AustraliaChris GreeningDepartment of Marine Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC, USAAndreas P. TeskeAuthorsMarguerite V. LangwigValerie De AndaNina DombrowskiKiley W. SeitzIan M. RamboChris GreeningAndreas P. TeskeBrett J. BakerCorresponding authorsCorrespondence to
    Marguerite V. Langwig or Brett J. Baker. More

  • in

    Diversity and compositional changes in the gut microbiota of wild and captive vertebrates: a meta-analysis

    Literature search and data retrievalWe performed a systematic literature search on the internet (Google Scholar, Web of Science) using the following keywords: [gut microbiota], [animal microbiome], [gut microbiome 16S] and [captive AND wild AND microbiota]. This search yielded 222 articles on animal microbiomes published between 2014 and 2020. The materials and methods of these articles were analysed to ascertain whether the study met the following criteria: (i) all wild and captive samples were processed using identical procedures, (ii) compared wild and captive animals were phylogenetically closely related (members of the same species or species complex), (iii) captive individuals were born in captivity, or no information was provided about the origin of the captive animals; i.e., wild animals brought into captivity and sampled some time later were excluded, (iv) captive animals that underwent a deliberate selection process (e.g. inbred mice or domestic animals) were also excluded for considering them genetically not comparable to the wild counterparts, and (v) only datasets with sample sizes over 12 individuals were considered for analysis. Raw data were extracted from the databases and repositories indicated in the articles (accession numbers listed in the “Bioinformatic resources”).Bioinformatic sequencing data processingDatafiles from the different studies were (i) stored at the University of Copenhagen’s Electronic Research Data Repository (ERDA), (ii) assigned a unique study identifier and (iii) re-processed in the Danish National Supercomputer for Life Sciences ‘Computerome2’ using a new bioinformatic pipeline we developed for processing data with different characteristics, including sequencing mode, read length and 16S rRNA gene fragment. The entire code can be found in the “Bioinformatic resources”. In short, for each individual dataset, we quality-filtered (mean phred score of q = 25) and (if necessary) trimmed and merged the paired-end reads based on the sequence overlap using AdapterRemoval224. Primers (if present) were trimmed using Cutadapt25, and reads were dereplicated with USEARCH Derep26 using a relative minimum copy number threshold of 0.01% of the total sequencing depth. Reads were then converted into zero-ratio OTUs using the denoising algorithm UNOISE327, and USEARCH was used to map the reads back to the OTUs and create an OTU table. HS-Blast28 was used to assign taxonomy against the non-redundant Silva 132 database29, and taxonomic assignments were filtered using different identity thresholds for each taxonomic level: 97% for genus-level taxonomy, 95% for family-level taxonomy, 92% for order-level taxonomy and 90% for higher taxonomic levels30. To minimise the impact of incorrectly assigned taxa, taxonomic annotations below these identity thresholds were converted into unclassified, and not considered in downstream analyses. This pipeline yielded relative read abundances assigned to different taxa for each individual dataset analysed.Data quality filteringIndividual data files generated by the aforementioned pipeline were aggregated by study and host species into genus-level abundance tables. The two datasets of Sarcophilus harrisii retrieved from two different studies were processed independently. Taxonomic resolution was limited to the genus level to maximise taxonomic annotation rate and minimise biases introduced by the different 16S rRNA gene markers employed in the analysed studies. On the one hand, wild animals’ microbial communities often contain taxa that do not map to any catalogued species with enough molecular similarity to assign species-level annotation. On the other hand, the analysed datasets were generated based on the V4, V3–V4 and V1–V3 regions of the 16S rRNA gene (Supplementary Dataset), which hindered comparability at the ASV or zOTU level. We then proceeded to quality-filter the genus-level abundance tables of each species through filtering individuals by minimum sequencing depth, minimum diversity coverage and taxonomic annotation. Only individual datasets with more than 1000 reads and diversity coverage values over 99% were retained, and final genus-level abundance tables that contained at least five animals in each contrasting group were considered for analysis. Since the studied datasets contained traces of dietary items and host DNA, read counts assigned to taxonomic groups not assigned to Bacteria genera, or not present in the LTPs132_SSU release of the SILVA Living Tree (https://www.arb-silva.de/projects/living-tree) used for measuring the phylogenetic relationships among bacteria, were removed to ensure accurate measurements of phylogenetic diversities. In the cases where one group (either wild or captive) outnumbered the other, samples were randomly selected to ensure even sample sizes.Diversity and compositional analysesDiversity and compositional analyses were carried out in the R statistical environment v.3.6.331 and Python 3.8 based on the Hill numbers framework. The operations explained below were conducted using the R packages ape32, dendextend33, dmetar34, hilldiv35, meta36, metamicrobiomeR37, phylosignal38, phytools39, treedist40, vegan41, and the python package qdiv42. Hereafter functions and their respective packages are displayed as ‘package::function’. Statistical significance level was set at a FDR-adjusted p-value of 0.05. All charts and figures in the manuscript were originally generated either in R (full code of all original figures is included in “Bioinformatic resources”) and subsequently modified in Adobe Illustrator to achieve the desired layout without distorting the dimensions of the quantitative elements.Hill numbersThe Hill numbers framework encompasses the group of diversity measures that quantify diversity in units of equivalent numbers of equally abundant taxa43,44—in our context bacteria genera. Hill numbers provide a general statistical framework that is sufficiently robust and flexible to address a wide range of scientific questions that molecular ecologists regularly try to answer through measurement, estimation, partitioning and comparison of diversities45. To obtain a complete vision of the gut microbiome differences between wild and captive animals, we conducted all our diversity and compositional analyses based on three contrasting Hill numbers based metrics: the so-called dR, which only accounts for richness (i.e., order of diversity 0, whether bacteria taxa were present or not), dRE which considered Richness + Evenness of order of diversity 1 (i.e., the relative abundances of bacteria are proportionally weighed) and dRER, which considered Richness, + Evenness + Regularity (i.e., the phylogenetic relationships among bacteria are accounted for). Detailed explanations of these metrics can be found elsewhere17,46,47.Phylogenetic treesThe dRER metric required a Bacterial phylogenetic tree to compute the relatedness among bacterial taxa. As our datasets contained different fragments of the 16S rRNA gene, and thus we were unable to generate a phylogenetic tree directly from our DNA sequence data, we relied on the SILVA Living Tree, and used the LTPs132_SSU release as the reference phylogenetic tree. In addition, the time-calibrated host phylogeny required by the host phylogenetic signal and phylosymbiosis analyses was generated using Timetree48.Diversity metrics and meta-analysisWe computed individual-based diversity metrics using the function hilldiv::hill_div, and obtained average alpha diversity metrics per species, as well as wild and captive populations per species. We used a Kruskal–Wallis (KW) test as implemented in the function hilldiv::div_test to ascertain whether the mean diversity values varied across analysed host species, and a PERMANOVA (PMV) test using vegan::adonis function based on the pairwise dissimilarity matrix to test whether host species were compositionally distinct.Average alpha diversity metrics of wild and captive populations per species were used to conduct a random-effects-model (REM) meta-analysis with raw effect sizes using the function meta::metacont. We used the Sidik–Jonkman estimator for the between-study variance and the Knapp–Hartung–Sidik–Jonkman adjustment method. The overall effect was calculated using Hedge’s g (SMD) and its 95% confidence interval and p-value. An identical analysis was performed for the entire dataset and two representative subsets of five species, containing only datasets derived from primates and cetartiodactylans. Higgin’s and Thompson’s I2 test, Tau-squared T2 and Cochran’s Q were used for quantifying the heterogeneity between the included species. Due to the high heterogeneity found in the study, we evaluated whether the between-study heterogeneity was caused by outliers with extreme effect sizes, which could be distorting our overall effect. We defined an outlier if the species’s confidence interval did not overlap with the confidence interval of the pooled effect using dmetar::find.outliers function.The function detected three outliers in dR metric (GOGO, PEMA and TUTR), four in dRE (GOGO, PEMA, MOCH, EQKI) and seven in dRER (RHBR, PYNE, PEMA, TUTR, MOCH, CENI and AIME). Even when these outliers were excluded from the analysis the I2 heterogeneity value was substantial for dR (I2 from 79.3 to 70.3%) and moderate for dRE (I2 from 80.1 to 60.0%) and dRER (I2 from 86.9 to 54.2%) and significant for both (Cochran’s Q, p-value  More

  • in

    Possible impacts of the predominant Bacillus bacteria on the Ophiocordyceps unilateralis s. l. in its infected ant cadavers

    Sample collectionSamples were collected from an evergreen broadleaf forest in central Taiwan (Lianhuachi Experimental Forest, Nantou County, 23°55′7″N 120°52′58″E) from January 2017 to March 2018. Permission to collect plants for the study was obtained from the Lianhuachi Research Center, Taiwan Forestry Research Institute, Council of Agriculture, Executive Yuan, Taiwan (Permission no.: 1062272538). The present study complies with the International Union for Conservation of Nature Policy Statement on Research Involving Species at Risk of Extinction and the Convention on the Trade in Endangered Species of Wild Fauna and Flora. Ant cadavers with fungal growth were collected from understory plants with a height of less than 3 m. Ant cadavers infected with O. unilateralis s. l. were removed carefully by cutting the leaf and placing it into a 50-mL conical centrifuge tube, which was then transported to the laboratory. Only cadavers in which the fungal growth stage preceded the development of perithecia, which theoretically has the highest biological activity, were collected (Fig. 1). In total, 24 infected P. moesta and 20 infected P. wolfi samples were collected.Figure 1Ophiocordyceps unilateralis sensu lato-infected (a) Polyrhachis moesta and (b) P. wolfi, with the stroma growing from the ant cadaver. The specimens were collected from the Lianhuachi Research Center, Taiwan and photographed in the laboratory by Wei-Jiun Lin.Full size imageIsolation and cultivation of bacteriaAnts on the leaves were first identified to species and then, using tweezers, each ant was placed carefully into a sterilized 1.5-mL microcentrifuge tube [see details in Lin et al. (2020)15. Samples were shaken one by one in 600 μL of sterilized water for a few seconds at 3000 revolutions/min (rpm) using a vortex mixer (AL-VTX3000L, CAE technology Co., Ltd., Québec, Canada), and were then soaked with 600 μL of 70% ethanol to sterilize the ant’s surface. The ethanol on the samples was washed twice with 600 μL of sterilized water, then vortexed in 400 μL of sterilized water. Next, 200 μL of the supernatant was spread homogeneously onto a Luria–Bertani (LB) agar plate (25 g Luria–Bertani broth and 15 g agar per liter) to confirm the absence of live bacteria.Bacteria from inside the ant host were released by homogenizing the ant host in 200 μL of water and culturing on LB agar plates at 28 °C for 2 days. Bacteria from each of the ant individuals were cultured independently and approximately equal numbers of the isolates were picked randomly with sterile toothpicks, and were suspended in the LB medium supplemented with 15% v/v glycerol and maintained at − 80 °C until the time of examination. In total, 247 bacterial isolates from P. moesta and 241 bacterial isolates from P. wolfi were collected.In addition to the bacterial isolates from the ant bodies, 60 bacterial isolates from soil, leaves, and air in the same forest were collected for the purpose of comparing their resistance to naphthoquinones (see below) by using the aforementioned procedure but without initial cleaning and sterilizing of the sample surface.Bacterial identificationBacteria collected from the ant hosts were identified by gene marker sequencing. Bacterial isolates were cultured in LB medium at 28 °C overnight to reach the log-phase, and genomic DNA was extracted following the methods described in Vingataramin and Frost (2015)20. Conspecies/strains of the bacterial isolates from the same host were determined using the randomly amplified polymorphic DNA (RAPD) method with the primer 5′-GAGGGTGGCGGTTCT-3′. PCR amplification was performed as follows: initial denaturation at 95 °C for 5 min, 40 cycles of amplification including denaturation at 95 °C for 1 min, annealing at 42 °C for 30 s, and extension at 72 °C for 1 min, followed by a final extension at 72 °C for 10 min. PCR products were run in 2% agarose gel and bacterial isolates were characterized by fragment patterns. For each of the ant hosts, bacterial isolates with the same RAPD pattern were considered to be the same strain. In total, 106 and 178 strains were found from P. moesta and P. wolfi, respectively. One of the bacterial isolates was selected at random to represent the strain and coded with “JYCB” followed by a series of numbers (e.g., JYCB191). Taxonomic status of each strain was determined to species by using the V3/V4 region of the 16S rDNA gene. PCR amplification with the primer set (8F: 5′-AGAGTTTGATCCTGGCTCAG-3′ and 1541R: 5′-AAGGAGGTGATCCAGCCGCA-3′)21,22 was performed under the following conditions: initial denaturation at 95 °C for 5 min, 40 cycles of amplification including denaturation at 95 °C for 1 min, annealing at 55 °C for 30 s, and extension at 72 °C for 1 min 45 s, followed by a final extension at 72 °C for 10 min. PCR products were first checked by running a gel, and were then sequenced at Genomics, Inc. (New Taipei City, Taiwan).The sequences of the bacterial strains from each of the ant hosts were first analyzed by the unweighted pair group method with arithmetic mean (UPGMA) analysis and clustered into clades according to the sequence dissimilarity ( More

  • in

    Endophytic bacterial communities are associated with leaf mimicry in the vine Boquila trifoliolata

    1.Wiens, D. Mimicry in plants. Evol. Biol. 11, 365–403 (1978).
    Google Scholar 
    2.Pasteur, G. A classificatory review of mimicry systems. Annu. Rev. Ecol. Syst. 13, 169–199 (1982).
    Google Scholar 
    3.Barrett, S. C. H. Mimicry in plants. Sci. Am. 257, 76–85 (1987).
    Google Scholar 
    4.Barlow, B. A. & Wiens, D. Host-parasite resemblance in Australian mistletoes: The case for cryptic mimicry. Evolution 31, 69–84 (1977).PubMed 

    Google Scholar 
    5.Ehleringer, J. R. et al. Mistletoes: A hypothesis concerning morphological and chemical avoidance of herbivory. Oecologia 70, 234–237 (1986).ADS 
    CAS 
    PubMed 

    Google Scholar 
    6.Canyon, D. V. & Hill, C. J. Mistletoe host-resemblance: A study of herbivory, nitrogen and moisture in two Australian mistletoes and their host trees. Aust. J. Ecol. 22, 395–403 (1997).
    Google Scholar 
    7.Blick, R. A. J., Burns, K. C. & Moles, A. T. Predicting network topology of mistletoe–host interactions: Do mistletoes really mimic their hosts?. Oikos 121, 761–771 (2012).
    Google Scholar 
    8.Gianoli, E. & Carrasco-Urra, F. Leaf mimicry in a climbing plant protects against herbivory. Curr. Biol. 24, 984–987 (2014).CAS 
    PubMed 

    Google Scholar 
    9.Gianoli, E., Saldaña, A., Jiménez-Castillo, M. & Valladares, F. Distribution and abundance of vines along the light gradient in a southern temperate rainforest. J. Veg. Sci. 21, 66–73 (2010).
    Google Scholar 
    10.Gianoli, E. Eyes in the chameleon vine?. Trends Plant Sci. 22, 4–5 (2017).CAS 
    PubMed 

    Google Scholar 
    11.Gianoli, E. & Molina-Montenegro, M. A. Leaf damage induces twining in a climbing plant. New Phytol. 167, 385–390 (2005).PubMed 

    Google Scholar 
    12.González-Teuber, M. & Gianoli, E. Damage and shade enhance climbing and promote associational resistance in a climbing plant. J. Ecol. 96, 122–126 (2008).
    Google Scholar 
    13.Calder, D. M. Mistletoes in focus: An introduction. In The Biology of Mistletoes (eds Calder, D. M. & Bernhardt, P.) 1–18 (Academic Press, 1983).
    Google Scholar 
    14.Cook, M. E., Leigh, A. & Watson, D. M. Hiding in plain sight: Experimental evidence for birds as selective agents for host mimicry in mistletoes. Botany 98, 525–531 (2020).
    Google Scholar 
    15.Atsatt, P. R. Mistletoe leaf shape: A host morphogen hypothesis. In The Biology of Mistletoes (eds Calder, D. M. & Bernhardt, P.) 259–275 (Academic Press, 1983).
    Google Scholar 
    16.Hall, P. J., Badenoch-Jones, J., Parker, C. W., Letham, D. S. & Barlow, B. A. Identification and quantification of cytokinins in the xylem sap of mistletoes and their hosts in relation to leaf mimicry. Aust. J. Plant Physiol. 14, 429–438 (1987).CAS 

    Google Scholar 
    17.Watson, D. M. Mistletoes of Southern Australia (CSIRO, 2019).
    Google Scholar 
    18.Holopainen, J. K. & Blande, J. D. Molecular plant volatile communication. In Sensing in Nature (ed. López-Larrea, C.) 17–31 (Springer Science, 2012).
    Google Scholar 
    19.Baldwin, I. T., Kessler, A. & Halitschke, R. Volatile signaling in plant–plant–herbivore interactions: What is real?. Curr. Opin. Plant Biol. 5, 351–354 (2002).CAS 
    PubMed 

    Google Scholar 
    20.Heil, M. & Karban, R. Explaining evolution of plant communication by airborne signals. Trends Ecol. Evol. 25, 137–144 (2010).PubMed 

    Google Scholar 
    21.Karban, R., Yang, L. H. & Edwards, K. F. Volatile communication between plants that affects herbivory: A meta-analysis. Ecol. Lett. 17, 44–52 (2014).PubMed 

    Google Scholar 
    22.Coyne, J. A. Fantastic and plastic mimicry in a tropical vine. Why Evolution is True Blog. http://whyevolutionistrue.com/2014/04/26/fantastic-and-plastic-mimicry-in-a-tropical-vine (2014).23.Pannell, J. R. Leaf mimicry: Chameleon-like leaves in a Patagonian vine. Curr. Biol. 24, R357–R359 (2014).CAS 
    PubMed 

    Google Scholar 
    24.Baluška, F. & Mancuso, S. Vision in plants via plant-specific ocelli?. Trends Plant Sci. 21, 727–730 (2016).PubMed 

    Google Scholar 
    25.Richardson, A. O. & Palmer, J. D. Horizontal gene transfer in plants. J. Exp. Bot. 58, 1–9 (2007).CAS 
    PubMed 

    Google Scholar 
    26.Bock, R. The give-and-take of DNA: Horizontal gene transfer in plants. Trends Plant Sci. 15, 11–22 (2010).CAS 
    PubMed 

    Google Scholar 
    27.Yoshida, S., Maruyama, S., Nozaki, H. & Shirasu, K. Horizontal gene transfer by the parasitic plant Striga hermonthica. Science 328, 1128 (2010).ADS 
    CAS 
    PubMed 

    Google Scholar 
    28.Christin, P. A. et al. Adaptive evolution of C4 photosynthesis through recurrent lateral gene transfer. Curr. Biol. 22, 445–449 (2012).CAS 
    PubMed 

    Google Scholar 
    29.Gao, C. et al. Horizontal gene transfer in plants. Funct. Integr. Genomics 14, 23–29 (2014).CAS 
    PubMed 

    Google Scholar 
    30.Diao, X., Freeling, M. & Lisch, D. Horizontal transfer of a plant transposon. PLoS Biol. 4, e5 (2006).PubMed 

    Google Scholar 
    31.El Baidouri, M. et al. Widespread and frequent horizontal transfers of transposable elements in plants. Genome Res. 24, 831–838 (2014).PubMed 
    PubMed Central 

    Google Scholar 
    32.Prentice, H. C., Li, Y., Lönn, M., Tunlid, A. & Ghatnekar, L. A horizontally transferred nuclear gene is associated with microhabitat variation in a natural plant population. Proc. R. Soc. B Biol. Sci. 282, 20152453 (2015).
    Google Scholar 
    33.Yu, A. et al. Dynamics and biological relevance of DNA demethylation in Arabidopsis antibacterial defense. Proc. Natl. Acad. Sci. 110, 2389–2394 (2013).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    34.Takahashi, K. Influence of bacteria on epigenetic gene control. Cell. Mol. Life Sci. 71, 1045–1054 (2014).CAS 
    PubMed 

    Google Scholar 
    35.Ramos-Cruz, D., Troyee, A. N. & Becker, C. Epigenetics in plant organismic interactions. Curr. Opin. Plant Biol. 61, 102060 (2021).CAS 
    PubMed 

    Google Scholar 
    36.Lodewyckx, C. et al. Endophytic bacteria and their potential applications. Crit. Rev. Plant Sci. 21, 583–606 (2002).
    Google Scholar 
    37.Ryan, R. P., Germaine, K., Franks, A., Ryan, D. J. & Dowling, D. N. Bacterial endophytes: Recent developments and applications. FEMS Microbiol. Lett. 278, 1–9 (2008).CAS 
    PubMed 

    Google Scholar 
    38.Barrett, S. C. H. Crop mimicry in weeds. Econ. Bot. 37, 255–282 (1983).
    Google Scholar 
    39.McElroy, J. S. Vavilovian mimicry: Nikolai Vavilov and his little-known impact on weed science. Weed Sci. 62, 207–216 (2014).CAS 

    Google Scholar 
    40.Ye, C.-Y. et al. Genomic evidence of human selection on Vavilovian mimicry. Nat. Ecol. Evol. 3, 1474–1482 (2019).PubMed 

    Google Scholar 
    41.Ruiz, E. Lardizabalaceae. In Flora de Chile Vol. 2 (eds Marticorena, C. & Rodríguez, R.) 24–27 (Universidad de Concepción, 2003).
    Google Scholar 
    42.Muñoz-Schick, M. Flora del Parque Nacional Puyehue (Editorial Universitaria, 1980).
    Google Scholar 
    43.Dorsch K. Hydrogeologische Untersuchungen der Geothermalfelder von Puyehue und Cordón Caulle, Chile. PhD thesis (Ludwig-Maximilians-Universität, 2003).44.Valladares, F., Saldaña, A. & Gianoli, E. Costs versus risks: Architectural changes with changing light quantity and quality in saplings of temperate rainforest trees of different shade tolerance. Austral Ecol. 37, 35–43 (2012).
    Google Scholar 
    45.Salgado-Luarte, C. & Gianoli, E. Shade-tolerance and herbivory are associated with RGR of tree species via different functional traits. Plant Biol. 19, 413–419 (2017).CAS 
    PubMed 

    Google Scholar 
    46.Salgado-Luarte, C. & Gianoli, E. Herbivory on temperate rainforest seedlings in sun and shade: Resistance, tolerance and habitat distribution. PLoS One 5, e11460 (2010).ADS 
    PubMed 
    PubMed Central 

    Google Scholar 
    47.Salgado-Luarte, C. & Gianoli, E. Herbivores modify selection on plant functional traits in a temperate rainforest understory. Am. Nat. 180, E42–E53 (2012).PubMed 

    Google Scholar 
    48.Sun, B. Y., Stuessy, T. F., Humaña, A. M., Riveros, G. M. & Crawford, D. J. Evolution of Rhaphithamnus venustus (Verbenaceae), a gynodioecious hummingbird-pollinated endemic of the Juan Fernandez Islands, Chile. Pac. Sci. 50, 55–65 (1996).
    Google Scholar 
    49.Saldaña, A. & Lusk, C. H. Influencia de las especies del dosel en la disponibilidad de recursos y regeneración avanzada en un bosque templado lluvioso del sur de Chile. Rev. Chil. Hist. Nat. 76, 639–650 (2003).
    Google Scholar 
    50.Gut, B. Árboles-Trees Patagonia. Árboles nativos e introducidos en Patagonia (Vázquez Mazzini, 2017).
    Google Scholar 
    51.Sahu, S. K., Thangaraj, M. & Kathiresan, K. DNA extraction protocol for plants with high levels of secondary metabolites and polysaccharides without using liquid nitrogen and phenol. ISRN Mol. Biol. 2012, 205049 (2012).PubMed 
    PubMed Central 

    Google Scholar 
    52.Lundberg, D. S., Yourstone, S., Mieczkowski, P., Jones, C. D. & Dangl, J. L. Practical innovations for high-throughput amplicon sequencing. Nat. Methods 10, 999–1002 (2013).CAS 
    PubMed 

    Google Scholar 
    53.Schloss, P. D. et al. Introducing mothur: Open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl. Environ. Microbiol. 75, 7537–7541 (2009).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    54.Quast, C. et al. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Res. 41, D590–D596 (2012).PubMed 
    PubMed Central 

    Google Scholar 
    55.Wang, Q., Garrity, G. M., Tiedje, J. M. & Cole, J. R. Naïve bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 73, 5261–5267 (2007).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    56.Borcard, D., Gillet, F. & Legendre, P. Numerical Ecology with R (Springer, 2011).MATH 

    Google Scholar  More

  • in

    Integrate geographic scales in equity, diversity and inclusion

    1.McGill, B. M. et al. Ecol. Evol. 11, 3636–3645 (2021).Article 

    Google Scholar 
    2.Röβler, D. C., Lötters, S. & Da Fonte, L. F. M. Nature 584, 525–525 (2020).
    Google Scholar 
    3.Nuñez, M. A. et al. J. Appl. Ecol. 56, 4–9 (2019).Article 

    Google Scholar 
    4.Nuñez, M. A., Chiuffo, M. C., Pauchard, A. & Zenni, R. D. Trends Ecol. Evol. 36, 766–769 (2021).Article 

    Google Scholar 
    5.Maas, B. et al. Conserv. Lett. 14, e12797 (2021).Article 

    Google Scholar 
    6.Khelifa, R., Mahdjoub, M., Baaloudj, A. & Chaib, S. Facets https://doi.org/10.1139/facets-2021-0073 (in press).7.Haelewaters, D., Hofmann, T. A. & Romero-Olivares, A. L. PLoS Comput. Biol. 17, e1009277 (2021).CAS 
    Article 

    Google Scholar  More

  • in

    Incrimination of shrews as a reservoir for Powassan virus

    Samples of host-seeking nymphal black-legged ticks were collected during 2018–2020 from Massachusetts and Rhode Island sites where DTV is enzootic (Fig. 1). Individual DTV-infected ticks were identified by RT-PCR; all ticks (including virus-negative ticks) were also analyzed for B. burgdorferi infection by PCR. Host bloodmeal remnant identification using assays targeting family and order specific retrotransposons was performed as described11, with the addition of newly described primers (see Table 1). Assays targeted likely mammalian reservoir hosts within our study sites.Fig. 1: Map of the field sites included in this study.Ticks were collected from two sites in Washington County, Rhode Island, MB and Trust, as well as, from three islands off the coast of Massachusetts: Nantucket, Martha’s Vineyard, and Naushon Island.Full size imageTable 1 Primers and probes targeting mammalian retrotransposons used in the study for bloodmeal identification in ticks.Full size tableWe identified 20 nymphal ticks that contained DTV RNA from 13 different sites (prevalence, 0.4–7%, Table 2) and confirmed viral identity by sequencing a 248 bp section of the NS5 gene and 286 bp section of the envelope gene. Cognate viral sequences from these ticks were assigned to the DTV lineage (Fig. 2). Sequences from ticks collected from field sites in close geographic proximity often clustered together. Borrelia burgdorferi prevalence was more variable, ranging from 0 to 21%., and B. burgdorferi infection was not associated with DTV infection (p = 0.5, Fig. 3), as sites with high numbers of ticks infected with spirochetes were not the same as those that had high numbers of DTV-infected ticks (Table 2).Table 2 Infection rate of deer tick virus (DTV) and Borrelia burgdorferi in ticks at each study site.Full size tableFig. 2: Maximum likelihood tree of deer tick virus (DTV) detected in this study.A 248 bp piece of the NS5 gene and the 286 bp piece of the envelope gene were sequenced from each positive tick in the study, as well as the positive shrew. These pieces were concatenated and aligned with deer tick virus (DTV) and Powassan virus (POW) sequences downloaded from GenBank (GenBank numbers are listed on the tree). A maximum likelihood tree was then created using MEGAX.Full size imageFig. 3: Correlation analysis of the percentage of ticks that fed on shrews compared to the percentage of infected ticks at our field sites.The B. burgdorferi (Borr) data are shown in panel a and deer tick virus (DTV) data are shown in panel b. The percentage of DTV (n = 20, p = 0.01), but not B. burgdorferi (n = 128, p = 0.5), in ticks at a site is associated with the percentage of ticks that fed on shrews.Full size imageThe source of the infectious larval bloodmeal was identified from 16 of the 20 DTV-infected ticks (80%), 13 of which were identified as shrews (65%) (Table 3). The other DTV-infected ticks had fed on diverse hosts such as bird, squirrel, and cat. One tick showed evidence of having fed on multiple hosts (shrew and deer). None of the ticks had fed on a mouse. We conclude that in our sites, during the years that we sampled, larval ticks feeding on shrews were more likely to be infected by DTV than by feeding on any other animal. Using the 0.1% estimated rate of transmission of adult female ticks to larval progeny for the related tick-borne encephalitis virus12, we calculated that up to four ticks (95% binomial confidence interval of 0 to 0.4% of ticks) from our study could derive from inheritance. Thus, we cannot exclude this as the source of the single infected ticks derived from a bird, squirrel, and cat. However, more than four infected ticks were derived from shrews, suggesting that inheritance alone cannot explain the apparent association.Table 3 Bloodmeal host identified from deer tick virus-infected ticks from each study site.Full size tableDuring the years that we sampled our study sites, mice did not contribute as many larval bloodmeals as might be expected13,14. The proportion of nymphal ticks that fed on mice ranged from 2 to 20% (median 10.5%) (Table 4). Our previous publication identified sites where the majority of ticks had fed on mice (Nantucket 2018, 100%, and Robin’s Island 2018 and 2019, 91% and 53%, respectively11), but DTV was not identified from these collections. Squirrels, or other Sciuridae, contributed ticks at only two sites (median host contribution, 1%). In contrast, shrews were common hosts at our study sites, with a median host contribution of 40.5% (range, 0–68%). The proportion of nymphal ticks that fed on shrews as larvae at a site was associated with the prevalence of DTV infection in ticks at that site (R2 = 0.44 p = 0.01, Fig. 3b), but not the prevalence of B. burgdorferi (R2 = 0.04, p = 0.5, Fig. 3a). DTV-infected nymphs were highly likely to had fed on a shrew (OR = 139, 95% confidence interval 42–456, but not a mouse, squirrel (or other Sciuridae) or other host (Fig. 4a). By contrast, B. burgdorferi-infected ticks were likely to have fed on mice, but not shrews (OR = 1.1, 95% confidence interval 0.6–1.9) (Fig. 4b). This excludes the hypothesis that shrews were found to have served as virus sources simply because these hosts were the dominant host in these sites.Table 4 The percentage of ticks at each site that tested positive for having fed on either a shrew (Soricidae), mouse (Peromyscus), squirrel (Sciuridae), or all other hosts tested (Odocoileus, Aves, Felis, Arvicolinae, or Lagomorpha).Full size tableFig. 4: The likelihood that an infected tick had fed on either a shrew, mouse, squirrel (or other Sciuridae), or other host.The data for deer tick virus-infected ticks are shown in panel a, and the data for B. burgdorferi-infected ticks are shown in panel b. Data are represented by boxplots of odds ratios (OR) with 95% confidence intervals, and all field sites are combined (n = 20 deer tick virus-infected ticks, n = 128 B. burgdorferi-infected ticks). A line is drawn at OR = 1, and any confidence interval that crosses it is not statistically significant. Sqrl= squirrel (or other Sciuridae).Full size imageThree B. brevicauda shrews were trapped from two of our study sites in September of 2020. DTV was detected in the brain of one shrew. Attempts to isolate virus by suckling mouse inoculation failed. Sequencing of two gene targets demonstrates greatest similarity to virus found in a tick from the same site (Fig. 2) and not to standard laboratory strains.DTV, like other tick-borne encephalitis viruses, may be perpetuated by three mechanisms15. Virus may be inherited by the tick, transovarial transmission16. We found that a greater number of ticks were associated with a specific host from all study sites than expected by vertical transmission, indicating that these ticks were not likely to have inherited the infection. There may be co-feeding or nonsystemic transmission in which an infected tick may serve as the direct source of infection for uninfected ticks attached to the skin around it, with no requirement for hematogenous viral dissemination17. Finally, horizontal transmission, in which a larval tick acquires infection from a viremic vertebrate host, requires a reservoir host that is susceptible to infection and allows for sufficient viremia to infect ticks as well as being sufficiently infested by the tick vector16. We focused solely on host-seeking nymphal ticks because they would only have one bloodmeal source, that of the larvae. Although adult ticks are also infected by DTV, they would have had two opportunities to become infected (a bloodmeal during the larval as well as the nymphal stage) and it would not be possible to determine whether the bloodmeal host that was identified from an adult was the source of the virus. Accordingly, we did not analyze adult ticks. Our analysis thus incriminates horizontal transmission between shrews and larval ticks, but we cannot exclude co-feeding transmission.Shrews (likely Blarina brevicauda, the most common shrew in our study sites; our retrotransposon assay, however, may also detect Sorex spp.) were the larval bloodmeal host for the majority (65%) of DTV-infected ticks. The infected ticks were collected from eight different sites over the course of three field seasons, indicating that the finding is not spatiotemporally specific. Although our sample size is small, the positive association between the proportion of shrew-fed ticks and the prevalence of DTV infection in ticks also supports a general finding; no association was found between DTV-infected ticks and either mouse-fed or Sciuridae-fed ticks. Finally, we detected virus in the brain of a shrew and find that it is genetically similar to virus within ticks from that site. Shrews are thus the main candidate for the vertebrate DTV reservoir but we cannot now rank the contribution of horizontal transmission relative to other modes of perpetuation. Shrews may be more likely to sustain an infectious viremia, or be more likely to simultaneously serve as host to nymphs and larvae (co-feeding), than the other mammals present in our study sites. Virus has been detected from xenodiagnostic ticks removed from skunks, raccoons and opossum in New York18. As with other tick-transmitted infections, contributions to the DTV enzootic cycle are likely to be dependent on local conditions and other hosts than shrews may contribute to maintenance. However, the association of shrews with DTV-infected ticks across multiple transmission seasons and across diverse sites, suggests that additional studies of shrews would be useful. Further investigations, including laboratory transmission studies are necessary to quantify the reservoir capacity of these hosts.Shrews have not previously been suggested as reservoir hosts for DTV or POWV, but they appear to be competent reservoirs for the related TBE virus in Eurasia19,20,21. When DTV was identified, white-footed mice were considered the likely reservoir given that these rodents maintained the tick-transmitted agents of Lyme disease, babesiosis, and human granulocytic anaplasmosis10,22. Shrews were considered to be poorly infested by ticks and thus were considered to have lower reservoir capacity for B. burgdorferi and B. microti23; this suggestion has been reconsidered24,25. Mammal surveys in DTV endemic sites have failed to detect virus or specific antibody in shrews18. Our use of host bloodmeal remnant analysis on infected ticks directly identified the source of the infecting animal reservoir without needing to extrapolate from indirect evidence such as comparative host density, tick infestation indices, and prevalence of pathogen exposure, and could be used to better understand the mode of perpetuation of other high consequence tick-borne pathogens such as the rickettsial agent of Rocky Mountain spotted fever, or those causing American tick-borne hemorrhagic fevers (Bourbon or Heartland virus). More

  • in

    Successful microbial colonization of space in a more dispersed manner

    Simulating competition for space using the “BacGo” modelTo investigate how spatial positioning of populations affects the outcome of microbial competition, we simulated two populations competing for space with a limiting size by building an individual-based model (named “BacGo”). The model was implemented in discrete grid boxes of a 20 × 20 array. As shown in Fig. 1a, our simulations were based on three basic assumptions. First, the two competing populations possess the same inherent growth rate and equal initial cell numbers, thus the only differences between them are their manners of colonizing free space. Second, the newly born daughter cell is located around its mother cell but with a random direction of spatial positioning [34], resulted in a microcolony with different spatial patterning. Lastly, if the selected box has been occupied, the newborn cell will compete for the box against the original occupants of the box and possesses a probability of 50% to survive [37].We first explored the outcome of spatial competition, which started by randomly distributing two populations on the grids with the same initial cell numbers of 10 for each. Based on our basic assumptions and the predictions of competitive exclusion theory [38], we hypothesized that only one population could win the competition and finally occupy all grids. As shown in 20,000 independent simulations with random initial distributions, we discovered that at the end of each simulation, only one population survived (Video S1 and Video S2). The Chi-square test showed no significant difference (P = 0.211) between the simulated winning times (10,177 of 20,000 simulations) and the random winning times (10,051 of 20,000 simulations) of the focus population. This result conformed with our initial assumption that cells possess a probability of 50% to survive in competing with original occupants. When we replicated simulations initiated with the same cell distribution, we found that the winning probabilities for each population changed in line with the initial distributions (Fig. S1). However, the winning probabilities never reached 100% no matter how the initial distribution changes. Together, these results suggested that unknown random factors may affect the final outcome of the competition.Next, we analyzed the dynamics of microbial colonization during our simulations. As summarized in Fig. 1b, we divided the competition process into two stages, the “occupation stage” and the “exclusion stage” (see Methods). To statistically characterize the competitive outcome at t3, we defined the winning asymmetry index, WinR, and the abundance asymmetry index, AbunR (see Methods). As shown in Fig. S2a, we found a strong positive correlation (R2 = 0.740, P  More

  • in

    Photoheterotrophy by aerobic anoxygenic bacteria modulates carbon fluxes in a freshwater lake

    SamplingWater was sampled from Cep lake in Czechia, at a regular sampling site of 10 m depth (48.944 °N, 14.877 °E). The lake originates from sand mining in the 1970–80s. It is a permanent meso-oligotrophic (chlorophyll-a concentrations ranged from 1.4–16.4 µg L−1) seepage reservoir filled with groundwater penetrating from the nearby river Lužnice. The lake area is about 1.16 km2, with the maximum depth about 11–12 m. These characteristics are representative for most of temperate and boreal lakes [16].Samples were collected every four weeks from April till October in 2018, and from April till November in 2019. Ten liters of water were collected from 0.5 m depth using a Ruttner Water Sampler (model 11.003KC Denmark AS). Temperature and oxygen profiles were taken with an EXO1 multi-parameter probe (YSI Inc., Yellow Springs, USA). Water was transported to the laboratory within 2 h from the collection in a closed container made from high-density polyethylene, rinsed three times with the sampled water and stored in a cooled box.NutrientsSamples were filtered through glass fiber filters with 0.4 µm nominal porosity (GF-5, Macherey-Nagel, Düren, Germany). Concentrations of soluble reactive phosphorus (SRP) were determined spectrophotometrically [17, 18]. Concentrations of nitrate and ammonium were measured according to Procházková [19] and Kopáčkek and Procházková [20]. Dissolved organic carbon (DOC) and dissolved nitrogen (DN) were measured with a TOC 5000 A analyzer (Shimadzu, Kyoto, Japan).PigmentsSeston from 1.43 to 3.65 L of water was collected onto GF-5 glass fiber filters (diameter 47 mm, Macherey-Nagel). The filters were dried of excess water by gently pressing in a paper towel, and flush frozen in liquid nitrogen. Pigments were extracted and analyzed by HPLC as described in Piwosz et al. [21].Net primary production (NPP) and community respiration (oxygen measurements)Oxygen concentration was measured with the Winkler method [22]. It was chosen because it allows O2 concentration to be measured directly in the water without the need to consider carbonic equilibrium, which is the case when changes in CO2 concentration is measured [23]. Samples were unfiltered to avoid the removal of particle-associated bacteria and also of free-living AAP bacteria, which tend to be larger than average freshwater bacteria [12]. Glass stoppered Winkler type oxygen bottles (115 mL nominal capacity, VTR glass, Prague, Czechia) were filled with the sampled water directly from the sampler via a rubber tube. Each bottle was first rinsed three times and then filled without the formation of air bubbles. Water was allowed to overflow the neck of the bottle for about 1 min, and the bottle was closed with a glass stopper to avoid air bubbles. The closed bottles were kept in the dark in a cooled box. On the shore, three bottles were selected as T0, and 1.2 mL of manganese (II) chloride solution (concentration 3 mol L−1) was addded, followed by the addition of 1.2 mL of a mixture containing 4 mol L−1 of sodium iodide solution and 8 mol L−1 of sodium hydroxide solution. These samples were processed in the laboratory within 3 h. The remaining bottles were incubated for 24 h at in situ temperature in the IR-box prepared from the MAKROLON IR polycarbonate sheet (4 mm thickness, Professional Plastics, Inc. Fullerton, CA, USA). These panels have a maximum transmittance of 90% in the infrared region 850–2000 nm, 50% at 780 nm, and 0% 10 °C according to our measurements (180 days, Equation 3 in Supplementary File S1). Subsequently, the differences in the Cep Lake’s carbon budget for the surface layer (down to 0.5 m depth) were calculated by multiplying the integrated values by the volume of this layer (Equation 4 and 5 in Supplementary File S1).HCO3
    − incorporationTriplicated water samples (32 mL) were incubated for 3.2–5.2 h in the IR light and the dark at in situ temperature, as described for respiration. Total activity added to each bottle was measured from 1 mL aliquot of the incubated sample that was transferred to a scintillation vial containing 20 μl of 5 mol L−1 NaOH (to prevent a loss of 14C-bicarbonate). Thirty mL of sample was filtered through 2.5 μm nitrate cellulose filters (Pragopor, Prague, Czechia, diameter 25 mm). Five mL of the filtered water was collected and subsequently filtered through a 0.17 μm nitrate cellulose filter. The resulting cell-free filtrate, which contained 14C-DOC was collected. The filtration was done at a low vacuum (0.02 MPa) to avoid cell breakage. The total CO2 assimilation rate was calculated as the sum of all these fractions.The filters were kept in an HCl-saturated atmosphere for 24 h at room temperature in a fume hood. They were placed in scintillation vials and dissolved in 1 mL of ethyl acetate (Penta, Prague, Czechia). Then, 5 mL of Ultima Golt LLT scintillation cocktail (PerkinElmer, Waltham, MA, USA) was added. Five mL of cell-free filtrates were acidified by adding 100 μL 5 mol L−1 HCl to volatilize non-incorporated H14CO3 and incubated 24 h at room temperature in a fume hood. Then, 10 mL of the scintillation cocktail was added. Finally, 5 mL of the scintillation cocktail was added to the total activity samples. Subsequently, the samples were gently mixed and left in the dark for 48 h. The radioactivity in the samples was measured using a Tri-Carb 2810 TR scintillation counter (PerkinElmer).To estimate carbon fluxes (μmol C L–1 h–1), a fraction of the added H14CO3 incorporated or released was corrected for the incubation time and multiplied by the concentration of total dissolved inorganic carbon (DIC). The DIC concentration was calculated based on temperature, pH, and alkalinity measurements (Inolab pH 720, WTW Xylem Inc. Rye Brook, NY, US) determined by Gran titration.Assimilation of organic monomersThe difference between microbial activity in the IR light and dark was also estimated based on assimilation rates of radiolabeled glucose, pyruvate, leucine and thymidine (American Radiolabeled Chemicals, St. Louis, MO, USA). Tritiated glucose (specific activity (SA): 2220 GBq mmol−1), leucine (SA: 4440 GBq mmol−1) and thymidine (SA: 2275.5 GBq mmol−1) were added to 5 mL samples to a final concentration of 5 nmol L−1. 14C-pyruvate (SA: 2.035 GBq mmol−1) was added to a final concentration of 10 nmol L−1. Trichloroacetic acid (TCA) was added to the killed controls to a final concentration of 1%. Samples were incubated for 1 h in the dark and IR light as described for respiration. The incubations were terminated as the killed controls and kept at 4 °C in the dark until processed within 65% except for the samples from 9th May and 29th Aug 2018 (10% each), 1st Aug 2018 (21%), 25th Nov 2018 (1%), and 14th Aug 2019 (22%), Supplementary Fig. S1A). Thus, we decided to concatenate the fastaq files and analyze both fractions together as the total community. This also facilitated statistical analysis, as the activity rates were measured for the whole community without fractionations.Reads quality was evaluated using FastQC v0.11.7 (Babraham Bioinformatics, Cambridge, UK). After primer sequences trimming using Cutadapt [29] (v1.16), the number of reads per sample ranged from 49,354 to 188,942. Subsequent analyses were done in the R/Bioconductor environment using the dada2 package (version 1.14.1) [30]. Forward and reverse reads were truncated to 225 bp and low quality sequences were filtered out with the filterAndTrim function (truncLen = c(225, 225), maxN = 0, maxEE = c(2, 2), truncQ = 2), which reduced the number of reads per sample to range from 30,190 to 143,552. After merging and chimera removal using the removeBimeraDenovo function, 4,893 amplicon sequence variants (ASV) were obtained. Rare ASVs (not seen >3 times in at least 20% of the samples) were removed, which reduced the number of ASVs to 658, and the number of reads to 14,613–69,046 per sample. Taxonomic assignment was done using SILVA 138.1 database [31, 32] released on August 27, 2020. ASVs identified as Chloroplast or Cyanobacteria were excluded from the analyses, giving the final number of 546 ASVs and from 10,819 to 54,799 reads per sample. The bacterial community composition graphs were done using phyloseq [33] and ggplot2 [34] packages.AAP community compositionThe composition of AAP community was analyzed by amplicon sequencing of pufM gene encoding the M subunit of bacterial type-2 reaction centers. This gene is routinely used for diversity studies of AAP bacteria [35].PufM gene amplicons (approx. 245 bp) were prepared using pufM_UniF (5′-GGN AAY YTN TWY TAY AAY CCN TTY CA-3′) and pufM_WAW (5′-AYN GCR AAC CAC CAN GCC CA-3′) primers [36]. PCR was performed in triplicate 20 μL reactions using Phusion™ High-Fidelity DNA Polymerase (Thermo Scientific, USA) with the following reaction conditions: 98 °C for 3 min, 27 cycles at 98 °C for 10 s, 58 °C for 30 s, 72 °C for 30 s, and a final extension at 72 °C for 5 min. The triplicate product reactions for each sample were pooled and gel purified using the kit Wizard SV Gel and PCR Clean-Up System (Promega, USA). The sequencing was performed on the Illumina MiSeq platform (2 × 150 bp) at Macrogen, South Korea.The fastq files were concatenated as described for bacteria communities. The Bray-Curtis similarity between two fractions for each sampling day was >70%, except for the samples from 1st Aug 2018 (47%) and 14th Aug 2019 (18%, Supplementary Fig. S1B).The samples were analyzed as described for bacterial communities. The number of reads per sample ranged from 192,360 to 239,418 after the cutadapt trimming. Forward and reverse reads were truncated to 130 bp, and the number of reads per sample after the quality filtering and denoising ranged from 189,432 to 235,311. Merging the forward and reverse reads with mergePairs function created 12,692 ASVs and reduced the number of reads to 183,136–221,281 per sample. The chimera removal lowered the number of ASVs to 1816, and the number of reads to 159,451–208,679. Rare ASVs (not seen >3 times in at least 20% of the samples) were removed, which resulted in the final 566 ASVs, and a number of reads ranging from 155,915 to 203,021 per sample. A manually curated taxonomic database was used for taxonomic assignment following the naïve Bayesian classifier method [37]. It contained 1580 unique pufM sequences, downloaded from the Fungene repository on May 16, 2019 (http://fungene.cme.msu.edu [38]), from metagenomes from the Římov Reservoir [39, 40] and from the Genome Taxonomy database accessed on September 16, 2020 [41].Statistical analysisLinear mixed-effects models were calculated in R (version 3.6.2) using lme function from the nlme package (version 3.1.143) on untransformed activity data and log10 transformed environmental variables [42]. Models’ parameters were estimated using maximum likelihood method and their significance was tested with ANOVA. Relationships between the activity measures, the environmental variables and the composition of AAP communities were investigated with distance-based linear models (DistLM) [43, 44] in Primer (version 7.0.13) with PERMANOVA + 1 add on (e-Primer, Plymouth, UK) [45]. The sequence reads were transformed with the varianceStabilizingTransformation function of the DESeq2 package [46] (version 1.14.1, blind = FALSE, fitType = “mean”).Data accessibilityThe sequences of 16S and pufM amplicons that support the findings of this study have been deposited in the EMBL database as the BioProject with the accession number PRJEB41596, together with most of the environmental metadata. The scripts and the remaining data supporting the results are included in the Supplementary Material. More