More stories

  • in

    Quaternary landscape dynamics boosted species dispersal across Southeast Asia

    Surface processes model and forcing mechanismsLandscape evolution over the last one million years interval is performed with the open-source modelling code Badlands34. It simulates the evolution of topography induced by sediment erosion, transport, and deposition (Fig. 1a). Amongst the different capabilities available in Badlands, we applied the fluvial incision and hillslope processes, which are described by geomorphic equations and explicitly solved using a finite volume discretisation. In this study, soil properties are assumed to be spatially and temporally uniform over the region, and we do not differentiate between regolith and bedrock. It is worth noting that the role of flexural responses induced by erosion and deposition is also not accounted for. Under these assumptions, the continuity of mass is governed by vertical land motion (U, uplift or subsidence in m/yr), long-term diffusive processes and detachment-limited fluvial runoff-based stream power law:$$frac{partial z}{partial t}=U+kappa {nabla }^{2}z+epsilon {(PA)}^{m}nabla {z}^{n}$$
    (1)
    where z is the surface elevation (m), t is the time (yr), κ is the diffusion coefficient for soil creep34 with different values for terrestrial and marine environments, ϵ is a dimensional coefficient of erodibility of the channel bed, m and n are dimensionless empirically constants, that are set to 0.5 and 1, respectively, and PA is a proxy for water discharge that numerically integrates the total area (A) and precipitation (P) from upstream regions34.Both κ and ϵ depend on lithology, precipitation, and channel hydraulics and are scale dependent34. All our landscape evolution simulations are running over a triangular irregular network of ~18. e6 km2 with a resolution of ~5 km, and outputs are saved every 1000 yr.The detachment-limited fluvial runoff-based stream power law is computed with a ({{{{{{{mathcal{O}}}}}}}}(n))-efficient ordering approach54 based on a single-flow-direction approximation where water is routed down the path of the steepest descent. The flow routing algorithm and associated sediment transport from source to sink depend on surface morphology, and sediment deposition occurs under three circumstances: (1) existence of depressions or endorheic basins, (2) if local slope is less than the aggregational slope in land areas and (3) when sediments enter the marine realm34. Submerged sediments are then transported by diffusion processes defined with a constant marine diffusion coefficient34.All landscape simulations are constrained with different forcing mechanisms, and five scenarios were tested (Supplementary Table 2).First, we impose precipitation estimates from the PaleoClim database38,39,40. These estimates are products from paleoclimate simulations (coupled atmosphere-ocean general circulation model) downscaled at approximately the same resolution as our landscape model (~5 km at the equator). Annual averages of precipitation rates are then used to provide rainfall trends in our simulations based on the ten specific snapshots available (from the mid-Pliocene warm period to late Holocene and present day). Between two consecutive snapshots, we assume that precipitation remains constant for the considered time interval. For exposed regions that are considered flooded in the PaleoClim database, we define offshore precipitations using a nearest neighbour algorithm where closest precipitation estimates are averaged from PaleoClim inland regions. To evaluate the role of precipitation variability on landscape dynamics, we also run a uniform rainfall scenario (2 m/yr obtained by averaging the annual precipitation rates from the PaleoClim database).Secondly, the models are forced with sea level fluctuations known to play a major role in the flooding history of the Sunda Shelf11,13,53. Two sea level curves are tested (Supplementary Fig. 1d). To account for the inherent uncertainty in reconstructed sea level variations, we chose a first curve37 obtained from a sea level stack constructed from five to seven individual reconstructions that agrees with isostatically adjusted coral-based sea level estimates at both 125 and 400 ka. The second one is taken from the global sea level curve reconstruction36 based on benthic oxygen isotope data and has been recently used to reconstruct the subsidence history of Sundaland17,18.The last forcing considered in our study is the tectonic regime. First, we chose to explore a non-tectonic model based on the default assumption of stability for the shelf17. Secondly, we assumed a uniform subsidence rate of −0.25 mm/yr recently derived from a combination of geomorphological observations, coral reef growth numerical simulations and shallow seismic stratigraphy interpretations17. Then, to represent the regional variations in the tectonic regime, we have compiled and digitised a number of calibration points (Supplementary Fig. 1b and Table 1) that were used to produce a subsidence and uplift map by geo-referencing calibration points and available tectonic polygons, and by Gaussian-smoothing and normalising the uplift and subsidence rates between the calibrated range to avoid sharp transitions in regions without observations. The resulting map does not account for fine spatial scale tectonic features such as fault systems43,55 or orogenic and sedimentary related isostatic responses. It rather represents a regional vertical tectonic trend with an overall uplift of Wallacea and NW Borneo regions and long-wavelength subsidence of Sunda Shelf and Singapore Strait17.Landscape evolution model calibrationThe landscape models start during the Calabrian in the Pleistocene Epoch, one million years before the present. At each time interval, the landscape evolves following Eq. (1) and the surface adjusts under the action of rivers and soil creep (Fig. 1a). In addition to surface changes, we extract morphometric characteristics such as drainage basins extents, river profiles lengths (Fig. 3 and Supplementary Fig. 2), distance between main rivers outlet (Supplementary Fig. 3) and tracks the cumulative erosion and deposition over time (Fig. 1b and Supplementary Fig. 1d).For model calibration, we perform a series of steps consisting in adjusting the initial elevation and the erosion–deposition parameters (i.e., κ and ϵ in Eq. (1)) to match with different observations.The initial paleo-surface is obtained by applying the uplift and subsidence rates backwards to calculate the total change in topography for the 1 Myr interval. Then, we test the simulated paleo-river drainages against results from a combination of phylogenic studies9,13 and paleo-river channels and valleys found from seismic and well surveys41,42,44. Iteratively, we modify our paleo-elevation to ensure those main river basins (e.g., Johore, Siam, Mekong, East Sunda) encapsulate the paleo-drainage maps reconstructed using lowland freshwater taxa described in13 (Supplementary Fig. 1a and Table 4) and that the major rivers follow paleo-rivers systems derived from both 2D and 3D seismic interpretations (Fig. 1b).For surface processes parametrisation, we tested different ranges of diffusion and erodibility coefficients and compared the final sediment accumulation across the Sunda Shelf (Fig. 1b) using estimated deposit thicknesses41,42,43,44. The Sunda Shelf is predominantly experiencing deposition over the past 500 kyr and increases in deposition are positively correlated with periods of sea level rise (i.e., Pearson’s coefficients for correlation with sea level above 80%, Supplementary Fig. 1d). After exploring a range of values, we set κ values to 1. e−2 and 8. e−2 m2/yr for terrestrial and marine environments and ϵ between 2.5 and 7.5 e−8yr−1 for the different scenarios to fit with chosen surveys dataset (Supplementary Table 2 and 3).Upon uniform subsidence case (−0.25mm/yr), flooding is limited, and the shelf only undergoes two full marine transgressions ( >60% of the shelf flooded) around 125 ka and during the last 10–20 kyr (Supplementary Fig. 1c). Upon spatially variable tectonics (non-uniform subsidence), partial flooding events are more pervasive, with higher magnitudes and greater temporal durations. Due to the shallow and flat physiography of Sundaland, we also note that even small increases in sea level amplitudes ( 0) and values higher than one and two standard deviations (zsc  > 1 and 2, respectively, Supplementary Fig. 1b). The approach provides a quantitative assessment of flow maps sensitivity to the chosen resistance maps.To gain additional insights into the distribution of connectivity regions across the shelf, we also employed a local spatial autocorrelation indicator, namely the Getis-Ord Gi⋆ index57. This hotspot analysis method assesses spatial clustering of the obtained current density maps, and the resultant z-scores provide spatially and statistically significant high or low clustered areas. The approach consists in looking at each local current value relative to its neighbouring one. From this spatial analysis, we extract both statistically significant hot and cold spots for each combination of resistance surfaces (Supplementary Fig. 5c). To extract statistically significant and persistent biogeographic connectivity areas across the exposed Sunda Shelf, we then combine all hotspots together and define preferential migration pathways as regions having a positive Gi⋆ z-scores for all resistance surfaces combination.We used the function zscore in the SciPy stats package to obtain the z-scores and the ESDA library for the Gi⋆ indicator computation.Modelling assumptions and limitationsThere are a number of important caveats for interpreting our modelling results.First, we made several assumptions related to our transient landscape evolution simulations. A single-flow direction algorithm54 was used to simulate temporal changes in river pathways. Recent work58 has shown that this algorithm might lead to numerical diffusion, fast degradation of knickpoints and underestimation of river captures particularly in flat regions. One way to address this would be to use a multiple flow direction method59 which allow for a better representation of flow distribution across the landscape. In this study, we also assumed a uniform and invariant soil erodibility coefficient for the entire domain and a detachment-limited erosion law. Even though the erodibility coefficient was calibrated independently for each simulation (Supplementary Table 3), soil cover and properties vary notably between Borneo, Sumatra, Java and the Malay Peninsula and soil conditions for the exposed sea floor would have changed significantly over successive flooding events12. Badlands software34 allows for multiple erodibility coefficients representing different soil compositions to be defined, and this functionality could be used to evaluate the impact of differential erosion on physiographic changes. Similarly, several transport-limited laws are also available and could be compared against our detachment-limited simulations.A second set of simplifications lies in the climatic conditions (i.e., rainfall regimes) used to force our simulations. We relied on the PaleoClim database40 which contains nine high-resolution paleoclimate dataset38,39,40 corresponding to specific time periods (4.2–0.3 ka, 8.326–4.2 ka, 11.7–8.326 ka, 12.9–11.7 ka, 14.7–12.9 ka, 17.0–14.7 ka, ca. 130 ka, ca. 787 ka and 3.205 Ma). The climate simulations from which these time periods are extracted do not consider emerged Sunda Shelf for the oldest inter-glacial events which can result in incorrect climatic pattern60. From 0.3–17 ka, precipitation fields in PaleoClim are obtained from the TRaCE21ka transient simulations of the last 21 kyr run with the CCSM3 model40. Although Fordham et al.39 show that precipitation errors range from 10–200% in their modern experiment, the paleoclim dataset provides a statistical downscaling method that includes a bias correction (namely the Change-Factor method, in which the anomaly between the modern simulation and observations is removed from the paleoclimate experiment) allowing the use of the model for paleoclimate studies40. The very same technique is applied for 130 ka and 787 ka fields that were obtained with different GCMs (namely HadCM3 and CCSM2). Given the absence of a million-year long transient climate simulation, we oversimplified the climatic conditions by considering that precipitation distribution and intensity remain constant between two consecutive intervals, generating an artificial stepwise evolution of rainfall through time. To evaluate the sensitivity of physiographic responses on the Sunda Shelf to precipitation, we ran a model with uniform rainfall over 1 Myr (scenario 4). Despite changes in the timing and extent of basins reorganisation (Supplementary Fig. 2 and Fig. 3b), we found limited differences in terms of flooding history and erosion/deposition patterns when compared with scenario 5 (Supplementary Fig. 1c, d and Supplementary Table 2). Recent work60 suggests clear regional responses induced by the emerged Sunda Shelf with seasonal enhancement of moisture convergence and continental precipitation induced by thermal properties of the land surface. This could significantly impact our simulation results. However, and at the time of writing, more continuous high-resolution paleoclimatic simulations considering the shelf as an emerged continental platform were still unavailable. Using high-resolution multi-model outputs would allow to target the uncertainty on climatic maps4 and will surely represent a significant advance for future studies. One approach would have used the orographic rainfall capability61 available in Badlands. The method is better suited to run generic simulations but falls short when applied to real cases as it relies on imposing paleo-environmental boundary conditions (e.g., temporal changes in wind direction and speed, moisture stability frequency or depth of moist layer) difficult to obtain for Earth-like model applied over geological time scales.Finally, our species-agnostic approach assumes an equally weighted cost between the three considered geomorphic features and does not account for additional factors (temperature, vegetation cover, solar radiation to cite a few), which are all important when assessing landscape connectivity for wildlife. Most importantly, we model connectivity at very large scales (5 km resolution). Often, species are highly influenced by microclimates and small-scale topography47. From our regional-scale simulations and hotspot analysis (Fig. 6), higher resolution models focusing on highly connected regions (across the Gulf of Thailand and Siam basin) could be applied to produce more detailed representations of species migration in the region. In addition, current flow field calculations from Circuitscape35 rely on randomly selecting nodes around the region of interest. For connectivity analysis, we used 33 terrestrial points located around the perimeter of the buffered Sundaland area (white contour line in Fig. 1b). Using a selection of nodes in a buffered region allows to reduce the bias in current density estimates46. However, bias might depend on the buffer size compared to the study area as well as the number of nodes selected46,47. Because of memory limitations and the great number of computed grids used to cover the past 500 kyr, we made a trade-off between buffer size and the number of selected points for pairwise calculations. Additional experiments could possibly be tested to evaluate bias in the proposed connectivity maps potentially using a tilling approach to reduce cell number45. More

  • in

    Correction to: Large-scale protein level comparison of Deltaproteobacteria reveals cohesive metabolic groups

    Author notesNina DombrowskiPresent address: Royal Netherlands Institute for Sea Research, Department of Marine Microbiology and Biogeochemistry, AB Den Burg, The NetherlandsKiley W. SeitzPresent address: EMBL Heidelberg, Meyerhofstraße 1, Heidelberg, GermanyThese authors contributed equally: Marguerite V. Langwig, Valerie De Anda.AffiliationsDepartment of Marine Science, University of Texas at Austin, Marine Science Institute, Port Aransas, TX, USAMarguerite V. Langwig, Valerie De Anda, Nina Dombrowski, Kiley W. Seitz, Ian M. Rambo & Brett J. BakerDepartment of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, AustraliaChris GreeningDepartment of Marine Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC, USAAndreas P. TeskeAuthorsMarguerite V. LangwigValerie De AndaNina DombrowskiKiley W. SeitzIan M. RamboChris GreeningAndreas P. TeskeBrett J. BakerCorresponding authorsCorrespondence to
    Marguerite V. Langwig or Brett J. Baker. More

  • in

    Diversity and compositional changes in the gut microbiota of wild and captive vertebrates: a meta-analysis

    Literature search and data retrievalWe performed a systematic literature search on the internet (Google Scholar, Web of Science) using the following keywords: [gut microbiota], [animal microbiome], [gut microbiome 16S] and [captive AND wild AND microbiota]. This search yielded 222 articles on animal microbiomes published between 2014 and 2020. The materials and methods of these articles were analysed to ascertain whether the study met the following criteria: (i) all wild and captive samples were processed using identical procedures, (ii) compared wild and captive animals were phylogenetically closely related (members of the same species or species complex), (iii) captive individuals were born in captivity, or no information was provided about the origin of the captive animals; i.e., wild animals brought into captivity and sampled some time later were excluded, (iv) captive animals that underwent a deliberate selection process (e.g. inbred mice or domestic animals) were also excluded for considering them genetically not comparable to the wild counterparts, and (v) only datasets with sample sizes over 12 individuals were considered for analysis. Raw data were extracted from the databases and repositories indicated in the articles (accession numbers listed in the “Bioinformatic resources”).Bioinformatic sequencing data processingDatafiles from the different studies were (i) stored at the University of Copenhagen’s Electronic Research Data Repository (ERDA), (ii) assigned a unique study identifier and (iii) re-processed in the Danish National Supercomputer for Life Sciences ‘Computerome2’ using a new bioinformatic pipeline we developed for processing data with different characteristics, including sequencing mode, read length and 16S rRNA gene fragment. The entire code can be found in the “Bioinformatic resources”. In short, for each individual dataset, we quality-filtered (mean phred score of q = 25) and (if necessary) trimmed and merged the paired-end reads based on the sequence overlap using AdapterRemoval224. Primers (if present) were trimmed using Cutadapt25, and reads were dereplicated with USEARCH Derep26 using a relative minimum copy number threshold of 0.01% of the total sequencing depth. Reads were then converted into zero-ratio OTUs using the denoising algorithm UNOISE327, and USEARCH was used to map the reads back to the OTUs and create an OTU table. HS-Blast28 was used to assign taxonomy against the non-redundant Silva 132 database29, and taxonomic assignments were filtered using different identity thresholds for each taxonomic level: 97% for genus-level taxonomy, 95% for family-level taxonomy, 92% for order-level taxonomy and 90% for higher taxonomic levels30. To minimise the impact of incorrectly assigned taxa, taxonomic annotations below these identity thresholds were converted into unclassified, and not considered in downstream analyses. This pipeline yielded relative read abundances assigned to different taxa for each individual dataset analysed.Data quality filteringIndividual data files generated by the aforementioned pipeline were aggregated by study and host species into genus-level abundance tables. The two datasets of Sarcophilus harrisii retrieved from two different studies were processed independently. Taxonomic resolution was limited to the genus level to maximise taxonomic annotation rate and minimise biases introduced by the different 16S rRNA gene markers employed in the analysed studies. On the one hand, wild animals’ microbial communities often contain taxa that do not map to any catalogued species with enough molecular similarity to assign species-level annotation. On the other hand, the analysed datasets were generated based on the V4, V3–V4 and V1–V3 regions of the 16S rRNA gene (Supplementary Dataset), which hindered comparability at the ASV or zOTU level. We then proceeded to quality-filter the genus-level abundance tables of each species through filtering individuals by minimum sequencing depth, minimum diversity coverage and taxonomic annotation. Only individual datasets with more than 1000 reads and diversity coverage values over 99% were retained, and final genus-level abundance tables that contained at least five animals in each contrasting group were considered for analysis. Since the studied datasets contained traces of dietary items and host DNA, read counts assigned to taxonomic groups not assigned to Bacteria genera, or not present in the LTPs132_SSU release of the SILVA Living Tree (https://www.arb-silva.de/projects/living-tree) used for measuring the phylogenetic relationships among bacteria, were removed to ensure accurate measurements of phylogenetic diversities. In the cases where one group (either wild or captive) outnumbered the other, samples were randomly selected to ensure even sample sizes.Diversity and compositional analysesDiversity and compositional analyses were carried out in the R statistical environment v.3.6.331 and Python 3.8 based on the Hill numbers framework. The operations explained below were conducted using the R packages ape32, dendextend33, dmetar34, hilldiv35, meta36, metamicrobiomeR37, phylosignal38, phytools39, treedist40, vegan41, and the python package qdiv42. Hereafter functions and their respective packages are displayed as ‘package::function’. Statistical significance level was set at a FDR-adjusted p-value of 0.05. All charts and figures in the manuscript were originally generated either in R (full code of all original figures is included in “Bioinformatic resources”) and subsequently modified in Adobe Illustrator to achieve the desired layout without distorting the dimensions of the quantitative elements.Hill numbersThe Hill numbers framework encompasses the group of diversity measures that quantify diversity in units of equivalent numbers of equally abundant taxa43,44—in our context bacteria genera. Hill numbers provide a general statistical framework that is sufficiently robust and flexible to address a wide range of scientific questions that molecular ecologists regularly try to answer through measurement, estimation, partitioning and comparison of diversities45. To obtain a complete vision of the gut microbiome differences between wild and captive animals, we conducted all our diversity and compositional analyses based on three contrasting Hill numbers based metrics: the so-called dR, which only accounts for richness (i.e., order of diversity 0, whether bacteria taxa were present or not), dRE which considered Richness + Evenness of order of diversity 1 (i.e., the relative abundances of bacteria are proportionally weighed) and dRER, which considered Richness, + Evenness + Regularity (i.e., the phylogenetic relationships among bacteria are accounted for). Detailed explanations of these metrics can be found elsewhere17,46,47.Phylogenetic treesThe dRER metric required a Bacterial phylogenetic tree to compute the relatedness among bacterial taxa. As our datasets contained different fragments of the 16S rRNA gene, and thus we were unable to generate a phylogenetic tree directly from our DNA sequence data, we relied on the SILVA Living Tree, and used the LTPs132_SSU release as the reference phylogenetic tree. In addition, the time-calibrated host phylogeny required by the host phylogenetic signal and phylosymbiosis analyses was generated using Timetree48.Diversity metrics and meta-analysisWe computed individual-based diversity metrics using the function hilldiv::hill_div, and obtained average alpha diversity metrics per species, as well as wild and captive populations per species. We used a Kruskal–Wallis (KW) test as implemented in the function hilldiv::div_test to ascertain whether the mean diversity values varied across analysed host species, and a PERMANOVA (PMV) test using vegan::adonis function based on the pairwise dissimilarity matrix to test whether host species were compositionally distinct.Average alpha diversity metrics of wild and captive populations per species were used to conduct a random-effects-model (REM) meta-analysis with raw effect sizes using the function meta::metacont. We used the Sidik–Jonkman estimator for the between-study variance and the Knapp–Hartung–Sidik–Jonkman adjustment method. The overall effect was calculated using Hedge’s g (SMD) and its 95% confidence interval and p-value. An identical analysis was performed for the entire dataset and two representative subsets of five species, containing only datasets derived from primates and cetartiodactylans. Higgin’s and Thompson’s I2 test, Tau-squared T2 and Cochran’s Q were used for quantifying the heterogeneity between the included species. Due to the high heterogeneity found in the study, we evaluated whether the between-study heterogeneity was caused by outliers with extreme effect sizes, which could be distorting our overall effect. We defined an outlier if the species’s confidence interval did not overlap with the confidence interval of the pooled effect using dmetar::find.outliers function.The function detected three outliers in dR metric (GOGO, PEMA and TUTR), four in dRE (GOGO, PEMA, MOCH, EQKI) and seven in dRER (RHBR, PYNE, PEMA, TUTR, MOCH, CENI and AIME). Even when these outliers were excluded from the analysis the I2 heterogeneity value was substantial for dR (I2 from 79.3 to 70.3%) and moderate for dRE (I2 from 80.1 to 60.0%) and dRER (I2 from 86.9 to 54.2%) and significant for both (Cochran’s Q, p-value  More

  • in

    Possible impacts of the predominant Bacillus bacteria on the Ophiocordyceps unilateralis s. l. in its infected ant cadavers

    Sample collectionSamples were collected from an evergreen broadleaf forest in central Taiwan (Lianhuachi Experimental Forest, Nantou County, 23°55′7″N 120°52′58″E) from January 2017 to March 2018. Permission to collect plants for the study was obtained from the Lianhuachi Research Center, Taiwan Forestry Research Institute, Council of Agriculture, Executive Yuan, Taiwan (Permission no.: 1062272538). The present study complies with the International Union for Conservation of Nature Policy Statement on Research Involving Species at Risk of Extinction and the Convention on the Trade in Endangered Species of Wild Fauna and Flora. Ant cadavers with fungal growth were collected from understory plants with a height of less than 3 m. Ant cadavers infected with O. unilateralis s. l. were removed carefully by cutting the leaf and placing it into a 50-mL conical centrifuge tube, which was then transported to the laboratory. Only cadavers in which the fungal growth stage preceded the development of perithecia, which theoretically has the highest biological activity, were collected (Fig. 1). In total, 24 infected P. moesta and 20 infected P. wolfi samples were collected.Figure 1Ophiocordyceps unilateralis sensu lato-infected (a) Polyrhachis moesta and (b) P. wolfi, with the stroma growing from the ant cadaver. The specimens were collected from the Lianhuachi Research Center, Taiwan and photographed in the laboratory by Wei-Jiun Lin.Full size imageIsolation and cultivation of bacteriaAnts on the leaves were first identified to species and then, using tweezers, each ant was placed carefully into a sterilized 1.5-mL microcentrifuge tube [see details in Lin et al. (2020)15. Samples were shaken one by one in 600 μL of sterilized water for a few seconds at 3000 revolutions/min (rpm) using a vortex mixer (AL-VTX3000L, CAE technology Co., Ltd., Québec, Canada), and were then soaked with 600 μL of 70% ethanol to sterilize the ant’s surface. The ethanol on the samples was washed twice with 600 μL of sterilized water, then vortexed in 400 μL of sterilized water. Next, 200 μL of the supernatant was spread homogeneously onto a Luria–Bertani (LB) agar plate (25 g Luria–Bertani broth and 15 g agar per liter) to confirm the absence of live bacteria.Bacteria from inside the ant host were released by homogenizing the ant host in 200 μL of water and culturing on LB agar plates at 28 °C for 2 days. Bacteria from each of the ant individuals were cultured independently and approximately equal numbers of the isolates were picked randomly with sterile toothpicks, and were suspended in the LB medium supplemented with 15% v/v glycerol and maintained at − 80 °C until the time of examination. In total, 247 bacterial isolates from P. moesta and 241 bacterial isolates from P. wolfi were collected.In addition to the bacterial isolates from the ant bodies, 60 bacterial isolates from soil, leaves, and air in the same forest were collected for the purpose of comparing their resistance to naphthoquinones (see below) by using the aforementioned procedure but without initial cleaning and sterilizing of the sample surface.Bacterial identificationBacteria collected from the ant hosts were identified by gene marker sequencing. Bacterial isolates were cultured in LB medium at 28 °C overnight to reach the log-phase, and genomic DNA was extracted following the methods described in Vingataramin and Frost (2015)20. Conspecies/strains of the bacterial isolates from the same host were determined using the randomly amplified polymorphic DNA (RAPD) method with the primer 5′-GAGGGTGGCGGTTCT-3′. PCR amplification was performed as follows: initial denaturation at 95 °C for 5 min, 40 cycles of amplification including denaturation at 95 °C for 1 min, annealing at 42 °C for 30 s, and extension at 72 °C for 1 min, followed by a final extension at 72 °C for 10 min. PCR products were run in 2% agarose gel and bacterial isolates were characterized by fragment patterns. For each of the ant hosts, bacterial isolates with the same RAPD pattern were considered to be the same strain. In total, 106 and 178 strains were found from P. moesta and P. wolfi, respectively. One of the bacterial isolates was selected at random to represent the strain and coded with “JYCB” followed by a series of numbers (e.g., JYCB191). Taxonomic status of each strain was determined to species by using the V3/V4 region of the 16S rDNA gene. PCR amplification with the primer set (8F: 5′-AGAGTTTGATCCTGGCTCAG-3′ and 1541R: 5′-AAGGAGGTGATCCAGCCGCA-3′)21,22 was performed under the following conditions: initial denaturation at 95 °C for 5 min, 40 cycles of amplification including denaturation at 95 °C for 1 min, annealing at 55 °C for 30 s, and extension at 72 °C for 1 min 45 s, followed by a final extension at 72 °C for 10 min. PCR products were first checked by running a gel, and were then sequenced at Genomics, Inc. (New Taipei City, Taiwan).The sequences of the bacterial strains from each of the ant hosts were first analyzed by the unweighted pair group method with arithmetic mean (UPGMA) analysis and clustered into clades according to the sequence dissimilarity ( More

  • in

    Endophytic bacterial communities are associated with leaf mimicry in the vine Boquila trifoliolata

    1.Wiens, D. Mimicry in plants. Evol. Biol. 11, 365–403 (1978).
    Google Scholar 
    2.Pasteur, G. A classificatory review of mimicry systems. Annu. Rev. Ecol. Syst. 13, 169–199 (1982).
    Google Scholar 
    3.Barrett, S. C. H. Mimicry in plants. Sci. Am. 257, 76–85 (1987).
    Google Scholar 
    4.Barlow, B. A. & Wiens, D. Host-parasite resemblance in Australian mistletoes: The case for cryptic mimicry. Evolution 31, 69–84 (1977).PubMed 

    Google Scholar 
    5.Ehleringer, J. R. et al. Mistletoes: A hypothesis concerning morphological and chemical avoidance of herbivory. Oecologia 70, 234–237 (1986).ADS 
    CAS 
    PubMed 

    Google Scholar 
    6.Canyon, D. V. & Hill, C. J. Mistletoe host-resemblance: A study of herbivory, nitrogen and moisture in two Australian mistletoes and their host trees. Aust. J. Ecol. 22, 395–403 (1997).
    Google Scholar 
    7.Blick, R. A. J., Burns, K. C. & Moles, A. T. Predicting network topology of mistletoe–host interactions: Do mistletoes really mimic their hosts?. Oikos 121, 761–771 (2012).
    Google Scholar 
    8.Gianoli, E. & Carrasco-Urra, F. Leaf mimicry in a climbing plant protects against herbivory. Curr. Biol. 24, 984–987 (2014).CAS 
    PubMed 

    Google Scholar 
    9.Gianoli, E., Saldaña, A., Jiménez-Castillo, M. & Valladares, F. Distribution and abundance of vines along the light gradient in a southern temperate rainforest. J. Veg. Sci. 21, 66–73 (2010).
    Google Scholar 
    10.Gianoli, E. Eyes in the chameleon vine?. Trends Plant Sci. 22, 4–5 (2017).CAS 
    PubMed 

    Google Scholar 
    11.Gianoli, E. & Molina-Montenegro, M. A. Leaf damage induces twining in a climbing plant. New Phytol. 167, 385–390 (2005).PubMed 

    Google Scholar 
    12.González-Teuber, M. & Gianoli, E. Damage and shade enhance climbing and promote associational resistance in a climbing plant. J. Ecol. 96, 122–126 (2008).
    Google Scholar 
    13.Calder, D. M. Mistletoes in focus: An introduction. In The Biology of Mistletoes (eds Calder, D. M. & Bernhardt, P.) 1–18 (Academic Press, 1983).
    Google Scholar 
    14.Cook, M. E., Leigh, A. & Watson, D. M. Hiding in plain sight: Experimental evidence for birds as selective agents for host mimicry in mistletoes. Botany 98, 525–531 (2020).
    Google Scholar 
    15.Atsatt, P. R. Mistletoe leaf shape: A host morphogen hypothesis. In The Biology of Mistletoes (eds Calder, D. M. & Bernhardt, P.) 259–275 (Academic Press, 1983).
    Google Scholar 
    16.Hall, P. J., Badenoch-Jones, J., Parker, C. W., Letham, D. S. & Barlow, B. A. Identification and quantification of cytokinins in the xylem sap of mistletoes and their hosts in relation to leaf mimicry. Aust. J. Plant Physiol. 14, 429–438 (1987).CAS 

    Google Scholar 
    17.Watson, D. M. Mistletoes of Southern Australia (CSIRO, 2019).
    Google Scholar 
    18.Holopainen, J. K. & Blande, J. D. Molecular plant volatile communication. In Sensing in Nature (ed. López-Larrea, C.) 17–31 (Springer Science, 2012).
    Google Scholar 
    19.Baldwin, I. T., Kessler, A. & Halitschke, R. Volatile signaling in plant–plant–herbivore interactions: What is real?. Curr. Opin. Plant Biol. 5, 351–354 (2002).CAS 
    PubMed 

    Google Scholar 
    20.Heil, M. & Karban, R. Explaining evolution of plant communication by airborne signals. Trends Ecol. Evol. 25, 137–144 (2010).PubMed 

    Google Scholar 
    21.Karban, R., Yang, L. H. & Edwards, K. F. Volatile communication between plants that affects herbivory: A meta-analysis. Ecol. Lett. 17, 44–52 (2014).PubMed 

    Google Scholar 
    22.Coyne, J. A. Fantastic and plastic mimicry in a tropical vine. Why Evolution is True Blog. http://whyevolutionistrue.com/2014/04/26/fantastic-and-plastic-mimicry-in-a-tropical-vine (2014).23.Pannell, J. R. Leaf mimicry: Chameleon-like leaves in a Patagonian vine. Curr. Biol. 24, R357–R359 (2014).CAS 
    PubMed 

    Google Scholar 
    24.Baluška, F. & Mancuso, S. Vision in plants via plant-specific ocelli?. Trends Plant Sci. 21, 727–730 (2016).PubMed 

    Google Scholar 
    25.Richardson, A. O. & Palmer, J. D. Horizontal gene transfer in plants. J. Exp. Bot. 58, 1–9 (2007).CAS 
    PubMed 

    Google Scholar 
    26.Bock, R. The give-and-take of DNA: Horizontal gene transfer in plants. Trends Plant Sci. 15, 11–22 (2010).CAS 
    PubMed 

    Google Scholar 
    27.Yoshida, S., Maruyama, S., Nozaki, H. & Shirasu, K. Horizontal gene transfer by the parasitic plant Striga hermonthica. Science 328, 1128 (2010).ADS 
    CAS 
    PubMed 

    Google Scholar 
    28.Christin, P. A. et al. Adaptive evolution of C4 photosynthesis through recurrent lateral gene transfer. Curr. Biol. 22, 445–449 (2012).CAS 
    PubMed 

    Google Scholar 
    29.Gao, C. et al. Horizontal gene transfer in plants. Funct. Integr. Genomics 14, 23–29 (2014).CAS 
    PubMed 

    Google Scholar 
    30.Diao, X., Freeling, M. & Lisch, D. Horizontal transfer of a plant transposon. PLoS Biol. 4, e5 (2006).PubMed 

    Google Scholar 
    31.El Baidouri, M. et al. Widespread and frequent horizontal transfers of transposable elements in plants. Genome Res. 24, 831–838 (2014).PubMed 
    PubMed Central 

    Google Scholar 
    32.Prentice, H. C., Li, Y., Lönn, M., Tunlid, A. & Ghatnekar, L. A horizontally transferred nuclear gene is associated with microhabitat variation in a natural plant population. Proc. R. Soc. B Biol. Sci. 282, 20152453 (2015).
    Google Scholar 
    33.Yu, A. et al. Dynamics and biological relevance of DNA demethylation in Arabidopsis antibacterial defense. Proc. Natl. Acad. Sci. 110, 2389–2394 (2013).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    34.Takahashi, K. Influence of bacteria on epigenetic gene control. Cell. Mol. Life Sci. 71, 1045–1054 (2014).CAS 
    PubMed 

    Google Scholar 
    35.Ramos-Cruz, D., Troyee, A. N. & Becker, C. Epigenetics in plant organismic interactions. Curr. Opin. Plant Biol. 61, 102060 (2021).CAS 
    PubMed 

    Google Scholar 
    36.Lodewyckx, C. et al. Endophytic bacteria and their potential applications. Crit. Rev. Plant Sci. 21, 583–606 (2002).
    Google Scholar 
    37.Ryan, R. P., Germaine, K., Franks, A., Ryan, D. J. & Dowling, D. N. Bacterial endophytes: Recent developments and applications. FEMS Microbiol. Lett. 278, 1–9 (2008).CAS 
    PubMed 

    Google Scholar 
    38.Barrett, S. C. H. Crop mimicry in weeds. Econ. Bot. 37, 255–282 (1983).
    Google Scholar 
    39.McElroy, J. S. Vavilovian mimicry: Nikolai Vavilov and his little-known impact on weed science. Weed Sci. 62, 207–216 (2014).CAS 

    Google Scholar 
    40.Ye, C.-Y. et al. Genomic evidence of human selection on Vavilovian mimicry. Nat. Ecol. Evol. 3, 1474–1482 (2019).PubMed 

    Google Scholar 
    41.Ruiz, E. Lardizabalaceae. In Flora de Chile Vol. 2 (eds Marticorena, C. & Rodríguez, R.) 24–27 (Universidad de Concepción, 2003).
    Google Scholar 
    42.Muñoz-Schick, M. Flora del Parque Nacional Puyehue (Editorial Universitaria, 1980).
    Google Scholar 
    43.Dorsch K. Hydrogeologische Untersuchungen der Geothermalfelder von Puyehue und Cordón Caulle, Chile. PhD thesis (Ludwig-Maximilians-Universität, 2003).44.Valladares, F., Saldaña, A. & Gianoli, E. Costs versus risks: Architectural changes with changing light quantity and quality in saplings of temperate rainforest trees of different shade tolerance. Austral Ecol. 37, 35–43 (2012).
    Google Scholar 
    45.Salgado-Luarte, C. & Gianoli, E. Shade-tolerance and herbivory are associated with RGR of tree species via different functional traits. Plant Biol. 19, 413–419 (2017).CAS 
    PubMed 

    Google Scholar 
    46.Salgado-Luarte, C. & Gianoli, E. Herbivory on temperate rainforest seedlings in sun and shade: Resistance, tolerance and habitat distribution. PLoS One 5, e11460 (2010).ADS 
    PubMed 
    PubMed Central 

    Google Scholar 
    47.Salgado-Luarte, C. & Gianoli, E. Herbivores modify selection on plant functional traits in a temperate rainforest understory. Am. Nat. 180, E42–E53 (2012).PubMed 

    Google Scholar 
    48.Sun, B. Y., Stuessy, T. F., Humaña, A. M., Riveros, G. M. & Crawford, D. J. Evolution of Rhaphithamnus venustus (Verbenaceae), a gynodioecious hummingbird-pollinated endemic of the Juan Fernandez Islands, Chile. Pac. Sci. 50, 55–65 (1996).
    Google Scholar 
    49.Saldaña, A. & Lusk, C. H. Influencia de las especies del dosel en la disponibilidad de recursos y regeneración avanzada en un bosque templado lluvioso del sur de Chile. Rev. Chil. Hist. Nat. 76, 639–650 (2003).
    Google Scholar 
    50.Gut, B. Árboles-Trees Patagonia. Árboles nativos e introducidos en Patagonia (Vázquez Mazzini, 2017).
    Google Scholar 
    51.Sahu, S. K., Thangaraj, M. & Kathiresan, K. DNA extraction protocol for plants with high levels of secondary metabolites and polysaccharides without using liquid nitrogen and phenol. ISRN Mol. Biol. 2012, 205049 (2012).PubMed 
    PubMed Central 

    Google Scholar 
    52.Lundberg, D. S., Yourstone, S., Mieczkowski, P., Jones, C. D. & Dangl, J. L. Practical innovations for high-throughput amplicon sequencing. Nat. Methods 10, 999–1002 (2013).CAS 
    PubMed 

    Google Scholar 
    53.Schloss, P. D. et al. Introducing mothur: Open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl. Environ. Microbiol. 75, 7537–7541 (2009).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    54.Quast, C. et al. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Res. 41, D590–D596 (2012).PubMed 
    PubMed Central 

    Google Scholar 
    55.Wang, Q., Garrity, G. M., Tiedje, J. M. & Cole, J. R. Naïve bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 73, 5261–5267 (2007).ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    56.Borcard, D., Gillet, F. & Legendre, P. Numerical Ecology with R (Springer, 2011).MATH 

    Google Scholar  More

  • in

    Integrate geographic scales in equity, diversity and inclusion

    1.McGill, B. M. et al. Ecol. Evol. 11, 3636–3645 (2021).Article 

    Google Scholar 
    2.Röβler, D. C., Lötters, S. & Da Fonte, L. F. M. Nature 584, 525–525 (2020).
    Google Scholar 
    3.Nuñez, M. A. et al. J. Appl. Ecol. 56, 4–9 (2019).Article 

    Google Scholar 
    4.Nuñez, M. A., Chiuffo, M. C., Pauchard, A. & Zenni, R. D. Trends Ecol. Evol. 36, 766–769 (2021).Article 

    Google Scholar 
    5.Maas, B. et al. Conserv. Lett. 14, e12797 (2021).Article 

    Google Scholar 
    6.Khelifa, R., Mahdjoub, M., Baaloudj, A. & Chaib, S. Facets https://doi.org/10.1139/facets-2021-0073 (in press).7.Haelewaters, D., Hofmann, T. A. & Romero-Olivares, A. L. PLoS Comput. Biol. 17, e1009277 (2021).CAS 
    Article 

    Google Scholar  More

  • in

    Successful microbial colonization of space in a more dispersed manner

    Simulating competition for space using the “BacGo” modelTo investigate how spatial positioning of populations affects the outcome of microbial competition, we simulated two populations competing for space with a limiting size by building an individual-based model (named “BacGo”). The model was implemented in discrete grid boxes of a 20 × 20 array. As shown in Fig. 1a, our simulations were based on three basic assumptions. First, the two competing populations possess the same inherent growth rate and equal initial cell numbers, thus the only differences between them are their manners of colonizing free space. Second, the newly born daughter cell is located around its mother cell but with a random direction of spatial positioning [34], resulted in a microcolony with different spatial patterning. Lastly, if the selected box has been occupied, the newborn cell will compete for the box against the original occupants of the box and possesses a probability of 50% to survive [37].We first explored the outcome of spatial competition, which started by randomly distributing two populations on the grids with the same initial cell numbers of 10 for each. Based on our basic assumptions and the predictions of competitive exclusion theory [38], we hypothesized that only one population could win the competition and finally occupy all grids. As shown in 20,000 independent simulations with random initial distributions, we discovered that at the end of each simulation, only one population survived (Video S1 and Video S2). The Chi-square test showed no significant difference (P = 0.211) between the simulated winning times (10,177 of 20,000 simulations) and the random winning times (10,051 of 20,000 simulations) of the focus population. This result conformed with our initial assumption that cells possess a probability of 50% to survive in competing with original occupants. When we replicated simulations initiated with the same cell distribution, we found that the winning probabilities for each population changed in line with the initial distributions (Fig. S1). However, the winning probabilities never reached 100% no matter how the initial distribution changes. Together, these results suggested that unknown random factors may affect the final outcome of the competition.Next, we analyzed the dynamics of microbial colonization during our simulations. As summarized in Fig. 1b, we divided the competition process into two stages, the “occupation stage” and the “exclusion stage” (see Methods). To statistically characterize the competitive outcome at t3, we defined the winning asymmetry index, WinR, and the abundance asymmetry index, AbunR (see Methods). As shown in Fig. S2a, we found a strong positive correlation (R2 = 0.740, P  More

  • in

    Photoheterotrophy by aerobic anoxygenic bacteria modulates carbon fluxes in a freshwater lake

    SamplingWater was sampled from Cep lake in Czechia, at a regular sampling site of 10 m depth (48.944 °N, 14.877 °E). The lake originates from sand mining in the 1970–80s. It is a permanent meso-oligotrophic (chlorophyll-a concentrations ranged from 1.4–16.4 µg L−1) seepage reservoir filled with groundwater penetrating from the nearby river Lužnice. The lake area is about 1.16 km2, with the maximum depth about 11–12 m. These characteristics are representative for most of temperate and boreal lakes [16].Samples were collected every four weeks from April till October in 2018, and from April till November in 2019. Ten liters of water were collected from 0.5 m depth using a Ruttner Water Sampler (model 11.003KC Denmark AS). Temperature and oxygen profiles were taken with an EXO1 multi-parameter probe (YSI Inc., Yellow Springs, USA). Water was transported to the laboratory within 2 h from the collection in a closed container made from high-density polyethylene, rinsed three times with the sampled water and stored in a cooled box.NutrientsSamples were filtered through glass fiber filters with 0.4 µm nominal porosity (GF-5, Macherey-Nagel, Düren, Germany). Concentrations of soluble reactive phosphorus (SRP) were determined spectrophotometrically [17, 18]. Concentrations of nitrate and ammonium were measured according to Procházková [19] and Kopáčkek and Procházková [20]. Dissolved organic carbon (DOC) and dissolved nitrogen (DN) were measured with a TOC 5000 A analyzer (Shimadzu, Kyoto, Japan).PigmentsSeston from 1.43 to 3.65 L of water was collected onto GF-5 glass fiber filters (diameter 47 mm, Macherey-Nagel). The filters were dried of excess water by gently pressing in a paper towel, and flush frozen in liquid nitrogen. Pigments were extracted and analyzed by HPLC as described in Piwosz et al. [21].Net primary production (NPP) and community respiration (oxygen measurements)Oxygen concentration was measured with the Winkler method [22]. It was chosen because it allows O2 concentration to be measured directly in the water without the need to consider carbonic equilibrium, which is the case when changes in CO2 concentration is measured [23]. Samples were unfiltered to avoid the removal of particle-associated bacteria and also of free-living AAP bacteria, which tend to be larger than average freshwater bacteria [12]. Glass stoppered Winkler type oxygen bottles (115 mL nominal capacity, VTR glass, Prague, Czechia) were filled with the sampled water directly from the sampler via a rubber tube. Each bottle was first rinsed three times and then filled without the formation of air bubbles. Water was allowed to overflow the neck of the bottle for about 1 min, and the bottle was closed with a glass stopper to avoid air bubbles. The closed bottles were kept in the dark in a cooled box. On the shore, three bottles were selected as T0, and 1.2 mL of manganese (II) chloride solution (concentration 3 mol L−1) was addded, followed by the addition of 1.2 mL of a mixture containing 4 mol L−1 of sodium iodide solution and 8 mol L−1 of sodium hydroxide solution. These samples were processed in the laboratory within 3 h. The remaining bottles were incubated for 24 h at in situ temperature in the IR-box prepared from the MAKROLON IR polycarbonate sheet (4 mm thickness, Professional Plastics, Inc. Fullerton, CA, USA). These panels have a maximum transmittance of 90% in the infrared region 850–2000 nm, 50% at 780 nm, and 0% 10 °C according to our measurements (180 days, Equation 3 in Supplementary File S1). Subsequently, the differences in the Cep Lake’s carbon budget for the surface layer (down to 0.5 m depth) were calculated by multiplying the integrated values by the volume of this layer (Equation 4 and 5 in Supplementary File S1).HCO3
    − incorporationTriplicated water samples (32 mL) were incubated for 3.2–5.2 h in the IR light and the dark at in situ temperature, as described for respiration. Total activity added to each bottle was measured from 1 mL aliquot of the incubated sample that was transferred to a scintillation vial containing 20 μl of 5 mol L−1 NaOH (to prevent a loss of 14C-bicarbonate). Thirty mL of sample was filtered through 2.5 μm nitrate cellulose filters (Pragopor, Prague, Czechia, diameter 25 mm). Five mL of the filtered water was collected and subsequently filtered through a 0.17 μm nitrate cellulose filter. The resulting cell-free filtrate, which contained 14C-DOC was collected. The filtration was done at a low vacuum (0.02 MPa) to avoid cell breakage. The total CO2 assimilation rate was calculated as the sum of all these fractions.The filters were kept in an HCl-saturated atmosphere for 24 h at room temperature in a fume hood. They were placed in scintillation vials and dissolved in 1 mL of ethyl acetate (Penta, Prague, Czechia). Then, 5 mL of Ultima Golt LLT scintillation cocktail (PerkinElmer, Waltham, MA, USA) was added. Five mL of cell-free filtrates were acidified by adding 100 μL 5 mol L−1 HCl to volatilize non-incorporated H14CO3 and incubated 24 h at room temperature in a fume hood. Then, 10 mL of the scintillation cocktail was added. Finally, 5 mL of the scintillation cocktail was added to the total activity samples. Subsequently, the samples were gently mixed and left in the dark for 48 h. The radioactivity in the samples was measured using a Tri-Carb 2810 TR scintillation counter (PerkinElmer).To estimate carbon fluxes (μmol C L–1 h–1), a fraction of the added H14CO3 incorporated or released was corrected for the incubation time and multiplied by the concentration of total dissolved inorganic carbon (DIC). The DIC concentration was calculated based on temperature, pH, and alkalinity measurements (Inolab pH 720, WTW Xylem Inc. Rye Brook, NY, US) determined by Gran titration.Assimilation of organic monomersThe difference between microbial activity in the IR light and dark was also estimated based on assimilation rates of radiolabeled glucose, pyruvate, leucine and thymidine (American Radiolabeled Chemicals, St. Louis, MO, USA). Tritiated glucose (specific activity (SA): 2220 GBq mmol−1), leucine (SA: 4440 GBq mmol−1) and thymidine (SA: 2275.5 GBq mmol−1) were added to 5 mL samples to a final concentration of 5 nmol L−1. 14C-pyruvate (SA: 2.035 GBq mmol−1) was added to a final concentration of 10 nmol L−1. Trichloroacetic acid (TCA) was added to the killed controls to a final concentration of 1%. Samples were incubated for 1 h in the dark and IR light as described for respiration. The incubations were terminated as the killed controls and kept at 4 °C in the dark until processed within 65% except for the samples from 9th May and 29th Aug 2018 (10% each), 1st Aug 2018 (21%), 25th Nov 2018 (1%), and 14th Aug 2019 (22%), Supplementary Fig. S1A). Thus, we decided to concatenate the fastaq files and analyze both fractions together as the total community. This also facilitated statistical analysis, as the activity rates were measured for the whole community without fractionations.Reads quality was evaluated using FastQC v0.11.7 (Babraham Bioinformatics, Cambridge, UK). After primer sequences trimming using Cutadapt [29] (v1.16), the number of reads per sample ranged from 49,354 to 188,942. Subsequent analyses were done in the R/Bioconductor environment using the dada2 package (version 1.14.1) [30]. Forward and reverse reads were truncated to 225 bp and low quality sequences were filtered out with the filterAndTrim function (truncLen = c(225, 225), maxN = 0, maxEE = c(2, 2), truncQ = 2), which reduced the number of reads per sample to range from 30,190 to 143,552. After merging and chimera removal using the removeBimeraDenovo function, 4,893 amplicon sequence variants (ASV) were obtained. Rare ASVs (not seen >3 times in at least 20% of the samples) were removed, which reduced the number of ASVs to 658, and the number of reads to 14,613–69,046 per sample. Taxonomic assignment was done using SILVA 138.1 database [31, 32] released on August 27, 2020. ASVs identified as Chloroplast or Cyanobacteria were excluded from the analyses, giving the final number of 546 ASVs and from 10,819 to 54,799 reads per sample. The bacterial community composition graphs were done using phyloseq [33] and ggplot2 [34] packages.AAP community compositionThe composition of AAP community was analyzed by amplicon sequencing of pufM gene encoding the M subunit of bacterial type-2 reaction centers. This gene is routinely used for diversity studies of AAP bacteria [35].PufM gene amplicons (approx. 245 bp) were prepared using pufM_UniF (5′-GGN AAY YTN TWY TAY AAY CCN TTY CA-3′) and pufM_WAW (5′-AYN GCR AAC CAC CAN GCC CA-3′) primers [36]. PCR was performed in triplicate 20 μL reactions using Phusion™ High-Fidelity DNA Polymerase (Thermo Scientific, USA) with the following reaction conditions: 98 °C for 3 min, 27 cycles at 98 °C for 10 s, 58 °C for 30 s, 72 °C for 30 s, and a final extension at 72 °C for 5 min. The triplicate product reactions for each sample were pooled and gel purified using the kit Wizard SV Gel and PCR Clean-Up System (Promega, USA). The sequencing was performed on the Illumina MiSeq platform (2 × 150 bp) at Macrogen, South Korea.The fastq files were concatenated as described for bacteria communities. The Bray-Curtis similarity between two fractions for each sampling day was >70%, except for the samples from 1st Aug 2018 (47%) and 14th Aug 2019 (18%, Supplementary Fig. S1B).The samples were analyzed as described for bacterial communities. The number of reads per sample ranged from 192,360 to 239,418 after the cutadapt trimming. Forward and reverse reads were truncated to 130 bp, and the number of reads per sample after the quality filtering and denoising ranged from 189,432 to 235,311. Merging the forward and reverse reads with mergePairs function created 12,692 ASVs and reduced the number of reads to 183,136–221,281 per sample. The chimera removal lowered the number of ASVs to 1816, and the number of reads to 159,451–208,679. Rare ASVs (not seen >3 times in at least 20% of the samples) were removed, which resulted in the final 566 ASVs, and a number of reads ranging from 155,915 to 203,021 per sample. A manually curated taxonomic database was used for taxonomic assignment following the naïve Bayesian classifier method [37]. It contained 1580 unique pufM sequences, downloaded from the Fungene repository on May 16, 2019 (http://fungene.cme.msu.edu [38]), from metagenomes from the Římov Reservoir [39, 40] and from the Genome Taxonomy database accessed on September 16, 2020 [41].Statistical analysisLinear mixed-effects models were calculated in R (version 3.6.2) using lme function from the nlme package (version 3.1.143) on untransformed activity data and log10 transformed environmental variables [42]. Models’ parameters were estimated using maximum likelihood method and their significance was tested with ANOVA. Relationships between the activity measures, the environmental variables and the composition of AAP communities were investigated with distance-based linear models (DistLM) [43, 44] in Primer (version 7.0.13) with PERMANOVA + 1 add on (e-Primer, Plymouth, UK) [45]. The sequence reads were transformed with the varianceStabilizingTransformation function of the DESeq2 package [46] (version 1.14.1, blind = FALSE, fitType = “mean”).Data accessibilityThe sequences of 16S and pufM amplicons that support the findings of this study have been deposited in the EMBL database as the BioProject with the accession number PRJEB41596, together with most of the environmental metadata. The scripts and the remaining data supporting the results are included in the Supplementary Material. More