More stories

  • in

    Experimental inoculation trial to determine the effects of temperature and humidity on White-nose Syndrome in hibernating bats

    All methods in this study were approved by the Institutional Animal Care and Use Committee at Texas Tech University (protocol 18032-12). All procedures were performed in accordance with relevant guidelines in the manuscript and the ARRIVE (Animal Research: Reporting of In Vivo Experiments) guidelines (https://arriveguidelines.org/).Experimental design for testing effects of temperature and humidity on Pd infection severity on Perimyotis subflavus
    We randomly assigned bats to seven environmental chambers (Caron, Model 7000-33-1, Marietta, Ohio, USA) in a blocked experimental design, controlling temperature and humidity in each chamber (Fig. 1). In each environmental chamber, we divided bats into two cages (23 × 38 × 50 cm) constructed from mesh fabric (Part FMLF, Seattle Fabrics, Inc., Seattle, Washington, USA), PVC pipe, and plastic sheeting. We stratified random assignment to ensure even distribution of initial body mass and sex across microclimate treatments. In addition to the seven treatments with fixed temperature and humidity conditions, we had two treatments that allowed bats to freely move among temperature or humidity conditions (Fig. 1). One group of bats (n = 14) was free to move among three chambers with a common temperature (8 °C) but different humidity (water vapor pressure deficit (VPD) = 0.05 kPa, 0.10 kPa, or 0.15 kPa, corresponding to 95, 90, and 85% relative humidity (RH))36. A second group of bats (n = 14) was free to move among three chambers with a common VPD condition (0.10 kPa, medium humidity) but different temperatures (5, 8, or 11 °C) (Fig. 1). Because our research questions were focused on comparing the effect of temperature and humidity conditions on disease severity, we did not include sham-inoculated control animals in the experiment. We made this decision to reduce the total number of animals used in the experiment and to maximize replication to test the effects of temperature and humidity on disease.Figure 1Schematic of the experimental design and sample sizes with 7 environmental chambers with fixed temperature and humidity conditions and two sets of connected chambers allowing bats to behaviorally select temperature (left) or humidity conditions (bottom) for the infection trial on tri-colored bats (Perimyotis subflavus). Water loss conditions were based on water vapor pressure deficit (VPD) levels set to 0.05 kPA to produce low potential evaporative water loss (pEWL) for high humidity, 0.10 kPa for medium pEWL and humidity, or 0.15 kPA for high pEWL and low humidity. Numbers are sample sizes of bats assigned to separate cages within each chamber. Bats in the low temperature and high humidity chamber were combined into a single cage after a camera failed at the start of the experiment (top right).Full size imageWe inoculated each bat by spreading 20 µL of Pd solution (5 × 105 conidia µL−1) evenly across both wings, following established protocols8,9,32,37; treatments were conducted blind without knowledge of which bat was being assigned to what group and bats were inoculated in no particular order to reduce the confounding influence on the order of treatment. We used a Pd strain collected by Karen J. Vanderwolf at Trent University from naturally infected Myotis lucifugus. We cultured Pd on Sabouraud Dextrose Agar with chloramphenicol and gentamicin (SabDex) (Part L96359, Fisher Scientific, Houston, Texas, USA) and incubated subcultured plates at 10 °C for 60 days to allow the formation of conidia. We then harvested conidia by flooding plates with phosphate buffered saline solution containing 0.5% Tween20 (PBST). Conidia were resuspended in PBST, enumerated, and diluted to the inoculum concentration8.Microclimate treatment conditionsWe used three temperatures 5, 8, or 11 °C to represent a range of roosting temperatures of P. subflavus in natural hibernacula24,29. We set humidity in environmental chambers to achieve specific levels of water vapor pressure deficit (VPD) between the surface of the bat and the environment because relative humidity varies by temperature36. Higher VPD corresponds to drier air resulting in higher potential evaporative water loss (pEWL). We used three levels of VPD: 0.05, 0.10, or 0.15 kPa corresponding to low pEWL (high humidity), medium pEWL (medium humidity), and high pEWL (low humidity) levels (Fig. 1). We verified the ambient temperature and relative humidity in each chamber at 10-min intervals (Hobo Model U23-001, Onset Computer Corporation, Bourne, Massachussetts, USA). For bats in the connected chambers that could behaviorally select their temperature and humidity conditions, we quantified the number of days bats spent in each condition38.Animal handling and data collectionWe used 98 (42 females, 56 males) tricolored bats collected on 10 December 2018 from culverts in Mississippi and transported directly to Texas Tech University39. We took morphometric measurements (body mass ± 0.1 g, forearm length ± 0.1 mm) and used quantitative magnetic resonance (QMR; Echo-MRI-B, Echo Medical Systems, Houston, Texas, USA) to determine pre-hibernation fat at the start of the experiment39,40. As an indicator of pre-hibernation stress, we collected a fur sample from the dorsal intrascapular region to quantify fur cortisol concentration with a commercial ELISA kit, following the manufacturer’s protocol (Arbor Assays, Michigan, USA) (see Supplemental Methods). Fur is moulted once per year in the late summer period41 and therefore fur cortisol reflects the level of circulating cortisol during the period of fur growth prior to hibernation. We attached a uniquely marked, modified datalogger42 (DS1925L iButton, Maxim Integrated, San Jose, California, USA) to the back of each bat using ostomy cement to record skin temperature39. Prior to inoculation, we swabbed bats with a sterile polyester swab (Fisherbrand synthetic tipped applicators 23-400-116) five times on forearm and five times on muzzle to determine if any bats were naturally infected with Pd at time of collection. Swabs were stored in RNAlater at  − 20 °C until testing using quantitative polymerase chain reaction (qPCR) at Northern Arizona University43.During the experiment, we provided ad libitum drinking water in each cage but did not provide food. We secured a motion-activated infrared camera (Model HT5940T, Speco Technologies, New York, New York, USA) above each cage to monitor bats throughout the experiment. Because one camera failed at the start of the experiment, we combined bats in that treatment chamber into a single cage (Fig. 1) and replicated this disturbance among all chambers. We monitored bats without disturbance by reviewing video recordings daily. Three bats died of unknown cause before the end of the experiment and were removed from analyses.After 83 days of hibernation, we terminated the experiment and bats were removed from cages and processed to determine body condition using QMR39. We took respirometry measurements on a subset of animals38, and swabbed for Pd as described above. We photographed the left ventral wing using ultraviolet (UV) transillumination (368-nm wavelength and 2-s exposure) to detect and measure florescence associated with Pd infection37,44. For histology, we removed the wing section from the fifth digit and the body and rolled wing tissue around dental wax dowels and 10% neutral buffered formalin. We collected a 90–110 µL blood sample in lithium-heparin-treated capillary tubes for immediate analysis of blood chemistry with a handheld analyzer (i-STAT1 Vet Scan, Abaxis, Union City, California, USA). Using an EC8+ cartridge, we measured sodium, potassium, chloride, anion gap, glucose, BUN (urea nitrogen), hematocrit, hemoglobin, pH, pCO2, TCO2, HCO3, and base excess (Table S1). We quantified arousals from torpor as reported by McGuire et al.39. All bats were handled and euthanized under Animal Care and Use Committee permit 18032-12 at Texas Tech University.Infection and disease metricsWe used several metrics to determine pathogen and disease presence and severity37: presence and amount of the pathogen, Pd, on a bat were determined by qPCR43, and presence of the disease, WNS, was determined via detection of orange-yellow florescence under UV light characteristic of Pd infection44 and histological presence of characteristic lesions and pustules with fungal hyphae45,46. Three types of cutaneous infection were described histologically, including characteristic cupping erosions with fungal hyphae, neutrophilic pustules with fungal hyphae, and fungal hyphae in the stratum corneum with dermal necrosis. Any bats with any of these three conditions noted were scored as WNS positive by histology. Presence and quantity of DNA of Pd was tested by qPCR at Northern Arizona University. All samples were run in duplicate and considered positive if at least one run was positive below a cycle threshold (Ct) of 40 and quantified using a quantification curve from serial dilutions (nanograms of Pd using the equation load = 10((22.049-Ct value)/3.34789), r2 = 0.986)47. Load values were averaged across multiple runs and then converted to attograms by multiplying loads in nanograms by 109.Statistical analysesWe used three different response variables (Pd prevalence, Pd loads, and WNS prevalence by histology) to determine whether infection status varied by microclimate treatment conditions. Low sample sizes of positive infection status by UV detection (n = 4) precluded use in statistical analyses (Table 1). We used generalized linear models with binomial distribution for analyses of Pd prevalence and WNS prevalence and a linear mixed effects model with Gaussian errors for Pd loads. Although the experiment was designed with replication at the cage level to account for cage effects, we were unable to include cage as a random effect because of the low numbers of bats that had signs of Pd or WNS infection. We analyzed whether infection status (i.e., Pd prevalence, Pd load, or WNS prevalence) varied by sex and cortisol separately from an a priori candidate model set (Table 2) to cope efficiently with small sample sizes. We first asked whether infection response varied by sex to determine if bats could be pooled in subsequent analyses. We analyzed separately whether infection response varied by pre-hibernation cortisol at the start of the experiment on the subset of animals for which we had cortisol measurements (n = 83). We then used an information-theoretic approach comparing a candidate set of models with Akaike Information Criterion (AIC)48 using initial fat mass as an individual covariate and temperature and humidity treatment conditions as categorical treatment groups to assess the effect of microclimate on infection response (Table 2). Bats behaviorally selecting their temperature and humidity conditions were assigned to a temperature or humidity treatment level if a bat spent  > 89% of captive days at that condition or was otherwise placed in an ‘inconstant condition’ treatment group. For WNS prevalence, we used the bias reduction method implemented in package brglm49 to deal with complete separation present in the data (in some treatments all bats were scored as negative for WNS) (Table 1; Fig. 2).Table 1 Signs of Pd infection or WNS disease for tri-colored bats (Perimyotis subflavus) exposed to different temperature and humidity regimes.Full size tableTable 2 Model selection results for model comparisons of humidity and temperature and pre-hibernation fat mass on Pd prevalence, Pd load, and WNS prevalence.Full size tableFigure 2Signs of Pseudogymnoascus destructans (Pd) infection or white-nose syndrome (WNS) disease for tri-colored bats (Perimyotis subflavus) exposed to different temperature and humidity regimes. (A) Fraction of bats with Pd detected by qPCR; (B) Fraction of bats with signs of WNS disease by histology, and (C) Mean quantity of Pd on bats at the end of the experiment. There was no statistical support for differences between temperature or humidity treatments for any response metrics. Points are estimated means and vertical lines show binomial standard error for prevalence and standard errors for Pd load.Full size imageBecause this was the first captive hibernation experiment with P. subflavus, we investigated the effects of temperature and humidity on the hibernation physiology of the species38,39 and how physiological markers (e.g., blood chemistry) may be associated with disease. To determine if physiological indicators were related to infection status at the end of the experiment, we compared total number of torpor arousal bouts during the experiment and 13 different blood chemistry metrics from blood samples taken at the end of the experiment and used t-test comparisons (at α = 0.05) for each metric between Pd/WNS positive and negative bats. We designated bats as Pd/WNS positive if a bat tested positive for either Pd or WNS by qPCR, UV, or histology. We used Program R version 3.6.2 to conduct all analyses.Experimental design for testing effects of temperature and humidity on Pd growth on substratesWe used five environmental chambers (CARON, Model 7000-33-1, Marietta, Ohio, USA) to test for the effects of temperature and humidity on fungal growth on natural and artificial substrates (Fig. S1). Our experimental design comprised a reduced temperature series and humidity gradient than what we used for the experiment on bats. In the humidity gradient, temperature was held constant at 8 °C, with 85%, 90%, and 95% RH representing our low, medium, and high humidity treatments, respectively. In the temperature series, vapor pressure deficit (VPD) was held constant across the low (5 °C), medium (8 °C), and high (11 °C) temperatures (VPD = nominally 0.01 kPa, range (0.105–0.107). The chamber set to 8 °C and 90% humidity (VPD = 0.107 kPa) was common to both series.Media plate inoculation and fungal growth measurementWe constructed modified plate lids to prevent contamination while allowing humidity to equilibrate across the plate lid. We drilled 14 equidistant holes (5.5 mm diameter) into each plate lid and hot glued a piece of circular filter paper to the top of the lid. Lids were then disinfected thoroughly with a hydrogen peroxide wipe before being placed in a disinfected, sealed storage container.We prepared Pd inoculum as described above for the infection trial on bats. We inoculated 30 SabDex plates with 100 µL of inoculum at a concentration of 20 conidia µL−1 by serial dilution with a starting concentration of 2.0 × 104 conidia µL−1 diluted four times by a factor of 10. We used sterile, individually wrapped 1-µL plastic inoculation loops to spread the inoculum evenly across the surface of the plates, added the modified plate lids, and immediately transferred plates into environmental chambers. We included six replicate plates in each of the five microclimate conditions.We took weekly digital photographs (Nikon, Model 26524, Tokyo, Japan) of each plate for the 5-week duration of the experiment (Fig. 3A). Our camera was mounted on a tripod to ensure consistent placement of plates relative to the camera. Each photo included a ruler, which was used to calibrate measurements made in ImageJ (Version 2.0.0-rc-69/1.52p, National Institutes of Health, Bethesda, Maryland, USA). One observer made all measurements for consistency. We used the freehand selection tool to trace the boundary of each fungal colony using a drawing tablet (Wacom, Model CTL-490, Kazo, Saitama, Japan). From these selections, we obtained the total surface area growth as the sum of all area selection (in cm2).Figure 3Examples demonstrate the process of measuring and estimating fungal growth of Pseudogymnoascus destructans (Pd) on media plates in temperature and humidity treatment conditions. (A) Examples of fungal growth on media plates measured at days 7, 14, 21, 28, and 34 from two of the treatment conditions (11 °C, 92% RH and 5 °C, 88% RH). (B) Examples of estimating maximum growth rate and latency variables from fungal growth measurements in panel A. We fit a sigmoidal curve to describe fungal growth (thick solid black line) to estimate the inflection point of the curve (vertical solid line). We calculated the slope (solid red line) at the inflection point of the curve to estimate maximum growth rate, and the days until total growth area reached 2.5 cm2 (dashed red lines) as an estimate of latency.Full size imageWe modelled the growth of Pd on each plate as a sigmoidal curve (Fig. 3B), which we fit using the SSlogis and nls functions in Program R v. 3.6.350. The model fitting function provides an estimate of the inflection point of the curve, and we calculated the slope at the inflection point to estimate the maximum growth rate. We also estimated the latency to rapid fungal growth on the plates by determining the date at which the total area of fungus on the plate reached 2.5 cm2 as an arbitrary threshold.We also quantified growth of individual colonies. To avoid biasing growth rate estimates, we excluded colonies that intercepted another colony by choosing independent colonies at the final time point and tracking them backwards through time. If there were fewer than 10 independent colonies at the final time point, we added additional unimpeded colonies with each earlier time point until the total number of colonies reached 10. We modelled growth of individual colonies following the same procedure as for total area of growth on the plate, with an arbitrary threshold of 0.05 cm2 for latency calculations. We used linear mixed models to test for the effects of temperature and humidity on maximum growth rate or latency, including plate as a random factor to account for measuring multiple colonies per plate.Rock inoculation and fungal growth measurementTo evaluate fungal growth and persistence on a natural substrate, we inoculated pieces of sandstone flagstone. We etched a 4 × 6 sampling grid, composed of 5 × 5 cm squares, onto the surface of each sandstone rock (Texas Rock and Flagstone, Lubbock, Texas, USA), where each square served as a sampling unit (Fig. S2). Each row represented a time series for a single replicate, while each column was composed of replicates for the respective time point. Rocks were then autoclaved at 121 °C for 40 min and stored individually in a disinfected, sealed container until inoculation. At the time of inoculation, we evenly spread 200 µL of inoculum (2.5 × 104 conidia µL−1) across each sampling square and immediately transferred the rock to an environmental chamber.We measured fungal growth at days 0, 14, 28, and 56. We used a sterile cotton swab to collect fungal DNA from each sampling square. Swabs were moistened with RNAlater and rolled horizontally, vertically, and diagonally across the surface of the sampling square to ensure contact with the total surface area. One researcher collected all swabs to maximize consistency among swabs collected throughout the experiment. Swabs were placed in RNAlater and stored at − 20 °C until shipped to Northern Arizona University for qPCR analysis43. We quantified fungal loads for each swab sample from qPCR using the quantification curve provided above and normalized fungal loads to the value at day zero for each rock respectively. We then used linear models to test for effects of temperature and humidity on changes in fungal load (log transformed) over time.To evaluate viability of Pd, we swabbed the entire inoculated surface of each rock at the end of the experiment and vortexed the swabs in RNAlater for one minute to release fungal DNA from the swab. We then applied 100 µL of RNAlater fungal solution from each rock to a respective SabDex media plate, using a sterile inoculation loop. After 2 weeks of incubation at 11 °C and 92% RH, we visually assessed plates for presence of fungal growth to determine viability of Pd collected from rocks at the end of the growth experiment. More

  • in

    Exploring how functional traits modulate species distributions along topographic gradients in Baxian Mountain, North China

    1.Díaz, S., Cabido, M. & Casanoves, F. Functional implications of trait-environment linkages in plant communities. Ecolog. Assem. Rules Perspect. Adv. Retreat. 26, 338–362 (1999).
    Google Scholar 
    2.Ordoñez, J. C. et al. A global study of relationships between leaf traits, climate and soil measures of nutrient fertility. Glob. Ecol. Biogeogr. 18(2), 137–149. https://doi.org/10.1111/j.1466-8238.2008.00441.x (2009).Article 

    Google Scholar 
    3.Westoby, M., Falster, D. S., Moles, A. T., Vesk, P. A. & Wright, I. J. Plant ecological strategies: some leading dimensions of variation between species. Annu. Rev. Ecol. Syst. 33(1), 125–159 (2002).
    Google Scholar 
    4.Brown, A. M. et al. The fourth-corner solution–using predictive models to understand how species traits interact with the environment. Methods Ecol. Evol. 5(4), 344–352. https://doi.org/10.1111/2041-210X.12163 (2014).Article 

    Google Scholar 
    5.Jamil, T., Ozinga, W. A., Kleyer, M. & ter Braak, C. J. F. Selecting traits that explain species–environment relationships: a generalized linear mixed model approach. J. Veg. Sci. 24(6), 988–1000 (2013).
    Google Scholar 
    6.Pollock, L. J., Morris, W. K. & Vesk, P. A. The role of functional traits in species distributions revealed through a hierarchical model. Ecography 35(8), 716–725 (2012).
    Google Scholar 
    7.Elith, J. & Leathwick, J. R. Species distribution models: ecological explanation and prediction across space and time. Annu. Rev. Ecol. Evol. Syst. 40, 677–697 (2009).
    Google Scholar 
    8.Moeslund, J. E., Arge, L., Bøcher, P. K., Dalgaard, T. & Svenning, J.-C. Topography as a driver of local terrestrial vascular plant diversity patterns. Nord. J. Bot. 31(2), 129–144. https://doi.org/10.1111/j.1756-1051.2013.00082.x (2013).Article 

    Google Scholar 
    9.Burnett, B. N., Meyer, G. A. & McFadden, L. D. Aspect-related microclimatic influences on slope forms and processes, Northeastern Arizona. J. Geophys. Res. Earth Surf. 113(3), 129. https://doi.org/10.1029/2007JF000789 (2008).Article 

    Google Scholar 
    10.Hais, M., Chytrý, M. & Horsák, M. Exposure-related forest-steppe: a diverse landscape type determined by topography and climate. J. Arid Environ. 135, 75–84. https://doi.org/10.1016/j.jaridenv.2016.08.011 (2016).ADS 
    Article 

    Google Scholar 
    11.Holden, Z. A. & Jolly, W. M. Modeling topographic influences on fuel moisture and fire danger in complex terrain to improve wildland fire management decision support. Forest Ecol. Manag. 262(12), 2133–2141. https://doi.org/10.1016/j.foreco.2011.08.002 (2011).Article 

    Google Scholar 
    12.Dyer, J. M. Assessing topographic patterns in moisture use and stress using a water balance approach. Landscape Ecol. 24(3), 391–403. https://doi.org/10.1007/s10980-008-9316-6 (2009).Article 

    Google Scholar 
    13.Lan, G., Hu, Y., Cao, M. & Zhu, H. Topography related spatial distribution of dominant tree species in a tropical seasonal rain forest in China. Forest Ecol. Manag. 262(8), 1507–1513. https://doi.org/10.1016/j.foreco.2011.06.052 (2011).Article 

    Google Scholar 
    14.Punchi-Manage, R. et al. Effects of topography on structuring local species assemblages in a Sri Lankan mixed dipterocarp forest. J. Ecol. 101(1), 149–160. https://doi.org/10.1111/1365-2745.12017 (2013).Article 

    Google Scholar 
    15.Rubino, D. L. & McCarthy, B. C. Evaluation of coarse woody debris and forest vegetation across topographic gradients in a southern Ohio forest. Forest Ecol. Manag. 183(1), 221–238. https://doi.org/10.1016/S0378-1127(03)00108-7 (2003).Article 

    Google Scholar 
    16.Sefidi, K., Esfandiary Darabad, F. & Azaryan, M. Effect of topography on tree species composition and volume of coarse woody debris in an Oriental beech (Fagus orientalis Lipsky) old growth forests, northern Iran. IForest-Biogeosciences and Forestry 9(4), 658 (2016).
    Google Scholar 
    17.Liu, J., Yunhong, T. & Slik, J. F. Topography related habitat associations of tree species traits, composition and diversity in a Chinese tropical forest. Forest Ecol. Manag. 330, 75–81 (2014).
    Google Scholar 
    18.Díaz, S. et al. The global spectrum of plant form and function. Nature 529(7585), 167 (2016).ADS 
    PubMed 

    Google Scholar 
    19.Westoby, M. A leaf-height-seed (LHS) plant ecology strategy scheme. Plant Soil 199(2), 213–227 (1998).CAS 

    Google Scholar 
    20.King, D. A. The adaptive significance of tree height. Am. Nat. 135(6), 809–828 (1990).
    Google Scholar 
    21.Koch, G. W., Sillett, S. C., Jennings, G. M. & Davis, S. D. The limits to tree height. Nature 428(6985), 851–854 (2004).ADS 
    CAS 
    PubMed 

    Google Scholar 
    22.Mäkelä, A. Implications of the pipe model theory on dry matter partitioning and height growth in trees. J. Theor. Biol. 123(1), 103–120 (1986).ADS 

    Google Scholar 
    23.King, D. Tree dimensions: maximizing the rate of height growth in dense stands. Oecologia 51(3), 351–356 (1981).ADS 
    PubMed 

    Google Scholar 
    24.Hoch, G., Popp, M. & Körner, C. Altitudinal increase of mobile carbon pools in Pinus cembra suggests sink limitation of growth at the Swiss treeline. Oikos 98(3), 361–374. https://doi.org/10.1034/j.1600-0706.2002.980301.x (2002).CAS 
    Article 

    Google Scholar 
    25.Körner, C. A re-assessment of high elevation treeline positions and their explanation. Oecologia 115(4), 445–459 (1998).ADS 
    PubMed 

    Google Scholar 
    26.Hoch, G. & Körner, C. Growth and carbon relations of tree line forming conifers at constant vs. variable low temperatures. J. Ecol. 97(1), 57–66. https://doi.org/10.1111/j.1365-2745.2008.01447.x (2009).Article 

    Google Scholar 
    27.Hoch, G. & Körner, C. Global patterns of mobile carbon stores in trees at the high-elevation tree line. Glob. Ecol. Biogeogr. 21(8), 861–871. https://doi.org/10.1111/j.1466-8238.2011.00731.x (2012).Article 

    Google Scholar 
    28.Shi, P., Körner, C. & Hoch, G. A test of the growth-limitation theory for alpine tree line formation in evergreen and deciduous taxa of the eastern Himalayas. Funct. Ecol. 22(2), 213–220. https://doi.org/10.1111/j.1365-2435.2007.01370.x (2008).Article 

    Google Scholar 
    29.Nagelmüller, S., Hiltbrunner, E. & Körner, C. Low temperature limits for root growth in alpine species are set by cell differentiation. AoB Plants https://doi.org/10.1093/aobpla/plx054 (2017).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    30.Hendrickson, L., Ball, M. C., Wood, J. T., Chow, W. S. & Furbank, R. T. Low temperature effects on photosynthesis and growth of grapevine. Plant Cell Environ. 27(7), 795–809. https://doi.org/10.1111/j.1365-3040.2004.01184.x (2004).CAS 
    Article 

    Google Scholar 
    31.Körner, C. & Hoch, G. A test of treeline theory on a montane permafrost island. Arct. Antarct. Alp. Res. 38(1), 113–119 (2006).
    Google Scholar 
    32.Muller-Landau, H. C. The tolerance–fecundity trade-off and the maintenance of diversity in seed size. Proc. Natl. Acad. Sci. 107(9), 4242–4247 (2010).ADS 
    PubMed 
    PubMed Central 

    Google Scholar 
    33.Lloret, F., Casanovas, C. & Peñuelas, J. Seedling survival of Mediterranean shrubland species in relation to root: shoot ratio, seed size and water and nitrogen use. Funct. Ecol. 13(2), 210–216. https://doi.org/10.1046/j.1365-2435.1999.00309.x (1999).Article 

    Google Scholar 
    34.Quero, J. L., Villar, R., Marañón, T., Zamora, R. & Poorter, L. Seed-mass effects in four Mediterranean Quercus species (Fagaceae) growing in contrasting light environments. Am. J. Bot. 94(11), 1795–1803. https://doi.org/10.3732/ajb.94.11.1795 (2007).Article 
    PubMed 

    Google Scholar 
    35.Hallett, L. M., Standish, R. J. & Hobbs, R. J. Seed mass and summer drought survival in a Mediterranean-climate ecosystem. Plant Ecol. 212(9), 1479. https://doi.org/10.1007/s11258-011-9922-2 (2011).Article 

    Google Scholar 
    36.McFadden, I. R. et al. Disentangling the functional trait correlates of spatial aggregation in tropical forest trees. Ecology 100(3), e02591. https://doi.org/10.1002/ecy.2591 (2019).Article 
    PubMed 

    Google Scholar 
    37.Moles, A. T. & Westoby, M. Seedling survival and seed size: a synthesis of the literature. J. Ecol. 92(3), 372–383. https://doi.org/10.1111/j.0022-0477.2004.00884.x (2004).Article 

    Google Scholar 
    38.Shipley, B. et al. Predicting habitat affinities of plant species using commonly measured functional traits. J. Veg. Sci. 28(5), 1082–1095. https://doi.org/10.1111/jvs.12554 (2017).Article 

    Google Scholar 
    39.Willson, C. J. & Jackson, R. B. Xylem cavitation caused by drought and freezing stress in four co-occurring Juniperus species. Physiol. Plant. 127(3), 374–382 (2006).CAS 

    Google Scholar 
    40.Peguero-Pina, J. J. et al. Hydraulic traits are associated with the distribution range of two closely related Mediterranean firs, Abies alba Mill. and Abies pinsapo Boiss. Tree Physiol. 31(10), 1067–1075 (2011).PubMed 

    Google Scholar 
    41.Tyree, M. & Sperry, J. Vulnerability of xylem to cavitation and embolism. Ann. Rev. Plant Biol 40, 19–36 (1989).
    Google Scholar 
    42.Wubbels, J. (2010). Tree Species Distribution in Relation to Stem Hydraulic Traits and Soil Moisture in a Mixed Hardwood Forest in Central Pennsylvania.43.Perez-Harguindeguy, N. et al. Corrigendum to: new handbook for standardised measurement of plant functional traits worldwide. Aust. J. Bot. 64(8), 715–716 (2016).
    Google Scholar 
    44.Oliveira, R. S. et al. Embolism resistance drives the distribution of Amazonian rainforest tree species along hydro-topographic gradients. New Phytol. 221(3), 1457–1465 (2019).PubMed 

    Google Scholar 
    45.Ahrens, C. W., Rymer, P. D. & Tissue, D. T. Intra-specific trait variation remains hidden in the environment. New Phytol. 2, 1183–1185 (2021).
    Google Scholar 
    46.Siefert, A. et al. A global meta-analysis of the relative extent of intraspecific trait variation in plant communities. Ecol. Lett. 18(12), 1406–1419 (2015).PubMed 

    Google Scholar 
    47.Benito Garzón, M., Alía, R., Robson, T. M. & Zavala, M. A. Intra-specific variability and plasticity influence potential tree species distributions under climate change. Glob. Ecol. Biogeogr. 20(5), 766–778 (2011).
    Google Scholar 
    48.Henn, J. J. et al. Intraspecific trait variation and phenotypic plasticity mediate alpine plant species response to climate change. Front. Plant Sci. 9, 1548 (2018).PubMed 
    PubMed Central 

    Google Scholar 
    49.Zhang, B. et al. Species responses to changing precipitation depend on trait plasticity rather than trait means and intraspecific variation. Funct. Ecol. 34(12), 2622–2633 (2020).
    Google Scholar 
    50.Xu, H., Wang, H., Prentice, I. C., Harrison, S. P. & Wright, I. J. Coordination of plant hydraulic and photosynthetic traits: confronting optimality theory with field measurements. New Phytol. 2, 90387 (2021).
    Google Scholar 
    51.Yang, Y. et al. Quantifying leaf-trait covariation and its controls across climates and biomes. New Phytol. 221(1), 155–168 (2019).CAS 
    PubMed 

    Google Scholar 
    52.Li, X., Lu, H., Yu, L. & Yang, K. Comparison of the spatial characteristics of four remotely sensed leaf area index products over China: Direct validation and relative uncertainties. Remote Sens. 10(1), 148 (2018).ADS 

    Google Scholar 
    53.Peel, M. C., Finlayson, B. L. & McMahon, T. A. Updated world map of the Köppen-Geiger climate classification. Sci. Rep. 3, 1069 (2007).
    Google Scholar 
    54.Gittleman, J. L. & Kot, M. Adaptation: statistics and a null model for estimating phylogenetic effects. Syst. Zool. 39(3), 227–241 (1990).
    Google Scholar 
    55.Reich, P. B., Wright, I. J. & Lusk, C. H. Predicting leaf physiology from simple plant and climate attributes: a global GLOPNET analysis. Ecol. Appl. 17(7), 1982–1988 (2007).PubMed 

    Google Scholar 
    56.Leishman, M. R., Wright, I. J., Moles, A. T. & Westoby, M. The evolutionary ecology of seed size. Seeds Ecol. Regener. Plant Commun. 2, 31–57 (2000).
    Google Scholar 
    57.Kattge, J. et al. TRY plant trait database–enhanced coverage and open access. Glob. Change Biol. 26(1), 119–188 (2020).ADS 

    Google Scholar 
    58.Wang, H. et al. The China plant trait database: toward a comprehensive regional compilation of functional traits for land plants. Ecology 99(2), 1039 (2018).
    Google Scholar 
    59.Knapp, B. O., Wang, G. G., Clark, S. L., Pile, L. S. & Schlarbaum, S. E. Leaf physiology and morphology of Castanea dentata (Marsh.) Borkh., Castanea mollissima Blume, and three backcross breeding generations planted in the southern Appalachians, USA. New Forests 45(2), 283–293 (2014).
    Google Scholar 
    60.Chen, L. et al. Seed dispersal and seedling recruitment of trees at different successional stages in a temperate forest in northeastern China. J. Plant Ecol. 7(4), 337–346 (2014).
    Google Scholar 
    61.Marchi, S., Tognetti, R., Minnocci, A., Borghi, M. & Sebastiani, L. Variation in mesophyll anatomy and photosynthetic capacity during leaf development in a deciduous mesophyte fruit tree (Prunus persica) and an evergreen Sclerophyllous Mediterranean shrub (Olea europaea). Trees 22(4), 559 (2008).CAS 

    Google Scholar 
    62.Gelman, A. Scaling regression inputs by dividing by two standard deviations. Stat. Med. 27(15), 2865–2873 (2008).MathSciNet 
    PubMed 

    Google Scholar 
    63.Miller, J. E. D., Damschen, E. I. & Ives, A. R. Functional traits and community composition: a comparison among community-weighted means, weighted correlations, and multilevel models. Methods Ecol. Evol. 10(3), 415–425. https://doi.org/10.1111/2041-210X.13119 (2019).Article 

    Google Scholar 
    64.Chung, Y., Rabe-Hesketh, S., Dorie, V., Gelman, A. & Liu, J. A nondegenerate penalized likelihood estimator for variance parameters in multilevel models. Psychometrika 78(4), 685–709 (2013).MathSciNet 
    PubMed 
    MATH 

    Google Scholar 
    65.Boyd, K., Costa, V. S., Davis, J., & Page, C. D. (2012). Unachievable region in precision-recall space and its effect on empirical evaluation. in Proceedings of the International Conference on Machine Learning. International Conference on Machine Learning, 2012, 349. NIH Public Access.66.Sofaer, H. R., Hoeting, J. A. & Jarnevich, C. S. The area under the precision-recall curve as a performance metric for rare binary events. Methods Ecol. Evol. 10(4), 565–577 (2019).
    Google Scholar 
    67.Grau, J., Grosse, I. & Keilwagen, J. PRROC: computing and visualizing precision-recall and receiver operating characteristic curves in R. Bioinformatics 31(15), 2595–2597 (2015).CAS 
    PubMed 
    PubMed Central 

    Google Scholar 
    68.Keilwagen, J., Grosse, I. & Grau, J. Area under precision-recall curves for weighted and unweighted data. PloS One 9(3), e92209 (2014).ADS 
    PubMed 
    PubMed Central 

    Google Scholar 
    69.Saito, T. & Rehmsmeier, M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PloS One 10(3), e0118432 (2015).PubMed 
    PubMed Central 

    Google Scholar 
    70.R Core Team (2019). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.71.Schmitt, S. et al. Topography consistently drives intra-and inter-specific leaf trait variation within tree species complexes in a Neotropical forest. Oikos 129(10), 1521–1530 (2020).
    Google Scholar  More

  • in

    Easy computation of the Bayes factor to fully quantify Occam’s razor in least-squares fitting and to guide actions

    How many parameters best describe data in muon spectroscopy?Here we find that the Bayes factor demands the inclusion of more physically-meaningful parameters than the BIC or significance tests. Figure 1a presents some data that might reasonably be fitted with as few as three or as many as 22 physically-meaningful parameters. We find that the Bayes factor encourages the inclusion of all these parameters until the onset of over-fitting. Even though many of them have fitted values that fail significance tests (i.e. are consistent with zero), their omission distorts the fitting results severely.Figure 1Full size imageFigure 1a shows an anti-level-crossing spectrum observed in photo-excited muon-spin spectroscopy26 from an organic molecule27. The data are presented in Fig. 2a of Ref.27 and are given in the SI. These spectra are expected to be Lorentzian peaks. Theory permits optical excitation to affect the peak position, the width and the strength (photosensitivity). In the field region over which the measurements are carried out, there is a background from detection of positrons, which has been subtracted from the data presented27. Wang et al.27 did not attempt to fit the data rigorously; they did report a model-independent integration of the data, which demonstrated a change in area and position.The model that we fit hypothesises one or more Lorentzian peaks, with optional photosensitivity on each fitting parameter and with optional linear backgrounds y = a + bx underlying the peaks, described by the full equation given in the SI, equation (S3). To do a single LS fit to all the data, we extend the data to three dimensions, (x gauss, y asymmetry, z) where z = 0 for data in the dark and z = 1 for photoexcited data. Including all the data in a single LS fit in this way, rather than fitting the dark and photoexcited data separately, simplifies both setting up the fit and doing the subsequent analysis.Figure 1b shows the evolution of the SBIC and the lnBF as the number of fitting parameters in the model is increased. Starting with a single Lorentzian peak, three parameters are required, peak position P, width W and intensity A. Three photosensitivity parameters ΔLP, ΔLW and ΔLA are then introduced successively to the fit, (open and small data points for n = 3–6). The SBIC decreases and the lnMLI scarcely increases. It is only with the inclusion of one background term (n = 7) that any figure of merit shows any substantial increase. There is no evidence here for photosensitivity. The weak peak around 7050 G does not seem worth including in a fit, as it is evidenced by only two or three data points and is scarcely outside the error bars. However, a good fit with two peaks (P1 ~ 7210 G, P2 ~ 7150 G, the subscripts 1 and 2 in accordance with the site labelling of Fig. 2a of Ref.27) can be obtained with just five parameters (P1, P2, A1, A2, W). This gives substantial increases in the SBIC and lnMLI, further increased when W1 and W2 are distinguished and then when the single background term and the three photosensitivity parameters ΔLP2, ΔLW2 and ΔLA2 are successively included (solid or large data points for n = 5–10 in Fig. 1b). The SBIC reaches its maximum here, at n = 10, and then decreases substantially when the other three photosensitivity parameters and the other three background terms are included. These additional parameters fail significance tests as well as decreasing the SBIC (Fig. 1b). Conventionally, the n = 10 fit would be accepted as best. The outcome would be reported as two peaks, with significant photo-sensitivities ΔLP2, ΔLW2 and ΔLA2 for all three of the 7150 G peak parameters, but no photosensitivity for the 7210 G peak (Table 1).Table 1 Photosensitivity results of fitting the data of Fig. 1a with 10, 16 and 19 parameters. Parameter units as implied by Fig. 1a.Full size tableThe Bayes factor gives a very different outcome. From 10 to 16 parameters, the Bayes factor between any two of these seven models is close to unity (Fig. 1b). That is, they have approximately equal probability. The Bayes factor shows that what the conventional n = 10 analysis would report is false. Specifically, it is not the case that ΔLP2, reported as − 14 ± 4 G, has a roughly ({raise0.5exhbox{$scriptstyle 2$} kern-0.1em/kern-0.15em lower0.25exhbox{$scriptstyle 3$}}) probability of lying between − 10 and − 18 G. That is not consistent with the roughly equal probability that it lies in the n = 16 range (− 24 ± 8 G). Table 1 shows that at n = 16, ΔLP2 is the only photosensitivity parameter to pass significance tests. ΔLA2, which had the highest significance level at n = 10, is now the parameter most consistent with zero. The other four are suggestively (about 1({raise0.5exhbox{$scriptstyle 1$} kern-0.1em/kern-0.15em lower0.25exhbox{$scriptstyle 2$}})σ) different from zero.Since the Bayes factor has already radically changed the outcome by encouraging more physically-meaningful parameters, it is appropriate to try the 7050 G peak parameters in the fit. With only 28 data-points, we should be alert to over-fitting. We can include P3 and A3 (n = 18), and ΔLP3 (n = 19), but W3 and ΔLA3 do cause overfitting. Figure 1b shows substantial increases of both the SBIC and the lnMLI for n = 18 to n = 20, where the twentieth parameter is in fact ΔLA3. The symptom of over-fitting that we observe here is an increase in the logarithm of the Occam Factor (lnMLI − lnL), the values of which decrease, − 26.9, − 33.5, − 34.8, and then increase, − 33.4, for n = 16, 18, 19 and 20 respectively. Just as lnL must increase with every additional parameter, so should the Occam factor decrease, as the prior parameter volume should increase more with a new parameter than the posterior parameter volume. So we stop at n = 19. The outcome, Table 1, is that the uncertainties on the n = 16 parameters have decreased markedly. This is due to the better fit, with a substantial increase in lnL corresponding to reduced residuals on all the data. The 7210 G peak 2 now has photosensitivities on all its parameters, significant to at least the 2σ or p value ~ 0.05 level. And the photosensitivities ΔLW2 and ΔLA2, both so significant at n = 10, and already dwindling in significance at n = 16, are both now taking values quite consistent with zero. In the light of Table 1, we see that stopping the fit at n = 10 results in completely incorrect results—misleading fitted values, with certainly false uncertainties.Discriminating between models for the pressure dependence of the GaAs bandgapThe main purpose of this example is to show how the Bayes factor can be used to decide between two models which have equal goodness of fit to the data (equal values of lnL and BIC, as well as p values, etc.). This illustrates the distinction it makes between physically-meaningful and physically meaningless parameters. This example also shows how ML fitting can be used together with the Bayes factor to obtain better results. For details, see SI §7.Figure 2 shows two datasets for the pressure dependence of the bandgap of GaAs (data given in the SI). The original authors published quadratic fits, ({E}_{g}(P)={E}_{0}+bP+c{P}^{2}), with b = 10.8 ± 0.3 meV kbar−1 (Goñi et al.28) and 11.6 ± 0.2 meV kbar−1 (Perlin et al.29). Other reported experimental and calculated values for b ranged from 10.02 to 12.3 meV kbar−130. These discrepancies of about ± 10% were attributed to experimental errors in high-pressure experimentation. However, from a comparison of six such datasets, Frogley et al.30 were able to show that the discrepancies arose from fitting the data with the quadratic formula. The different datasets were reconciled by using the Murnaghan equation of state and supposing the band-gap to vary linearly with the density (see SI, §7, equations (S4) and (S5)30. The curvature c of the quadratic is constant, while the curvature of the density, due to the pressure dependence Bʹ of the bulk modulus B0, decreases with pressure—and the six datasets were recorded over very different pressure ranges, as in Fig. 2. So the fitted values of c, c0, were very different, and the correlation between b and c resulted in the variations in b0.Here, using the Bayes factor, we obtain the same result from a single dataset, that of Goñi et al.28 The two fits are shown in Fig. 2. They are equally good, with values of lnL and SBIC the same to 0.01. The key curvature parameters, c and ({text{B}}^{prime }), are both returned as non-zero by 13.5σ (SI, §7, Table S1), consequently both with p-values less than 10−18. However, c is a physically-meaningless parameter. The tightest constraint we have for setting its range is the values previously reported, ranging from 0 to 60 μeV kbar−2, so we use Δc = 100 μeV kbar−2. In contrast, ({text{B}}^{prime }) is known for GaAs to be 4.4931. For many other materials and from theory the range 4–5 is expected, so we use (Delta {text{B}}^{prime } = 1). The other ranges are same for both models (see SI §7). This difference gives a lnBF of 3.8 in favour of the Murnaghan model against the quadratic, which is strong evidence for it. Moreover, the value of ({text{B}}^{prime }) returned is 4.47 ± 0.33, in excellent agreement with the literature value. Had it been far out of range, the model would have to be rejected. The quadratic model is under no such constraint; indeed, a poor fit might be handled by adding cubic and higher terms ad lib. This justifies adding about 5 to lnBF (see “Background in fitting a carbon nanotube Raman spectrum” section), giving a decisive preference to the Murnaghan model, and the value of b it returns, 11.6 ± 0.3. Note the good agreement with the value from Perlin et al.29 If additionally we fix ({mathrm{B}}^{prime}) at its literature value of 4.4931, lnBF is scarcely improved, because the Occam factor against this parameter is small, but the uncertainty on the pressure coefficient, Ξ/B0, is much improved.When we fit the Perlin data, the Murnaghan fit returns ({text{B}}^{prime }) = 6.6 ± 2.4. This is outside range, and indicates that this data cannot give a reliable value—attempting it is over-fitting. However, it is good to fit this data together with the Goñi data. The Perlin data, very precise but at low pressures only, complement the Goñi data with their lower precision but large pressure range. We notice also that the Perlin data has a proportion of outlier data points. Weighted or rescaled LS fitting can handle the different precisions, but it cannot handle the outliers satisfactorily. Maximum Likelihood fitting handles both issues. We construct lnL using different pdfs P(r) for the two datasets, and with a double-Gaussian pdf for the Perlin data (see equation (S6) in the SI §7). Fixing ({text{B}}^{prime }) at 4.49, fitting with the same Ξ/B0 returns 11.42 ± 0.04 meV kbar−1. Separate Ξ/B0 parameters for the two datasets give an increase of lnL of 4.6, with values 11.28 ± 0.06 and 11.60 ± 0.04 meV kbar−1—a difference in b of 0.32 ± 0.07 meV kbar−1, which is significant at 4½σ. This difference could be due to systematic error, e.g. in pressure calibration. Or it could be real. Goñi et al.28 used absorption spectroscopy to measure the band-gap; Perlin et al.29 used photoluminescence. The increase of the electron effective mass with pressure might give rise to the difference. In any case, it is clear that high-pressure experimentation is much more accurate than previously thought, and that ML fitting exploits the information in the data much better than LS fitting.Figure 2GaAs band-gap. Data for Eg(P) in GaAs from Goñi et al.28 (
    ) and from Perlin et al.29 (
    ) are shown after subtraction of the straight line E0 + 8.5P to make the curvature more visible. The Perlin data is expanded × 10 on both axes for clarity. Two least-squares fits to the Goñi data are shown, polynomial (dashed red line) and Murnaghan (solid blue line). (Figure prepared using Mathematica 12.0, www.wolfram.com/mathematica/).Full size imageBackground in fitting a carbon nanotube Raman spectrumThis example demonstrates how the Bayes Factor provides a quantitative answer to the problem, whether we should accept a lower quality of fit to the data if the parameter set is intuitively preferable. It also provides a simple example of a case where the MLI calculated by Eq. (1) is in error and can readily be corrected (see SI §8 Fig. S3).The dataset is a Raman spectrum of the radial breathing modes of a sample of carbon nanotubes under pressure32. The whole spectrum at several pressures is shown with fits in Fig. 1 of Ref.32. The traditional fitting procedure used there was to include Lorentzian peaks for the clear peaks in the spectra, and then to add broad peaks as required to get a good fit, but without quantitative figures of merit and without any attempt to explain the origin of the broad peaks, and therefore with no constraints on their position, widths or intensities. The key issue in the fitting was to get the intensities of the peaks as accurately as possible, to help understand their evolution with pressure. Here, we take a part of the spectrum recorded at 0.23 GPa (the data is given in the SI.) and we monitor the quality of fit and the Bayes factor while parameters are added in four models. This part of the spectrum has seven sharp pseudo-Voigt peaks (Fig. 3a; the two strong peaks are clearly doublets). With seven peak positions Pi, peak widths Wi and peak intensities Ai, and a factor describing the Gaussian content in the pseudo-Voigt peak shape, there are already 22 parameters (for details, see SI §8). This gives a visibly very poor fit, with lnL = − 440, SBIC = − 510 and lnMLI = − 546. The ranges chosen for these parameters for calculating the MLI (see SI §8) are not important because they are used in all the subsequent models, and so they cancel out in the Bayes factors between the models.Figure 3Carbon nanotube Raman spectrum. In (a), the carbon nanotube Raman spectrum is plotted (black datapoints) with a fit (cyan solid line) using the Fourier model. The residuals for four good fits are shown, × 10 and displaced successively downwards (Fourier, Polynomial, Peaks and Tails; all at lnL about − 60, see text). The backgrounds are shown, × 8 (long dashed, chain-dotted, short dashed and solid, respectively. In (b), the evolution of the MLIs is shown against the number of parameters for these four models. (Figure prepared using Mathematica 12.0, www.wolfram.com/mathematica/).Full size imageTo improve the fit, in the Fourier model we add a Fourier background (y=sum {c}_{i}mathrm{cos}ix+{s}_{i}mathrm{sin}ix) (i = 0,..) and in the Polynomial model, we add (y=sum {a}_{i}{x}^{i}) (i = 0,..) for the background. In both, the variable x is centred (x = 0) at the centre of the fitted spectrum and scaled to be ± π or ± 1 at the ends. In the Peaks model we add extra broad peaks as background, invoking extra parameter triplets (Pi, Wi, Ai). These three models all gave good fits; at the stage shown in Fig. 3a they gave lnL values of − 65, − 54 and − 51 and BIC values of − 156, − 153 and − 148 respectively. Thus there is not much to choose between the three models, but it is noteworthy that they give quite different values for the intensities of the weaker peaks, with the peak at 265 cm−1 at 20.5 ± 1.1, 25.5 ± 1.3 and 27 ± 1.7 respectively (this is related to the curvature of the background function under the peak). So it is important to choose wisely.A fourth model was motivated by the observation that the three backgrounds look as if they are related to the sharp peaks, rather like heavily broadened replicas (see Fig. 3a). Accordingly, in the fourth model, we use no background apart from the zeroth term c0 or a0 to account for dark current). Instead, the peak shape is modified, giving it stronger, fatter tails than the pseudo-Voigt peaks (Tails model). This was done by adding to the Lorentzian peak function a smooth function approximating to exponential tails on both sides of the peak position (for details, see SI §8) with widths and amplitudes as fitting parameters. What is added may be considered as background and is shown in Fig. 3a. This model, at the stage of Fig. 3a, returned lnL = − 62, BIC = − 146, and yet another, much smaller value of 15.5 ± 1.0 for the intensity of the 265 cm−1 peak.The Tails model is intuitively preferable to the other three because it does not span the data space—e.g. if there was really were broad peaks at the positions identified by the Peaks model, or elsewhere, the Tails model could not fit them well. That it does fit the data is intuitively strong evidence for its correctness. The Bayes factor confirms this intuition quantitatively. At the stage of Fig. 3a, the lnMLI values are − 251, − 237 and − 223 for the Fourier, Poly and Peaks models, and − 211 for the Tails model. This gives a lnBF value of 12 for the Tails model over the Peaks model—decisive—and still larger lnBF values for these models over the Fourier and Poly models.All models can be taken further, with more fitting parameters. More Fourier or polynomial terms or more peaks can be added, and for the Tails model more parameters distinguishing the tails attached to each of the seven Lorentizian peaks. In this way, the three background models can improve to a lnL ~ − 20; the Tails model does not improve above lnL ~ − 50. However, as seen in Fig. 3b, the MLIs get worse with too many parameters, except when over-fitting occurs, as seen for the Poly model at 35 parameters. The Tails model retains its positive lnBF  > 10 over the other models.The other models can have an indefinite number of additional parameters—more coefficients or more peaks, to fit any data set. It is in this sense that they span the data space. The actual number used is therefore itself a fitting parameter, with an uncertainty perhaps of the order of ± 1, and a range from 0 to perhaps a quarter or a half of the number of data points m. We may therefore penalise their lnMLIs by ~ ln 4 m−1 or about − 5 for a few hundred data points. This takes Tails to a lnBF  > 15 over the other models—overwhelmingly decisive. This quantifies the intuition that a model that is not guaranteed to fit the data, but which does, is preferable to a model that certainly can fit the data because it spans the data space. It quantifies the question, how much worse a quality of fit should we accept for a model that is intuitively more satisfying. Here we accept a loss of − 30 on lnL for a greater gain of + 45 in the Occam factor. It quantifies the argument that the Tails model is the most worthy of further investigation because the fat tails probably have a physical interpretation worth seeking. In this context, it is interesting that in Fig. 3a fat tails have been added only to the 250, 265 and 299 cm−1 peaks; adding fat tails to the others did not improve the fit; however, a full analysis and interpretation is outside the scope of this paper. In the Peaks model it is not probable (though possible) that the extra peaks would have physical meaning. In the other two models it is certainly not the case that their Fourier or polynomial coefficients will have physical meaning. More

  • in

    Resolving the structure of phage–bacteria interactions in the context of natural diversity

    SamplingEnvironmental samplingSamples were collected from the littoral marine zone at Canoe Cove, Nahant, Massachusetts, USA, on 22 August (ordinal day 222), 18 September (261) and 13 October (286) 2010, during the course of the three month Nahant Collection Time Series sampling11.Bacterial isolation and characterizationBacterial isolationBacterial strains were isolated from water samples using a fractionation-based approach7 as previously described19,20. In brief, seawater was passed first through a 63um plankton net and then sequentially through 5um (Whatman 111113 or Sterlitech PCT5047100), 1um (Whatman 111110 or Sterlitech PCT1047100), and 0.2um (Whatman 111106) hydrophilic polycarbonate filters; material recovered on the filters was resuspended by shaking for 20 min; dilution series of resuspended cells were filtered onto 0.2um polyethersulfone filters (Pall 66234) in a carrier solution of artificial seawater (40 g Sigma Sea Salts, S9883; 0.2um filtered), and filters placed directly onto Vibrio-selective MTCBS plates (BD Difco TCBS Agar 265020, supplemented with with 10 g NaCl per liter to 2% final w/v). Colonies (96) from each of three replicates of each size fraction were selected from the dilution plates with the fewest numbers of colonies (1,152 isolates per isolation day). Colonies were purified by serial passage, first onto TSB-II (Difco Tryptic Soy Broth, 1.5% BD Difco Bacto Agar 214010, amended with 15 g NaCl to 2% w/v), second onto MTCBS, finally onto TSB-II again. Colonies were inoculated into 1 ml of 2216 Marine Broth (BD Difco 279110) in 96-well 2 ml culture blocks and allowed to grow, shaking at room temperature, for 48 h. Glycerol stocks were prepared by combining 100 ul of culture with 100 ul of 50% glycerol (BDH 1172-4LP) in 96-well microtiter plates and sealed with adhesive aluminum foil for preservation at −80 °C.Bacterial hsp60 gene sequencingTo obtain hsp60 gene sequences for isolates, Lyse and Go (LNG) (Pierce, Thermo Scientific 78882) treatments of subsamples of the same overnights cultures used in the bait assay (described below) were used directly as template in PCR amplification reactions. PCR reactions were prepared in 30 ul volumes, as follows: 1 ul LNG template, 3 ul 10x buffer, 3 ul 2 mM dNTPs, 3 ul 2um hsp60-F primer, 3 ul 2um hsp60-R primer, 0.3 ul NEB Taq, 16.7 ul PCR-grade HOH; with hsp60-F (H279) primer sequence: 5′-GAA TTC GAI III GCI GGI GAY GGI ACI ACI AC-3′, and hsp60-R (H280) primer sequence: 5′-CGC GGG ATC CYK IYK ITC ICC RAA ICC IGG IGC YTT-3′61 (Supplementary Table 1). PCR thermocycling conditions were as follows: initial denaturation at 94 °C for 2 min; 35 cycles of 94 °C for 1 min, 37 C for 1 min, 72 °C for 1 min; final annealing at 72 °C for 6 min; hold at 10 °C. PCR products were cleaned up by isopropyl alcohol (IPA) precipitation, as follows: addition of 100 ul 75% IPA to 30 ul PCR reaction product, gentle inversion mixing followed by 25 min incubation at RT, 30 min centrifugation at 2800 rcf, addition of 50 ul 70% IPA with gentle inversion wash, centrifugation at 2000 rcf, inversion on paper towels to remove IPA, 10 min centrifugation at 700 rcf, air drying in PCR hood for 30 min, resuspension in 30 ul PCR HOH. PCR products were Sanger sequenced (Genewiz, Inc.) using hsp60-R primer, as follows: 5 ul of 5 um hsp60-R primer, 7 ul nuclease free water, 3 ul DNA template. For a subset of strains hsp60 sequences were obtained from subsequently determined whole-genome sequences. Hsp60 sequences were aligned to the hsp60 sequence previously published for Vibrio 1S_84 and trimmed to 422 bases using Geneious (https://www.geneious.com/). Accession numbers for these 1287 strains are provided in Supplementary Data 1, where they are identified as baxSet1287.Bacterial hsp60 phylogeniesA phylogenetic tree of relationships among bacterial isolates screened in the bait assay (described below) was produced based on a 422 bp fragment of the hsp60 gene, derived either from Sanger or whole genome sequences; with E. coli K12 serving as the outgroup. Sequences from each of the three days of isolation were aligned using muscle v.3.8.3162 with default settings (muscle -in $seqsALL -out $seqsALL.muscleAln), and a single tree including all 1287 sequences from all the days was generated using FastTree v.2.1.863 (FastTree -gtr -gamma -nt -spr 4 -slow  $seqsALL.muscleAln.fasttree). For presentation in Fig. 1 three sub-trees including only nodes from each day were produced using PareTree v.1.0.264 (java -jar PareTree1.0.2.jar -t O -del notDay222.txt -f $seqsALL.$round.muscleAln.fasttree.DAY222). Trees were visualized using iTOL65 and painted with metadata for each of the strains, including: sensitivity to killing in agar overlay by co-occurring phage predators collected on the same day and, for the subset of strains that were genome sequenced and also included in the host range matrix, the bacterial species, based on concatenated ribosomal protein analysis using RiboTree66 as described below. Isolation days for each of the strains included in these analyses are provided in Supplementary Data 1, where these strains are identified as baxSet1287.Bacterial genome sequencing and assignment to populationsTo assign genome-sequenced bacterial isolates used in the host range assay to species, we use the RiboTree tool66 to produce a phylogeny based on concatenated single copy ribosomal proteins as in23. We include strains of previously described Vibrionaceae in preliminary analyses as reference strains and assign species names to new isolates based on clustering with named representatives, as well as provide placeholder names for newly identified clades with no previously described representatives. Trees were visualized using iTOL65 and the representation including only those strains included in the host range assay is shown in Supplementary Fig. 1; population assignments and accession numbers for this set of 294 genomes, which also includes a small number of previously isolated bacterial strains that were included in the host range assay (described below), are provided in Supplementary Data 1, where they are identified as baxSet294.Viral isolation and characterizationWe have previously described features of the viruses of the Nahant Collection20, as well as approaches used for the standardization of their genome assemblies19, additional details are provided below.Viral sample collectionThe iron chloride flocculation approach was used to generate 1000-fold concentrated viral samples from 0.2 um-filtered seawater, as follows. For each isolation day, triplicate 4 L seawater samples were filtered through 0.2 um polyethersulfone cartridge filters (Millipore, Sterivex, SVGP0150) into collection bottles, spiked with 400 uL of FeCl3 solution (10 gL−1 Fe; as 4.83 g FeCl3•6H2O (Mallinckrodt 5029) into 100 ml H2O), and allowed to incubate at room temperature for at least 1 h. Virus-containing flocs were then recovered from the sample by filtration onto 90 mm 0.2 um polycarbonate filters (Millipore, Isopore, GTTP09030) under gentle vacuum in a 90 mm glass cup-frit system (e.g Kontes funnel 953755-0090, fritted base 953752-0090, and clamp 953753-0090); once liquid was fully passed, the funnel was removed and, with the vacuum pump left on, the filters were folded into quarters, removed from the fritted base, and inserted into a 7 ml borosilicate glass vial. A volume of 4 ml of oxalate-EDTA solution (prepared from stock solution as 10 ml 2 M Mg2EDTA (J.T. Baker, JTL701-5), 10 ml 2.5 M Tris-HCl (Promega PAH5123), 25 ml 1 M oxalic acid (Mallinckrodt 2752); adjusted to pH 6 with 10 M NaOH (J.T. Baker, 3722-01); final volume 100 ml; used within 7 days of preparation and maintained at room temperature in the dark) was added to the vial and the sample allowed to dissolve at room temperature for at least 30 min before transfer to storage at 4 °C. A reagent used in this original formulation (JT-Baker 7501 Mg2EDTA) is no longer available and an updated recipe is provided elsewhere67.Bait assay and associated viral plaque archivalIn order to obtain estimates of co-occurring phage predator loads at bacterial strain level resolution, and generate plaques from which to isolate phages, we exposed 1440 purified bacterial isolates to phage concentrates from their same day of isolation (1334 yielded lawns sufficient to evaluate for plaques, and hsp60 sequences could be determined for 1287 of these). Bacterial strains screened included 480 isolates from each ordinal day, representing 120 strains from each of 4 size-fractionation classes (0.2 um, 1.0 um, 5.0 um, 63 um) details of isolation origin are provided for each strain in Supplementary Data 1, and description of naming conventions is as previously described19. For the bait assay each strain was mixed in agar overlay with seawater concentrates containing viruses (15 ul concentrate, equivalent to 15 ml unconcentrated seawater assuming 100% recovery efficiency; derived from pooling of three replicate virus concentrates from each day). We note that recoveries were not tested for individual samples and that previous tests14 of recovery efficiency have shown that resuspension of iron flocculates in oxalate solution yields initial recoveries of approximately 50% (49 ± 3% and 55 ± 11% for a marine sipho- and myo-virus respectively, at 24 h post re-suspension) and shows low decay rate over time (47 ± 5% and 73 ± 16% for a marine sipho- and myo-virus respectively, at 38 days post re-suspension). All of our assays were performed approximately 8–9 months post-sampling from oxalate concentrates stored at 4 °C. Agar overlays were performed based on the previously described Tube-free method13, as follows. Bacterial strains were prepared for agar overlay plating by streaking out from glycerol stocks onto 2216MB agar plates with 1.5% agar (Difco, BD Bacto, 214010), and allowed to grow for 2 days at room temperature. Strains were then inoculated into 1 ml 2216MB in a 96-well culture block and incubated 24 h at room temperature shaking at 275 rpm on a VWR DS500E orbital shaker. Immediately prior to use in direct plating the OD600 was measured in 96-well microtiter plates and subsamples were taken for Lyse and Go (LNG) processing for DNA (10 ul culture, 10 ul LNG). Phage concentrates were prepared for plating by pooling 1.2 ml from each of the concentrate replicates into a 7 ml borosilicate scintillation vial. Cultures were transferred from overnight culture blocks to 96-well PCR plates in 100 ul volume and 15 ul of pooled phage concentrate was added to cultures one row at a time, with each row plated in agar overlay before adding phage concentrate to the next row of bacterial cultures. Mixed samples of 100 ul bacterial overnight culture and 15 ul pooled phage concentrate were transferred to the surface of bottom agar plates (2216MB, 1% agar, 5% glycerol, 125 ml L−1 of chitin supplement [40 g L−1 coarsely ground chitin, autoclaved, 0.2 um filtered]). A 2.5 ml volume of 52 °C molten top agar (2216MB, 0.4% agar, 5% glycerol BDH 1172-4LP) was added to the surface of the bottom agar and swirled around to incorporate and evenly disperse the mixed bacterial and phage sample into an agar overlay lawn. Agar overlay lawns were held at room temperature for 14–16 days and observed for plaque formation. Glycerol was incorporated into this assay to facilitate detection of plaques68. Chitin supplement was incorporated into this assay to facilitate detection of phages interacting with receptors upregulated in response to chitin degradation products. A variety of preliminary tests exploring potential optimizations to agar compositions for direct plating indicated that the addition of chitin did not negatively impact recovery of plaques with control phage strains tested. After approximately 2 weeks, plaques on agar overlay lawns were cataloged and described with respect to plaque morphology and plaques were picked for storage based on the previously described Archiving Plaques method13, as follows. All plaques were archived from plates containing less than 25 plaques, on plates with larger numbers of plaques a random subsample of plaques from each distinct morphology were archived. A polypropylene 96-well PCR plate was filled with 200 ul aliquots of 0.2 um filtered 2216MB, agar plugs were collected from plates using a 1 ml barrier pipette tip and ejected into the 2216MB, skipping one well between each sample to minimize potential for cross-contamination, for a final count of 48 phage plugs per plate. Plaque plugs were soaked at 4 °C for several hours to allow elution of phage particles into the media. After soaking, 96-well plates were centrifuged at 2,000 rcf for 3 min before proceeding to the next step. Plug soaks were then processed for two independent storage treatments. For storage at 4 °C, plates were processed by transferring 150 ul of eluate from each well to a 0.2 um filtration plate (Millipore, Multiscreen HTS GV 0.22um Filter Plate MSGVS22) and gently filtered under vacuum to remove bacteria, the cell-free filtrates containing eluted phage particles from each plaque plug were stored at 4 °C. For storage at −20 °C, 50 ul of 50% glycerol was added to the residual ~50 ul of the plug elution, often still containing the agar plug. In this way all plaques were characterized and many plaques from each strain were archived in two independent sets of conditions. Total plaque counts for all strains included in the bait assay are represented in Fig. 1, and provided in Supplementary Data 1, where they are identified as baxSet1287. Notes on limitations to the assay: Water temperatures on each of the three isolations days were 13.8 °C, 16.3 °C, and 14.2 °C, for days 222, 261, and 286; as bait assays were performed at room temperature (approximately 22 °C) some phages requiring lower temperatures may not have yielded plaques. The majority of plates were evaluated for plaque formation twice, on day 1 and day 13, thus any plaques appearing after day 1 and disappearing before day 13 – for example as a result of overgrowth of lysogens—are likely to have been missed in these assays.Viral purificationA subset of plaques archived during the bait assay was selected for phage purification, genome sequencing, and host range characterization. This subset included single randomly-selected representatives from each plaque-positive bacterial strain. Minor details of the purification and lysate preparation varied across samples but were largely as follows. Phages were purified from inocula derived primarily from −20 °C plaque archives, and secondarily from 4 °C archives when primary attempts with −20 °C stocks failed to produce plaques. Three serial passages were performed using Molten Streaking for Singles13 method. Agar overlay lawns for passages were prepared by aliquoting 100 ul of host overnight culture (4 ml 2216MB, colony inoculum from streak on 2216MB with 1.5% Bacto Agar, shaken overnight at RT at 250 rpm on VWR DS500E orbital shaker) onto a standard size bottom agar plate and adding 2.5 ml of molten 52 °C top agar as in the bait assay, swirling to disperse the host into the top agar and form a lawn, and streaking-in phage with a toothpick either from the plaque archive or directly from well-separated plaques in overlays from the previous step in serial purification. Following plaque formation on the third serial passage plate plaque plugs were picked using barrier tip 1 ml pipettes and ejected into 250 ul of 2216MB to elute overnight at 4 °C. Plaque eluates were spiked with 20 ul of host culture and grown with shaking for several hours to generate a primary small-scale lysate. Small scale primary lysates were centrifuged to pellet cells and titered by drop spot assay to estimate optimal inoculum volume to achieve confluent lysis in a 150 mm agar overlay plate lysate. Plate lysates were generated by mixing 250 ul of overnight host culture with primary lysate and plating in 7.5 ml agar overlay. After development of confluent lysis of lawns as compared against negative control without phage addition, the lysates were harvested by addition of 25 ml of 2216MB, shredding of the agar overlay with a dowel, and collection of the broth and top agar. Freshly harvested lysates were stored at 4 °C overnight for elution of phage particles, the following day lysates were centrifuged at 5,000 rcf for 20 min and the supernatant filtered through a 0.2 um Sterivex filter into a 50 ml tube and stored at 4 °C.Viral genome sequencingSequencing of Nahant Collection viruses was described in previous work19, and was performed as follows. For DNA extraction approximately 18 ml of phage lysate was concentrated using a 30 kD centrifugal filtration device (Millipore, Amicon Ultra Centrifugal Filters, Ultracel 30 K, UFC903024) and washed with 1:100 2216MB to reduce salt concentrations inhibitory to downstream nuclease treatments. Concentrates were brought to approximately 500 ul using 1:100 diluted 2216MB and then treated with DNase I and RNase A (Qiagen RNase A 100 mg mL−1) for 65 min at 37 °C to digest unencapsidated nucleic acids. Nuclease treated concentrates were extracted using an SDS, KOAc, phenol-chloroform extraction and resuspended in EB Buffer (Qiagen 19086) for storage at 20 °C. Phage genomic DNA was sheared by sonication in preparation for genome library preparation. DNA concentrations of extracts were determined using PicoGreen (Invitrogen, Quant-iT PicoGreen dsDNA Reagent and Kits P7589) in a 96-well format and samples brought to 5 ug in 100 ul final volume of PCR-grade water diluent for sonication. Samples were sonicated in batches of 6 for 6 cycles of 5 min each, at an interval of 30 s on/off on the Low Intensity setting of the Biogenode Bioruptor to enrich for a fragment size of ~300 bp. Illumina constructs were prepared from sheared DNA as follows: end repair of sheared DNA (NEB, Quick Blunting Kit, E1201L), 0.72×/0.21× dSPRI (AMPure XP SPRI Beads) size selection to enrich for ~300 bp sized fragments, ligation (NEB, Quick Ligation Kit, M2200L) of Illumina adapters and unique pairs of forward and reverse barcodes for each sample, SPRI (AMPure XP SPRI Beads) clean up, nick translation (NEB, Bst DNA polymerase, M0275L), and final SPRI (AMPure XP SPRI Beads) clean up (Rodrigue et al., 2010). Constructs were enriched by PCR using PE primers following qPCR-based normalization of template concentrations. Enrichment PCRs were prepared in octuplicate 25 ul volumes, with the recipe: 1 ul Illumina construct template, 5 ul 5x Phusion polymerase buffer (NEB, 5X Phusion HF Reaction Buffer, B0518S), 0.5 ul 10 mM dNTPs (NEB, dNTP Mix (1 mM; 0.5 ml), N1201AA), 0.25 ul 40 uM IGA-PCR-PE-F primer, 0.25 ul 40 uM IGA-PCR-PE-R primer, 0.25 ul Phusion polymerase (NEB, Phusion High Fidelity DNA Pol, M0530L), 17.75 ul PCR-grade water. PCR thermocycling conditions were as follows: initial denaturation at 98 °C for 20 sec; batch dependent number of cycles (range of 12–28) of 98 °C for 15 sec, 60 °C for 20 see, 72 °C for 20 sec; final annealing at 72 °C for 5 min; hold at 10 °C. For each sample 8 replicate enrichment PCR reactions were pooled and purified by 0.8x SPRI beads (AMPure XP) clean up. Each sample was then checked by Bioanalyzer (2100 expert High Sensitivity DNA Assay) to confirm the presence of a unimodal distribution of fragments with a peak between 350–500 bp. Sequencing of phage genomes was distributed over 4 paired-end sequencing runs as follows: HiSeq library of 18 samples pooled with 18 external samples, 3 MiSeq libraries each containing ~100 multiplexed phage genomes. Accession numbers for all sequenced phage genomes are provided in Supplementary Data 1, where they are identified as phageSet283; the subset of phages used in the majority of analyses in this work are identified as phageSet248 and exclude non-independent isolates derived from the same plaque, as well as well as identical phages isolated from multiple independent plaques from the same host strain in the bait assay.Viral protein clusteringTo characterize and annotate groups of proteins in assembled viral genomes in the Nahant Collection19, proteins were clustered using MMseqs2 v. 2.2339469 with default parameter settings, the 21,937 proteins reported in the GenBank files associated with each of the 283 Nahant Collection phage genomes were clustered into 5,929 clusters including 2,978 singletons. MMseqs2 cluster assignments for each protein sequence are provided in Supplementary Data 6.Viral protein cluster annotationAll proteins were annotated using InterProScan70 v.5.39–77.0; eggNOG-mapper71,72 v.2 using both automated and viral HMM selection options; Meta-iPVP73; and with best matches to 9518 Viral Orthologous Groups74 HMM profiles (obtained at http://dmk-brain.ecn.uiowa.edu/pVOGs/downloads.html); search was performed with hmmer, requiring a bitscore of 50 or greater (highest e-value 5.80E-13), as follows: hmmsearch -o $out_dir/$hmm_group.$hmmfile.$prots_short_name.hmm.out -tblout $out_dir/$hmm_group.$hmmfile.$prots_short_name.hmm.tbl.out -noali -T 50 $hmmfile $prots_dir/$prots_file. Annotations for viral protein clusters are provided in Supplementary Data 6.Receptor binding proteins (RBPs) were annotated as follows. RBPs were defined here to include both globular and fibrous host interacting proteins and general protein annotations were reviewed for similarity to known phage receptor binding proteins and supplemented with Phyre275, HHpred, and literature review76. Annotated RBPs were mapped onto phage genome diagrams and additional RBPs were annotated based on gene order conservation with phages in the same genus for which RBPs were already identified; annotated RBPs were then used to iteratively search against all Nahant Collection phage proteins using the jackhmmer search tool in the HMMER77 v.3.2.1 package (jackhmmer -cpu 16 -N 3 -E 0.00001 -incE 0.01 -incdomE 0.01 -o $run.$1.vs.$2.jackhmmer.iters-$iters.out -tblout $run.$1.vs.$2.jackhmmer.iters-$iters.tbl.out -domtblout $run.$1.vs.$2.jackhmmer.iters-$iters.dom.tbl.out $queryFASTAS $subjectFASTAS) and new hits were manually reviewed. All annotations were performed on a protein-cluster level and annotations of proteins and protein clusters as “adsorption – RBP” are indicated in Supplementary Data 6.Recombinases were annotated as follows: Homologs of single strand annealing protein recombinases in the Rad52, Rad51 and Gp2.5 superfamilies in the Nahant Collection phages were identified as described below. First, iterative HMM searches were performed against the Nahant Collection phage proteins using as seeds 194 recombinases identified in Lopes et al.44 (excluding RecET fusion protein YP_512292.1; http://biodev.extra.cea.fr/virfam/table.aspx), these represent 6 families of SSAP recombinases (UvsX, Sak4, Sak, RedB, ERF, and Gp2.5); searches were performed using the jackhmmer function of HMMER v.3.1.2 (jackhmmer -cpu 16 -N 5 -E 0.00001 -incE 0.01 -incdomE 0.01 -o $run.$1.vs.$2.jackhmmer.out -tblout $run.$1.vs.$2.jackhmmer.tbl.out -domtblout $run.$1.vs.$2.jackhmmer.dom.tbl.out $queryFASTAS $subjectFASTAS) – this yielded 156 proteins. Second, all hits were plotted onto genome diagrams for all phages in the collection and additional candidate recombinases identified based on gene neighborhood comparisons (Supplementary Data 9) – this step identified 4 additional protein clusters (mmseqs 297, 149, 2211, and 600), totaling 224 proteins. Third, all proteins clusters were curated by manual review of annotations made using InterProScan70, EggNOG-mapper71, and Phyre275 (annotations provided in Supplementary Data 6) to identify potential false positives (none identified), and references to recombinases in annotations. Where these annotation methods did not provide additional support, sequences were evaluated for additional support using HHpred78 (hhsearch -cpu 8 -i../results/full.a3m -d /cluster/toolkit/production/databases/hh-suite/mmcif70/pdb70 -o../results/2058109.hhr -oa3m../results/2058109.a3m -p 20 -Z 250 -loc -z 1 -b 1 -B 250 -ssm 2 -sc 1 -seq 1 -dbstrlen 10000 -norealign -maxres 32000 -contxt /cluster/toolkit/production/bioprogs/tools/hh-suite-build-new/data/context_data.crf) as implemented on the MPI Bioinformatics Toolkit webserver (mmseq 2896 and 5138 both gave >99% probability hits to DNA repair protein Rad52 with PDB ID 5JRB_G), or JackHMMER (-E 1 -domE 1 -incE 0.01 -incdomE 0.03 -mx BLOSUM62 -pextend 0.4 -popen 0.02 -seqdb uniprotkb) as implemented on the EMBL-EBI webserver (mmseq 2990 showed hits to diverse RedB family RecT-like sequences at e-value ≤1e-05). Following this third step, there were 3 protein clusters for which support was limited, these were included in the final dataset as putative SSAP recombinases but are highlighted here. Protein cluster mmseq 297 (present in 21 phages in 6 genera): was always encoded by genes adjacent to genes in protein cluster mmseq 3923, which was itself a recombinase associated exonuclease that was found either adjacent to mmseq 297 or to the well-supported putative SSAP recombinase mmseq 3721 (sometimes separated by one gene from mmseq 3721). Protein cluster mmseq 600 (present in 2 phages in 2 genera): was encoded adjacent to a protein cluster annotated as a recombination associated exonuclease; iterative HHMER searches of a mmseq 600 cluster representative (AUR82881.1) against Viruses in UniProtKB using jackhmmer yielded hits to proteins in mmseq 297 in iteration 3. Protein cluster mmseq 2990 (present in 1 phage): was encoded adjacent to two small proteins encoding putative recombination associated exonucleases and was in the same genomic position relative to neighboring genes as putative recombinases in related phages in the genus. Finally, all putative SSAP recombinase genes were assigned to a recombinase family by clustering based on 2 iterations of all-by-all HMM jackhmmer sequence similarity searches of all candidates and the reference seed set of Lopes44 (jackhmmer -cpu 16 -N 2 -E 0.00001 -incE 0.01 -incdomE 0.01 -o $run.$1.vs.$2.jackhmmer.out -tblout $run.$1.vs.$2.jackhmmer.tbl.out -domtblout $run.$1.vs.$2.jackhmmer.dom.tbl.out $queryFASTAS $subjectFASTAS); similarities were were visualized using Cytoscape v.3.3.0 using the “Edge-weighted Spring Embedded Layout” based on jackhmmer score, clusters were identified using the ClusterMaker2 v.1.2.1 Cytoscape plugin with the MCL cluster option and all settings at default and Granularity=2.5. Proteins in 3 mmseq clusters (149, 297, 600) did not fall into MCL clusters with recombinases from the annotated seed set and therefore are described as “unknown” rather than being assigned to a family of recombinases. All final assignments of genes to a recombinase superfamily and family, as well as all associated annotations, are provided in Supplementary Data 6 (sheet A.prots_overview column anno_Recombinase_manual). Additional details regarding seed sequences and MCL cluster assignments associated with recombinase analyses are provided in Supplementary Data 7 which contains a main descriptor sheet (00.readme), an overview of the 224 Nahant phages with recombinases (sheet 01.NahantPhageRecombinases_224), a table of InterPro domains associated with each of the reference and Nahant recombinases, with specific mmseqs and MCL clusters (sheet 02.IPR_annos_Lopes+Nahant), a list of all references used (sheet 03.List1_LOPES_ALL.noETfusion), the output of the iterative jackhmmer search with seeds against all Nahant Collection proteins (sheet 04.List1_vs_NahantProts), the output of the all-by-all jackhmmer search for 194 references and 224 putative Nahant recombinases (sheet 05.Lopes+Nahant224_v_self2iter), and information on the assignment of all Nahant and reference proteins to MCL clusters as shown in Fig. 6 (06.Recombinase_assign_by_MCL).All proteins were assigned to one of three broad categories – structural, other (non-structural), or no prediction – based on manual review of annotations derived from: NCBI product ID, Virfam21, PhANNs79, pVOGs74, eggNOG-mapper72, Phyre275, the MPI Bioinformatics implementation of HHpred78, and targeted annotations of predicted receptor binding proteins and recombinases (see descriptions for targeted annotations in Methods, above). Protein clusters (mmseq groups) were reviewed for conflicting calls and ultimately all proteins within each protein cluster (mmseqsID) were assigned to a single category. All assignments, and annotations on which they were based, are provided in Supplementary Data 6.The approach for assigning annotations to these broad categories was as follows: Step 1) All genes identified as putative recombinases through targeted annotations were assigned as “other”. Step 2) All genes identified as putative receptor binding proteins through targeted annotations were assigned as “structural”. Step 3) Genes not assigned to a category in steps 1 and 2, and which were identified by Virfam as “head-neck-tail” associated were assigned as follows: Genes annotated by Virfam as a terminase (TerL) were assigned as “other”; genes annotated by Virfam as a major capsid protein (MCP), portal (portal), adaptor (Ad1, Ad2, Ad3), head-closure (Hc1, Hc2, Hc3), tail completion (Tc1, Tc2), major tail protein (MTP), neck (Ne), or sheath (sheath) were assigned as “structural”. Step 4) Genes not assigned to a category in steps 1–3, were assigned as “structural” or “other” (non-structural) if identified as such by PhANNs with a confidence of ≥95%. Cases where conflicting annotations were observed between PhANNs and other annotations were flagged for review in subsequent steps. Step 5) Genes with annotations of VOG0263 (DNA transfer protein); terminal protein, any reference to internal virion protein, DNA circularization protein, and MuF-like proteins were assigned as “other”; in the case of conflict the Step 5 annotation superseded the prior annotations. Step 6) Genes with annotation as a terminase (large subunit, small subunit, and unspecified) by any of the tools (requiring ≥ 90% confidence if based on Phyre2) were assigned as “other”. Step 7) All genes lacking support across annotations were assigned as “no prediction”, high confidence Phyre2 predictions qualitatively judged as inappropriate were disregarded. Step 8) Genes flagged in Step 4 were reviewed and assigned as “structural” when containing any structural related genes (i.e. those listed in Step 3 and any others identifiable as structural based on words in the annotations and consensus across tools, e.g. containing the word baseplate, capsid, coat, head, spike, tail, whisker, fibritin). Additional targeted annotation by HHpred was used to facilitate assignment to “structural” (known structural proteins as described for Step 3 and in the aforementioned list), “other” (non-structural), “no prediction” (e.g. no assignable function based on available annotations and a PhANNs confidence of More

  • in

    Physical simulation study on grouting water plugging of flexible isolation layer in coal seam mining

    Analysis of the roof failure characteristics of coal seamBefore mining, fracturing was conducted on a portion of gritstone in the lower section of the Naoro Formation and then entered the mining stage. Figure 9 shows the influence law of coal roof rupture under different periodic pressures. With mining of the #2 coal seam working face, the direct roof of the coal seam partially broke and collapsed, forming gangue in the goaf. There is a clear separation between the direct and basic roof. When the working face advanced to 228.2 mm, the old roof ruptured, and the working face started to enter the periodic pressure-bearing stage. As the working face advanced to 592.9 mm, the roof exhibited the fourth periodic pressure. The overlying layer roof in the excavation area was affected by the upper bearing arch pressure, leading to the collapsed rock to not completely contact the upper roof. With the increasing distance of coal seam mining, the roof developed significant subsidence, and the influence range of the bedrock boundary caused by the mining was still in the isolation layer fracturing zone. The bedrock influence boundary angle reached 73.57°, and the rock fracture angle was 56.95°. When the working face advanced to 726.5 mm, the fifth periodic pressure on the roof occurred. The bedrock layer in the upper right of the workings was near the right boundary of the first isolated coal seam rupture. Then, coal mining was suspended, and a second isolated seam fracturing process was conducted. The bedrock influence boundary angle reached 73.57°, and the rock rupture angle was 56.95°.Figure 9Influence law of coal roof rupture during different periodic pressure.Full size imageWhen the processing was advanced to 798.4 mm, the bedrock layer in the upper right of the processed area became close to the right boundary of the second isolated seam fracturing. After the third isolated layer fracturing process, the rock impact boundary angle reached 75.33°, and the rock fracture angle was 50.39°. Proceeding to 1031.6 mm, eighth periodic pressure was generated on the roof. The falling gangue in the mined-out area was in contact with the roof, with the bedrock impact boundary angle reaching 74.77° and the rock fracture angle reaching 57.06°. Thereafter, the bedrock layer of the roof gradually entered the full-scale mining stage. As the working face continues to advance, the bedrock impact boundary caused by coal seam mining should be in isolated coal seam fractures. When the bedrock layer at the working face is close to the right boundary of the isolation layer fracturing, the next isolation layer fracturing should be performed.Analysis of roof stress evolution lawFigure 10 illustrates the change law of the roof support pressure when mining of the working face, in which the roof support pressure curve is the stress change minus the initial value of the sensor before mining. After the excavation of the working face, the surrounding rock will exhibit stress redistribution. The increase in tangential stress in front of the working face or on both sides is called the support pressure. The peak value of the support pressure generally occurs on the front of the working face. As the working face advanced to 228.2 mm, the direct roof gradually broke and collapsed with mining. Due to the redistribution of surrounding rock stress, the stress fluctuation at the open cut was clear. In front of the working face, the overlying rock stress was redistributed due to mining, and the vertical pressure peak area appeared, with a stress increment of 0.03 MPa. When the working face advanced to 360.8 mm, the first cycle pressure on the roof occurred. The falling gangue in the mine-out area gradually approached its upper strata, and the peak support pressure increments reached 0.05 MPa. During the advancement of the working face to 592.9 mm, the direct roof continued to collapse. The gangue at the cuttings was gradually compacted with the roof, and the stresses gradually restored to stability. Coal seam mining led to the decompression of the floor, and the vertical stress maximum reduction at the working face was 0.045 MPa. The peak vertical pressure in front of the working face shifted to the right as mining progressed. When degradation reached 726.5 mm, the fifth periodic pressure on the roof occurred. Figure 10b shows that the fracture of the isolation layer had no apparent effect on the change in roof stress. Within 560 mm from the open excavation, the mine-out area gangue gradually compacted with the roof. Vertical pressure changes between the fourth and fifth periodic pressures are slight and practically nonsignificant.Figure 10Vertical pressure variation law with coal mining. (a) First pressure and First periodic pressure difference. (b) Fourth and First periodic pressure difference. (c) Eighth and Ninth periodic pressure difference. (d) Eleventh and Twelfth periodic pressure difference. (e) Variation laws of vertical pressure with mining.Full size imageWhen the mining reached 1031.6 mm, the directly caving gangue completely filled the goaf and was compacted with the roof. The upper roof of the caving rock was supported again, and the compaction range of the mining area extended to 821 mm. As the working face advanced to 1338.9 mm, the peak vertical pressure appeared at 1400 mm, with a maximum increment of 0.375 MPa. The compaction range of the mining area extends to 1200 mm. Then, the fractured isolation layer can be grouted. The subsequent working face advances until the end of mining, and the rock movement above the mine-out zone will exhibit a periodic “falling-filling-cutting-compaction” process. Fracture grouting of the flexible isolation layer has no significant effect on the vertical stress changes, and the stress unloading area and the peak vertical pressure will continue to change with mining. Nevertheless, consideration needs to be given to the adequacy of the gangue falling from the roof for isolation layer grouting.Roof displacement and development pattern of water-conducting fracture zoneFigure 11 shows the development law of the roof water-conducting fissures in the roof of the coal seam during different pressure periods, where the illustration shows the von Mises equivalent strain. Figure 12 shows the development trend of the water-conducting fracture zone height. From the whole observation, although the isolation layer is treated by fracturing before back mining, it has less influence on the displacement and deformation of the overlying rock layer because it is restricted by the surrounding rock of the model. When the working face was mined to 228.2 mm, the upper roof of the mining face collapsed, and the first periodic pressure occurred on the roof. The roof displacement reached the Yan’an Group mudstone layer, and the roof collapse height was only 104.3 mm. As the mining advanced, the roof fractures in the mining-out area continued to develop upwards. When the working face was mined to 360.8 mm, the first cycle pressure on the roof occurred, and the roof collapse height extended upwards to the siltstone of the Yan’an Formation, with a collapse range of 117.6 mm. At this point, only a small displacement change occurred around the direct roof, and the flexible isolation layer was basically not affected by any impact.Figure 11Development regularity of roof water-conducting fissures during different period pressure.Full size imageFigure 12Development height curve of water-conducting fracture zone.Full size imageFrom the second cycle pressure onwards, the development trend accelerated significantly, and the collapsed height rose rapidly to 210.9 mm. When the working face advanced to 537.1 mm, the third cycle pressure occurred on the roof. The collapsed Yan’an Formation mudstone layer was further pressurized by its upper layers and collapsed to a height of 344.7 mm. The roof displacement had spread to the coarse sandstone of the Naoro Formation, but the height of the water-conducting fracture zone had not reached the bottom of the isolation layer. When the workings reached 592.9 mm, the roof collapsed again, showing the fourth periodic pressure. The water-conducting fissure zone continues to develop upwards to 355.3 mm, which passes through the fissure isolation layer and reaches the gritstone at the top of the isolation layer. The fractured isolation layer is in an “activated” state.When the working face reached 1031.6 mm, fallen gangue completely filled the mining-out area and compacted with the roof, and eighth periodic pressure occurred on the roof. The height of the water-conducting fracture zone developed to 496.8 mm, which was lower than the height of the water-conducting fracture zone of 565.8 mm at the seventh periodic pressure. After that, the old roof collapsed as a cantilevered beam. The development height of the water-conducting fracture zone was allegedly less than 565.8 mm. Afterwards, the roof fracturing direction was consistent with the direction of working face advancement, from left to right. Displacement and fracture of the overlying rock layer were mainly caused by the overall downwards sliding of the upper rock seam due to the collapse of the bottom rock seam. At different heights of the coal seam roof, the degree of displacement damage decreased with increasing height.When the working face reached 1178.7 mm, the roof covering the open cut stabilized. The fractured isolation layers in the 1st ~ 13th groups were grouted, and then the coal was mined only after the slurry had completely solidified and reached a certain strength. The eleventh periodic pressure occurred on the roof, with a water-conducting fracture height of 367.6 mm at this time. When the working face was advanced to 1471.9 mm and 1645.2 mm, the roof had twelfth and fourteenth periodic pressures, and the heights of the water-conducting fracture zone were 332.0 mm and 416.0 mm, respectively. Then, the 14th ~ 15th and 16th ~ 17th group isolation layers of the upper coal seam were grouted while fracturing the right isolation layer. However, the disruption of displacement towards the extent of the development had a relatively small impact, mainly on the roof rock layer above the mining face. Table 2 indicates the development height of the water-conducting fracture zone and the fracture and grouting sequence of the isolated layer.Table 2 Development pattern of water-conducting fracture zone and fracture and grouting sequence of isolated layer.Full size tableDuring the mining process, damage to the water-conducting fissure zone was always a major factor in the displacement of the roof slab. Nonetheless, after fracturing and grouting measures, the effects of the damage were significantly reduced such that the damage to the roof rock was contained within the flexible isolation layer. After grouting, the enhanced strength of the isolation layer ensured that mining was carried out normally. During the mining period, four grouting reforms were made, and the isolation layer was fractured six times, with the maximum development height of the water-conducting fracture zone located at the seventh periodic pressure, reaching 565.8 mm.Analysis of water flow evolution law of overburden roofTo analyse the seepage law of the overburden roof, seven water flow monitoring lines were arranged from the top of the flexible isolation layer to the direct roof of the coal seam. The No. 1 water flow monitoring line was placed in the position of the third group of the isolation layer, which is initially located outside the deformation range of bedrock disturbed by mining and outside the stop line. The flow line was mainly used to monitor the influence of the rock disturbance boundary above the open cut on isolated seam fracturing and grouting. No 2–3 water flow monitoring lines were placed at the isolation layer positions of Group 12 and Group 14, which were initially located near the maximum height of the water-conducting fracture zone and were mainly used to monitor the change laws of the water-conducting fracture zone with mining impact. Monitoring Lines 4–6 were placed in isolation layers No. 17, No. 22 and No. 26 to study the impact of water flow changes with mining disturbance and the advanced influence scope. Water flow monitoring line No. 7 was placed in the thirtieth group of isolated layers, which was originally outside the cut-off line. As shown in Fig. 13, white arrows are water flow vectors in mL/min. Fracturing the 1–18 isolation layers before mining, the water tank hot water was injected into the flexible isolation layer such that the iodized salt in the flexible isolation layer was completely dissolved, and the infrared monitor showed the yellow area in the image. At this point, the water flow monitoring Lines 1–3 and 5–7 show yellow status, indicating that after the fracturing of the isolation layer, the aquifer water flows downwards along the fracture. The lower part of monitoring Line 4 was compacted at the top of the coal seam, indicating that the cracks between the roof and the aquifer had not been communicated. Therefore, the water flow rate was 0 mL/min until the sixth periodic pressure. Mining was then undertaken on the working face. The No. 1 monitoring line was therefore less affected by mining due to its layout outside the stop line, and there was no significant change in water flow before the first grouting.Figure 13Water flow evolution of the overburden roof with coal mining.Full size imageAs shown in Fig. 13, when the working face progressed to second periodic pressure, with the collapse of the coal seam, the stress of the surrounding rock was redistributed, the height of the water flowing fractured zone of the roof increased, and the water flow of the No. 2 monitoring line increased from the initial 9.1 mL/min to 14.0 mL/min. As the working face was advanced above the No. 2 monitoring line, the fifth periodic of pressure were generated in the roof. The development height of the roof water flowing fractured zone reached 504.4 mm. The roof was separated and collapsed, the cracks in the monitoring line communicated with each other, and the rock stress was released. The water flow in the No. 2 monitoring line increased significantly. Monitoring line No. 3 was affected by advanced mining, resulting in the coal seam roof’s increased rock fissures, the water flow path and resistance were reduced, and the water flow reached 48.3 mL/min. At the same time, the influence range of working face bedrock was close to the boundary of the first fracturing of the flexible isolation layer, and Groups 20–22 of isolation lays had been fractured.When mining started at the sixth periodic pressure, the roof water-conducting fracture zone gradually reached the maximum height and penetrated the fractured isolation layer, and the fracture of the roof rock increased. Lines No. 2 and No. 3 reached 44.4 mL/min and 85.6 mL/min, respectively. In fact, the encounter may indicate that the confined water of the gritstone aquifer was released, and the water flow of the working face increased. Then, the working face progressed, and the collapsed gangue above the mining-out area was compacted into the bedrock roof. The stress in the goaf did not change significantly, and the cracks in the strata decreased. The No. 2 and No. 3 water flows of the monitoring line gradually dropped. During this period, the change law of monitoring Lines 4–7 was similar to that of No. 2 and No. 3. During coal seam mining, the roof underwent a process of fracture, collapse, compaction and full mining, and the water flow monitoring line also went through a process of rising and then falling.When the working face was advanced to the eleventh periodic pressure, the grouting transformation of isolation layers 1–12 was conducted. The slurry was injected into the flexible isolation layer by hand pressure pump along the grouting pipe. After the slurry solidified, the colour of the No. 1 and No. 2 monitoring lines gradually became shallow, and the water flow gradually decreased under infrared observation. As the extraction of the coal seam progressed and the flexible insulation layer was broken and grouted, the colour of observation Lines 1–4 turned black in the infrared observation until the fourth grouting of insulation layer 18–19, and the water flow rate all showed 0 mL/min. However, the lower strata of the flexible isolation layer were not yet stabilized, so monitoring Lines 5–7 did not undergo any grouting transformation and still had a large water flow until the end of mining. Flow metre and infrared observations show that the destruction and grouting of the flexible isolation layer had a noticeable effect on the seepage characteristics of the overburden. In particular, after the grouting of the isolation layer, the slurry filled and solidified rapidly, the water flow decreased rapidly, and the water plugging effect of flexible isolation layer grouting was remarkable.Discussion and analysisDuring coal seam mining, the fracturing of the flexible isolation layer should be based on the premining overtopping influence range; that is, when the boundary line of bedrock influence extends to the range of the flexible isolation layer reached by the fracturing area of the flexible isolation layer, the next fracturing should continue. The average boundary angle range of the bedrock was 76.7°, and the field angle should not be less than 73.57°. The grouting of the flexible isolation layer considers the full mining degree of the coal seam. When there is no significant change in stress in the mined area, grouting of the flexible isolation layer at the top of the goaf is conducted. According to the simulation experiment in this paper, the full mining distance of the working face is 1338.9 mm, and the actual distance on site is 187.446 m. It is calculated that the distance between the fracture of the flexible isolation layer should be no less than 854.8 mm away from the working face, and the actual distance on site is 119.672 m. After the working face enters full mining, the shortest distance between the fracturing grouting range of the flexible isolation layer and the working face is not less than 242.6 mm, and the actual distance on site is 33.964 m.As seen from the previous analysis, with the advancement of the working face, the bedrock influence boundary angle of the coal seam does not change significantly, which only plays a guiding role in the fracturing sequence of the flexible isolation layer. The fracturing of the flexible isolation layer had an clear influence on the seepage of water-rich bedrock at the bottom of the Zhiluo Formation. The water-flowing fractured zone formed in the process of coal seam mining promoted the release of fractured water in the water-rich bedrock at the bottom of the Zhiluo Formation. The higher the height of the water-flowing fractured zone is, the greater the seepage of the water-rich bedrock. Coal seam mining had little effect on the seepage characteristics of the water-rich bedrock layer at the bottom of the Zhiluo Formation in the range of not disturbed by mining and advanced influence.In accordance with the stress sensor data, when the working face passed a certain distance, the bottom plate of the extraction area was compacted by the falling gangue, and the sensor pressure data did not change with the mining face. At this time, the grouting of the fracturing area of the flexible isolation layer corresponding to the above goaf was not affected by the mining face. For example, the stress in the goaf of 1200 mm had no clear change. Therefore, the first grouting was conducted in the fracturing area. After the solidification of the grouting slurry, the water flow of monitoring lines No. 1 and No. 2 decreased significantly. This minimized the impact on the original geological environment and at the same time reduced the goaf water drainage of the working face. The sealing effect of the isolation layer has an important influence on promoting water-retaining coal mining.The experimental application of the flexible isolation layer has realized its feasibility from the physical simulation test method in this paper. The realization of a flexible isolation layer requires premining fracturing and postmining isolation grouting. At present, premining fracturing can be achieved by directional drilling technology. There are also examples of roof separation grouting for postmining flexible isolation layer grouting28,29. Therefore, there is no technical bottleneck in field applications. Moreover, there is still a certain distance from the specific engineering application. According to the results of this study, it is predicted that the implementation of a flexible isolation layer will have great significance for water conservation coal mining in western China, which can reduce soil erosion and protect surface ecology. More

  • in

    Portugal leads with Europe’s largest marine reserve

    CORRESPONDENCE
    18 January 2022

    Portugal leads with Europe’s largest marine reserve

    Filipe Alves

     ORCID: http://orcid.org/0000-0003-3752-2745

    0
    ,

    João G. Monteiro

     ORCID: http://orcid.org/0000-0002-3401-6495

    1
    ,

    Paulo Oliveira

    2
    &

    João Canning-Clode

     ORCID: http://orcid.org/0000-0003-2143-6535

    3

    Filipe Alves

    MARE Marine and Environmental Sciences Centre, Madeira, Portugal.

    View author publications

    You can also search for this author in PubMed
     Google Scholar

    João G. Monteiro

    MARE Marine and Environmental Sciences Centre, Madeira, Portugal.

    View author publications

    You can also search for this author in PubMed
     Google Scholar

    Paulo Oliveira

    Institute of Forests and Nature Conservation (IFCN, IP-RAM), Funchal, Madeira, Portugal.

    View author publications

    You can also search for this author in PubMed
     Google Scholar

    João Canning-Clode

    Smithsonian Environmental Research Center, Maryland, USA.

    View author publications

    You can also search for this author in PubMed
     Google Scholar

    Twitter

    Facebook

    Email

    Marine conservation is central to the United Nations’ Sustainable Development Goals 13 (climate action) and 14 (life below water). Portugal has now created the largest marine reserve with full protection in Europe and the North Atlantic, an achievement that other nations could follow.

    Access options

    Access through your institution

    Change institution

    Buy or subscribe

    /* style specs start */
    style{display:none!important}.LiveAreaSection-193358632 *{align-content:stretch;align-items:stretch;align-self:auto;animation-delay:0s;animation-direction:normal;animation-duration:0s;animation-fill-mode:none;animation-iteration-count:1;animation-name:none;animation-play-state:running;animation-timing-function:ease;azimuth:center;backface-visibility:visible;background-attachment:scroll;background-blend-mode:normal;background-clip:borderBox;background-color:transparent;background-image:none;background-origin:paddingBox;background-position:0 0;background-repeat:repeat;background-size:auto auto;block-size:auto;border-block-end-color:currentcolor;border-block-end-style:none;border-block-end-width:medium;border-block-start-color:currentcolor;border-block-start-style:none;border-block-start-width:medium;border-bottom-color:currentcolor;border-bottom-left-radius:0;border-bottom-right-radius:0;border-bottom-style:none;border-bottom-width:medium;border-collapse:separate;border-image-outset:0s;border-image-repeat:stretch;border-image-slice:100%;border-image-source:none;border-image-width:1;border-inline-end-color:currentcolor;border-inline-end-style:none;border-inline-end-width:medium;border-inline-start-color:currentcolor;border-inline-start-style:none;border-inline-start-width:medium;border-left-color:currentcolor;border-left-style:none;border-left-width:medium;border-right-color:currentcolor;border-right-style:none;border-right-width:medium;border-spacing:0;border-top-color:currentcolor;border-top-left-radius:0;border-top-right-radius:0;border-top-style:none;border-top-width:medium;bottom:auto;box-decoration-break:slice;box-shadow:none;box-sizing:border-box;break-after:auto;break-before:auto;break-inside:auto;caption-side:top;caret-color:auto;clear:none;clip:auto;clip-path:none;color:initial;column-count:auto;column-fill:balance;column-gap:normal;column-rule-color:currentcolor;column-rule-style:none;column-rule-width:medium;column-span:none;column-width:auto;content:normal;counter-increment:none;counter-reset:none;cursor:auto;display:inline;empty-cells:show;filter:none;flex-basis:auto;flex-direction:row;flex-grow:0;flex-shrink:1;flex-wrap:nowrap;float:none;font-family:initial;font-feature-settings:normal;font-kerning:auto;font-language-override:normal;font-size:medium;font-size-adjust:none;font-stretch:normal;font-style:normal;font-synthesis:weight style;font-variant:normal;font-variant-alternates:normal;font-variant-caps:normal;font-variant-east-asian:normal;font-variant-ligatures:normal;font-variant-numeric:normal;font-variant-position:normal;font-weight:400;grid-auto-columns:auto;grid-auto-flow:row;grid-auto-rows:auto;grid-column-end:auto;grid-column-gap:0;grid-column-start:auto;grid-row-end:auto;grid-row-gap:0;grid-row-start:auto;grid-template-areas:none;grid-template-columns:none;grid-template-rows:none;height:auto;hyphens:manual;image-orientation:0deg;image-rendering:auto;image-resolution:1dppx;ime-mode:auto;inline-size:auto;isolation:auto;justify-content:flexStart;left:auto;letter-spacing:normal;line-break:auto;line-height:normal;list-style-image:none;list-style-position:outside;list-style-type:disc;margin-block-end:0;margin-block-start:0;margin-bottom:0;margin-inline-end:0;margin-inline-start:0;margin-left:0;margin-right:0;margin-top:0;mask-clip:borderBox;mask-composite:add;mask-image:none;mask-mode:matchSource;mask-origin:borderBox;mask-position:0% 0%;mask-repeat:repeat;mask-size:auto;mask-type:luminance;max-height:none;max-width:none;min-block-size:0;min-height:0;min-inline-size:0;min-width:0;mix-blend-mode:normal;object-fit:fill;object-position:50% 50%;offset-block-end:auto;offset-block-start:auto;offset-inline-end:auto;offset-inline-start:auto;opacity:1;order:0;orphans:2;outline-color:initial;outline-offset:0;outline-style:none;outline-width:medium;overflow:visible;overflow-wrap:normal;overflow-x:visible;overflow-y:visible;padding-block-end:0;padding-block-start:0;padding-bottom:0;padding-inline-end:0;padding-inline-start:0;padding-left:0;padding-right:0;padding-top:0;page-break-after:auto;page-break-before:auto;page-break-inside:auto;perspective:none;perspective-origin:50% 50%;pointer-events:auto;position:static;quotes:initial;resize:none;right:auto;ruby-align:spaceAround;ruby-merge:separate;ruby-position:over;scroll-behavior:auto;scroll-snap-coordinate:none;scroll-snap-destination:0 0;scroll-snap-points-x:none;scroll-snap-points-y:none;scroll-snap-type:none;shape-image-threshold:0;shape-margin:0;shape-outside:none;tab-size:8;table-layout:auto;text-align:initial;text-align-last:auto;text-combine-upright:none;text-decoration-color:currentcolor;text-decoration-line:none;text-decoration-style:solid;text-emphasis-color:currentcolor;text-emphasis-position:over right;text-emphasis-style:none;text-indent:0;text-justify:auto;text-orientation:mixed;text-overflow:clip;text-rendering:auto;text-shadow:none;text-transform:none;text-underline-position:auto;top:auto;touch-action:auto;transform:none;transform-box:borderBox;transform-origin:50% 50% 0;transform-style:flat;transition-delay:0s;transition-duration:0s;transition-property:all;transition-timing-function:ease;vertical-align:baseline;visibility:visible;white-space:normal;widows:2;width:auto;will-change:auto;word-break:normal;word-spacing:normal;word-wrap:normal;writing-mode:horizontalTb;z-index:auto;-webkit-appearance:none;-moz-appearance:none;-ms-appearance:none;appearance:none;margin:0}.LiveAreaSection-193358632{width:100%}.LiveAreaSection-193358632 .login-option-buybox{display:block;width:100%;font-size:17px;line-height:30px;color:#222;padding-top:30px;font-family:Harding,Palatino,serif}.LiveAreaSection-193358632 .additional-access-options{display:block;font-weight:700;font-size:17px;line-height:30px;color:#222;font-family:Harding,Palatino,serif}.LiveAreaSection-193358632 .additional-login >li:not(:first-child)::before{transform:translateY(-50%);content:”;height:1rem;position:absolute;top:50%;left:0;border-left:2px solid #999}.LiveAreaSection-193358632 .additional-login >li:not(:first-child){padding-left:10px}.LiveAreaSection-193358632 .additional-login >li{display:inline-block;position:relative;vertical-align:middle;padding-right:10px}.BuyBoxSection-683559780{display:flex;flex-wrap:wrap;flex:1;flex-direction:row-reverse;margin:-30px -15px 0}.BuyBoxSection-683559780 .box-inner{width:100%;height:100%}.BuyBoxSection-683559780 .readcube-buybox{background-color:#f3f3f3;flex-shrink:1;flex-grow:1;flex-basis:255px;background-clip:content-box;padding:0 15px;margin-top:30px}.BuyBoxSection-683559780 .subscribe-buybox{background-color:#f3f3f3;flex-shrink:1;flex-grow:4;flex-basis:300px;background-clip:content-box;padding:0 15px;margin-top:30px}.BuyBoxSection-683559780 .subscribe-buybox-nature-plus{background-color:#f3f3f3;flex-shrink:1;flex-grow:4;flex-basis:100%;background-clip:content-box;padding:0 15px;margin-top:30px}.BuyBoxSection-683559780 .title-readcube{display:block;margin:0;margin-right:20%;margin-left:20%;font-size:24px;line-height:32px;color:#222;padding-top:30px;text-align:center;font-family:Harding,Palatino,serif}.BuyBoxSection-683559780 .title-buybox{display:block;margin:0;margin-right:29%;margin-left:29%;font-size:24px;line-height:32px;color:#222;padding-top:30px;text-align:center;font-family:Harding,Palatino,serif}.BuyBoxSection-683559780 .title-asia-buybox{display:block;margin:0;margin-right:5%;margin-left:5%;font-size:24px;line-height:32px;color:#222;padding-top:30px;text-align:center;font-family:Harding,Palatino,serif}.BuyBoxSection-683559780 .asia-link{color:#069;cursor:pointer;text-decoration:none;font-size:1.05em;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;line-height:1.05em6}.BuyBoxSection-683559780 .access-readcube{display:block;margin:0;margin-right:10%;margin-left:10%;font-size:14px;color:#222;padding-top:10px;text-align:center;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;line-height:20px}.BuyBoxSection-683559780 .access-asia-buybox{display:block;margin:0;margin-right:5%;margin-left:5%;font-size:14px;color:#222;padding-top:10px;text-align:center;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;line-height:20px}.BuyBoxSection-683559780 .access-buybox{display:block;margin:0;margin-right:30%;margin-left:30%;font-size:14px;color:#222;opacity:.8px;padding-top:10px;text-align:center;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;line-height:20px}.BuyBoxSection-683559780 .usps-buybox{display:block;margin:0;margin-right:30%;margin-left:30%;font-size:14px;color:#222;opacity:.8px;text-align:center;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;line-height:20px}.BuyBoxSection-683559780 .price-buybox{display:block;font-size:30px;color:#222;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;padding-top:30px;text-align:center}.BuyBoxSection-683559780 .price-from{font-size:14px;padding-right:10px;color:#222;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;line-height:20px}.BuyBoxSection-683559780 .issue-buybox{display:block;font-size:13px;text-align:center;color:#222;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;line-height:19px}.BuyBoxSection-683559780 .no-price-buybox{display:block;font-size:13px;line-height:18px;text-align:center;padding-right:10%;padding-left:10%;padding-bottom:20px;padding-top:30px;color:#222;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif}.BuyBoxSection-683559780 .vat-buybox{display:block;margin-top:5px;margin-right:20%;margin-left:20%;font-size:11px;color:#222;padding-top:10px;padding-bottom:15px;text-align:center;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;line-height:17px}.BuyBoxSection-683559780 .button-container{display:flex;padding-right:20px;padding-left:20px;justify-content:center}.BuyBoxSection-683559780 .button-container >*{flex:1px}.BuyBoxSection-683559780 .button-container >a:hover,.Button-505204839:hover,.Button-1078489254:hover,.Button-2808614501:hover{text-decoration:none}.BuyBoxSection-683559780 .readcube-button{background:#fff;margin-top:30px}.BuyBoxSection-683559780 .button-asia{background:#069;border:1px solid #069;border-radius:0;cursor:pointer;display:block;padding:9px;outline:0;text-align:center;text-decoration:none;min-width:80px;margin-top:75px}.BuyBoxSection-683559780 .button-label-asia,.ButtonLabel-3869432492,.ButtonLabel-3296148077,.ButtonLabel-1566022830{display:block;color:#fff;font-size:17px;line-height:20px;font-family:-apple-system,BlinkMacSystemFont,”Segoe UI”,Roboto,Oxygen-Sans,Ubuntu,Cantarell,”Helvetica Neue”,sans-serif;text-align:center;text-decoration:none;cursor:pointer}.Button-505204839,.Button-1078489254,.Button-2808614501{background:#069;border:1px solid #069;border-radius:0;cursor:pointer;display:block;padding:9px;outline:0;text-align:center;text-decoration:none;min-width:80px;max-width:320px;margin-top:10px}.Button-505204839 .readcube-label,.Button-1078489254 .readcube-label,.Button-2808614501 .readcube-label{color:#069}
    /* style specs end */Subscribe to nature+Get immediate online access to the entire Nature family of 50+ journals$29.99monthlySubscribeSubscribe to JournalGet full journal access for 1 year$199.00only $3.90 per issueSubscribeAll prices are NET prices. VAT will be added later in the checkout.Tax calculation will be finalised during checkout.Buy articleGet time limited or full article access on ReadCube.$32.00BuyAll prices are NET prices.

    Additional access options:

    Log in

    Learn about institutional subscriptions

    Nature 601, 318 (2022)
    doi: https://doi.org/10.1038/d41586-022-00093-8

    Competing Interests
    The authors declare no competing interests.

    Related Articles

    See more letters to the editor

    Subjects

    Conservation biology

    Ecology

    Sustainability

    Latest on:

    Ecology

    Wind power versus wildlife: root mitigation in evidence
    Correspondence 11 JAN 22

    Two million species catalogued by 500 experts
    Correspondence 11 JAN 22

    EU Nature Restoration Law needs ambitious and binding targets
    Correspondence 11 JAN 22

    Sustainability

    Sustainability at the crossroads
    Editorial 21 DEC 21

    The UN must get on with appointing its new science board
    Editorial 08 DEC 21

    Battery-powered trains offer a cost-effective ride to a cleaner world
    Research Highlight 22 NOV 21

    Jobs

    JUNIOR PROFESSOR IN MECHANICAL ENGINEERING: DIGITISATION FOR SMART OPERATIONS AND MAINTENANCE OF MACHINES

    KU Leuven
    Leuven, Belgium

    Director of IOCB Prague

    Institute of Organic Chemistry and Biochemistry of the CAS (IOCB Prague)
    Prague, Czech Republic

    Director of IOCB Prague

    Institute of Organic Chemistry and Biochemistry of the CAS (IOCB Prague)
    Prague, Czech Republic

    Chair, Department of Physiology

    Tulane University School of Medicine (SOM)
    New Orleans, LA, United States More

  • in

    Fishing activity before closure, during closure, and after reopening of the Northeast Canyons and Seamounts Marine National Monument

    Data and softwareThis analysis used two main data sources: (1) annual (through 2020) summaries of landings by species and by region provided by the Atlantic Coastal Cooperative Statistics Program (ACCSP), and (2) vessel-tracking data provided by Global Fishing Watch. The ACCSP is a cooperative state-federal program of U.S. states and the District of Columbia; it was established in 1995 to be the principal source of fisheries-dependent information on the Atlantic Coast of the United States. For the ACCSP data, I obtained annual landings by species for the North Atlantic region, Mid Atlantic region, and South Atlantic region (excluding landings from the Gulf of Mexico). The weekly cumulative landings data was obtained from the NOAA Fisheries Greater Atlantic Quota Monitoring website. Global Fishing Watch is an organization that provides access to information on commercial fishing activities, in particular information on the identity and location of fishing vessels34. Many large vessels use a system known as the Automatic Identification System (AIS) to avoid collisions at sea, broadcast their location to port authorities and other vessels, and to view other vessels in their vicinity. Vessels fitted with AIS transceivers can be observed by AIS base stations and by satellites fitted with AIS receivers. The US Coast Guard requires all vessels larger than 65 feet to have an AIS receiver onboard. Global Fishing Watch obtains AIS data for fishing vessels and enables users with Internet access to monitor fishing activity globally, and to view individual vessel tracks. They also partner with academic researchers to provide more fine-scale data.To obtain the vessel-tracking data for the relevant fisheries, I reviewed NOAA databases of squid and mackerel permits (2019 version; vessels with squid permits are automatically issued a Butterfish permit), and the Atlantic tuna permits (2020 version) and matched each permitted vessel to its unique Maritime Mobile Service Identity (MMSI) number, which is associated with Global Fishing Watch tracking information. I was able to identify 84% (187/224) of squid/butterfish permitted vessels (I focused on the SMB1A (Tier 1) permit category associated with the vast majority ( approx. 99%) of squid catch35), 100% of Tier 1 and Tier 2 mackerel-permitted vessels (56/56 vessels), and 74% of active tuna longline vessels (100/135 vessels). “Active” is defined as having reported successfully setting pelagic longline gear at least once between 2006 and 201236. This translates to a total of 17.55 million observations on fishing vessel locations for all three fisheries. I drop any observations that are missing either a latitude or longitude entry. For the squid and mackerel fisheries, I drop any observations with unusual longitudes ((ge 0^{circ }) and ( More

  • in

    Iterative data-driven forecasting of the transmission and management of SARS-CoV-2/COVID-19 using social interventions at the county-level

    1.Ebrahim, S. H., Ahmed, Q. A., Gozzer, E., Schlagenhauf, P. & Memish, Z. A. Covid-19 and community mitigation strategies in a pandemic. BMJ 368, m1066. https://doi.org/10.1136/bmj.m1066 (2020).Article 
    PubMed 

    Google Scholar 
    2.Ebrahim, S. H. et al. All hands on deck: A synchronized whole-of-world approach for COVID-19 mitigation. Int. J. Infect. Dis. 98, 208–215. https://doi.org/10.1016/j.ijid.2020.06.049 (2020).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    3.Kantner, M. & Koprucki, T. Beyond just “flattening the curve”: Optimal control of epidemics with purely non-pharmaceutical interventions. J. Math. Ind. https://doi.org/10.1186/s13362-020-00091-3 (2020).MathSciNet 
    Article 
    PubMed 
    PubMed Central 
    MATH 

    Google Scholar 
    4.Kupferschmidt, K. The lockdowns worked-but what comes next?. Science 368, 218–219. https://doi.org/10.1126/science.368.6488.218 (2020).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    5.Byambasuren, O. et al. Estimating the seroprevalence of SARS-CoV-2 infections: Systematic review. medRxiv. https://doi.org/10.1101/2020.07.13.20153163 (2020).Article 

    Google Scholar 
    6.Fontanet, A. & Cauchemez, S. COVID-19 herd immunity: Where are we?. Nat. Rev. Immunol. 20, 583–584. https://doi.org/10.1038/s41577-020-00451-5 (2020).CAS 
    Article 
    PubMed 

    Google Scholar 
    7.Chowdhury, R. et al. Dynamic interventions to control COVID-19 pandemic: A multivariate prediction modelling study comparing 16 worldwide countries. Eur. J. Epidemiol. 35, 389–399. https://doi.org/10.1007/s10654-020-00649-w (2020).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    8.Giordano, G. et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nat. Med. 26, 855–860. https://doi.org/10.1038/s41591-020-0883-7 (2020).CAS 
    Article 
    PubMed 

    Google Scholar 
    9.Kissler, S. M., Tedijanto, C., Goldstein, E., Grad, Y. H. & Lipsitch, M. Projecting the transmission dynamics of SARS-CoV-2 through the postpandemic period. Science 368, 860. https://doi.org/10.1126/science.abb5793 (2020).ADS 
    CAS 
    Article 
    PubMed 

    Google Scholar 
    10.Prem, K. et al. The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in Wuhan, China: A modelling study. Lancet Public Health 5, e261–e270. https://doi.org/10.1016/S2468-2667(20)30073-6 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    11.Leung, K., Wu, J. T., Liu, D. & Leung, G. M. First-wave COVID-19 transmissibility and severity in China outside Hubei after control measures, and second-wave scenario planning: A modelling impact assessment. Lancet 395, 1382–1393. https://doi.org/10.1016/S0140-6736(20)30746-7 (2020).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    12.Peng, L., Yang, W., Zhang, D., Zhuge, C. & Hong, L. Epidemic analysis of COVID-19 in China by dynamical modeling. medRxiv. https://doi.org/10.1101/2020.02.16.20023465 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    13.Read, J. M., Bridgen, J. R. E., Cummings, D. A. T., Ho, A. & Jewell, C. P. Novel coronavirus 2019-nCoV: Early estimation of epidemiological parameters and epidemic predictions. medRxiv. https://doi.org/10.1101/2020.01.23.20018549 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    14.Roda, W. C., Varughese, M. B., Han, D. & Li, M. Y. Why is it difficult to accurately predict the COVID-19 epidemic?. Infect. Dis. Model 5, 271–281. https://doi.org/10.1016/j.idm.2020.03.001 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    15.Wu, J. T., Leung, K. & Leung, G. M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: A modelling study. Lancet 395, 689–697. https://doi.org/10.1016/S0140-6736(20)30260-9 (2020).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    16.Perc, M., Gorišek Miksić, N., Slavinec, M. & Stožer, A. Forecasting COVID-19. Front. Phys. https://doi.org/10.3389/fphy.2020.00127 (2020).Article 

    Google Scholar 
    17.Er, S., Yang, S. & Zhao, T. COUnty aggRegation mixup AuGmEntation (COURAGE) COVID-19 prediction. Sci. Rep. 11, 14262. https://doi.org/10.1038/s41598-021-93545-6 (2021).ADS 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    18.Hunter, E., Mac Namee, B. & Kelleher, J. An open-data-driven agent-based model to simulate infectious disease outbreaks. PLoS One. https://doi.org/10.1371/journal.pone.0208775 (2018).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    19.Venkatramanan, S. et al. Using data-driven agent-based models for forecasting emerging infectious diseases. Epidemics 22, 43–49. https://doi.org/10.1016/j.epidem.2017.02.010 (2018).Article 
    PubMed 

    Google Scholar 
    20.Brett, T. S. & Rohani, P. Transmission dynamics reveal the impracticality of COVID-19 herd immunity strategies. Proc. Natl. Acad. Sci. U. S. A. https://doi.org/10.1073/pnas.2008087117 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    21.Britton, T., Ball, F. & Trapman, P. A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2. Science 369, 846–849. https://doi.org/10.1126/science.abc6810 (2020).ADS 
    MathSciNet 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    22.Beven, K. Environmental Modelling: An Uncertain Future? (CRC Press, 2010).
    Google Scholar 
    23.Dietze, M. C. Prediction in ecology: A first-principles framework. Ecol. Appl. 27, 2048–2060. https://doi.org/10.1002/eap.1589 (2017).Article 
    PubMed 

    Google Scholar 
    24.Dietze, M. C. et al. Iterative near-term ecological forecasting: Needs, opportunities, and challenges. Proc. Natl. Acad. Sci. 115, 1424. https://doi.org/10.1073/pnas.1710231115 (2018).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    25.Keenan, T. F., Carbone, M. S., Reichstein, M. & Richardson, A. D. The model-data fusion pitfall: Assuming certainty in an uncertain world. Oecologia 167, 587–597. https://doi.org/10.1007/s00442-011-2106-x (2011).ADS 
    Article 
    PubMed 

    Google Scholar 
    26.Niu, S. et al. The role of data assimilation in predictive ecology. Ecosphere 5, art65. https://doi.org/10.1890/ES13-00273.1 (2014).Article 

    Google Scholar 
    27.White, E. P. et al. Developing an automated iterative near-term forecasting system for an ecological study. Methods Ecol. Evol. 10, 332–344. https://doi.org/10.1111/2041-210X.13104 (2019).Article 

    Google Scholar 
    28.Luo, Y. et al. Ecological forecasting and data assimilation in a data-rich era. Ecol. Appl. 21, 1429–1442. https://doi.org/10.1890/09-1275.1 (2011).Article 
    PubMed 

    Google Scholar 
    29.White, B. G. et al. Short-term forecast validation of six models. Weather Forecast. 14, 84–108. https://doi.org/10.1175/1520-0434(1999)014%3C0084:STFVOS%3E2.0.CO;2 (1999).ADS 
    Article 

    Google Scholar 
    30.Calvetti, D., Hoover, A. P., Rose, J. & Somersalo, E. Metapopulation network models for understanding, predicting, and managing the coronavirus disease COVID-19. Front. Phys. https://doi.org/10.3389/fphy.2020.00261 (2020).Article 

    Google Scholar 
    31.O’Sullivan, D., Gahegan, M., Exeter, D. J. & Adams, B. Spatially explicit models for exploring COVID-19 lockdown strategies. T Gis 24, 967–1000. https://doi.org/10.1111/tgis.12660 (2020).Article 

    Google Scholar 
    32.James, N., Menzies, M. & Bondell, H. Understanding spatial propagation using metric geometry with application to the spread of COVID-19 in the United States. EPL (Europhys. Lett.) 135, 48004. https://doi.org/10.1209/0295-5075/ac2752 (2021).ADS 
    CAS 
    Article 

    Google Scholar 
    33.Li, D. et al. Identifying US County-level characteristics associated with high COVID-19 burden. BMC Public Health 21, 1007. https://doi.org/10.1186/s12889-021-11060-9 (2021).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    34.Bisset, K. R. et al. INDEMICS: An interactive high-performance computing framework for data-intensive epidemic modeling. ACM Trans. Model Comput. Simul. https://doi.org/10.1145/2501602 (2014).MathSciNet 
    Article 
    PubMed 
    PubMed Central 
    MATH 

    Google Scholar 
    35.Chao, D. L., Halloran, M. E., Obenchain, V. J. & Longini, I. M. Jr. FluTE, a publicly available stochastic influenza epidemic simulation model. PLoS Comput. Biol. 6, e1000656. https://doi.org/10.1371/journal.pcbi.1000656 (2010).ADS 
    MathSciNet 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    36.Marathe, M. V. & Ramakrishnan, N. Recent advances in computational epidemiology. IEEE Intell. Syst. 28, 96–101. https://doi.org/10.1109/MIS.2013.114 (2013).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    37.Dowd, M. A sequential Monte Carlo approach for marine ecological prediction. Environmetrics 17, 435–455. https://doi.org/10.1002/env.780 (2006).MathSciNet 
    Article 

    Google Scholar 
    38.Gu, F. On-demand data assimilation of large-scale spatial temporal systems using sequential Monte Carlo methods. Simul. Model. Pract. Theory 85, 1–14. https://doi.org/10.1016/j.simpat.2018.03.007 (2018).Article 

    Google Scholar 
    39.Michael, E. et al. Continental-scale, data-driven predictive assessment of eliminating the vector-borne disease, lymphatic filariasis, in sub-Saharan Africa by 2020. BMC Med. 15, 176. https://doi.org/10.1186/s12916-017-0933-2 (2017).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    40.Poole, D. & Raftery, A. E. Inference for deterministic simulation models: The Bayesian melding approach. J. Am. Stat. Assoc. 95, 1244–1255. https://doi.org/10.1080/01621459.2000.10474324 (2000).MathSciNet 
    Article 
    MATH 

    Google Scholar 
    41.Singh, B. K. & Michael, E. Bayesian calibration of simulation models for supporting management of the elimination of the macroparasitic disease, Lymphatic Filariasis. Parasites Vectors 8, 522. https://doi.org/10.1186/s13071-015-1132-7 (2015).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    42.Sisson, S. A., Fan, Y. & Tanaka, M. M. Sequential Monte Carlo without likelihoods. Proc. Natl. Acad. Sci. 104, 1760. https://doi.org/10.1073/pnas.0607208104 (2007).ADS 
    MathSciNet 
    CAS 
    Article 
    PubMed 
    PubMed Central 
    MATH 

    Google Scholar 
    43.Spear, R. C., Hubbard, A., Liang, S. & Seto, E. Disease transmission models for public health decision making: Toward an approach for designing intervention strategies for Schistosomiasis japonica. Environ. Health Perspect. 110, 907–915. https://doi.org/10.1289/ehp.02110907 (2002).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    44.Taylor, S. D. & White, E. P. Automated data-intensive forecasting of plant phenology throughout the United States. Ecol. Appl. 30, e02025. https://doi.org/10.1002/eap.2025 (2020).Article 
    PubMed 

    Google Scholar 
    45.Beaulieu-Jones, B. K. & Greene, C. S. Reproducibility of computational workflows is automated using continuous analysis. Nat. Biotechnol. 35, 342–346. https://doi.org/10.1038/nbt.3780 (2017).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    46.Delgoshaei, P., Austin, M. A. & Pertzborn, A. J. A semantic framework for modeling and simulation of cyber-physical systems. Int. J. Adv. Sys. Measure. 7, 223–237 (2014).
    Google Scholar 
    47.Dong, E., Du, H. & Gardner, L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 20, 533–534. https://doi.org/10.1016/S1473-3099(20)30120-1 (2020).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    48.Henkel, R., Wolkenhauer, O. & Waltemath, D. Combining computational models, semantic annotations and simulation experiments in a graph database. Database https://doi.org/10.1093/database/bau130 (2015).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    49.Merkel, D. Docker: Lightweight linux containers for consistent development and deployment. Linux J. 2014, 2 (2014).
    Google Scholar 
    50.Nakamura, K., Higuchi, T. & Hirose, N. Sequential data assimilation: Information fusion of a numerical simulation and large scale observation data. J. UCS 12, 608–626. https://doi.org/10.3217/jucs-012-06-0608 (2006).Article 

    Google Scholar 
    51.Stodden, V. & Miguez, S. Best practices for computational science: Software infrastructure and environments for reproducible and extensible research. J. Open Res. Softw. https://doi.org/10.5334/jors.ay (2014).Article 

    Google Scholar 
    52.Unacast. Social distancing scoreboard. https://www.unacast.com/covid19/social-distancing-scoreboard (2020).53.Willem, L. et al. SOCRATES: An online tool leveraging a social contact data sharing initiative to assess mitigation strategies for COVID-19. BMC Res. Notes 13, 293. https://doi.org/10.1186/s13104-020-05136-9 (2020).CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    54.Iboi, E. A., Ngonghala, C. N. & Gumel, A. B. Will an imperfect vaccine curtail the COVID-19 pandemic in the U.S.?. Infect. Dis. Model 5, 510–524. https://doi.org/10.1016/j.idm.2020.07.006 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    55.Badr, H. S. et al. Association between mobility patterns and COVID-19 transmission in the USA: A mathematical modelling study. Lancet Infect. Dis. https://doi.org/10.1016/S1473-3099(20)30553-3 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    56.Contreras, S., Villavicencio, H. A., Medina-Ortiz, D., Biron-Lattes, J. P. & Olivera-Nappa, A. A multi-group SEIRA model for the spread of COVID-19 among heterogeneous populations. Chaos Solitons Fractals 136, 109925. https://doi.org/10.1016/j.chaos.2020.109925 (2020).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    57.Mossong, J. et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med. 5, e74. https://doi.org/10.1371/journal.pmed.0050074 (2008).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    58.Chen, R. Markov Chain Monte Carlo Vol. Volume 7 Lecture Notes Series, Institute for Mathematical Sciences, National University of Singapore 147–182 (Co-Published with Singapore University Press, 2005).59.Doucet, A., Godsill, S. & Andrieu, C. On sequential Monte Carlo sampling methods for Bayesian filtering. Stat. Comput. 10, 197–208. https://doi.org/10.1023/A:1008935410038 (2000).Article 

    Google Scholar 
    60.Fearnhead, P. & Kunsch, H. R. Particle filters and data assimilation. Annu. Rev. Stat. Appl. 5, 421–449. https://doi.org/10.1146/annurev-statistics-031017-100232 (2018).MathSciNet 
    Article 

    Google Scholar 
    61.Gu, F., Butt, M., Ai, C., Shen, X. & Xiao, J. Proceedings of the Conference on Summer Computer Simulation 1–10 (Society for Computer Simulation International, 2015).62.Florida Agency for Health Care Administration. https://ahca.myflorida.com/ (2020).63.Polonsky, J. A. et al. Outbreak analytics: A developing data science for informing the response to emerging pathogens. Philos. Trans. R. Soc. B. https://doi.org/10.1098/rstb.2018.0276 (2019).Article 

    Google Scholar 
    64.Gambhir, M. et al. Geographic and ecologic heterogeneity in elimination thresholds for the major vector-borne helminthic disease, lymphatic filariasis. BMC Biol. 8, 22. https://doi.org/10.1186/1741-7007-8-22 (2010).Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    65.Spear, R. C. & Hubbard, A. Modelling Parasite Transmission and Control 99–111 (Springer, 2010).66.James, N. & Menzies, M. COVID-19 in the United States: Trajectories and second surge behavior. Chaos Interdiscip. J. Nonlinear Sci. 30, 091102. https://doi.org/10.1063/5.0024204 (2020).MathSciNet 
    CAS 
    Article 

    Google Scholar 
    67.Chang, S. et al. Mobility network models of COVID-19 explain inequities and inform reopening. Nature 589, 82–87. https://doi.org/10.1038/s41586-020-2923-3 (2021).ADS 
    CAS 
    Article 
    PubMed 
    PubMed Central 

    Google Scholar 
    68.James, N. & Menzies, M. Efficiency of communities and financial markets during the 2020 pandemic. Chaos Interdiscip. J. Nonlinear Sci. 31, 083116. https://doi.org/10.1063/5.0054493 (2021).MathSciNet 
    Article 

    Google Scholar 
    69.Yilmazkuday, H. Stay-at-home works to fight against COVID-19: International evidence from Google mobility data. J. Hum. Behav. Soc. Environ. 31, 210–220. https://doi.org/10.1080/10911359.2020.1845903 (2021).Article 

    Google Scholar 
    70.Brienen, N. C., Timen, A., Wallinga, J., Van Steenbergen, J. E. & Teunis, P. F. The effect of mask use on the spread of influenza during a pandemic. Risk Anal. 30, 1210–1218. https://doi.org/10.1111/j.1539-6924.2010.01428.x (2010).Article 
    PubMed 
    PubMed Central 

    Google Scholar  More