Population genomics and antimicrobial resistance dynamics of Escherichia coli in wastewater and river environments
Escherichia coli is amongst the dominant pan-aminoglycoside-resistant bacteria from wastewater and river waterA total of 168 bacteria highly resistant to aminoglycosides were obtained by growing collected samples in MacConkey media supplemented with high aminoglycoside concentrations, from both WWTPs (66 isolates) and rivers (102 isolates) sampled at the same period of time and located in the Barcelona region. Most of the isolates were identified as Escherichia coli, being the predominant Enterobacteriaceae species in WWTPs (50 isolates, 75.76%) and the second most prevalent in rivers (20 isolates, 19.61%). Klebsiella pneumoniae was the predominant bacterium identified in river environments (50 isolates, 49.02%), although its presence in WWTP samples was scarce (7 isolates, 10.61%). The remaining aminoglycoside-resistant species, members of genus Enterobacter, Citrobacter and Aeromonas among others, represented a low fraction in WWTPs (9 isolates, 13.64%) comparing to river environments (32 isolates, 31.37%), showing a higher bacterial diversity of natural over WWTP niches (Fig. 1, Supplementary Data 1).Fig. 1: Geographical distribution of sampled points in the region of Barcelona (Spain).The location of Barcelona city is indicated by black circled city icon. The Llobregat river is presented in light blue and the Cardener river in dark blue. The three river sampled locations are indicated by black circled river icon. The two sampled WWTPs are indicated by black circled droplet icon. The collection area of the El Prat WWTP is highlighted in dark green, whereas the area of the Gavà WWTP is highlighted in pale green, collecting the wastewater of 2,000,000 and 370,000 inhabitants, respectively. Total number of pan-aminoglycoside-resistant bacteria collected from each sampling location is indicated in the center of the sunburn diagrams. Inner rings represent the proportion of different pan-aminoglycoside-resistant bacterial species identified in each sampling location, indicating the total number of E. coli, K. pneumoniae and other species, and the number of E. coli isolates selected for later Illumina (I) and Nanopore sequencing (N). Outer rings show the 16S-RMTase gene harbored by these bacteria.Full size imageAntimicrobial resistance in E. coli varies between wastewater and river water environmentsResistance to diverse antibiotic compounds was related to the sampling origin and the species of the isolates. All isolates were highly resistant to 4,6-DOS aminoglycosides, since they were selected according to their resistance to this family of antibiotics. The 16S-RMTase gene identified in most wastewater isolates was rmtB (55 isolates, 83.33%), whereas armA was the responsible in the remainder (11 isolates, 16.67%). On the contrary, 100% of isolates from river environments harbored the armA gene (Fig. 1, Supplementary Data 1). No other 16S-RMTase gene was detected neither in wastewater nor river isolates. Bacteria from wastewater samples showed higher MIC values to all tested antibiotics (e.g., 77.3% isolates resistant to cefotaxime) than bacteria from freshwater environments (e.g., 0% isolates resistant to cefotaxime), even when comparing isolates belonging to the same species. This was also the case for E. coli, where isolates from WWTPs were among the most resistant (e.g., 90% isolates resistant to cefotaxime) while those from rivers were among the most susceptible ones (e.g., 0% isolates resistant to cefotaxime), highlighting key differences between bacteria from these different ecological niches (Supplementary Data 1).Distinct E. coli populations exist in wastewater and river water environmentsPulsed-field gel electrophoresis (PFGE) of E. coli isolates revealed the existence of two highly prevalent pulsotypes present in the two WWTPs (31 isolates, 62%), and a total of 5 characterized pulsotypes (≥90% similarity in the PFGE pattern) from wastewater samples. Interestingly, 15 isolates (30%) from both WWTPs could not be typed by PFGE. In contrast, river environments showed higher clonal diversity, with up to 10 different pulsotypes characterized, despite the lower number of E. coli isolates recovered from these samples (Supplementary Fig. 1). Of the 70 E. coli isolates, 43 were selected for further analyses and sequencing by Illumina technology (25 from WWTPs and 18 from river environments), ensuring wide representation in relation to the origins, clonal relatedness, and antibiogram resistance profiles.Multi-locus sequence typing (MLST) results were highly correlated with PFGE profiles. Almost all E. coli isolates from wastewater samples (23 isolates, 92%) belonged to two predominant sequence types (STs) (Fig. 2). The most prevalent one, ST1196/ST632 (Warwick/Pasteur MLST scheme), encompassed the two main pulsotypes identified by PFGE. ST1196 is an increasingly prevalent ST related to OXA-48-carbapenemase production and associated with the mcr-1 colistin-resistance gene expansion in clinical settings9, hospital wastewater10 and companion animals11. Minimal inhibitory concentration (MIC) values of the isolates belonging to this ST showed resistance to numerous antibiotic classes, including clinically critical antibiotics such as 3rd generation cephalosporins and colistin6. The other predominant ST in WWTPs, ST224/ST479, comprised all the isolates which were non-typeable by PFGE. ST224 is a pandemic multi-drug resistant ST previously associated with NDM-, CTX-M- and KPC-carbapenemase production, found in both clinical human12 and animal13 samples, but also in natural environments14. ST224 E. coli isolates identified exhibited a common resistance profile, including resistance to 3rd generation cephalosporins. However, two isolates from river samples belonging to this same ST showed a different resistance pattern, revealing the influence of the niche in the bacterial resistance phenotype, even within the same ST. One of the E. coli isolates from El Prat WWTP was also identified as ST131, a clonal group present in multiple environments with a plethora of resistance mechanisms and virulence factors, representing a major public health concern15. River-related E. coli showed a higher number of E. coli STs compared to wastewater isolates, comprising up to 6 different E. coli STs (Fig. 2). All river E. coli STs presented similar MIC values for most of the tested antibiotics, which were lower than those exhibited by wastewater E. coli. ST607 (Warwick MLST scheme) was the most prevalent ST found in rivers. This ST has been scarcely reported, although it has been already detected in river sediments16. MICs of plazomicin for all isolates, both from wastewater and river environments, were ≥512 mg/L, demonstrating the high-level resistance to aminoglycosides conferred by 16S-RMTases, even to an aminoglycoside that has not yet been approved for clinical use in the EU (Fig. 2).Fig. 2: Sequenced E. coli data.The source of the isolates is specified by different colors in the genomic SNP-tree branches, as well as the related sequence type. Level of resistance to all tested antibiotics is shown in a gradient of colors: PLZ (plazomicin), GEN (gentamicin), AMP (ampicillin), FOT (cefotaxime), TAZ (ceftazidime), MERO (meropenem), CHL (chloramphenicol), TMP (trimethoprim), AZI (azithromycin), COL (colistin), CIP (ciprofloxacin), NAL (nalidixic acid), TET (tetracycline), TGC (tigecycline), and SMX (sulfamethoxazole). The presence and absence of antibiotic resistance genes and plasmid incompatibility groups are indicated by circle and triangle symbols, respectively. The presence and absence of specific 16S-RMTase genes are indicated by star symbols.Full size imageResistome analysis results were highly correlated with resistance phenotypic profiles. Thus, isolates from WWTPs harbored an heterogenous antibiotic resistance gene content depending on the E. coli ST, such as the strong association between the mcr-1 gene and isolates belonging to ST119617. This heterogenicity was even present between isolates belonging to ST1196, which exhibited different resistance levels to chloramphenicol and tetracycline depending on the presence of cmlA1/floR and tet(A), respectively. However, all wastewater isolates showed a common high-level resistance to β-lactam compounds, including third generation cephalosporins, which could be attributed to the presence of blaCTX-M-55 (ST1196) and blaCMY-2 (ST224) (Fig. 2). On the contrary, all isolates from river environments possessed a uniform antibiotic resistance gene content, despite comprising different STs (Fig. 2). Almost all of them (17 isolates, 94.4%) were susceptible to β-lactams, including the isolates belonging to ST224, contrasting with wastewater isolates, which have been under anthropogenic pressure, such as clinical treatments with a combination of aminoglycosides and β-lactams18, that led to this resistance associations. Likewise, plasmidome analysis revealed that the total plasmid content, based on the plasmid incompatibility groups, was heterogeneously distributed among isolates from WWTPs according to E. coli STs and closely correlated with the resistance gene content (an average of 6.36 different plasmid replicons per isolate). ST1196 isolates carried a higher plasmid content, exhibiting different plasmid profiles among them. The only plasmid incompatibility group carried by all wastewater isolates was IncFII type, specifically a pC15-1a-like plasmid (Fig. 2). Considering river E. coli STs, the total plasmid content was generally lower (an average of 3.72 different plasmid replicons per isolate) and more uniform compared to wastewater STs, similar to the pattern of the resistome (these differences are addressed in the E. coli pan-genome structure and genome, plasmid and antibiotic resistance gene diversities section). The most prevalent plasmid incompatibility group found among them was IncHI2A, present in all river isolates except ST224, which harbored a completely different plasmid content (Fig. 2).
E. coli STs have a similar genomic complexity in wastewater and river water, but the diversity of plasmids and resistance genes is higher in wastewater STsThe pan-genome of all 43 E. coli isolates, with the independence of the origin, was constituted by a total of 13,819 different genes. The pan-genome was distributed in a common core-genome of 3109 genes (22.5%) and a variable accessory-genome of 10,710 genes (77.5%), covering a considerable E. coli diversity, considering that the estimation of the global E. coli core-genome comprises around 1500 genes19 (Supplementary Fig. 2). Attending to the pan-genome configuration depending on the water source, 3340 out of a total of 9099 genes (36.71%) were included in the core-genome of wastewater isolates, and the core-genome of river isolates was formed by 3410 out of 9927 genes (34.35%), showing that river isolates presented a larger total gene pool and a smaller relative core-genome comparing to WWTP isolates. The genes conforming the pan-genomes of wastewater and river water E. coli were statistically different, considering the genes that were present and absent in each niche from the total pan-genome (Jaccard, P-value = 0.001) (Fig. 3a), indicating that the different environments, and/or upstream environments from which they have seeded, led to the selection of specific genomic populations, even between members of E. coli ST224 originating from the two different environments (Jaccard, P-value = 0.049). The genomic diversity of the whole E. coli population from river water was significantly higher than the one found in wastewater (Jaccard, P-value = 3.599 × 10−8) (Supplementary Fig. 3a). However, considering the number of different STs constituting the genomic pool of each environment, the genetic diversity of an E. coli ST from wastewater was statistically similar to the diversity of an E. coli ST from freshwater (Jaccard, P-value = 0.3123) (Fig. 3b). This model, which took into account the number of STs from each source in the diversity analysis, was previously checked by random sampling analysis, obtaining similar results. Essentially, the diversity of the total E. coli population was dependent on the number of different STs that defined the population, but the diversity of a specific ST was independent of this factor and more suitable to estimate the variability of specific bacterial clones according to the origin. Likewise, the diversity of plasmid content, taking into account the different plasmid incompatibility groups identified, was distinctive for each water type (Jaccard, P-value = 0.001) (Fig. 3c). Thus, the ecological niche also influenced the presence of specific plasmid types. The level of complexity of the E. coli plasmid pool circulating in each aquatic environment was statistically similar (Jaccard, P-value = 0.2447) (Supplementary Figure 3b). However, the number of different plasmid types carried by each E. coli ST was significantly higher in WWTPs than in rivers (Jaccard, P-value = 1.037 × 10−14) (Fig. 3d). Both wastewater and river E. coli populations also showed a great divergence in the antibiotic resistance gene content (Jaccard, P-value = 0.001) (Fig. 3e), showing different resistance mechanisms to particular antimicrobial classes. Furthermore, this diversity was significantly higher in the total E. coli population from wastewater comparing to the E. coli population from river water (Jaccard, P-value = 5.093 × 10−14) (Supplementary Fig. 3c), even when the latter possessed a higher total genomic diversity. Thus, the sum of genetic resistance determinants carried by each E. coli ST from wastewater environments was much higher than for each E. coli ST from natural effluents (Jaccard, P-value More
