Whole genome sequence of Vibrio cholerae NB-183 isolated from freshwater in Ontario, Canada harbors a unique gene repertoire

Lawal, Opeyemi U.; Bryan, Noah; Soni, Mitra; Chen, Yanhong; Precious, Melinda; Parreira, Valeria R.; Goodridge, Lawrence

doi:10.1186/s12863-024-01204-2

Data Note
Open access
Published: 15 February 2024

Whole genome sequence of Vibrio cholerae NB-183 isolated from freshwater in Ontario, Canada harbors a unique gene repertoire

Opeyemi U. Lawal¹,
Noah Bryan^1,2,
Mitra Soni¹,
Yanhong Chen¹,
Melinda Precious¹,
Valeria R. Parreira¹ &
…
Lawrence Goodridge¹

BMC Genomic Data volume 25, Article number: 18 (2024) Cite this article

843 Accesses
Metrics details

Abstract

Objective

Vibrio cholerae is an enteric pathogen that poses a significant threat to global health. It causes severe dehydrating diarrheal disease cholera in humans. V. cholerae could be acquired either from consuming contaminated seafood or direct contact with polluted waters. As part of a larger program that assesses the microbial community profile in aquatic systems, V. cholerae strain NB-183 was isolated and characterized using a combination of culture- and whole-genome sequencing-based approaches.

Data description

Here we report the assembled and annotated whole-genome sequence of a V. cholerae strain NB-183 isolated from a recreational freshwater lake in Ontario, Canada. The genome was sequenced using short-read Illumina systems. The whole-genome sequencing yielded 4,112,549 bp genome size with 99 contigs with an average genome coverage of 96× and 47.42% G + C content. The whole genome-based comparison, phylogenomic and gene repertoire indicates that this strain harbors multiple virulence genes and biosynthetic gene clusters. This genome sequence and its associated datasets provided in this study will be an indispensable resource to enhance the understanding of the functional, ecological, and evolutionary dynamics of V. cholerae.

Peer Review reports

Objective

V. cholerae is a causative agent of human diarrheal disease cholera and poses a significant threat to global health [1, 2]. While V. cholerae is naturally found in aquatic systems [1, 3,4,5], its persistence in this environment is attributed to specific stress response and adaptation mechanisms that include biofilm formation on an array of surfaces, survival in different environmental conditions, as well as interaction with other organisms in such environment [2]. V. cholerae is also a foodborne pathogen that could be acquired either from consuming undercooked or raw seafoods or a direct contact with polluted waters [4, 5]. As part of a larger study that assesses the microbial community profile and tracks antimicrobial resistant and pathogenic clones of bacteria in aquatic systems [6,7,8,9], we isolated V. cholerae strain NB-183 from a recreational freshwater lake. The objective of this study is to report the characterized V. cholerae NB-183 strain using whole genome sequencing-based approach. We also provided an extensive genetic background and gene content analysis of this strain.

Water sample was collected in the Fall of 2023 from a recreational freshwater Kettle Lake in Ontario, Canada (43.9486°N, 79.4352°W). To concentrate and detect bacteria in the water sample, 3 mL of nanobeads (Ceres Nanosciences) was added to 5 L of lake water sample and stirred at room temperature for 30 min. Thereafter, beads were collected using a 5-micron sock filter. An aliquot (100 uL) was spread onto MacConkey agar plates and incubated overnight at 37 °C. Pure colonies of differing morphologies were transferred onto a fresh tryptic soy agar (TSA) plate. Phenotypic identification was performed using VITEK® (bioMérieux Canada).

Data description

DNA extraction and whole-genome sequencing of a pure colony was performed as previously described [6, 7]. Briefly, genomic DNA was extracted using the DNeasy blood and tissue kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. DNA libraries were prepared using the DNA prep tagmentation kit and IDT for DNA/RNA unique dual (UD) indexes from Illumina. 2 × 150-bp paired-end sequencing was performed on the Illumina MiniSeq system. Raw reads were preprocessed with FastQC v0.11.9 (https://github.com/s-andrews/FastQC) and trimmed using Trimmomatic v0.39 [10]. Reads with Phred scores of ≥ 30 was assembled de novo using SKESA v2.4.0 [11]. The assembly quality and genome completeness was assessed using QUAST v5.2 [12] and BUSCO v5.5 [13]. Sequence type (ST) assignment was performed using the multilocus sequence type (MLST) database [14]. Genome annotation was performed using the NCBI Prokaryotic Genome Annotation Pipeline v6.6 [15]. Resistome and virulome were identified using CARD [16] and VFDB [17, 18], respectively using minimum coverage of 70% and minimum identity of 90%. Plasmids were identified using MOB-suite v3.1 [19] while PHASTER [20] was used to detect phage regions in the draft genome. Biosynthetic gene clusters were assessed using web-based antiSMASH v7 [21]. Default parameters were used for all bioinformatics pipelines except where otherwise stated.

A total of 1,477,996 paired-end reads were obtained from sequencing isolate NB-183 (Dataset 1, [22]). While the VITEK result was inconclusive, isolate NB-183 was identified as V. cholerae using the k-mer-based species identification with Kraken2 database [23] and refseq_masher using Mash MinHash (https://github.com/phac-nml/refseq_masher). The whole-genome sequencing of NB-183 isolate yielded 99 contigs (N₅₀ = 125,765 bp) from 4,112,549 bp genome size with genome coverage of 96×, 47.42% G + C content, and BUSCO single-copy completeness of 99.2% (v5.5.0) [13] (Table 1; Datafile 1–2 [24, 25]). Whole genome-based comparison using OrthoANI program [26] showed that NB-183 had average nucleotide identity (ANI) of 98% with V. cholerae N16961 (Datafile 1 [24]). In addition, the MLST using the pubMLST database showed that NB-183 had a novel pyrC allele and unique allele profile that had been assigned a new sequence type 1668 (Datafile 3, [27]).

Table 1 Overview of datafiles/datasets

Full size table

V. cholerae NB-183 strain contained 3,645 protein-coding sequences (CDS), 66 pseudo genes, and 60 RNAs (Dataset 2, [28]). Among the CDS were antibiotic resistance genes blaCARB-7 and almEFG operon that encode resistance to penams and polymyxin [16, 29], respectively. In addition, 127 virulence genes were identified in the whole-genome, including genes encoding type II secretion system (T2SS) essential components (espCDEFGHIJKLMN), type VI secretion system (T6SS)-associated genes (hcp-1, hcp-2), toxins (rtxBCD, toxA), among others. Of note, cholera toxin structural genes (ctxA and ctxB) and toxin co-regulated pilus gene (tcpA) were absent in V. cholerae NB-183 (Datafile 4, [30]). No plasmid was detected, but two intact phages (NBp1 and phage NBp2) were identified in the NB-183 genome (Datafile 5, [31]). One of these phages (phage NBp2, size = 6.8 Kbp) was highly similar (97% coverage and nucleotide identity) to Vibrio phage VCY-NC_016162.1 [32]. Likewise, six different biosynthetic gene clusters were identified, with only two showing high homology (Blastp = 100%) to two BGCs (vibriobactin and piscibactin) that were identified previously in Vibrio species (Datafile 6, [33]). The remaining four BGCs had low homology (Blastp = 0–33%) to those in the Minimum Information about a Biosynthetic Genes Cluster (MIBiG database).

Limitations

This data note was limited to the description of draft genome of a V. cholerae strain isolated from a freshwater sample. Further analysis on a larger collection is needed to source attribute the strain and assess the widespread and significance of the unique biosynthetic gene clusters identified.

Data availability

The genomic sequence data described in this Data note has been deposited and freely accessible at DDBJ/ ENA/GenBank. The raw reads were deposited under SRA accession number SRR26980564. The genome annotation version described here is version JAXIPZ000000000.1. Associated datafiles are available on Figshare: Sequence quality metrics and average nucleotide identity [24, 25]; multilocus sequence typing and virulome profile [27, 30]; and phage and biosynthetic gene cluster regions [31, 33].

Abbreviations

TSA:: Tryptic Soy Agar
ANI:: Average Nucleotide Identity
BGC:: Biosynthetic Gene Cluster
CDS:: Coding Sequences
DDH:: DNA-DNA Hybridization
MIBiG:: Minimum Information about a Biosynthetic Genes Cluster
MLST:: Multilocus Sequence Typing
PGAP:: Prokaryotic Genome Annotation Pipeline
SRA:: Sequence Reads Archive
ST:: Sequence Type
T2SS:: Type II Secretion System
T6SS:: Type VI Secretion System

References

Bhandari M, Jennison AV, Rathnayake IU, Huygens F. Evolution, distribution and genetics of atypical Vibrio cholerae – A review. Infect Genet Evol. 2021;89:104726.
Article CAS PubMed Google Scholar
Lutz C, Erken M, Noorian P, Sun S, McDougald D. Environmental reservoirs and mechanisms of persistence of Vibrio cholerae. Front Microbiol. 2013;4.
Jiang SC. Vibrio cholerae in recreational beach waters and tributaries of Southern California. In: Porter JW, editor. The Ecology and etiology of newly emerging Marine diseases. Dordrecht: Springer Netherlands; 2001. pp. 157–64.
Chapter Google Scholar
Baker-Austin C, Oliver JD, Alam M, Ali A, Waldor M, Qadri F, et al. Vibrio spp. infections. Nat Reviews Disease Primers. 2018;4:1–19.
Article Google Scholar
Deng Y, Xu L, Chen H, Liu S, Guo Z, Cheng C, et al. Prevalence, virulence genes, and antimicrobial resistance of Vibrio species isolated from diseased marine fish in South China. Sci Rep. 2020;10:14329.
Article CAS PubMed PubMed Central ADS Google Scholar
Bryan N, Anderson R, Lawal OU, Parreira VR, Goodridge L. Draft genome sequence of Bacillus anthracis N1, isolated from a Recreational Freshwater Kettle Lake in Ontario, Canada. Microbiol Resour Announc. 2023;:e01262–22.
Bryan N, Anderson R, Lawal OU, Parreira VR, Goodridge L. Draft genomes sequences of Exiguobacterium sp. N5 isolated from Recreational Freshwater Kettle Lake in Ontario. Microbiol Resource Announcements. 2022; Under review.
Botschner W, Davidson H, Lawal OU, Parreira VR, Goodridge L. Draft genome sequences of two Proteus mirabilis isolates recovered from a municipal wastewater treatment plant in Ontario, Canada. Microbiol Resour Announc. 2023;:e00559–23.
Lawal OU, Zhang L, Parreira VR, Brown RS, Chettleburgh C, Dannah N et al. Metagenomics of Wastewater Influent from Wastewater Treatment Facilities across Ontario in the era of emerging SARS-CoV-2 variants of concern. Microbiol Resour Announc. 2022;:e00362–22.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.
Article CAS PubMed PubMed Central Google Scholar
Souvorov A, Agarwala R, Lipman DJ. SKESA: strategic k-mer extension for scrupulous assemblies. Genome Biol. 2018;19:153.
Article PubMed PubMed Central Google Scholar
Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: Quality assessment tool for genome assemblies. Bioinformatics. 2013;29:1072–5.
Article CAS PubMed PubMed Central Google Scholar
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–2.
Article PubMed Google Scholar
Jolley KA, Maiden MCJ, BIGSdb. Scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics. 2010;11:595.
Article PubMed PubMed Central Google Scholar
Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, et al. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 2016;44:6614–24.
Article CAS PubMed PubMed Central Google Scholar
Alcock BP, Raphenya AR, Lau TTY, Tsang KK, Bouchard M, Edalatmand A, et al. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2020;48:D517–25.
CAS PubMed Google Scholar
Chen L, Zheng D, Liu B, Yang J, Jin QVFDB. 2016: Hierarchical and refined dataset for big data analysis – 10 years on. Nucleic Acids Research. 2016;44:D694–7.
Liu B, Zheng D, Zhou S, Chen L, Yang J. VFDB 2022: a general classification scheme for bacterial virulence factors. Nucleic Acids Res. 2022;50:D912–7.
Article CAS PubMed Google Scholar
Robertson J, Nash JHE. MOB-suite: software tools for clustering, reconstruction and typing of plasmids from draft assemblies. Microb Genomics. 2018;4:1–7.
Article Google Scholar
Arndt D, Grant JR, Marcu A, Sajed T, Pon A, Liang Y, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016;44:W16–21.
Article CAS PubMed PubMed Central Google Scholar
Blin K, Shaw S, Kloosterman AM, Charlop-Powers Z, van Wezel GP, Medema MH, et al. antiSMASH 6.0: improving cluster detection and comparison capabilities. Nucleic Acids Res. 2021;49:W29–35.
Article CAS PubMed PubMed Central Google Scholar
NCBI Sequence Read. Archive https://identifiers.org/ncbi/insdc.sra:SRR26980564. 2023.
Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken2. Genome Biol. 2019;20:1–13.
Article Google Scholar
Lawal OU, Bryan N, Soni M, Chen Y, Precious M, Parreira VR et al. Table S1, NB-183 – sequence metrics. https://doi.org/10.6084/m9.figshare.24724467. 2023.
Lawal OU, Bryan N, Soni M, Chen Y, Precious M, Parreira VR et al. File 1, NB-183 – genome completeness metrics with BUSCO. https://doi.org/10.6084/m9.figshare.24724443. 2023.
Lee I, Ha S-M, Baek M, Kim DW, Yi H, Chun J. VicPred: a Vibrio cholerae genotype Prediction Tool. Front Microbiol. 2021;12:691895.
Article PubMed PubMed Central Google Scholar
Lawal OU, Bryan N, Soni M, Chen Y, Precious M, Parreira VR et al. Table S2, NB-183 – multilocus sequence typing. https://doi.org/10.6084/m9.figshare.24844686. 2023.
Nucleotide NCBI. https://identifiers.org/nucleotide:JAXIPZ000000000. 2023.
Henderson JC, Herrera CM, Trent MS. AlmG, responsible for polymyxin resistance in pandemic Vibrio cholerae, is a glycyltransferase distantly related to lipid a late acyltransferases. J Biol Chem. 2017;292:21205–15.
Article CAS PubMed PubMed Central Google Scholar
Lawal OU, Bryan N, Soni M, Chen Y, Precious M, Parreira VR et al. Table S3, NB-183 – virulence gene profile. https://doi.org/10.6084/m9.figshare.25103531. 2024.
Lawal OU, Bryan N, Soni M, Chen Y, Precious M, Parreira VR et al. Figure S1, NB-183 – phage regions. https://doi.org/10.6084/m9.figshare.24724509. 2023.
Xue H, Xu Y, Boucher Y, Polz MF. High frequency of a Novel Filamentous phage, VCYϕ, within an environmental Vibrio cholerae Population. Appl Environ Microbiol. 2012;78:28–33.
Article CAS PubMed PubMed Central ADS Google Scholar
Lawal OU, Bryan N, Soni M, Chen Y, Precious M, Parreira VR et al. Figure S2, NB-183 – Biosynthetic gene cluster regions. https://doi.org/10.6084/m9.figshare.24724506. 2023.

Download references

Funding

This project was funded through support from the Canada First Research Excellence Fund.

Author information

Authors and Affiliations

Canadian Research Institute for Food Safety (CRIFS), University of Guelph, Guelph, ON, N1G 2W1, Canada
Opeyemi U. Lawal, Noah Bryan, Mitra Soni, Yanhong Chen, Melinda Precious, Valeria R. Parreira & Lawrence Goodridge
Bayview Secondary School, 10077 Bayview Ave, Richmond Hill, ON, L4C 2L4, Canada
Noah Bryan

Authors

Opeyemi U. Lawal
View author publications
You can also search for this author in PubMed Google Scholar
Noah Bryan
View author publications
You can also search for this author in PubMed Google Scholar
Mitra Soni
View author publications
You can also search for this author in PubMed Google Scholar
Yanhong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Melinda Precious
View author publications
You can also search for this author in PubMed Google Scholar
Valeria R. Parreira
View author publications
You can also search for this author in PubMed Google Scholar
Lawrence Goodridge
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

OL, NB, MS, YC, MP, VP conducted the sampling, isolation, and whole-genome sequencing; OL performed the bioinformatics, and data analysis, and wrote the original draft of the manuscript. LG conceived the project and provided funding and resources. OL, VP and LG supervised the study. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Opeyemi U. Lawal or Lawrence Goodridge.

Ethics declarations

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Lawal, O.U., Bryan, N., Soni, M. et al. Whole genome sequence of Vibrio cholerae NB-183 isolated from freshwater in Ontario, Canada harbors a unique gene repertoire. BMC Genom Data 25, 18 (2024). https://doi.org/10.1186/s12863-024-01204-2

Download citation

Received: 16 December 2023
Accepted: 31 January 2024
Published: 15 February 2024
DOI: https://doi.org/10.1186/s12863-024-01204-2

Whole genome sequence of Vibrio cholerae NB-183 isolated from freshwater in Ontario, Canada harbors a unique gene repertoire

Abstract

Objective

Data description

Objective

Data description

Limitations

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics declarations

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

BMC Genomic Data

Contact us

Whole genome sequence of Vibrio cholerae NB-183 isolated from freshwater in Ontario, Canada harbors a unique gene repertoire

Abstract

Objective

Data description

Objective

Data description

Limitations

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics declarations

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomic Data

Contact us