Skip to main content

Genome assembly of Ottelia alismoides, a multiple-carbon utilisation aquatic plant

Abstract

Objectives

Ottelia Pers. is in the Hydrocharitaceae family. Species in the genus are aquatic, and China is their centre of origin in Asia. Ottelia alismoides (L.) Pers., which is distributed worldwide, is a distinguishing element in China, while other species of this genus are endemic to China. However, O. alismoides is also considered endangered due to habitat loss and pollution in some Asian countries. Ottelia alismoides is the only submerged macrophyte that contains three carbon dioxide-concentrating mechanisms, i.e. bicarbonate (HCO3) use, crassulacean acid metabolism and the C4 pathway. In this study, we present its first genome assembly to help illustrate the various carbon metabolism mechanisms and to enable genetic conservation in the future.

Data description

Using DNA and RNA extracted from one O. alismoides leaf, this work produced  73.4 Gb HiFi reads,  126.4 Gb whole genome sequencing short reads and  21.9 Gb RNA-seq reads. The de novo genome assembly was 6,455,939,835 bp in length, with 11,923 scaffolds/contigs and an N50 of 790,733 bp. Genome assembly completeness assessment with Benchmarking Universal Single-Copy Orthologs revealed a score of 94.4%. The repetitive sequence in the assembly was 4,875,817,144 bp (75.5%). A total of 116,176 genes were predicted. The protein sequences were functionally annotated against multiple databases, facilitating comparative genomic analysis.

Peer Review reports

Objective

Ottelia Pers., an aquatic plant genus that includes approximately 24 extant species, is the second largest genus in the family Hydrocharitaceae [1, 2]. China is the centre for Ottelia in Asia. There are 10 Ottelia species in China, all of which are endemic, except O. alismoides [1, 2]. Ottelia alismoides (L.) Pers. is an annual or perennial herb that can be submersed or floating in fresh or salt water [1,2,3,4]. It is distributed worldwide, including Africa, Australia and Asia [4]. Molecular phylogeny analysis indicates that O. alismoides is the ancestor of the other Ottelia species in China [1, 2]. Due to the loss and deterioration of aquatic habitats due to anthropogenic activities, it has been endangered in both China and Japan [2, 4]. However, it is listed as a noxious weed in America [5]. One particular property in O. alismoides is that it is the only submerged macrophyte that contains three carbon dioxide-concentrating mechanisms, i.e. bicarbonate (HCO3) use, crassulacean acid metabolism (CAM) and the C4 pathway [6, 7]. It can be used to treat water pollution [3, 8] and has as medicinal value, such as cancer and tuberculosis treatment [3, 9]. Therefore, our work provides a draft genome of O. alismoides to help depict the genetic bases of its different carbon usages and metabolism related to variable biochemical medicines for its conservation, management and utility in the future.

Data description

Leaf samples from one O. alismoides individual planted in the South China Botanical Gard in Guangzhou, China, were collected. For genome assembly and annotation, three sequencing libraries were constructed using total RNA and genomic DNA extracted from the samples. Genomic DNA was extracted using the cetyltrimethylammonium bromide method, and total RNA was extracted using the TRNzol Universal RNA Extraction Kit (Tiangen, Beijing, China). The quality and quantity of DNA/RNA were assessed using the NanoDrop™ One microvolume UV-Vis Spectrophotometer (Thermo Fisher Scientific, California, USA) and gel electrophoresis. The PacBio Sequel II sequencer was used for circular consensus long read whole genomic sequencing (WGS), which is also known as HiFi sequencing. A MGI DNBSEQ-T7 sequencer was used for short-read WGS and RNA sequencing (RNA-seq), both under 150 bp paired-end mode. Using sequencing data, different programmes were applied to perform the analyses. In these analyses, the default parameters of the programmers were used unless otherwise mentioned.

The WGS short reads were trimmed with Sickle v1.33 [10] under the parameters of “-q 30 -l 80”. KmerGenie v1.7044 [11] was then used to estimate the O. alismoides genome size with the trimmed reads under the parameters of “-k 141 --diploid”. After removing adapters in HiFi reads by HiFiAdapterFilt v2.0.0 [12], hifiasm v0.19.6 [13] was used to assemble the O. alismoides genome. Duplicated sequences were further removed by Redundans 0.14a [14] and Purge_dups v1.2.5 [15]. Using RNA-seq data, the assembly was scaffolded with P_RNA_scaffolder [16], and the scaffolds were gap closed by TGS-GapClose 1.2.1 [17]. The completeness of the final assembly was assessed by BUSCO v5.7.0 [18] using the Embryphyta odb10 2020-09-10 database.

The assembly was parsed through RED v2.0 [19] and EDTA v2.1.0 [20] for repeat sequence identification. After combining the RED and EDTA results, the repeated sequences were then soft-masked in the assembly. Braker3 v.3.0.6 [21] was applied for initial gene prediction aided with transcriptome data and reference protein sequences (Data file 1) [22]. The braker results were then input into the Funannotate pipeline v1.8.16 [23] under the “funannotate prediction” command with the parameters “--max_intronlen 1000000”. The predicted genes were functionally annotated against multiple databases using the “funannotate annotate” command.

The sequencing libraries produced  73.4 Gb raw data for HiFi sequencing (Data file 2) [24],  126.4 Gb for WGS short read sequencing (Data file 3) [25] and  21.9 Gb for RNA-seq (Data file 4) [26]. The estimated genome size of O. alismoides was 6,863,432,158 bp, while the assembly was 6,455,939,835 bp with 11,923 scaffolds/contigs (N50 = 790,733 bp) (Data file 5) [27]. The BUSCO assessment indicated a completeness of 94.4% (Data file 6) [28]. EDTA and RED identified 3,695,203,717 bp (57.2%) (Data file 7) [29] and 4,138,710,098 bp (64.1%) (Data file 8) [30] of repetitive sequences, respectively, in the genome. Their combination was 4,875,817,144 bp, accounting for 75.5% of the genome (data file 9) [31]. A total of 116,176 genes were predicted (Data files 10–12) [32,33,34], and their annotation is shown in Data files 13 and 14 [35, 36].

Limitations

The current assembled genome is still fragmented and could be further improved by increasing HiFi sequencing data and combining ultra-long Nanopore sequencing and Hi-C data.

Table 1 Overview of all data files/data sets

Data availability

Raw sequenced reads have been uploaded to the NCBI Sequence Read Archive under accession number SRR27887122 for HiFi reads, SRR27887124 for short-WGS sequencing reads, SRR27887123 for RNA-seq reads, and JAZKJV000000000 for the assembled genome. Please further see Table 1 in the manuscript for details and references of the results of the annotations submitted to figshare.

References

  1. Li Z-Z, Lehtonen S, Martins K, Gichira AW, Wu S, Li W, Hu G-W, Liu Y, Zou C-Y, Wang Q-F, Chen J-M. Phylogenomics of the aquatic plant genus Ottelia (Hydrocharitaceae): implications for historical biogeography. Mol Phylogenet Evol. 2020;152:106939. https://doi.org/10.1016/j.ympev.2020.106939.

    Article  PubMed  Google Scholar 

  2. Li ZZ, Peng S, Wang QF, Li W, Liang SC, Chen JM. (2023) Cryptic diversity of the genus Ottelia in China. Biodiver. Sci. 2023;31:22394. https://doi.org/10.17520/biods.2022394.

  3. Sumithira G, Kavya V, Ashma A, Kavinkumar MC. A review of ethanobotanical and phytopharmacology of Ottelia alismoides (L.). PERS. Int J Res Pharmacol Pharmacotherapeutics. 2017;6(3):302–11. https://doi.org/10.61096/ijrpp.v6.iss3.2017.302-311.

    Article  Google Scholar 

  4. Wagutu GK, Tengwer MC, Jiang W, Li W, Fukuoka G, Wang G, Chen Y. Genetic diversity and population structure of Ottelia alismoides (Hydrocharitaceae), a vulnerable plant in agro-ecosystems of Japan. Glob Ecol Conserv. 2021;28:e01676. https://doi.org/10.1016/j.gecco.2021.e01676.

    Article  Google Scholar 

  5. USDA. https://plants.usda.gov/home/plantProfile?symbol=OTAL. Accessed 1 Dce 2023.

  6. Han S, Xing Z, Li W, Huang W. Response of anatomy and CO-concentrating mechanisms to variable CO in linear juvenile leaves of heterophyllous Ottelia alismoides: comparisons with other leaf types. Environ Exp Bot. 2020;179:104–94. https://doi.org/10.1016/j.envexpbot.2020.104194.

    Article  CAS  Google Scholar 

  7. Wang W, Yuan L, Zhou J, Zhu X, Liao Z, Yin L, Li W, Jiang HS. Inorganic carbon utilization: a target of silver nanoparticle toxicity on a submerged macrophyte. Environ Pollut. 2022;318:120906. https://doi.org/10.1016/j.envpol.2022.120906.

    Article  CAS  PubMed  Google Scholar 

  8. Mullaia P, Vishalib S, Sobiyaa E. Studies on the application of algae biomass as an adsorbent in the treatment of industrial Azadirachtin insecticide wastewater. Desalin Water Treat. 2022;251:7–17. https://doi.org/10.5004/dwt.2022.27888.

    Article  CAS  Google Scholar 

  9. Katoh T. Enantioselective total synthesis of otteliones a and B, novel and powerful antitumor agents from the freshwater plant Ottelia alismoides. Nat Prod Commun. 2013;8(7):973–80.

    CAS  PubMed  Google Scholar 

  10. Joshi NA, Fass JN, Sickle. A sliding-window, adaptive, quality-based trimming tool for FastQ files (Version 1.33) [Software]. (2011) https://github.com/najoshi/sickle. Accessed 24 Aug 2022.

  11. Chikhi R, Medvedev P. Informed and automated k-mer size selection for genome assembly. Bioinformatics. 2014;30:31–7. https://doi.org/10.1093/bioinformatics/btt310.

    Article  CAS  PubMed  Google Scholar 

  12. Sim SB, Corpuz RL, Simmonds TJ, Geib SM. HiFiAdapterFilt, a memory efficient read processing pipeline, prevents occurrence of adapter sequence in PacBio HiFi reads and their negative impacts on genome assembly. BMC Genom. 2022;23:157. https://doi.org/10.1186/s12864-022-08375-1.

    Article  Google Scholar 

  13. Cheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021;18:170–5. https://doi.org/10.1038/s41592-020-01056-5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Pryszcz LP, Gabaldón T. Redundans: an assembly pipeline for highly heterozygous genomes. Nucleic Acids Res. 2016;44:e113. https://doi.org/10.1093/nar/gkw294.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Guan DF, McCarthy SA, Wood J, Howe K, Wang YD. Identifying and removing haplotypic duplication in primary genome assemblies. Bioinformatics. 2020;36:2896–8. https://doi.org/10.1093/bioinformatics/btaa025.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Zhu BH, Xiao J, Xue W, Xu G-C, Sun M-Y, Li J-T. P_RNA_scaffolder: a fast and accurate genome scaffolder using paired-end RNA-sequencing reads. BMC Genom. 2018;19:175. https://doi.org/10.1186/s12864-018-4567-3.

    Article  CAS  Google Scholar 

  17. Xu M, Guo L, Gu S, Wang O, Zhang R, Peters BA, Fan G, Liu X, Xu X, Deng L, Zhang Y. TGS-GapCloser: a fast and accurate gap closer for large genomes with low coverage of error-prone long reads. Gigascience. 2020;9(9):giaa094. https://doi.org/10.1093/gigascience/giaa094.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Seppey M, Manni M, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness. Methods Mol Biol. 2019;1962:227–45. https://doi.org/10.1007/978-1-4939-9173-0_14.

    Article  CAS  PubMed  Google Scholar 

  19. Girgis HZ. Red: an intelligent, rapid, accurate tool for detecting repeats de-novo on the genomic scale. BMC Bioinform. 2015;16:227. https://doi.org/10.1186/s12859-015-0654-5.

    Article  Google Scholar 

  20. Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, Lugo CSB, Elliott TA, Ware D, Peterson T, Jiang N, Hirsch CN, Hufford MB. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol. 2019;20:275. https://doi.org/10.1186/s13059-019-1905-y.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Gabriel L, Brůna T, Hoff KJ, Ebel M, Lomsadze A, Borodovsky M, Stanke M. BRAKER3: fully automated genome annotation using RNA-Seq and protein evidence with GeneMark-ETP, AUGUSTUS and TSEBRA. BioRxiv. 2023. https://doi.org/10.1101/2023.06.10.544449.

  22. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25498324.v1.

    Article  Google Scholar 

  23. Palmer J, Funannotate. Eukaryotic Genome Annotation Pipeline. https://github.com/nextgenusfs/funannotate. Accessed 20 Sep 2022.

  24. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. NCBI Seq Read Archive. 2024. https://identifiers.org/ncbi/insdc.sra:SRR27887122.

  25. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. NCBI Seq Read Archive. 2024. https://identifiers.org/ncbi/insdc.sra:SRR27887124.

  26. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. NCBI Seq Read Archive. 2024. https://identifiers.org/ncbi/insdc.sra:SRR27887123.

  27. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. NCBI Seq Read Archive. 2024. https://identifiers.org/nucleotide:JAZKJV000000000.1.

  28. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25498345.v1.

  29. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25499017.v1. Figshare.

    Article  Google Scholar 

  30. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25499059.v1. Figshare.

    Article  Google Scholar 

  31. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25499077.v1.

    Article  Google Scholar 

  32. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25499086.v2.

    Article  Google Scholar 

  33. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25499185.v1.

    Article  Google Scholar 

  34. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25499230.v1.

    Article  Google Scholar 

  35. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25499305.v1.

    Article  Google Scholar 

  36. Wang Z-F, Wu L-F, Chen L, Zhu W-G, Yu E-P, Xu F-X, Cao H-L. Genome assembly of Ottelia alismoides, a multiple-carbon utilization aquatic plant. Figshare. 2024. https://doi.org/10.6084/m9.figshare.25499254.v1.

    Article  Google Scholar 

Download references

Acknowledgements

We thank the reviewers for their time, expertise, and helpful suggestions to improve our manuscript.

Funding

The study is supported by the Key-Area Research and Development Program of Guangdong Province (2022B1111230001) and its sub-project (2022B1111230001-2-5); The Project of Department of Natural Resources of Guangdong Province: Monitoring and Evaluation of Nature Reserves in Guangdong Province; Guangdong Provincial Forestry Bureau Project — Planning of the Provincial Plant Ex Situ Protection System and National Key Protected Plant Ex Situ Protection and Propagation; the National Natural Science Foundation of China (No. 32370406, 31970188).

Author information

Authors and Affiliations

Authors

Contributions

Z-F W collected the samples, generated the sequencing data, analyzed the data and wrote the manuscript. L C, W-G Z, E-P Y, F-X X collected the samples and wrote the manuscript. Z-F W, L-F W and H-L C conceived and designed the project. All of the authors have read and approved the final version of this manuscript.

Corresponding authors

Correspondence to Zheng-Feng Wang or Hong-Lin Cao.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, ZF., Wu, LF., Chen, L. et al. Genome assembly of Ottelia alismoides, a multiple-carbon utilisation aquatic plant. BMC Genom Data 25, 48 (2024). https://doi.org/10.1186/s12863-024-01230-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12863-024-01230-0

Keywords