Skip to main content

ONT read assembly of the black rhino genome

Abstract

Objectives

The black rhinoceros (Diceros bicornis) is an endangered mammal for which a captive breeding program is part of the conservation effort. Black rhinos in zoo’s often suffer from chronic infections and heamochromatosis. Furthermore, breeding is hampered by low male fertility. To aid a research project studying these topics, we sequenced and assembled the genome of a captive male black rhino using ONT sequencing data only.

Data description

This work produced over 100 Gb whole genome sequencing reads from whole blood. These were assembled into a 2.47 Gb draft genome consisting of 834 contigs with an N50 of 29.53 Mb. The genome annotation was lifted over from an available genome annotation for black rhino, which resulted in the retrieval of over 99% of gene features. This new genome assembly will be a valuable resource in for conservation genetic research in this species.

Keywords

Black rhinoceros, Diceros bicornis, Genome assembly, Long reads, Whole genome sequencing.

Peer Review reports

Objective

Ongoing habitat loss, overexploitation, climate change and other factors have severely reduced population size of many large mammals. In addition to preventing and reversing these causal factors of biodiversity loss, ex-situ captive breeding programs can be an effective method for boosting small populations. Captive breeding programs are being attempted to augment the wild population of several species and prevent their extinction [1, 2].

Genomic resources play an increasingly important role in conservation-related captive breeding projects [2]. For example, measurement of genetic diversity and relatedness between individuals may inform choice of breeding individuals [1]. Genomic resources may also aid in managing health and fertility of captive animals [2].

The Eastern black rhinoceros Diceros bicornis is critically endangered in the wild. The total number of individuals is estimated at 3,142, many of which live in captivity [3]. A captive breeding program aimed at boosting population size is currently ongoing [4]. Several aspects of this effort benefit from genomic resources, including genetics-informed choice of breeding animals, microbiome research in relation to health issues and the genetics of male fertility. Several genome assemblies are available for black rhino, but these are either from a female or produced using short reads, preventing the study of Y-chromosomal regions that may affect male fertility. We sequenced the genome of a male black rhino using ONT long reads with the aim of generating long contigs that can be assigned to the sex chromosomes.

We present here the results of the sequencing, assembly and annotation. A preliminary identification of sex-chromosomal regions in this genome assembly is outlined in [5].

Data description

A blood sample was collected from the Eastern black rhino ‘Vungu’ held at Blijdorp Zoo (Rotterdam, The Netherlands) during routine medical examination. DNA was extracted from whole blood using the Nanobind CBB kit (Circulomics) following the manufacturer’s instructions. ONT sequencing libraries were prepared using ligation sequencing kit v9 (SQK-LSK109) and v10 (SQK-LSK110) and sequenced on a Minion device with both R9 and R10 flowcells. FAST5 data was basecalled using the Guppy basecaller version 3.3.3. This resulted in 5.7 million reads with an average read length of 18 kb (Table 1, Data set 1, [6]). The sequence reads were assembled de novo using Flye with the parameters --nano-raw = ONT regular reads, pre-Guppy5, −i 2 = number of polishing iterations [7]. This resulted in a draft genome assembly of 2.47 Gb, consisting of 834 contigs with a contig N50 of 29.5 Mb. The draft genome assembly was annotated by lifting over the annotation of accession GCA_013634535.1 [8], which is the only Black rhino genome assembly for which an annotation is currently available. Over 99.9% of gene features were retrieved in the new genome assembly.

Table 1 Overview of data files/data sets

Limitations

The genome assembly is still fragmented and can be further improved using Hi-C, bionano or other techniques.

Data availability

The data described in this Data note can be freely and openly accessed on NCBI SRA under Bioproject ID PRJNA777872 [6], NCBI GenBank under accession number JANTPW000000000 [9] and figshare (https://doi.org/10.6084/m9.figshare.22699801) [10]. Please see Table 1 for details and links to the data.

Abbreviations

NCBI:

National Center for Biotechnology Information

ONT:

Oxford Nanopore Technologies

References

  1. Witzenberger KA, Hochkirch A. Ex situ conservation genetics: a review of molecular studies on the genetic consequences of captive breeding programmes for endangered animal species. Biodivers Conserv. 2011;20:1843–61.

    Article  Google Scholar 

  2. Supple MA, Shapiro B. Conservation of biodiversity in the genomics era. Genome Biol. 2018;19:131.

    Article  PubMed  PubMed Central  Google Scholar 

  3. IUCN Red List. https://www.iucnredlist.org/. Accessed 6 Jul 2023.

  4. Foose TJ, Wiese RJ. Population management of rhinoceros in captivity. Int Zoo Yearbook. 2006;40:174–96.

    Article  Google Scholar 

  5. Kraaijeveld K, Bossers K, Petrusevksi N, Pieterman S, Bruins-van Sonsbeek L, Wittink F. Long-read assembly of the black rhinoceros genome reveals Y-chromosomal sequences. https://doi.org/10.5281/zenodo.8289411.

  6. NCBI Sequence Read Archive. 2021. https://identifiers.org/ncbi/bioproject:PRJNA777872.

  7. Kolmogorov M, Yuan J, Lin Y, Pevzner P. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol. 2019;37:540–6.

    Article  CAS  PubMed  Google Scholar 

  8. Moodley Y, Westbury MV, Russo I-RM, Gopalakrishnan S, Rakotoarivelo A, Olsen R-A, et al. Interspecific Gene Flow and the evolution of specialization in Black and White Rhinoceros. Mol Biol Evol. 2020;37:3105–17.

    Article  CAS  PubMed  Google Scholar 

  9. Genbank. https://www.ncbi.nlm.nih.gov/nuccore/JANTPW010000000, 31 juli 2023.

  10. Kraaijeveld K. Annotation Vungu_v1.fa. 2023. https://doi.org/10.6084/m9.figshare.22699801.

Download references

Funding

This project was funded by SIA Raak Publiek (SVB RAAK.PUB06.042). The funders had no role in the design of this study, during its execution, analyses, interpretation of the data, or writing the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

KK analyzed the data and wrote the manuscript, KB performed the genome assembly, NP performed the DNA extraction and sequencing, SP performed the basecalling and annotation liftover, LBR provided samples for the DNA extractions and FW coordinated the project and designed the work. All authors reviewed the manuscript. All authors declare no Competing interest.

Corresponding author

Correspondence to Floyd Wittink.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kraaijeveld, K., Bossers, K., Petrusevski, N. et al. ONT read assembly of the black rhino genome. BMC Genom Data 25, 27 (2024). https://doi.org/10.1186/s12863-024-01214-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12863-024-01214-0