Chromosome-level genome assembly with telomeric repeats at scaffold ends for Rhabdosargus sarba

Sci Data. 2025 Jul 11;12(1):1202. doi: 10.1038/s41597-025-05324-x.

Abstract

Rhabdosargus sarba, the goldlined seabream, is a euryhaline marine fish of great aquaculture potential. Genome sequencing and assembly of R. sarba was carried utilizing a multi-platform sequencing strategy that included long-read sequencing (PacBio HiFi), short-read sequencing (Illumina), and chromatin interaction mapping (Hi-C). The final genome assembly size after scaffolding was 764.59 Mb in 31 scaffolds with an N50 length of 33.98 Mb. Repeat profiling of primary assembly showed that 28.71% of the genome comprises of repeat elements. Gene prediction utilising the evidence from ab initio prediction and transcriptome data revealed 26,913 protein encoding genes and functional annotation and pathway analysis showed their participation in 332 pathways. This genome is an excellent resource for future research on genetic improvement and molecular breeding programmes for R. sarba.

Publication types

  • Dataset

MeSH terms

  • Animals
  • Genome*
  • Repetitive Sequences, Nucleic Acid*
  • Telomere*