Squalomix

Squalomix - shark genomics and beyond

What is ‘Squalomix’?

Our project Squalomix aims at unveiling the unique evolutionary trajectory of sharks and rays and is sustained by tight collaborations involving an academic sequencing core facility and multiple public aquariums. By means of multimodal omics and molecular evolutionary approaches, it tackles various biological questions including morphological evolution, sensory systems, and reproduction including sex determination. The project overview is presented in this introductory movie prepared for Biodiversity Genomics 2020 Conference held online in October 2020. Very importantly, we don’t stick to performing de novo sequencing and are rather open to collaborating with other projects for contributing complimentary samples and analyses in order to maximize existing resources. Please find more details in our position paper.

For updates, follow us on X.

logo

Why sharks (and rays)?

The taxon Elasmobranchii (sharks and rays) is one of the most long-established evolutionary lineages of vertebrates severely underrepresented in molecular-level research. This limitation is attributable to the long-standing absence of their whole genome sequence information, resulting from the elusiveness and low fecundity of those animals, as well as the large and repetitive genomes. Their peculiar body fluid composition has also hindered routine cell culturing that is required for chromosome analysis which serves as an indispensable reference for genome sequencing.

Access to our products

Data archive gateway
BLAST-based sequence search
NCBI BioProject entry
Gene expression profiles based on published RNA-seq data (cloudy catshark; brownbanded bamboo shark)
Zebra shark genome browser powered by JBrowse2

Citation: Nishimura et al. ‘Squalomix: shark and ray genome analysis consortium and its data sharing platform’ F1000Res, 2022.

Squalomix trailer movie on YouTube

Species

We first chose two egg-laying species, brownbanded bamboo shark (Chiloscyllium punctatum) and cloudy catshark (Scyliorhinus torazame), to facilitate sustainable sampling without sacrifices of adults as well as to fuel laboratory experimentation. Please see this post at Nature Research Community for more details of our species choice. Currently, we are working on >10 more species for transcriptome or/and genome sequencing, and the speices list as of April 2021 is provided below (also, the list as of September 2020 available in the introductory movie linked above).

Status of sequencing data acquisition
T, transcriptome reads
S, illumina short genomic reads
X, 10x Genomics Chromium linked reads
N, Oxford nanopore reads
P, PacBio CLR reads
H, PacBio HiFi reads
C, Hi-C reads (or reads with Hi-C derivatives like Omni-C)
#, Our flagship species

Last updated on July 4, 2024

Species T S X N P H C
#Brownbanded bamboo shark Chiloscyllium punctatum O O   O O O O
Whale shark Rhincodon typus O O O O     O
#Cloudy catshark Scyliorhinus torazame O O       O O
#Zebra shark Stegostoma tigrinum (fasciatum) O O       O O
#Megamouth shark Megachasma pelagios O O O     O O
Blue shark Prionace glauca O O       O O
Short fin mako Isurus oxyrinchus   O       O O
Porbeagle shark Lamna nasus           O  
Basking shark Cetorhinus maximus   O O        
Coral catshark Atelomycterus marmoratus O            
Sandbar shark Carcharhinus plumbeus O            
Ocellate spot skate Okamejei kenojei O            
#Red stingray Hemitrygon akajei O O       O O
Japanese bullhead shark Heterodontus japonicus O            
Zebra bullhead shark Heterodontus zebra O            
Japanese sleeper ray Narke japonica O            
Frilled shark Chlamydoselachus anguineus O         O O
#Brown guitarfish Rhinobatos schlegelii O         O O
Shark ray Rhina ancylostoma O         O  
#Slendertail lanternshark Etmopterus molleri O         O  
Japanese sawshark Pristiophorus japonicus O            
Japanese angelshark Squatina japonica O         O O
Shortspine spurdog Squalus mitsukurii O            
White-bellied softnose skate Rhinoraja longicauda O         O O
Silver chimaera Chimaera phantasma O       O   O
Spotted ratfish Hydrolagus colliei             O
Greenland shark Somniosus microcephalus             O
Yellow-spotted fanray Platyrhina tangi           O O
Goblin shark Mitsukurina owstoni              
Naru eagle ray Aetobatus narutobiei              

This list includes some species for which our collaborators take an initiative in genome sequencing. Also see the species list on Genomes on a Tree (GoaT).

History

The chronological information about our activity before and after the launch of the project is included below. Previously, we reported interim results of our genome analysis on three shark species (brownbanded bamboo shark, cloudy catshark, and whale shark), together with tissue-level gene expression profiles and epigenomic marks of the chromatin regulator CTCF (Hara et al., 2018. Nat. Ecol. Evol. 2: 1761-1771). Please refer to the introductory paragraphs of this article in 2018 for the information about ealirer genomic studies.

Year-Month Event
2024 Oct Presentation about our activity at Virtual Conference Biodiversity Genomics 2024
2024 Oct Presentation about our activity at International Workshop on Chondrichthyan Development and Genomics
2023 Dec Our activity introduced at MBSJ conference in Kobe
2023 Aug Our original paper reporting first elasmobranch sex chromosome DNA sequences (in zebra shark and whale shark) published in Genome Res.
2023 Apr Our activity introduced at on-site conference 9th International Symposium on the Biology of Vertebrate Sex Determination
2023 Mar Our original paper reporting retained egg yolk protein (vitellogenin) repertoires in viviparous sharks published in GBE
2023 Mar Our original paper reporting unique ‘blue-shift’ mechanism of whale shark rhodopsin published in PNAS
2022 Oct Presentation about our activity at Virtual Conference Sharks International 2022
2022 Oct Presentation about our activity at Virtual Conference Biodiversity Genomics 2022
2022 Sep Our position paper published in F1000Res
2022 Feb Project briefing for Japanese researchers and related communities
2022 Jan Squalomix introduced as a part of the PNAS Special Feature for EBP
2021 Sep Presentation about our activity at Virtual Conference Biodiversity Genomics 2021
2021 Jun Release of the preprint of our position paper at Authorea
2021 May Our project introduced in 2021 PacBio Japan Virtual User Group Meeting
2021 Apr Project briefing for (potential) Japanese collaborators working at aquariums
2021 Jan Project briefing for (potential) international collaborators
2020 Nov Squalomix joined Earth BioGenome Project (EBP) as an affiliated project
2020 Nov Presentation about our activity at Virtual Annual Conference of Oceania Chondrichthyan Society (OCS)
2020 Oct Presentation about our activity at Virtual Conference Biodiversity Genomics 2020
2020 Sep The project Squalomix officially launched
2019 Jul Presentation about our activity at on-site American Elasmobranch Society (AES) Meeting in Snowbird
2019 Jun Presentation about our activity at on-site 5th International Whale Shark Conference (IWSC) in Exmouth
2018 Dec Presentation about our activity at on-site biennial meeting for Japanese Society of Elasmobranch Studies
2018 Oct Publication of our first landmark paper for shark genome analysis

Original genome size data

The contents of this section have been moved to the repository C-value.

Original karyotype data

Species Cell source Karyotype Reference
Chiloscyllium punctatum cultured embryonic fibroblasts 2n = 106 Uno et al., 2020
Chiloscyllium plagiosum cultured embryonic fibroblasts 2n = 106 Uno et al., 2020
Rhincodon typus cultured lymphocytes 2n = 102 Uno et al., 2020
Stegostoma tigrinum cultured lymphocytes 2n = 102 Uno et al., 2020

Our technical basis

For our previous publication in 2018, we produced all short read sequence data in-house at the DNA Analysis Facility operated inside the Laboratory for Phyloinformatics, RIKEN BDR. Our genome assembly also employed mate-pair reads produced with our protocol guide ‘iMate’.

We released the ‘iconHi-C’ (inexpensive and controllable Hi-C) protocol for adaptable Hi-C data acquisition (Kadota et al., 2018. GigaScience 9: giz158) and performed Hi-C scaffolding to obtain chromosome-scale sequences for shark genomes. The obtained sequences are validated with original high-fidelity karyotype data provided by our cell culture protocol for elasmobranchs (Uno et al., 2020. Commun. Biol. 3: 652).

Most recently, a technical note for Hi-C scaffolding using YaHS, based on our internal benchmarking by Taiki Niwa, has been made available.

Efforts by other parties

There are other parties working on chondrichthyan genes and genomes and other kinds of molecular data production. Included below are links to some of those efforts.
Entries at NCBI Genomes for the taxon Chondrichthyes
Inventory at GenomeSync for the taxon Chondrichthyes
Published studies focusing at least partly on cartiaginous fishes at PubMed (The search criterion may not be specific enough)
Entries at GoaT (Genomes on a tree)

Other useful resources

Shark References
Eschmeyer’s Catalog of Fishes

Squalomix data use policy

Newly formulated on October 13, 2024

We release our sequence data upon reaching the stage of manuscript preparation based on the generated data. However, if required by funding agencies, we will release the data earlier. In either case, we expect researchers outside the Squalomix consortium to respect our right to first present and publish analyses using the data we generate and release. Exceptions to this policy will be considered when researchers contact us to coordinate collaborations prior to data presentation and a formal agreement for collaboration is reached.

Unlike many genome sequencing consortia, our work is primarily supported by academic, discovery-driven grants, and we have limited personnel dedicated to data finalization and release. Please understand that we do not necessarily release data as openly as other consortia.

There have been numerous instances where genome assemblies of cartilaginous fish released by other consortia, particularly VGP, have been used without proper acknowledgment in publications and preprints, disregarding the policies set by each of those consortia. Our consortium takes such cases seriously and expects the broader cartilaginous fish genomics community to uphold higher standards of data usage ethics.