Resources

A major task of our lab is to provide publicly available tools and resources to study alternative splicing and transcriptomics. We are currently developing and maintaining several tools, organized around the VastDB framework. This framework is especially designed to aid biomedical researchers without a strong computational background. It offers tools and resources to: (i) quantify AS and identify differentially spliced AS events using RNA-seq data (vast-tools), (ii) perform multiple genomic and sequence analyses for investigating AS events (Matt), (iii) assess relative intron splicing order around exons of interest (Insplico), (iv) identify AS events with genomic and regulatory conservation among species (ExOrthist), and (v) help with the biological interpretation of the results, and, ultimately, with the identification of interesting AS events to design wet-lab experiments (VastDB and PastDB).


Summary of the VastDB framework (from Gohr, Mantica et al, 2022)

(i) vast-tools: Vertebrate Alternative Splicing and Transcription Tools is a toolset for profiling and comparing alternative splicing events and gene expression from RNA-Seq data. It is currently available for 30 species, and we are constantly adding new species. Availability: https://github.com/vastgroup/vast-tools.

(ii) Matt: it is a Linux command-line tool-kit for analyzing genomic sequences with focus on the downstream analysis of alternative splicing events. Being a POSIX-style command-line tool-kit, Matt is run on the terminal of any Linux-like system. Availability: https://gitlab.com/aghr/matt.

(iii) Insplico: it is the first standalone tool to quantify inclusion as well as intron splicing order around exons of interest that works with both short and long read sequencing technologies. Availability: https://gitlab.com/aghr/insplico.

(iv) ExOrthist: Nextflow-based software enabling inference of exon homologs and orthogroups, visualization of evolution of exon-intron structures, and assessment of conservation of alternative splicing patterns. ExOrthist evaluates exon sequence conservation and considers the surrounding exon-intron context to derive genome-wide multi-species exon homologies at any evolutionary distance. Availability: https://github.com/biocorecrg/ExOrthist.

(v.a) VastDB: Vertebrate Alternative Splicing and Transcription Data Base provides genome-wide alternative splicing and gene expression profiles for dozens of cell and tissue types and developmental stages for (currently) human, mouse, rat, cow, chicken, zebrafish and fruitfly obtained from hundreds of carefully selected independent experiments profiled in a standardized using vast-tools. The combination of resources will allow the users to easily put their alternative splicing event of interest into a wide regulatory and evolutionary context. Furthermore, VastDB includes multiple features such as impact on protein sequence and domains, evolutionary conservation (through ExOrthist) and automatic primer design for validation. Availability: http://vastdb.crg.eu.

(v.b) PastDB: Plant Alternative Splicing and Transcription Data Base. Sister of VastDB, this resource offers similar data for the plant model Arabidopsis thaliana, including a broad compendium of special datasets related to biotic and abiotic stress. Availability: http://pastdb.crg.eu.