Skip to content

malszycki/SON_SRRM2_speckles

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
fig
 
 
 
 
 
 

Description:

Analysis of SRRM2, SON, PRPF8, SRRM1, RBM25, Pinin and Coilin orthologues

Contents:

  • data - generally data files
    • genbank - primary sequence dataset in genbank format
    • genbank_final - final sequence dataset in genbank format
    • mobiDB - fasta sequence datasets and MobiDB-Lite results
    • NCBI_ortholog - vertebrate orthologues lists in .csv from NCBI ortholog resource
    • orthofinder - list of accession numbers for additional species obtained through Orthofinder2 analysis
  • results - results in tabular format
  • fig - figures

Notebooks:

  • Data_preparation.ipynb : dataset preparation and length comparison. Vertebrate SRRM2, SON, PRPF8, SRRM1, RBM25, Pinin and Coilin orthologous protein datasets were downloaded from NCBI’s orthologs and supplemented with orthologues predicted for invertebrate species using OrthoFinder. To visualize the results, protein lengths in each dataset were plotted on swarmplot.

  • Disorder_prediction.ipynb : Disorder prediction using IUPred2a and MobiDB-Lite. Visualized using Matplotlib's heatmap in two variants: sorted by phylogeny (TimeTree) and sorted by protein length.

About

Analysis of SRRM2, SON, PRPF8, SRRM1, RBM25, Pinin and Coilin orthologues

Resources

Stars

Watchers

Forks

Releases

No releases published