From dad2adb8e0bac7435ec0de429a5fa7efe5d0a262 Mon Sep 17 00:00:00 2001 From: Michal Malszycki Date: Sun, 26 Apr 2020 23:36:49 +0200 Subject: [PATCH] Update README.md --- README.md | 20 ++++++++++---------- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/README.md b/README.md index 1832e11..c59f7dc 100644 --- a/README.md +++ b/README.md @@ -2,17 +2,17 @@ Analysis of SRRM2, SON, PRPF8, SRRM1, RBM25, Pinin and Coilin orthologues # Contents: -- data - generally data files - - genbank - primary sequence dataset in genbank format - - genbank_final - final sequence dataset in genbank format - - mobiDB - fasta sequence datasets and MobiDB-Lite results - - NCBI_ortholog - vertebrate orthologues lists in .csv from NCBI ortholog resource - - orthofinder - list of accession numbers for additional species obtained through Orthofinder2 analysis -- results - results in tabular format -- fig - figures +- `data` - generally data files + - `genbank` - primary sequence dataset in genbank format + - `genbank_final` - final sequence dataset in genbank format + - `mobiDB` - fasta sequence datasets and MobiDB-Lite results + - `NCBI_ortholog` - vertebrate orthologues lists in .csv from NCBI ortholog resource + - `orthofinder` - list of accession numbers for additional species obtained through Orthofinder2 analysis +- `results` - results in tabular format +- `fig` - figures Notebooks: -- Data_preparation.ipynb : dataset preparation and length comparison. Vertebrate SRRM2, SON, PRPF8, SRRM1, RBM25, Pinin and Coilin orthologous protein datasets were downloaded from NCBI’s orthologs and supplemented with orthologues predicted for invertebrate species using OrthoFinder. To visualize the results, protein lengths in each dataset were plotted on swarmplot. +- `Data_preparation.ipynb` : dataset preparation and length comparison. Vertebrate SRRM2, SON, PRPF8, SRRM1, RBM25, Pinin and Coilin orthologous protein datasets were downloaded from NCBI’s orthologs and supplemented with orthologues predicted for invertebrate species using OrthoFinder. To visualize the results, protein lengths in each dataset were plotted on swarmplot. -- Disorder_prediction.ipynb: Disorder prediction using IUPred2a and MobiDB-Lite. Visualized using Matplotlib's heatmap in two variants: sorted by phylogeny (TimeTree) and sorted by protein length. +- `Disorder_prediction.ipynb` : Disorder prediction using IUPred2a and MobiDB-Lite. Visualized using Matplotlib's heatmap in two variants: sorted by phylogeny (TimeTree) and sorted by protein length.