Skip to content
forked from loosolab/UROPA

Development repository of the Universal RObust Peak Annotator

License

Notifications You must be signed in to change notification settings

kaischmid/UROPA

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code
This branch is 22 commits behind loosolab:master.

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
November 14, 2016 14:23
January 30, 2019 09:21
January 30, 2019 13:32
January 30, 2019 13:32
January 30, 2019 13:32
September 7, 2017 14:48
November 14, 2016 14:18
October 27, 2016 09:02
January 30, 2019 13:32

Universal RObust Peak Annotator

Anaconda-Server Badge Docker Badge PyPI version

The Universal RObust Peak Annotator (UROPA) is a command line based tool, intended for genomic region annotation. Based on a configuration file, different target features can be prioritized with multiple integrated queries. These can be sensitive for feature type, distance, strand specificity, feature attributes (eg. protein_coding) or the anchor position relative to the feature. UROPA can incorporate reference annotation files (GTF) from different sources, like Gencode, Ensembl, or RefSeq, as well as custom reference files produced by the user.

Features

  • Detect the most appropriate annotation with flexible parameter keys that allow robustness and simple customization, such as

    • feature type
    • feature anchor
    • feature direction relative to peak location
    • filter for attribute values, e.g. “protein_coding”
    • strand specificity
  • Utilization of all available GTF files as annotation database

  • One run with variable sets of parameters by multiple queries

  • Graduated annotation due to priorization

  • Different easily-readable output tables (allhits, finalfits, besthits).

  • Visual summary for annotation evaluation

  • Preparation of custom annotation files

Documentation

A detailed description of how to apply UROPA to your data can be found here.

Installation and Command-line usage

Conda package manager

We recommend to install UROPA using the conda package manager. Make sure to have conda installed, e.g. via

  • Miniconda
    • download the Miniconda installer for Python 3
    • run bash Miniconda3-latest-Linux-x86_64.sh to install Miniconda
    • Answer the question "Do you wish the installer to prepend the Miniconda install location to PATH in your /home/.../.bashrc ?" with yes OR do PATH=dir/to/miniconda3:$PATH after installation process

The UROPA installation is now as easy as

conda create --name uropa

conda activate uropa

conda install python uropa.

Biocontainers / Docker

If you have a running Docker environment, you can pull a biocontainer with UROPA and all dependencies via

  • docker pull quay.io/biocontainers/uropa:latest_tag using the latest tag from the taglist, e.g. 1.2.1--py27r3.3.2_0
  • docker pull loosolab/uropa

Installation from source

You can also install UROPA from the source PyPI package. Note that this comes without the R dependencies for auxillary scripts:

pip install uropa

To fulfill all other dependencies, follow the instructions below:

  • R/Rscript (v3.3.0 or higher; follow instructions on url)
    • install required packages step by step:
     install.packages(c("ggplot2", "devtools", "gplots", "gridExtra", "jsonlite", "VennDiagram", "getopt", "tidyr", "UpSetR"))
     source("https://bioconductor.org/biocLite.R")
     biocLite(c("RBGL", "graph"))

In order to plot the Chow-Ruskey plot with uropa_summary.R, install the modified Vennerable package from our fork:

library(devtools)
install_github("jenzopr/Vennerable")

Usage

To effectively use UROPA, make yourself familiar with the command-line options:

Command-line usage

$ uropa                   
Usage: uropa [options]          

Available options:

	-h, --help             	print this help message and further details on the configuration file
	-i, --input            	filename of configuration file [mandatory]
	-p, --prefix           	prefix for output files, can include subdirectories [basename of --input]
	-r, --reformat         	create an additional compact and line-reduced table as result file
	-s, --summary          	additional visualisation of results in graphical format will be created
	-t n, --threads n      	multiprocessed run: n = number of threads to run annotation process
	-add-comments          	show comment lines in output files explaining the columns
	-l, --log              	log file name for messages and warnings
	-d, --debug            	print verbose messages (for debugging purposes)
	-v, --version          	print the version and exit

Biocontainer usage

Running UROPA from a docker container can be done using the following command:

sudo docker run --rm -v <path-to-input-files-on-HOST>:<path-to-container-mnt> UROPA:LATEST uropa <UROPA-Paramters> -p <path-to-container-mnt>/'your-file-prefix'

-v parameter mounts a HOST folder into your docker CONTAINER. This folder should contain the input files for UROPA and also the result files will be stored here. No files will be stored in the container!

--rm removes/closes the container after the run

Make sure to use the uropa -p option specifying the output directory and prefix, otherwise results are lost in the container environment.

How to cite

Kondili M, Fust A, Preussner J, Kuenne C, Braun T, and Looso M. UROPA: a tool for Universal RObust Peak Annotation. Scientific Reports 7 (2017), doi: 10.1038/s41598-017-02464-y

Contribute

Support

If you have any questions please feel free to contact Mario Looso (mario.looso@mpi-bn.mpg.de).

License

The project is licensed under the MIT License.

About

Development repository of the Universal RObust Peak Annotator

Resources

License

Stars

Watchers

Forks

Languages

  • Python 68.7%
  • R 31.3%