Version 1.0 #94

renewiegandt · 2019-03-28T14:32:56Z

No description provided.

…es; if compareBed.sh is not executable chmod +x is called

Estimation motifs

… second gtf file

Estimation motifs

msbentsen · 2019-04-07T09:00:22Z

Have you tried to run the test data? :-)

run pipeline.nf --bigwig ./demo/buenrostro50k_chr1_fp.bw --bed ./demo/buenrostro50k_chr1_peaks.bed --genome_fasta ./demo/hg38_chr1.fa --motif_db ./demo/jaspar_vertebrates.meme --out ./demo/buenrostro50k_chr1_out/ --organism hg38
gives me:

N E X T F L O W  ~  version 19.01.0
Launching `pipeline.nf` [adoring_cuvier] - revision: 47924cb4dd

        Usage: nextflow run pipeline.nf --bigwig [BigWig-file] --bed [BED-file] --genome_fasta [FASTA-file] --motif_db [MEME-file] --config [UROPA-config-file]

        Required arguments:
                --bigwig                 Path to BigWig-file
                --bed                    Path to BED-file
                --genome_fasta           Path to genome in FASTA-format
                --motif_db               Path to motif-database in MEME-format
                --config                 Path to UROPA configuration file
                --gtf_annotation        Path to gtf annotation file
                --organism               Input organism [hg38 | hg19 | mm9 | mm10]
                --out                    Output Directory (Default: './out/')

        Optional arguments:

                --help [0|1]            1 to show this help message. (Default: 0)
                --gtf_merged            Path to gtf-file. If path is set the process which creates a gtf-file is skipped.
                --tfbs_path             Path to directory with tfbsscan output. If given tfbsscan will be skipped.

                Footprint extraction:
                --window_length INT     This parameter sets the length of a sliding window. (Default: 200)
                --step INT              This parameter sets the number of positions to slide the window forward. (Default: 100)
                --percentage INT        Threshold in percent (Default: 0)
                --min_gap INT           If footprints are less than X bases apart the footprints will be merged (Default: 6)

                Filter motifs:
                --min_size_fp INT       Minimum sequence length threshold. Smaller sequences are discarded. (Default: 10)
                --max_size_fp INT       Maximum sequence length threshold. Discards all sequences longer than this value. (Default: 200)
                --tfbsscan_method [moods|fimo] Method used by tfbsscan. (Default: moods)

                Cluster:
                Sequence preparation/ reduction:
                --kmer INT              K-mer length (Default: 10)
                --aprox_motif_len INT   Motif length (Default: 10)
                --motif_occurrence FLOAT        Percentage of motifs over all sequences. Use 1 (Default) to assume every sequence contains a motif.
                --min_seq_length Interations    Remove all sequences below this value. (Default: 10)
                Clustering:
                --global INT            Global (=1) or local (=0) alignment. (Default: 0)
                --identity FLOAT        Identity threshold. (Default: 0.8)
                --sequence_coverage INT Minimum aligned nucleotides on both sequences. (Default: 8)
                --memory INT            Memory limit in MB. 0 for unlimited. (Default: 800)
                --throw_away_seq INT    Remove all sequences equal or below this length before clustering. (Default: 9)
                --strand INT            Align +/+ & +/- (= 1). Or align only +/+ (= 0). (Default: 0)

                Motif estimation:
                --min_seq INT           Sets the minimum number of sequences required for the FASTA-files given to GLAM2. (Default: 100)
                --motif_min_key INT     Minimum number of key positions (aligned columns) in the alignment done by GLAM2. (Default: 8)
                --motif_max_key INT     Maximum number of key positions (aligned columns) in the alignment done by GLAM2. (Default: 20)
                --iteration INT         Number of iterations done by GLAM2. More Iterations: better results, higher runtime. (Default: 10000)
                --tomtom_treshold FLOAT Threshold for similarity score. (Default: 0.01)
                --best_motif INT        Get the best X motifs per cluster. (Default: 3)
                --gap_penalty INT       Set penalty for gaps in GLAM2 (Default: 1000)
                --seed Set seed for GLAM2 (Default: 123456789)
                Moitf clustering:
                --cluster_motif Boolean If 1 pipeline clusters motifs. If its 0 it does not. (Defaul: 0)
                --edge_weight INT       Minimum weight of edges in motif-cluster-graph (Default: 5)
                --motif_similarity_thresh FLOAT Threshold for motif similarity score (Default: 0.00001)

                Creating GTF:
                --tissues List/String   List of one or more keywords for tissue-/category-activity, categories must be specified as in JSON
                                        config
                Evaluation:
                --max_uropa_runs INT     Maximum number UROPA runs running parallelized (Default: 10)
        All arguments can be set in the configuration files

Nextflow log contains:

Apr-07 10:53:03.160 [main] DEBUG nextflow.cli.Launcher - $> nextflow run pipeline.nf --bigwig ./demo/buenrostro50k_chr1_fp.bw --bed ./demo/buenrostro50k_chr1_peaks.bed --genome_fasta ./demo/hg38_chr1.fa --motif_db ./demo/jaspar_vertebrates.meme --out ./demo/buenrostro50k_chr1_out/ --organism hg38
Apr-07 10:53:03.397 [main] INFO  nextflow.cli.CmdRun - N E X T F L O W  ~  version 19.01.0
Apr-07 10:53:03.438 [main] INFO  nextflow.cli.CmdRun - Launching `pipeline.nf` [adoring_cuvier] - revision: 47924cb4dd
Apr-07 10:53:03.470 [main] DEBUG nextflow.config.ConfigBuilder - Found config local: /mnt/agnerds/mette.bentsen/masterJLU2018/nextflow.config
Apr-07 10:53:03.471 [main] DEBUG nextflow.config.ConfigBuilder - Parsing config file: /mnt/agnerds/mette.bentsen/masterJLU2018/nextflow.config
Apr-07 10:53:03.559 [main] DEBUG nextflow.config.ConfigBuilder - Applying config profile: `standard`
Apr-07 10:53:04.656 [main] DEBUG nextflow.Session - Session uuid: c9cc6bb9-e1ff-44b2-a947-31a9c0c98840
Apr-07 10:53:04.657 [main] DEBUG nextflow.Session - Run name: adoring_cuvier
Apr-07 10:53:04.659 [main] DEBUG nextflow.Session - Executor pool size: 64
Apr-07 10:53:04.743 [main] DEBUG nextflow.cli.CmdRun - 
  Version: 19.01.0 build 5050
  Modified: 22-01-2019 11:19 UTC (12:19 CEST)
  System: Linux 4.9.0-8-amd64
  Runtime: Groovy 2.5.5 on OpenJDK 64-Bit Server VM 10.0.2+13
  Encoding: UTF-8 (UTF-8)
  Process: 10062@KI-V0290 [172.16.12.72]
  CPUs: 64 - Mem: 62.9 GB (18.2 GB) - Swap: 4 GB (4 GB)
Apr-07 10:53:04.821 [main] DEBUG nextflow.Session - Work-dir: /mnt/agnerds/mette.bentsen/masterJLU2018/work [cifs]
Apr-07 10:53:05.206 [main] DEBUG nextflow.Session - Session start invoked
Apr-07 10:53:05.216 [main] DEBUG nextflow.processor.TaskDispatcher - Dispatcher > start
Apr-07 10:53:05.217 [main] DEBUG nextflow.script.ScriptRunner - > Script parsing
Apr-07 10:53:06.454 [main] DEBUG nextflow.script.ScriptRunner - > Launching execution
Apr-07 10:53:06.482 [main] INFO  nextflow.Nextflow - 
	Usage: nextflow run pipeline.nf --bigwig [BigWig-file] --bed [BED-file] --genome_fasta [FASTA-file] --motif_db [MEME-file] --config [UROPA-config-file]

	Required arguments:
		--bigwig		 Path to BigWig-file
		--bed			 Path to BED-file

renewiegandt · 2019-04-12T11:49:53Z

Have you tried to run the test data? :-)

No... I added a new file to the demo folder and added the new required parameter to the demo run/call. It should be working now.

Estimation motifs

Added nextflwo to masterenv.yml

Estimation motifs

renewiegandt added 30 commits January 23, 2019 13:15

Update parameter names in create_gtf.config

0f88bad

Update parameter names in footprint_extraction.config

4bef518

Update parameter names in moitif_estimation.config

0f0aa58

pipeline.nf: Added check for unknown parameters; update parameter nam…

7e90339

…es; if compareBed.sh is not executable chmod +x is called

Removed debugging code

e2494a0

Merge pull request #91 from loosolab/estimation_motifs

987bf7f

Estimation motifs

get_motif_seq.R: Removed redundant code

170ca1e

Rename get_motif_seq.R

565d4b8

pipeline.nf: changed output dir of logs

e470bed

compareBed.sh: rename stats to log

b931ea7

Added venn.R to 3.2_evaluation

35d6f0c

Update uropa.config

98654e1

Added uropa to the enviroment yaml

2186cc6

Added uropa and venn process; mergeing gtf files; added parameter for…

3793c70

… second gtf file

Added yaml-file for meme-suite enviroment

1a95d4c

Added meme_env parameter

011b99a

Fixed channel for venn; added conda meme-env

10a3df3

added parameter gap penalty to 2.2config

4c08eeb

merge_similar_clusters.R: bugfix

9bf3ed9

fixed label_cluster; minor changes

50ec488

merge_similar_clusters: bugfix; improved cluster plot

e11e0af

pipeline: added uropa summary to output

a46476f

merge_similar_clusters: fixed filenames

51dcfc0

pipeline: Added seed for glam2

f1eb5f1

Added skript png_to_pdf

c034878

Better output directory names

fae46c6

Minor formating changes of output file

cdc73a2

Update parameters in README

89ae057

Added new parameters to config files

f8c572b

Minor changes

5d9fb6c

renewiegandt added 6 commits March 26, 2019 17:33

Minor changes

07d492c

Merge pull request #92 from loosolab/estimation_motifs

d6b44ea

Estimation motifs

Renamed log file

183483f

Update: new parameters

9b5cd47

sorting gtf; bug fix; renaming gtf parameter

bce6e1b

Merge pull request #93 from loosolab/estimation_motifs

8f9fbee

Estimation motifs

renewiegandt requested a review from msbentsen April 5, 2019 10:40

Added TODO

2e14753

Update test run

5caf87e

renewiegandt added 13 commits April 12, 2019 14:44

Improved error handling if parameter is missing

f210c5e

Merge branch 'dev' into estimation_motifs

66c11ff

Added error message for missing required parameters

3caf997

Reworked check on required parameters

1b4d9c3

Merge pull request #95 from loosolab/estimation_motifs

a58cc18

Estimation motifs

Added nextflwo to masterenv.yml

8069aba

Merge pull request #96 from loosolab/estimation_motifs

3ef6756

Added nextflwo to masterenv.yml

Add fixed channel for jellyfish package

f7da052

Update documentation for installation

99cefb6

Merge pull request #97 from loosolab/estimation_motifs

a3f074c

Estimation motifs

Update README.md

bc25457

Update README.md

a36e060

Update README.md

db3c470

Version 1.0 #94

Version 1.0 #94

renewiegandt commented Mar 28, 2019

msbentsen commented Apr 7, 2019

renewiegandt commented Apr 12, 2019

Version 1.0 #94

Are you sure you want to change the base?

Version 1.0 #94

Conversation

renewiegandt commented Mar 28, 2019

msbentsen commented Apr 7, 2019

renewiegandt commented Apr 12, 2019