# HG changeset patch # User iuc # Date 1525435190 14400 # Node ID 04f5c3d7448ed3157454fb7b46aa76f61312e823 # Parent bdfc22c1c3e366cb896c473451dc146c9a38a79e planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/repeat_masker commit cbba6947f1380751e1db3fa5b043af630523fd86 diff -r bdfc22c1c3e3 -r 04f5c3d7448e repeatmasker.xml --- a/repeatmasker.xml Wed May 02 20:18:11 2018 -0400 +++ b/repeatmasker.xml Fri May 04 07:59:50 2018 -0400 @@ -1,4 +1,4 @@ - + RepeatMasker @@ -11,7 +11,7 @@ export REPEATMASKER_LIB_DIR=\$(pwd)/lib && for file in \$(ls \$RM_LIB_PATH) ; do ln -s \$RM_LIB_PATH/\$file lib/\$file ; done && #if $repeat_source.source_type == "repbase": - cp '${repeat_source.repbase_file}' lib/RMRBSeqs.embl && + cp '${repeat_source.repbase_file}' 'lib/${repeat_source.repbase_file_name}' && #end if ln -s '${input_fasta}' rm_input.fasta && RepeatMasker -dir \$(pwd) @@ -26,31 +26,31 @@ #end if #end if -parallel \${GALAXY_SLOTS:-1} - '${gff}' - '${ignore_n_stretches}' - '${advanced.is_only}' - '${advanced.is_clip}' - '${advanced.no_is}' - '${advanced.rodspec}' - '${advanced.primspec}' - '${advanced.nolow}' - '${advanced.noint}' - '${advanced.norna}' - '${advanced.alu}' - '${advanced.div}' - '${advanced.search_speed}' - '${advanced.frag}' - '${advanced.maxsize}' - #if $advanced.gc is not None: - '${advanced.gc}' + ${gff} + ${excln} + ${advanced.is_only} + ${advanced.is_clip} + ${advanced.no_is} + ${advanced.rodspec} + ${advanced.primspec} + ${advanced.nolow} + ${advanced.noint} + ${advanced.norna} + ${advanced.alu} + ${advanced.div} + ${advanced.search_speed} + -frag ${advanced.frag} + ## -maxsize ${advanced.maxsize} + #if str($advanced.gc): + -gc ${advanced.gc} #end if - '${advanced.gccalc}' - '${advanced.nocut}' - '${advanced.keep_alignments}' - '${advanced.invert_alignments}' - '${advanced.xout}' - '${advanced.xsmall}' - '${advanced.poly}' + ${advanced.gccalc} + ${advanced.nocut} + ${advanced.keep_alignments} + ${advanced.invert_alignments} + ${advanced.xout} + ${advanced.xsmall} + ${advanced.poly} rm_input.fasta && #if $advanced.is_only != '-is_only': mv rm_input.fasta.masked '${output_masked_genome}' && @@ -79,6 +79,7 @@ + @@ -126,7 +127,7 @@ - +
@@ -145,8 +146,9 @@ - - + + + @@ -203,6 +205,17 @@ + + + + + + + + + + + =20 X/Ns in query were excluded in % calcs + The query species was assumed to be homo RepeatMasker Combined Database: Dfam_Consensus-20170127 diff -r bdfc22c1c3e3 -r 04f5c3d7448e test-data/small_repbase.fasta.log --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/small_repbase.fasta.log Fri May 04 07:59:50 2018 -0400 @@ -0,0 +1,10 @@ +SW score % div. % del. % ins. query sequence pos in query: begin end (left) repeat class/family pos in repeat: begin end (left) ID + +18 0.0 0.0 0.0 scaffold_1 613 632 (13588) (GT)n Simple_repeat 1 20 (0) 1 +16 18.3 2.2 2.2 scaffold_1 780 824 (13396) (ATAATA)n Simple_repeat 1 45 (0) 2 +12 23.9 4.5 0.0 scaffold_1 2231 2274 (11946) (CAGA)n Simple_repeat 1 46 (0) 3 +15 18.4 10.2 0.0 scaffold_1 4853 4901 (9319) (TC)n Simple_repeat 1 54 (0) 4 +13 19.1 1.8 7.7 scaffold_1 6230 6284 (7936) (TAATTAA)n Simple_repeat 1 52 (0) 5 +15 28.3 0.0 3.5 scaffold_1 6548 6606 (7614) (GACA)n Simple_repeat 1 57 (0) 6 +67 2.9 1.4 0.0 scaffold_1 11981 12050 (2170) (CT)n Simple_repeat 1 71 (0) 7 +19 15.4 2.8 0.0 scaffold_1 12078 12113 (2107) (CT)n Simple_repeat 1 37 (0) 7 diff -r bdfc22c1c3e3 -r 04f5c3d7448e test-data/small_repbase.fasta.stats --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/small_repbase.fasta.stats Fri May 04 07:59:50 2018 -0400 @@ -0,0 +1,60 @@ +================================================== +file name: rm_input.fasta +sequences: 1 +total length: 14220 bp (14220 bp excl N/X-runs) +GC level: 39.94 % +bases masked: 378 bp ( 2.66 %) +================================================== + number of length percentage + elements* occupied of sequence +-------------------------------------------------- +Retroelements 0 0 bp 0.00 % + SINEs: 0 0 bp 0.00 % + Penelope 0 0 bp 0.00 % + LINEs: 0 0 bp 0.00 % + CRE/SLACS 0 0 bp 0.00 % + L2/CR1/Rex 0 0 bp 0.00 % + R1/LOA/Jockey 0 0 bp 0.00 % + R2/R4/NeSL 0 0 bp 0.00 % + RTE/Bov-B 0 0 bp 0.00 % + L1/CIN4 0 0 bp 0.00 % + LTR elements: 0 0 bp 0.00 % + BEL/Pao 0 0 bp 0.00 % + Ty1/Copia 0 0 bp 0.00 % + Gypsy/DIRS1 0 0 bp 0.00 % + Retroviral 0 0 bp 0.00 % + +DNA transposons 0 0 bp 0.00 % + hobo-Activator 0 0 bp 0.00 % + Tc1-IS630-Pogo 0 0 bp 0.00 % + En-Spm 0 0 bp 0.00 % + MuDR-IS905 0 0 bp 0.00 % + PiggyBac 0 0 bp 0.00 % + Tourist/Harbinger 0 0 bp 0.00 % + Other (Mirage, 0 0 bp 0.00 % + P-element, Transib) + +Rolling-circles 0 0 bp 0.00 % + +Unclassified: 0 0 bp 0.00 % + +Total interspersed repeats: 0 bp 0.00 % + + +Small RNA: 0 0 bp 0.00 % + +Satellites: 0 0 bp 0.00 % +Simple repeats: 7 378 bp 2.66 % +Low complexity: 0 0 bp 0.00 % +================================================== + +* most repeats fragmented by insertions or deletions + have been counted as one element + Runs of >=20 X/Ns in query were excluded in % calcs + + +The query species was assumed to be anopheles genus +RepeatMasker Combined Database: Dfam_Consensus-20170127 + +run with rmblastn version 2.2.27+ +