Mercurial > repos > yufei-luo > s_mart
changeset 58:5f5c9b74c2dd
Uploaded
author | m-zytnicki |
---|---|
date | Fri, 07 Feb 2014 11:53:36 -0500 |
parents | 36eaa579fdb6 |
children | 2a4884ba3e5c |
files | SMART/Java/Python/mySql/MySqlExonTable.pyc SMART/Java/Python/mySql/MySqlTranscriptTable.pyc SMART/galaxy/.removeIntrons.xml.swp SMART/galaxy/clusterizeBySlidingWindows.xml SMART/galaxy/removeExonLines.sh SMART/galaxy/removeIntrons.xml commons/core/writer/MySqlTranscriptWriter.pyc commons/core/writer/TranscriptWriter.pyc |
diffstat | 8 files changed, 20 insertions(+), 42 deletions(-) [+] |
line wrap: on
line diff
--- a/SMART/galaxy/clusterizeBySlidingWindows.xml Fri Feb 07 08:29:48 2014 -0500 +++ b/SMART/galaxy/clusterizeBySlidingWindows.xml Fri Feb 07 11:53:36 2014 -0500 @@ -4,25 +4,8 @@ <requirement type="set_environment">PYTHONPATH</requirement> </requirements> <command interpreter="python"> - ../Java/Python/clusterizeBySlidingWindows.py -i $formatType.inputFileName - #if $formatType.FormatInputFileName == 'bed': - -f bed - #elif $formatType.FormatInputFileName == 'gff': - -f gff - #elif $formatType.FormatInputFileName == 'gff2': - -f gff2 - #elif $formatType.FormatInputFileName == 'gff3': - -f gff3 - #elif $formatType.FormatInputFileName == 'sam': - -f sam - #elif $formatType.FormatInputFileName == 'gtf': - -f gtf - #end if - -s $size - -e $overlap - -o $outputFileGff - $normalize - $strands + #set $inputFiles = ",".join(["%s" % (s) for s in $formatType.inputFileName]) + ../Java/Python/clusterizeBySlidingWindows.py -i $inputFiles -f $formatType.FormatInputFileName -s $size -e $overlap -o $outputFileGff $strands #if $OptionTag.tag == "Yes": -g $OptionTag.value @@ -33,14 +16,10 @@ #end if #if $OptionWriteTag.writeTag == "Yes": - -w $OptionWriteTag.value + #set $outputTags = ",".join(["%s" % (t["value"]) for t in $OptionWriteTag.writeTags]) + -w $outputTags #end if - $strand - $plot $plotPng - $excel $excelOutput - - </command> <inputs> @@ -52,31 +31,34 @@ <option value="gff3">gff3</option> <option value="sam">sam</option> <option value="gtf">gtf</option> + <option value="bam">bam</option> </param> <when value="bed"> - <param name="inputFileName" format="bed" type="data" label="Input File"/> + <param name="inputFileName" format="bed" type="data" multiple="true" label="Input File"/> </when> <when value="gff"> - <param name="inputFileName" format="gff" type="data" label="Input File"/> + <param name="inputFileName" format="gff" type="data" multiple="true" label="Input File"/> </when> <when value="gff2"> - <param name="inputFileName" format="gff2" type="data" label="Input File"/> + <param name="inputFileName" format="gff2" type="data" multiple="true" label="Input File"/> </when> <when value="gff3"> - <param name="inputFileName" format="gff3" type="data" label="Input File"/> + <param name="inputFileName" format="gff3" type="data" multiple="true" label="Input File"/> </when> <when value="sam"> - <param name="inputFileName" format="sam" type="data" label="Input File"/> + <param name="inputFileName" format="sam" type="data" multiple="true" label="Input File"/> </when> <when value="gtf"> - <param name="inputFileName" format="gtf" type="data" label="Input File"/> + <param name="inputFileName" format="gtf" type="data" multiple="true" label="Input File"/> + </when> + <when value="bam"> + <param name="inputFileName" format="bam" type="data" multiple="true" label="Input File"/> </when> </conditional> <param name="size" type="text" value="50000" label="Size option" help="Size of the regions."/> <param name="overlap" type="text" value="50" label="Overlap option" help="Overlap between two consecutive regions."/> - <param name="normalize" type="boolean" truevalue="-m" falsevalue="" checked="false" label="Normalize option for only GFF3 file format" help="(only work if the tag nbOccurrences is set)"/> <param name="strands" type="boolean" truevalue="-2" falsevalue="" checked="false" label="Consider the two strands separately"/> <conditional name="OptionTag"> @@ -91,7 +73,6 @@ </when> </conditional> - <conditional name="OptionsOperation"> <param name="operation" type="select" label="combine tag value with given operation"> <option value="Yes">Yes</option> @@ -110,14 +91,15 @@ </when> </conditional> - <conditional name="OptionWriteTag"> - <param name="writeTag" type="select" label="write a new tag in output file"> + <param name="writeTag" type="select" label="use given tag in output file"> <option value="Yes">Yes</option> <option value="No" selected="true">No</option> </param> <when value="Yes"> - <param name="value" type="text" value="nbElements" label="write tag option" help="print the result in the given tag (default usually is 'nbElements')"/> + <repeat name="writeTags" title="Output Tag" min="1"> + <param name="value" type="text" value="nbElements" label="write tag option" help="print the result in the given tag (default usually is 'nbElements')"/> + </repeat> </when> <when value="No"> </when> @@ -132,10 +114,8 @@ <help> Sliding windows are a convenient ways to clusterize data mapped on the genome. There are two important parameters of a sliding window: the size of the window and the size of the overlap. -By default, sliding windows count the number of reads in each window. However, you can basically merge any information which is contained in the tags. You can compute the average, sum, median, max or min of the tags for each window. For instance, every window can contain the average cluster size, if you merge clusters instead of reads. +By default, sliding windows count the number of reads in each window of each input file. However, you can merge any information which is contained in the tags. You can compute the average, sum, median, max or min of the tags for each window. For instance, every window can contain the average cluster size, if you merge clusters instead of reads. The output file is a GFF3 file, where each element is a window. There is a special tag for each window, whose name is **nbElements** if you counted the number of transcripts per sliding window. However, if you performed a **min** (resp. **max**, **sum**, **median**, **average**) operation on the tags **value** of the transcripts, then the tag of the window will be **minValue** (resp. **maxValue**, **sumValue**, **medValue**, **avgValue**). You can also specify the name of your tag (which is actually advised: **nbReadsInSample1** will always be more informative than **nbElements**). - -You also have different option, which can select the *n* % highest regions, or the regions with at least *n* features in it, or even the regions with at least *n* unique features. This last option is useful when you want to cluster the reads which have mapped only once, for instance. </help> </tool>
--- a/SMART/galaxy/removeExonLines.sh Fri Feb 07 08:29:48 2014 -0500 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,2 +0,0 @@ -#!/bin/bash -sed '/exon/d' $1
--- a/SMART/galaxy/removeIntrons.xml Fri Feb 07 08:29:48 2014 -0500 +++ b/SMART/galaxy/removeIntrons.xml Fri Feb 07 11:53:36 2014 -0500 @@ -6,7 +6,7 @@ <command interpreter="python"> ../Java/Python/RemoveExons.py -i $formatType.inputFileName -f $formatType.FormatInputFileName -o $outputFile </command> <inputs> <conditional name="formatType"> - <param name="FormatInputFileName1" type="select" label="query File Format"> + <param name="FormatInputFileName" type="select" label="query File Format"> <option value="bed">bed</option> <option value="gff">gff</option> <option value="gff2">gff2</option>