view combineprobforallelecombination.xml @ 0:70f8259b0b30 draft

Uploaded
author arkarachai-fungtammasan
date Wed, 01 Apr 2015 16:48:58 -0400
parents
children
line wrap: on
line source

<tool id="combineproballelecom" name="Combine probability to generate read profile " version="2.0.0">
  <description>from the same allele combination</description>
  <command interpreter="python2.7">combinedprobforallelecombination.py  $input > $output </command>

  <inputs>
    <param name="input" type="data" label="Select microsatellite length profile" />
 
  </inputs>
  <outputs>
    <data name="output" format="tabular" />
  </outputs>
  <tests>
    <!-- Test data with valid values -->
    <test>
      <param name="input" value="probvalueforhetero_out.txt"/>
      <output name="output" file="combineprob_out.txt"/>
    </test>
    
  </tests>
  <help>


.. class:: infomark

**What it does**

- This tool will combine probability that the allele combination can generated any read profile in the input. This is the last step to calculate probability to detect heterozygous for each allele combination and each depth.

**Citation**

When you use this tool, please cite **Fungtammasan A, Ananda G, Hile SE, Su MS, Sun C, Harris R, Medvedev P, Eckert K, Makova KD. 2015. Accurate Typing of Short Tandem Repeats from Genome-wide Sequencing Data and its Applications, Genome Research**
 
**Input**

The input format is the same as output from **Evaluate the probability of the allele combination to generate read profile** tool.

- Column 1 = location of microsatellite locus. 
- Column 2 = length profile (length of microsatellite in each read that mapped to this location in comma separated format). 
- Column 3 = motif of microsatellite in this locus. The input file can contain more than three column. 
- Column 4 = homozygous/heterozygous label.
- Column 5 = log based 10 of (the probability of homozygous/the probability of heterozygous)
- Column 6 = Allele for most probable homozygous form.
- Column 7 = Allele 1 for most probable heterozygous form.
- Column 8 = Allele 2 for most probable heterozygous form.
- Column 9 = Probability of the allele combination to generate given read profile.
- Column 10 = Number of possible rearrangement of given read profile.
- Column 11 = Probability of the allele combination to generate read profile with any rearrangement (Product of column 9 and column 10)
- Column 12 = Read depth

Only column 2,3,4,7,8,11 were used in calculation. 

**Output**


The output will contain the following header and column
 
- Line 1 header: read_depth	allele	heterozygous_prob	motif
- Column 1 = read depth
- Column 2 = allele combination
- Column 3 = probability to detect heterozygous of that allele combination
- Column 4 = motif




</help>
</tool>