0
|
1 <tool id="readdepth2seqdepth" name="Convert informative read depth to sequencing depth" version="1.0.0">
|
|
2 <description>for flank-based mapping of microsatellites</description>
|
|
3 <command interpreter="python2.7">sequencingdepthconversion_G.py $repeatlength $flanksize $readlength $infodepth $probprediction > $output </command>
|
|
4
|
|
5 <inputs>
|
|
6 <param name="repeatlength" type="integer" value="10" label="Repeat length (bp)" />
|
|
7 <param name="flanksize" type="integer" value="20" label="Required flank bases on each side in mapping" />
|
|
8 <param name="readlength" type="integer" value="100" label="Read length (treat all read as single end read)" />
|
|
9 <param name="infodepth" type="integer" value="5" label="Required read depth" />
|
|
10 <param name="probprediction" type="float" value="0.9" label="Proportion of genome that need certain level of read depth" />
|
|
11 </inputs>
|
|
12 <outputs>
|
|
13 <data format="input" name="output" />
|
|
14
|
|
15 </outputs>
|
|
16 <tests>
|
|
17 <!-- Test data with valid values -->
|
|
18 <test>
|
|
19 <param name="repeatlength" value="10"/>
|
|
20 <param name="flanksize" value="20" />
|
|
21 <param name="readlength" value="100" />
|
|
22 <param name="infodepth" value="5" />
|
|
23 <param name="probprediction" value="0.9" />
|
|
24 <output name="output" file="readdepth2seqdepth.out"/>
|
|
25 </test>
|
|
26
|
|
27 </tests>
|
|
28 <help>
|
|
29
|
|
30
|
|
31 .. class:: infomark
|
|
32
|
|
33 **What it does**
|
|
34
|
|
35 This tool is used to convert informative read depth (specified by user) to sequencing depth when the microsatellites is mapped using TRFM pipeline.
|
|
36 The locus specific sequencing depth is the sequencing depth that will make a certain loci have certain read depth based on uniform mapped of read. It is calculated as: ::
|
|
37
|
|
38 yrequired = ( X * L ) / (L - (2F+r-1))
|
|
39
|
|
40 Where X = read depth, L = read length, F = the number of flanked bases required on each flanking regions, r = the expected repeat length of microsatellite of interest.
|
|
41
|
|
42 The genome wide sequencing depth is the sequencing depth that will make certain percentage of genome (e.g. 90 percent or 95 percent) to have certain locus specific sequencing depth. It's calculated using numerical guessing to find smallest lambda that: ::
|
|
43
|
|
44 0.90 (or other proportion specified by user) < = P(Y=0) + P(Y=1) + …+ P(Y=yrequired-1)
|
|
45
|
|
46 P(Y=y) = (lambda^(y) * e ^(-lambda)) /y!
|
|
47
|
|
48 y = specific level of sequencing depth. Lambda = genome wide sequencing depth
|
|
49
|
|
50
|
|
51 **Citation**
|
|
52
|
|
53 When you use this tool, please cite **Fungtammasan A, Ananda G, Hile SE, Su MS, Sun C, Harris R, Medvedev P, Eckert K, Makova KD. 2015. Accurate Typing of Short Tandem Repeats from Genome-wide Sequencing Data and its Applications, Genome Research**
|
|
54
|
|
55
|
|
56 </help>
|
|
57 </tool>
|