comparison gbsx_demultiplexer.xml @ 9:8a457a81f492 draft

Uploaded
author koen
date Wed, 13 Jan 2016 10:51:44 -0500
parents 9dc99f2deb3d
children 32ee036c156a
comparison
equal deleted inserted replaced
8:ac7b4d03ab2f 9:8a457a81f492
1 <tool id="gbsx_demultiplexer" name="GBSX: Demultiplexer" force_history_refresh="True" version="1.1.5">
2 <description>demultiplexes inline barcodes from GBS and RAD experiments</description>
3 <requirements>
4 <requirement type="set_environment">GBSX_PATH</requirement>
5 <requirement type="package" version="1.1.5">GBSX</requirement>
6 </requirements>
7 <command interpreter="sh">
8 #if $singleOrPaired.SR_PE_select=="SR"
9 gbsx_demultiplexer.sh $SR_Read 0 $info_file $barcodeMismatch $enzymeMismatch $rad_gbs_select $demultiplex_stats
10 #else
11 gbsx_demultiplexer.sh $PE_Read1 $PE_Read2 $info_file $barcodeMismatch $enzymeMismatch $rad_gbs_select $demultiplex_stats
12 #end if
13 </command>
14 <inputs>
15 <conditional name="singleOrPaired">
16 <param name="SR_PE_select" type="select" label="How did you sequence?">
17 <option value="SR">Single Read</option>
18 <option value="PE">Paired End</option>
19 </param>
20 <when value="SR">
21 <param name="SR_Read" format="fastq" type="data" label="Single Read Fastq File" />
22 </when>
23 <when value="PE">
24 <param name="PE_Read1" format="fastq" type="data" label="Paired End Fastq File Read 1" />
25 <param name="PE_Read2" format="fastq" type="data" label="Paired End Fastq File Read 2" />
26 </when>
27 </conditional>
28 <param name="info_file" format="tabular" type="data" label="Info File" help="see help below" />
29 <param name="barcodeMismatch" size="4" type="integer" value="1" min="0" max="3" label="Number of mismatches allowed in the barcode:" />
30 <param name="enzymeMismatch" size="4" type="integer" value="1" min="0" max="3" label="Number of mismatches allowed in the enzyme cutsite:" />
31
32
33 <param name="rad_gbs_select" size="4" type="select" label="Experiment type:" help="see help below" >
34 <option value="false">GBS</option>
35 <option value="true">RAD</option>
36 </param>
37 </inputs>
38 <outputs>
39 <data format="tabular" name="demultiplex_stats" label="GBSX: Demultiplexer: Demultiplex stats" />
40 <data format="txt" name="report">
41 <discover_datasets pattern="__designation__" ext="fastq" directory="gbsx_demultiplex" visible="true" />
42 </data>
43 </outputs>
44
45 <tests>
46 <test>
47 <param name="SR_Read" value="gbsx_demultiplex_data.R1.fastq"/>
48 <param name="PE_Read1" value="gbsx_demultiplex_data.R1.fastq"/>
49 <param name="PE_Read2" value="gbsx_demultiplex_data.R2.fastq"/>
50 <param name="info_file" value="gbsx_demultiplex_barcode_list.tsv"/>
51 <param name="barcodeMismatch" value="1"/>
52 <param name="enzymeMismatch" value="1"/>
53 <param name="rad_gbs_select" value="false"/>
54 <output name="demultiplex_stats" file="gbsDemultiplex.stats"/>
55 </test>
56 </tests>
57
58 <help>
59
60 .. class:: infomark
61
62 **Overview**
63
64 Genotyping by Sequencing is an emerging technology for cost effective variant discovery and genotyping. However, current analysis tools do not fulfill all experimental design and analysis needs.
65
66 GBSX is a package of tools to first aid in experimental design, including choice of enzymes and barcode design. Secondly, it provides a first analysis step to demultiplex samples using in-line barcodes, providing fastq files that can easily be plugged into existing variant analysis pipelines.
67
68 ----
69
70 **What it does**
71
72 This toolkit demultiplexes the samples using inline barcodes, for GBS and RAD experiments.
73
74 For the demultiplexing we adjust the original barcode_list.txt file. Sample names are added in the first column, the second column will be the generated barcodes, the last column will be the restriction enzyme.
75
76 Only the most used options are included. For more options, please use the command line version of GBSX.
77
78 --------
79
80 **Help**
81
82 This program demultiplexes fastq files obtained from sequencing with inline barcodes. Like used in GBS, RAD, ... protocols.
83
84 You can demultiplex Single Read and Paired end experiments. The correct sequence method can be selected with the 'How did you sequence?' parameter. Paired end libraries must be split in 2 files, with the regular Read1 and Read2 fastq file definitions.
85
86 Info File: this is a tab delimeted file without headings, with three (or more) columns: sample name (only letters, numbers and underscores are allowed), sequence of the barcode, name of the enzyme, name of the second enzyme (optional, can be an empty string), the second barcode (optional, can be an empty string). This can be the output of the GBSX: Barcode Generator.
87
88 Number of mismatches allowed leaves an option open for the recovering of possible sequencing errors.
89
90 Experiment type: GBS is the Genotype-by-sequencing protocol where the sequenced reads are inlineBarcode-RestrictionEnzyme-insert-RestrictionEnzyme-CommonAdaptor. RAD is the Restriction site Associated DNA protocol where the sequenced reads are inlineBarcode-RestrictionEnzyme-insert-CommonAdaptor.
91
92 Note: if you want to demultiplex data containing inline barcodes, without restriction enzymes, you should use the RAD parameter and give NA as Restriction Enzyme. The sequenced reads are inlineBarcode-insert-CommonAdaptor.
93
94 The used Common Adaptor is: AGATCGGAAGAGCG
95
96 After the tool finished running, a history refresh is needed.
97
98 ----
99
100 **Created by:**
101
102 Koen Herten,
103
104 Genomics Core Leuven,
105
106 Belgium
107
108 ----
109
110 **Project links:**
111
112 https://github.com/GenomicsCoreLeuven/GBSX
113
114 ----
115
116 **Integrated by:**
117
118 Koen Herten
119
120 Genomics Core
121
122 Center for Human Genetics
123
124 UZ – KU Leuven
125
126 Herestraat 49 PO box 602
127
128 B-3000 Leuven, Belgium
129
130 If you use this tool in Galaxy, please cite:
131
132 </help>
133 <citations>
134 <citation type="doi">10.1186/s12859-015-0514-3</citation>
135
136 <citation type="bibtex">@INPROCEEDINGS{
137 author = {Herten, Koen and Hestand, Matthew S. and Vermeesch, Joris R. and Van Houdt, Jeroen KJ},
138 doi = {10.1186/s12859-015-0514-3},
139 issn = {1471-2105},
140 journal = {BMC Bioinformatics},
141 number = {1},
142 pages = {1–6},
143 title = {GBSX: a toolkit for experimental design and demultiplexing genotyping by sequencing experiments},
144 url = {http://dx.doi.org/10.1186/s12859-015-0514-3},
145 volume = {16},
146 year = {2015}
147 }</citation>
148 </citations>
149
150 </tool>