annotate GetHaplotypesFromPhasedVCF/getHaplotypesFromPhasedVCF.xml @ 12:88748d846a20 draft

planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
author dereeper
date Tue, 14 Aug 2018 08:21:55 -0400
parents c6640c49fd01
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
10
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
1 <tool id="getHaplotypesFromPhasedVCF" name="Get Haplotypes From Phased VCF" version="2.0.0">
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
2 <description>Get Haplotypes From Phased VCF</description>
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
3
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
4 <requirements>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
5 <requirement type="binary">perl</requirement>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
6 </requirements>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
7 <stdio>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
8 <exit_code range="1:" />
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
9 </stdio>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
10 <command interpreter="perl">
12
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
11 GetHaplotypesFromPhasedVCF.pl $input $output_label &amp;&amp; mv ${output_label}.distinct_haplotypes.txt $output_distinct &amp;&amp; mv ${output_label}.haplo.fas $output_haplo &amp;&amp; mv ${output_label}.distinct_haplotypes.fa $output_distinct_fasta
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
12 </command>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
13 <inputs>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
14 <param type="data" name="input" format="vcf" label="Phased VCF" />
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
15 <param type="text" name="output_label" label="Output_label" value='Haplotypes' />
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
16 </inputs>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
17 <outputs>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
18 <data name="output_distinct" format="txt" label="${output_label}.distinct_haplotypes.txt"/>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
19 <data name="output_haplo" format="fasta" label="${output_label}.haplo.fas"/>
12
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
20 <data name="output_distinct_fasta" format="fasta" label="${output_label}.distinct_haplotypes.fa"/>
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
21 </outputs>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
22 <tests>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
23 <test>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
24 <param name="input" value="getHaplotypesFromPhasedVCF-input.vcf"/>
10
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
25 <output name="output_distinct" file="getHaplotypesFromPhasedVCF-result.distinct_haplotypes.txt" compare="sim_size" delta="0"/>
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
26 <output name="output_haplo" file="getHaplotypesFromPhasedVCF-result.haplo.fas" compare="sim_size" delta="0"/>
12
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
27 <output name="output_distinct_fasta" file="getHaplotypesFromPhasedVCF-result.distinct_haplotypes.fas" compare="sim_size" delta="0"/>
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
28 </test>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
29 </tests>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
30 <help><![CDATA[
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
31
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
32 .. class:: infomark
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
33
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
34 **Authors** Dereeper Alexis (alexis.dereeper@ird.fr), IRD, South Green platform
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
35
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
36 | **Please cite** "SNiPlay3: a web-based application for exploration and large scale analyses of genomic variations", **Dereeper A. et al.**, Nucl. Acids Res. (1 july 2015) 43 (W1).
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
37
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
38 .. class:: infomark
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
39
10
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
40 **Galaxy integration** Provided by Southgreen & Andres Gwendoline (Institut Français de Bioinformatique) & Marcon Valentin (IFB & INRA)
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
41
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
42 .. class:: infomark
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
43
10
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
44 **Support** For any questions about Galaxy integration, please send an e-mail to alexis.dereeper@ird.fr
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
45
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
46 ---------------------------------------------------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
47
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
48 ==============================
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
49 Get Haplotypes From Phased VCF
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
50 ==============================
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
51
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
52 -----------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
53 Description
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
54 -----------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
55
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
56 | Get Haplotype from phased VCF
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
57
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
58 ----------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
59 Input file
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
60 ----------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
61
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
62 VCF file
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
63 Phased VCF file
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
64
10
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
65 ---------
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
66 Parameter
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
67 ---------
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
68
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
69 Output file basename
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
70 Prefix for the output VCF file
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
71
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
72 ------------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
73 Output files
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
74 ------------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
75
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
76
12
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
77 Distinct Haplotypes text file
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
78 File describing distincts haplotypes
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
79
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
80 Fasta file
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
81 Fasta file with haplotypes
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
82
12
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
83 Distinct Haplotypes fasta file
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
84 Fasta file with distincts haplotypes
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
85
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
86
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
87 ---------------------------------------------------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
88
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
89 ---------------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
90 Working example
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
91 ---------------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
92
10
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
93 Input file
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
94 ==========
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
95
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
96 VCF file
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
97 ---------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
98
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
99 ::
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
100
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
101 #fileformat=VCFv4.1
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
102 #FILTER=&lt;ID=LowQual,Description="Low quality">
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
103 #FORMAT=&lt;ID=AD,Number=.,Type=Integer,Description="Allelic depths for the ref and alt alleles in the order listed">
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
104 [...]
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
105 CHROM POS ID REF ALT QUAL FILTER INFO FORMAT AZUCENA
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
106 Chr1 4299 . G A . PASS AR2=1;DR2=1;AF=0.168 GT:DS:GP 0|0:0:1,0,0
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
107
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
108
10
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
109 Parameter
c6640c49fd01 planemo upload commit 475f4d7d8442a0d75e103af326ae5881c4d2a4ac
dereeper
parents: 6
diff changeset
110 =========
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
111
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
112 Output name -> haplotypes
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
113
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
114
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
115 Output files
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
116 ============
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
117
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
118 haplotypes.distinct_haplotypes.txt
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
119 ----------------------------------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
120
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
121 ::
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
122
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
123 ===Chr10===
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
124 haplo1:2:CIRAD403_1,CIRAD403_2,
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
125 TTTAAGAAATTCCTATATAGGTCTTCTAAGCGTATCTATTAACAT
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
126 haplo2:2:MAHAE_1,MAHAE_2,
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
127 TAAATCTTGGTGCTGATCTGATATTTAATGCGT
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
128
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
129
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
130 haplotypes.haplo.fas
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
131 --------------------
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
132
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
133 ::
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
134
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
135 >Chr10_AZUCENA_1
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
136 TTTAAGAAATTCCTATATAGGTCTTCTAAGCGTATCTATTAACAT
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
137 >Chr10_AZUCENA_2
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
138 TAAATCTTGGTGCTGATCTGATATTTAATGCGT
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
139
12
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
140 haplotypes.distinct_haplotypes.fas
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
141 ----------------------------------
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
142
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
143 ::
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
144
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
145 >haplo1|2
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
146 CAATTTATATATACTTGTATATAACCACAACGAGAGAGTTTTACCT
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
147 TTTATAAAAAATAAATAATGTATTACGGCTAATATAGCAATCTTTT
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
148 AAAATAAATCTATATTTAAATGACTATGGAATTACTAATCACAATA
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
149 ACAGGATCTTGTTATTTTTAGCTTGTGTACTTATAATGATCCGATG
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
150 >haplo2|2
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
151 GCTACTTAAATATCTAGCATTAATCCACAACGAGAGGCTCTTACCT
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
152 TTAAAAAAGGGTCATCGCCTATAGGTTAGATAATCGACACATATAA
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
153 TTATAAGAAATTATATATAATTTTTAATCTAGTTCATTCTTGTGCA
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
154 TCATTATGTTATATAATAATAAACGTAACAAATATTGATACTACTC
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
155
88748d846a20 planemo upload commit 11382afe87364aaafb19973470d5066229a6e34f
dereeper
parents: 10
diff changeset
156
6
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
157 ]]></help>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
158 <citations>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
159 <!-- [HELP] As DOI or BibTex entry -->
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
160 <citation type="bibtex">@article{Dereeper03062015,
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
161 author = {Dereeper, Alexis and Homa, Felix and Andres, Gwendoline and Sempere, Guilhem and Sarah, Gautier and Hueber, Yann and Dufayard, Jean-François and Ruiz, Manuel},
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
162 title = {SNiPlay3: a web-based application for exploration and large scale analyses of genomic variations},
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
163 year = {2015},
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
164 doi = {10.1093/nar/gkv351},
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
165 abstract ={SNiPlay is a web-based tool for detection, management and analysis of genetic variants including both single nucleotide polymorphisms (SNPs) and InDels. Version 3 now extends functionalities in order to easily manage and exploit SNPs derived from next generation sequencing technologies, such as GBS (genotyping by sequencing), WGRS (whole gre-sequencing) and RNA-Seq technologies. Based on the standard VCF (variant call format) format, the application offers an intuitive interface for filtering and comparing polymorphisms using user-defined sets of individuals and then establishing a reliable genotyping data matrix for further analyses. Namely, in addition to the various scaled-up analyses allowed by the application (genomic annotation of SNP, diversity analysis, haplotype reconstruction and network, linkage disequilibrium), SNiPlay3 proposes new modules for GWAS (genome-wide association studies), population stratification, distance tree analysis and visualization of SNP density. Additionally, we developed a suite of Galaxy wrappers for each step of the SNiPlay3 process, so that the complete pipeline can also be deployed on a Galaxy instance using the Galaxy ToolShed procedure and then be computed as a Galaxy workflow. SNiPlay is accessible at http://sniplay.southgreen.fr.},
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
166 URL = {http://nar.oxfordjournals.org/content/early/2015/06/03/nar.gkv351.abstract},
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
167 eprint = {http://nar.oxfordjournals.org/content/early/2015/06/03/nar.gkv351.full.pdf+html},
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
168 journal = {Nucleic Acids Research}
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
169 }
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
170
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
171 }</citation>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
172
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
173 </citations>
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
174
ebb0ac9b6fa9 planemo upload
gandres
parents:
diff changeset
175 </tool>