annotate snpeff_to_peptides.xml @ 1:80eff5812b2a default tip

Check for both classic and sequence_ontology name for EFF: "NON_SYNONYMOUS_CODING" or "missense_variant"
author Jim Johnson <jj@umn.edu>
date Thu, 06 Nov 2014 13:57:31 -0600
parents fcb7188fa0d2
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
1 <?xml version="1.0"?>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
2 <tool id="snpeff_to_peptides" name="SnpEff to Peptide fasta" version="0.0.1">
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
3 <description> to create a Search DB fasta for variant SAP peptides</description>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
4 <command interpreter="python">snpeff_to_peptides.py --input "$snpeff_vcf" --protein_fasta "$all_pep_fasta" --output "$peptide_variant_fasta"
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
5 #if $leading_aa_num:
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
6 --leading_aa_num $leading_aa_num
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
7 #end if
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
8 #if $trailing_aa_num:
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
9 --trailing_aa_num $trailing_aa_num
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
10 #end if
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
11 </command>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
12 <inputs>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
13 <param name="snpeff_vcf" type="data" format="vcf" label="SnpEff generated VCF file with NON_SYNONYMOUS_CODING annotations"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
14 <param name="all_pep_fasta" type="data" format="fasta,tabular" label="Ensembl all_pep.fa"
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
15 help="An Ensembl all_pep.fa file corresponding to the genome build used for SnpEff (May be converted to tabular fasta format)"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
16 <param name="leading_aa_num" type="integer" value="30" min="0" optional="true" label="Preceeding AAs"
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
17 help="The number of Amino Acids to include before the variant position (leave blank to include all)"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
18 <param name="trailing_aa_num" type="integer" value="30" min="0" optional="true" label="Following AAs"
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
19 help="The number of Amino Acids to include after the variant position (leave blank to include all)"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
20 </inputs>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
21 <stdio>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
22 <exit_code range="1:" level="fatal" description="Error" />
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
23 </stdio>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
24 <outputs>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
25 <data name="peptide_variant_fasta" metadata_source="all_pep_fasta" format="fasta"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
26 </outputs>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
27 <tests>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
28 <test>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
29 <param name="snpeff_vcf" value="snpeff.vcf" ftype="vcf" dbkey="hg19"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
30 <param name="all_pep_fasta" value="all_pep.fa" ftype="fasta" dbkey="hg19"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
31 <param name="leading_aa_num" value="10"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
32 <param name="trailing_aa_num" value="10"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
33 <output name="peptide_variant_fasta" file="peptides_10_10.fa"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
34 </test>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
35 <test>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
36 <param name="snpeff_vcf" value="snpeff.vcf" ftype="vcf" dbkey="hg19"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
37 <param name="all_pep_fasta" value="all_pep.tabular" ftype="tabular" dbkey="hg19"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
38 <param name="leading_aa_num" value="10"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
39 <param name="trailing_aa_num" value="10"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
40 <output name="peptide_variant_fasta" file="peptides_10_10.fa"/>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
41 </test>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
42 </tests>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
43 <help>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
44 **SnpEff to Peptide Fasta**
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
45
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
46 This generates a fasta file of peptide sequences with SAPs ( Single Amino acid Polymorphisms )
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
47 from the NON_SYNONYMOUS_CODING EFF annnotations from the SnpEff_ application.
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
48 The SnpEff VCF may be filtered or annotated using SnpSift.
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
49
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
50 The following is appended to the fasta ID line: snp_location:chr:position codon_change:nnn/nnn sap:AposA
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
51
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
52 For VCF entry::
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
53
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
54 chr1 22846709 . G A 9.31 . DP=2;VDB=0.0174;AF1=1;AC1=2;DP4=0,0,1,1;MQ=20;FQ=-33;EFF=NON_SYNONYMOUS_CODING(MODERATE|MISSENSE|Gtg/Atg|V885M|1127|ZBTB40|protein_coding|CODING|ENST00000374651|12|1) PL 40,6,0
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
55
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
56 The peptide fasta entry that matches transcript ID: ENST00000374651 would be::
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
57
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
58 >ENSP00000363782 pep:known chromosome:GRCh37:1:22778472:22853855:1 gene:ENSG00000184677 transcript:ENST00000374651 gene_biotype:protein_coding transcript_biotype:protein_coding
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
59
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
60 The ID of the output peptide fasta ID would be::
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
61
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
62 >ENSP00000363782 pep:sap chromosome:GRCh37:1:22778472:22853855:1 gene:ENSG00000184677 transcript:ENST00000374651 gene_biotype:protein_coding transcript_biotype:protein_coding snp_location:chr1:22846709 codon_change:Gtg/Atg sap:V885M
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
63
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
64
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
65 .. _SnpEff: http://snpeff.sourceforge.net/index.html
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
66
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
67 **Citation**
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
68
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
69 SnpEff citation:
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
70 "A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.", Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. Fly (Austin). 2012 Apr-Jun;6(2):80-92. PMID: 22728672 [PubMed - in process]
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
71
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
72 SnpSift citation:
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
73 "Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift", Cingolani, P., et. al., Frontiers in Genetics, 3, 2012.
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
74
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
75 </help>
fcb7188fa0d2 Uploaded
jjohnson
parents:
diff changeset
76 </tool>