annotate README.rst @ 0:3a11830963e3 default tip

Initial upload
author Jim Johnson <jj@umn.edu>
date Mon, 17 Mar 2014 15:59:57 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
1 This package contains a Galaxy workflow for the detection and incorporation of single amino acid polymorphism (SAP) into a custom proteomics search database.
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
2
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
3 The workflow aligns RNA-Seq reads to the organism's genome using Tophat, call SNPs using SAMtools mpileup command, and annotates the SNPs that reside within protein-coding regions using SNPeff.
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
4 Galaxy tool "SNPeff to Peptide Fasta" generates peptide sequences with the polymorhisms.
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
5
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
6 See http://www.galaxyproject.org for information about the Galaxy Project.
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
7
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
8
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
9 Availability
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
10 ============
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
11
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
12 This workflow is available to download and/or install from the main
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
13 Galaxy Tool Shed:
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
14
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
15 http://toolshed.g2.bx.psu.edu/view/galaxyp/proteomics_rnaseq_sap_db_workflow
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
16
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
17
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
18 Reference Data
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
19 ==============
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
20
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
21 For Human RNAseq data this workflow was tested with a genome build named "GRCh37_canon" using reference data from:
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
22
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
23 * ftp://ftp.ensembl.org/pub/release-73/fasta/homo_sapiens/dna/Homo_sapiens.GRCh37.73.dna.chromosome.[1-9XY]*.fa.gz
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
24 * ftp://ftp.ensembl.org/pub/release-73/fasta/homo_sapiens/pep/Homo_sapiens.GRCh37.73.pep.all.fa.gz
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
25 * ftp://ftp.ensembl.org/pub/release-73/gtf/homo_sapiens/Homo_sapiens.GRCh37.73.gtf.gz
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
26
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
27
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
28 For Mouse RNAseq data this workflow was tested with a genome build named "GRCm38_canon" using reference data from:
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
29
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
30 * ftp://ftp.ensembl.org/pub/release-73/fasta/mus_musculus/dna/Mus_musculus.GRCm38.73.dna.chromosome.[1-9XY]*.fa.gz
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
31 * ftp://ftp.ensembl.org/pub/release-73/fasta/mus_musculus/pep/Mus_musculus.GRCm38.73.pep.all.fa.gz
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
32 * ftp://ftp.ensembl.org/pub/release-73/gtf/mus_musculus/Mus_musculus.GRCm38.73.gtf.gz
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
33
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
34
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
35 Genome builds "GRCh37_canon" and "GRCm38_canon" were built according to instructions in: https://wiki.galaxyproject.org/Admin/NGS%20Local%20Setup
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
36 The builds used only the standard chromosomesi sequences from the reference fasta; the other sequences were filtered out.
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
37 The GTF was filtered to retain only those entries that referenced the standard chromosomes.
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
38
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
39
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
40 Dependencies
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
41 ============
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
42
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
43 These dependencies should be resolved automatically via the Galaxy Tool Shed:
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
44
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
45 * http://toolshed.g2.bx.psu.edu/view/devteam/tophat
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
46 * http://toolshed.g2.bx.psu.edu/view/devteam/samtools_mpileup
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
47 * http://toolshed.g2.bx.psu.edu/view/nilesh/bcftools
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
48 * http://toolshed.g2.bx.psu.edu/view/iuc/snpeff
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
49 * http://toolshed.g2.bx.psu.edu/view/jjohnson/snpeff_to_peptides
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
50
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
51 History
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
52 =======
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
53
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
54 ======= ======================================================================
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
55 Version Changes
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
56 ------- ----------------------------------------------------------------------
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
57 v0.0.1 - Initial release to Tool Shed (March, 2014)
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
58 ======= ======================================================================
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
59
3a11830963e3 Initial upload
Jim Johnson <jj@umn.edu>
parents:
diff changeset
60