Mercurial > repos > peterjc > rxlr_venn_workflow
view README.rst @ 5:4726b640f221 draft default tip
Uploaded revision to update the citation
author | peterjc |
---|---|
date | Fri, 25 Oct 2013 10:20:46 -0400 |
parents | 3c280e01b920 |
children |
line wrap: on
line source
This is package is a Galaxy workflow for comparing three RXLR prediction methods with a Venn Diagram, and creates a FASTA file of any proteins passing all three methods. See http://www.galaxyproject.org for information about the Galaxy Project. Availability ============ This workflow is available to download and/or install from the main Galaxy Tool Shed: http://toolshed.g2.bx.psu.edu/view/peterjc/rxlr_venn_workflow Test releases (which should not normally be used) are on the Test Tool Shed: http://testtoolshed.g2.bx.psu.edu/view/peterjc/rxlr_venn_workflow Development is being done on github here: https://github.com/peterjc/pico_galaxy/tree/master/workflows/rxlr_venn_workflow Sample Data =========== This workflow was developed and run on several *Phytophthora* species. For example, try the "Phyca11" protein set for *Phytophthora capsici*: http://genome.jgi-psf.org/Phyca11/download/Phyca11_filtered_proteins.fasta.gz You can upload this directly into Galaxy via this URL. Galaxy will handle removing the gzip compression to give you the FASTA protein file which has 19,805 protein sequences. The expected results: * 89 RXLRs using Whisson et al. (2007) * 124 RXLRs using Win et al. (2007) * 162 RXLRs using Bhattacharjee et al. (2006) Of these, only 79 sequences pass all three of the RXLR prediction tools, while 19643 have no RXLR matches at all. .. image:: Phyca11_example_output.png :height: 400px :width: 400px Citation ======== If you use this workflow directly, or a derivative of it, in work leading to a scientific publication, please cite: Cock, P.J.A. and Pritchard, L. (2014). Galaxy as a platform for identifying candidate pathogen effectors. Chapter 1 in "Plant-Pathogen Interactions: Methods and Protocols (Second Edition)"; P. Birch, J. Jones, and J.I. Bos, eds. Methods in Molecular Biology. Humana Press, Springer. ISBN 978-1-62703-985-7. http://www.springer.com/life+sciences/plant+sciences/book/978-1-62703-985-7 For the associated RXLR Galaxy tool, please cite: Peter J.A. Cock, Björn A. Grüning, Konrad Paszkiewicz and Leighton Pritchard (2013). Galaxy tools and workflows for sequence analysis with applications in molecular plant pathology. PeerJ 1:e167 http://dx.doi.org/10.7717/peerj.167 For the three underlying methods, please cite: Whisson, S.C., Boevink, C.V., Moleleki, L., et al. (2007) A translocation signal for delivery of oomycete effector proteins into host plant cells. Nature 450:115-118. http://dx.doi.org/10.1038/nature06203 Win, J., Morgan, W., Bos, J., et al. (2007) Adaptive evolution has targeted the C-terminal domain of the RXLR effectors of plant pathogenic oomycetes. The Plant Cell 19:2349-2369. http://dx.doi.org/10.1105/tpc.107.051037 Bhattacharjee, S., Luisa Hiller, N., Liolios, K., et al. (2006) The malarial host-targeting signal is conserved in the Irish potato famine pathogen. PLoS Pathogens 2(5):e50. http://dx.doi.org/10.1371/journal.ppat.0020050 Dependencies ============ These dependencies should be resolved automatically via the Galaxy Tool Shed: * http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp * http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id * http://toolshed.g2.bx.psu.edu/view/peterjc/venn_list However, at the time of writing those Galaxy tools have their own dependencies required for this workflow which require manual installation (SignalP v3.0, HMMER v2.0, and the R/Bioconductor package limma). History ======= ======= ====================================================================== Version Changes ------- ---------------------------------------------------------------------- v0.0.1 - Initial release to Tool Shed (May, 2013) - Expanded README file to include example data v0.0.2 - Updated versions of the tools used, inclulding core Galaxy Filter tool to avoid warning about new ``header_lines`` parameter. - Added link to Tool Shed in the workflow annotation explaining there is a README file with sample data, and a requested citation. - Bundle sample output image for display in this README file. ======= ====================================================================== Developers ========== This workflow is under source code control here: https://github.com/peterjc/pico_galaxy/tree/master/workflows/rxlr_venn_workflow To prepare the tar-ball for uploading to the Tool Shed, I use this: $ tar -cf rxlr_venn_workflow.tar.gz README.rst repository_dependencies.xml rxlr_venn_workflow.ga Phyca11_example_output.png Check this, $ tar -tzf rxlr_venn_workflow.tar.gz README.rst repository_dependencies.xml rxlr_venn_workflow.ga Phyca11_example_output.png