view README.txt @ 1:3c57a8767bb8 draft default tip

Uploaded v1.0.1, updates the help text, renamed readme file to display on the ToolShed.
author peterjc
date Wed, 05 Jun 2013 13:40:56 -0400
parents
children
line wrap: on
line source

#Created 07/01/2011 - Konrad Paszkiewicz, Exeter Sequencing Service, University of Exeter, UK
Revisions 2013 by Peter Cock, The James Hutton Institute, UK

The attached is a crude wrapper script for Interproscan. Typically this is useful when one wants to produce an annotation which is not based on sequence 
similarity. E.g after a denovo transcriptome assembly, each transcript could be translated and run through this tool.

Prerequisites:

1. A working installation of Interproscan on your Galaxy server/cluster.

Limitations:

Currently it is setup to work with PFAM only due to the heavy computational demands Interproscan makes. 

Input formats:

The standard interproscan input is either genomic or protein sequences. In the case of genomic sequences Interproscan will of run an ORF 
prediction tool. However this tends to lose the ORF information (e.g. start/end co-ordinates) from the header. As such the requirement here is to input ORF 
sequences (e.g. from EMBOSS getorf) and to then replace any spaces in the FASTA header with underscores. This workaround generally preserves the relevant 
positional information.