view otu_clustering/cluster_otu.xml @ 0:52043d3d71f5 draft

Uploaded
author qfab
date Wed, 28 May 2014 21:22:08 -0400
parents
children 5d05b34a5fdf
line wrap: on
line source

<tool id="cluster_otu" name="Cluster OTU" version="1.0.0">
  <description>Cluster OTUs</description>
  <command>
   usearch -cluster_otus $input -otus $output -otuid $radius 2&gt;1
  </command>

<inputs>
  <param name="input" type="data" format="fasta" label="Input sequence file"
         help="Size annotation is required, see description below!"/>
  <param name="radius" type='float' value='0.97' label='Minimum cluster radius'
         help="default=0.97, corresponds to minimum of 97% sequence identity" />
</inputs>

<outputs>
  <data name='output' format="fasta" />
</outputs>

<help>
===========
Description
===========

Performs OTU clustering using the USEARCH-Tool-Suite_.

.. _USEARCH-Tool-Suite: http://www.drive5.com/usearch/

-----

-----
Input
-----

FASTA file with size annotation in the input line, e.g. >sequenceXXXX;size=1443;

.. class:: warningmark 

Size annotation is required.

----------
Parameters
----------

radius
  (Usearch v7.0.1002 and earlier) specifies the OTU 'radius' as the minimum identity between an OTU member sequence and the representative sequence as a fractional identity 0.0 to 1.0. Default is 0.97, which corresponds to a minimum identity of 97%.

------
Output
------

FASTA output of the representative sequences.

.. class:: warningmark 

If there is a "size=XX" annotation in the identifier line of the output file, please note this is NOT the number of sequences in this cluster. The size annotation is then coming from the input file.

-----

=========
Resources
=========

UPARSE_ (OTU Clustering)

.._UPARSE: http://drive5.com/usearch/manual/cluster_otus.html

**Author**

Robert C. Edgar (bob@drive5.com)

**Wrapper Author**

QFAB Bioinformatics (support@qfab.org)
</help>
<tests>
    <test>
      <param name="input" value="seqs_derep.fasta" />
      <param name="radius" value="0.97" />
      <output name="output" file="seqs_otus.fasta" ftype="fasta" lines_diff="10" />
    </test>
</tests>
</tool>