view otu_clustering/cluster_otu.xml @ 1:5d05b34a5fdf draft default tip

Uploaded
author qfab
date Wed, 28 May 2014 21:27:15 -0400
parents 52043d3d71f5
children
line wrap: on
line source

<tool id="cluster_otu" name="Cluster OTU" version="1.0.0">
  <description>Cluster OTUs</description>
  <command>
   usearch -cluster_otus $input -otus $output -otuid $radius 2&gt;1
  </command>

<inputs>
  <param name="input" type="data" format="fasta" label="Input sequence file"
         help="Size annotation is required, see description below!"/>
  <param name="radius" type='float' value='0.97' label='Minimum cluster radius'
         help="default=0.97, corresponds to minimum of 97% sequence identity" />
</inputs>

<outputs>
  <data name='output' format="fasta" />
</outputs>

<help>
===========
Description
===========

Performs OTU clustering using the USEARCH-Tool-Suite_.

.. _USEARCH-Tool-Suite: http://www.drive5.com/usearch/

-----

-----
Input
-----

FASTA file with size annotation in the input line, e.g. >sequenceXXXX;size=1443;

.. class:: warningmark 

Size annotation is required.

----------
Parameters
----------

radius
  (Usearch v7.0.1002 and earlier) specifies the OTU 'radius' as the minimum identity between an OTU member sequence and the representative sequence as a fractional identity 0.0 to 1.0. Default is 0.97, which corresponds to a minimum identity of 97%.

------
Output
------

FASTA output of the representative sequences.

.. class:: warningmark 

If there is a "size=XX" annotation in the identifier line of the output file, please note this is NOT the number of sequences in this cluster. The size annotation is then coming from the input file.

-----

=========
Resources
=========

UPARSE_ (OTU Clustering)

.. _UPARSE: http://drive5.com/usearch/manual/cluster_otus.html

**Author**

Robert C. Edgar (bob@drive5.com)

**Wrapper Author**

QFAB Bioinformatics (support@qfab.org)
</help>
<tests>
    <test>
      <param name="input" value="seqs_derep.fasta" />
      <param name="radius" value="0.97" />
      <output name="output" file="seqs_otus.fasta" ftype="fasta" lines_diff="10" />
    </test>
</tests>
</tool>