Mercurial > repos > rreumerman > snptools

<tool id="snpsplit" name="SNP splitter">

  <description>splits multicharacter entries into separate lines</description>

  <command interpreter="python">snpsplit.py $input $output</command>

  <inputs>

    <param name="input" type="data" format="tabular" label="SNP table" />

  </inputs>

  <outputs>

    <data name="output" format="tabular" label="${tool.name} on ${on_string}" />

  </outputs>

  <help>


**What it does**


SNPsplit prepares tab-delimited SNP files for TRAMS. It checks each line for entries that contain multiple consecutive bases and splits them over several lines.


- **Input**: tab delimited, format: Position Ref Pol
- Position is 1-based genomic coordinate

- Ref is the reference sequence

- Pol is the polymorphism sequence


Ref en Mut sequences consisting of more than one character will be split up into separate lines. Example:


**Input**:


=== === ===

123 CGT ATG

=== === ===


**Output**:


=== = =

123 C A

124 G T

125 T G

=== = =


Bases that are the same in both columns, will be skipped. Example:


**Input**:


=== ===== ===

123 C*G*T AGG

=== ===== ===


**Output**:


=== = =

123 C A

125 T G

=== = =


  </help>

</tool>
author	rreumerman
date	Fri, 05 Apr 2013 05:00:40 -0400
parents
children