Mercurial > repos > timpalpant > java_genomics_toolkit
annotate galaxy-conf/KMeans.xml @ 17:ace7855c1017 draft
Add back tool_data_table_conf.xml.sample since apparently it's still needed.
author | timpalpant |
---|---|
date | Sat, 09 Jun 2012 16:10:42 -0400 |
parents | 81d5b81fb3c2 |
children |
rev | line source |
---|---|
4 | 1 <tool id="KMeans" name="KMeans cluster" version="1.0.0"> |
2 <description>an aligned matrix</description> | |
3 <command interpreter="sh">galaxyToolRunner.sh visualization.KMeans -i $input -k $K -1 $min -2 $max -o $output</command> | |
4 <inputs> | |
5 <param format="tabular" name="input" type="data" label="Aligned matrix" /> | |
6 <param name="K" type="integer" value="10" label="Number of clusters" /> | |
7 <param name="min" type="integer" value="1" label="Minimum column to use for clustering" /> | |
11 | 8 <param name="max" type="integer" value="-1" label="Maximum column to use for clustering (-1 to end)" /> |
4 | 9 </inputs> |
10 <outputs> | |
11 <data format="tabular" name="output" metadata="input" /> | |
12 </outputs> | |
13 <tests> | |
14 </tests> | |
15 | |
16 <help> | |
12
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
17 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
18 .. class:: warningmark |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
19 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
20 This tool requires tabular data in matrix2png format (with column AND row headers). For more information about the required format and usage instructions, see the matrix2png_ website. |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
21 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
22 .. _matrix2png: http://bioinformatics.ubc.ca/matrix2png/dataformat.html |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
23 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
24 .. class:: infomark |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
25 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
26 You can use the "Align values in a matrix" tool to create a matrix, then use this tool to cluster the matrix with k-means. |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
27 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
28 .. class:: infomark |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
29 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
30 **TIP:** You can use the **min** and **max** columns to cluster a large matrix based on a subset of the columns. For example, you could cluster a 4000x4000 matrix on columns 200-300 by setting min = 200 and max = 300. This will greatly increase the efficiency of distance calculations during the k-means EM, and also allows you to cluster based on specific regions, such as promoters or coding sequences. |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
31 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
32 ----- |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
33 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
34 This tool will cluster the rows in an aligned matrix with KMeans_. The implementation builds upon the KMeansPlusPlusClusterer available in commons-math3_. |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
35 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
36 .. _KMeans: http://en.wikipedia.org/wiki/K-means_clustering |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
37 |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
38 .. _commons-math3: http://commons.apache.org/math/ |
81d5b81fb3c2
Added help for all tools in the toolkit. Many bug fixes and a few new nucleosome tools.
timpalpant
parents:
11
diff
changeset
|
39 |
4 | 40 </help> |
41 </tool> |