annotate readme.rst @ 9:66b138a2e74c draft

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit e6a741fc2fe38e91a310e9c29467c814a2d34cac
author iuc
date Sat, 18 May 2024 19:13:20 +0000
parents ee407fae3ccd
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
5
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
1
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
2 =================
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
3 IMPORTANT NOTE REGARDING SYSTEM CONFIGURATION
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
4 =================
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
5
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
6 All of the Galaxy wrappers contained herein call the respective bbtools' shell wrapper, which calls the underlying java-based tool. Unlike a C-based program, java will grab a pre-determined amount of memory at the very beginning of the execution.
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
7
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
8 Some of the algorithms (e.g. bbnorm) utilise a hash table, and potential collusions can decrease the numeric accuracy of the output. This problem is expected to become more pronounced if the fraction of the memory occupied w.r.t. allocated memory becomes high, i.e. when the available memory is low and/or the input file is big. If the tool generates a warning to stderr, and will be caught by the Galaxy wrapper resulting in a failed job. However, `count min sketch <https://en.wikipedia.org/wiki/Count%E2%80%93min_sketch>` does not run out of memory, this is a gradual effect, and will NOT trigger a fatal error unless the load reaches this critically high level. You can read more about the implications of this at the `BBtools manual <https://jgi.doe.gov/data-and-tools/software-tools/bbtools/bb-tools-user-guide/bbnorm-guide/>`
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
9
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
10 If you are administering a heteregenous computing environment with multiple nodes of very different quantities of physically available RAM, it is recommended to define a global cap on the RAM to be used to avoid introducing run-to-run bias by exporting an environmental variable, by something like:
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
11 export _JAVA_OPTIONS="-Xmx2048m -Xms256m"
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
12
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
13 The tool currently considers the following limits, in the given priority order:
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
14 1) _JAVA_OPTIONS
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
15 2) JAVA_TOOL_OPTIONS
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
16 3) GALAXY_MEMORY_MB
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
17 4) 4 GB
ee407fae3ccd planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bbtools commit 35db9ac8668f3e376886ea09de63c87dce93e1ce
iuc
parents:
diff changeset
18