annotate tools/fastq_filter_by_id/README.rst @ 3:e0041942a12d draft default tip

v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
author peterjc
date Fri, 03 Feb 2017 05:34:18 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
1 Obsolete
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
2 ========
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
3
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
4 This tool is now obsolete, having been replaced by a more general version
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
5 covering the FASTA, FASTQ and SFF sequence formats in a single tool. You
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
6 should only install this tool if you need to support existing workflows
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
7 which used it.
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
8
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
9 Galaxy tool to filter FASTQ sequences by ID
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
10 ===========================================
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
11
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
12 This tool is copyright 2010-2017 by Peter Cock, The James Hutton Institute
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
13 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
14 See the licence text below (MIT licence).
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
15
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
16 This tool is a short Python script (using the Galaxy library functions) which
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
17 divides a FASTQ file in two, those sequences with or without an ID present in
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
18 the specified column(s) of a tabular file. Example uses include filtering based
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
19 on search results from a tool like NCBI BLAST before assembly.
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
20
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
21 There are just two files to install:
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
22
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
23 * fastq_filter_by_id.py (the Python script)
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
24 * fastq_filter_by_id.xml (the Galaxy tool definition)
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
25
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
26 The suggested location is next to the similarly named fastq_filter.py and
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
27 fastq_filter.xml files which are included with Galaxy, i.e. in the Galaxy
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
28 folder tools/fastq
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
29
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
30 You will also need to modify the tools_conf.xml file to tell Galaxy to offer
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
31 the tool. The suggested location is next to the fastq_filter.xml entry. Simply
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
32 add the line:
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
33
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
34 <tool file="fastq/fastq_filter_by_id.xml" />
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
35
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
36 That's it.
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
37
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
38
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
39 History
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
40 =======
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
41
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
42 ======= ======================================================================
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
43 Version Changes
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
44 ------- ----------------------------------------------------------------------
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
45 v0.0.1 - Initial verion (not publicly released)
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
46 v0.0.2 - Allow both, just pos or just neg output files
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
47 - Preserve the FASTQ variant in the XML wrapper
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
48 v0.0.3 - Fixed bug when generating non-matching FASTQ file only
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
49 v0.0.4 - Deprecated, marked as hidden in the XML
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
50 v0.0.5 - Explicit dependency on ``galaxy_sequence_utils``.
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
51 - Citation information (Cock et al. 2013).
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
52 - Explicitly record version via ``<version_command>``.
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
53 - Use ``format_source``/``metadata_source`` idiom for output format.
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
54 - Use standard MIT license (was previously using the MIT/BSD style
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
55 Biopython Licence Agreement).
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
56 ======= ======================================================================
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
57
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
58
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
59 Developers
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
60 ==========
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
61
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
62 This script and other tools for filtering FASTA, FASTQ and SFF files were
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
63 initially developed on the following hg branches:
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
64 http://bitbucket.org/peterjc/galaxy-central/src/tools
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
65 http://bitbucket.org/peterjc/galaxy-central/src/fasta_filter
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
66
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
67 It is now under GitHub https://github.com/peterjc/pico_galaxy/
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
68
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
69 For pushing a release to the test or main "Galaxy Tool Shed", use the following
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
70 Planemo commands (which requires you have set your Tool Shed access details in
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
71 ``~/.planemo.yml`` and that you have access rights on the Tool Shed)::
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
72
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
73 $ planemo shed_update -t testtoolshed --check_diff tools/fastq_filter_by_id/
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
74 ...
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
75
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
76 or::
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
77
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
78 $ planemo shed_update -t toolshed --check_diff tools/fastq_filter_by_id/
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
79 ...
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
80
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
81 To just build and check the tar ball, use::
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
82
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
83 $ planemo shed_upload --tar_only tools/fastq_filter_by_id/
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
84 ...
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
85 $ tar -tzf shed_upload.tar.gz
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
86 tools/fastq_filter_by_id/README.rst
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
87 tools/fastq_filter_by_id/fastq_filter_by_id.py
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
88 tools/fastq_filter_by_id/fastq_filter_by_id.xml
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
89 tools/fastq_filter_by_id/tool_dependencies.xml
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
90 test-data/empty_file.dat
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
91 test-data/sanger-pairs-mixed.fastq
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
92 test-data/sanger-pairs-names.tabular
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
93
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
94
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
95 Licence (MIT)
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
96 =============
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
97
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
98 Permission is hereby granted, free of charge, to any person obtaining a copy
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
99 of this software and associated documentation files (the "Software"), to deal
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
100 in the Software without restriction, including without limitation the rights
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
101 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
102 copies of the Software, and to permit persons to whom the Software is
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
103 furnished to do so, subject to the following conditions:
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
104
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
105 The above copyright notice and this permission notice shall be included in
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
106 all copies or substantial portions of the Software.
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
107
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
108 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
109 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
110 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
111 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
112 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
113 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
e0041942a12d v0.0.5 - galaxy_sequence_utils dependency and other cleanups inc using MIT license
peterjc
parents:
diff changeset
114 THE SOFTWARE.