annotate README.txt @ 0:69e8f12c8b31 draft

"planemo upload"
author bioit_sciensano
date Fri, 11 Mar 2022 15:06:20 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
1 PROGRAM
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
2 =======
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
3
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
4 PhageTerm.py - run as command line in a shell
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
5
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
6
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
7 VERSION
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
8 =======
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
9
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
10 Version 4.0.0
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
11 Compatible with python 3.7
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
12
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
13
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
14 INTRODUCTION
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
15 ============
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
16
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
17 PhageTermVirome software is a tool to determine phage genome termini and genome packaging mode on single phage or multiple contigs at once.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
18 The software uses phage and virome sequencing reads obtained from libraries prepared with DNA fragmented randomly (e.g. Covaris fragmentation,
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
19 and library preparation using Illumina TruSeq). Phage or virome sequencing reads (fastq files) are aligned to the assembled phage genome or assembled
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
20 virome (fasta or multifasta files) in order to calculate two types of coverage values (whole genome coverage and the Starting Position Coverage (SPC)). The starting position coverage is used to perform a detailed termini and packaging mode analysis.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
21
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
22 Mu-type phage analysis : can be done if user suspect the phage genome to be Mu-like type (Only for single phage genome analysis, not possible with multifasta file) :
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
23 User can also provide the host (bacterial) genome sequence. The Mu-type phage analysis will take the reads that does not match the phage
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
24 genome and align them on the bacterial genome using the same mapping function. The analysis to identify Mu-like phages is available only when providing a single phage genome (not possible if user provide a multi-fast file with multiple assembled phage contigs).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
25
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
26
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
27 The previous PhageTerm program (single phage analysis only) is still available at https://sourceforge.net/projects/phageterm/ (for versions <3.0.0)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
28
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
29
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
30 A Galaxy wrapper version is also available for the previous version at https://galaxy.pasteur.fr (only for the first version PhageTerm).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
31 PhageTermVirome is not implemented on Galaxy yet).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
32
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
33 Since version 3.0.0, PhageTerm can work in 2 modes:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
34 - the usual mono machine mode (parallelization on several cores on the same machine).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
35 - a new multi machine mode (advanced users) with parallelization on several machines, using intermediate files for data exchange.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
36
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
37 The default mode is mono machine.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
38 Version 3.0.0 up to version 4.0 work with python 2.7
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
39
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
40 Since version 4.0, PhageTerm (now PhageTermVirome) works with python 3.7
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
41
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
42
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
43 PREREQUISITES
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
44 =============
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
45
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
46
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
47 For version 4.0
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
48
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
49 Unix/Linux
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
50
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
51 - backports
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
52 - backports.functools_lru_cache
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
53 - backports_abc
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
54 - cycler
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
55 - libwebp-base
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
56 - lz4-c
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
57 - matplotlib-base
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
58 - matplotlib
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
59 - numpy
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
60 - openssl
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
61 - pandas
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
62 - patsy
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
63 - pillow
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
64 - pip
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
65 - pyparsing
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
66 - python=3.7
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
67 - python-dateutil
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
68 - python_abi
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
69 - pytz
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
70 - readline
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
71 - reportlab
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
72 - scikit-learn
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
73 - scipy
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
74 - setuptools
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
75 - singledispatch
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
76 - statsmodels
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
77 - tk
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
78 - tornado
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
79
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
80 A conda virtualenv containing python3.7 and all dependencies is provided for convenience so that users
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
81 don't need to install anything else than miniconda or conda. (See below)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
82
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
83
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
84 FOR INPATIENT USERS : INSTALLING PHAGETERMVIROME USING THE CONDA VIRTUALENV (easiest option)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
85 ============================================================================================
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
86
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
87 First install miniconda if you don't have it already (you don't even need to have python 2.7 or python 3.7 installed on your machine for that since
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
88 miniconda contains it): https://docs.conda.io/en/latest/miniconda.html
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
89
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
90 Download and decompress/extract the PhageTermVirome directory available at https://gitlab.pasteur.fr/vlegrand/ptv.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
91
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
92 Then go in the PTV directory, and create the conda environment using the yml file PhageTerm_env_3.yml file for version >=4.0 (python3)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
93
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
94 $ conda env create -f PhageTerm_env_3.yml
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
95
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
96 Then activate the environment so you can launch PhageTermVirome:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
97
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
98 $ conda activate PhageTerm_env_py3
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
99
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
100
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
101 NOTE:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
102
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
103 You can still use the old PhageTerm under python 2.7 (but no multi-fast analysis possible) using the miniconda environment from the PhageTerm_env.yml file for version<4.0 (python2). Using the following commands.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
104
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
105 $ conda env create -f PhageTerm_env.yml
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
106
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
107 $ conda activate PhageTerm_env
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
108
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
109
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
110
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
111 COMMAND LINE USAGE
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
112 ==================
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
113
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
114 Basic usage with mandatory options (PhageTermVirome needs at least one read file, but user can provide a second corresponding paired-end read file if available, using the -p option).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
115
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
116 ./PhageTerm.py -f reads.fastq -r phage_sequence(s).fasta
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
117
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
118
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
119 Help:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
120
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
121 ./PhageTerm.py -h
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
122 ./PhageTerm.py --help
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
123
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
124
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
125 After installation, we recommend users to perform a software run test, use any of the following:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
126 -t TEST_VALUE, --test=TEST_VALUE
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
127 TEST_VALUE=C5 : Test run for a 5' cohesive end (e.g. Lambda)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
128 TEST_VALUE=C3 : Test run for a 3' cohesive end (e.g. HK97)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
129 TEST_VALUE=DS : Test run for a short Direct Terminal Repeats end (e.g. T7)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
130 TEST_VALUE=DL : Test run for a long Direct Terminal Repeats end (e.g. T5)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
131 TEST_VALUE=H : Test run for a Headful packaging (e.g. P1)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
132 TEST_VALUE=M : Test run for a Mu-like packaging (e.g. Mu)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
133
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
134
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
135 Non-mandatory options
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
136
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
137 [-p reads_paired -c nbr_core_threads --report_title name_to_write_on_report_outputs -s seed_lenght -d surrounding -g host.fasta -l contig_size_limit_multi-fasta -v virome_run_time_estimation]
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
138
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
139
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
140 Additional advanced options (only for multi-machine users)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
141
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
142
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
143 [--mm --dir_cov_mm path_to_coverage_results -c nb_cores --core_id idx_core -p reads_paired -s seed_lenght -d surrounding -l limit_multi-fasta]
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
144 [--mm --dir_cov_mm path_to_coverage_results --dir_seq_mm path_to_sequence_results --DR_path path_to_results --seq_id index_of_sequence --nb_pieces nbr_of_read_chunks -p reads_paired -s seed_lenght -d surrounding -l limit_multi-fasta] [--mm --DR_path path_to_results --dir_seq_mm path_to_sequence_results -p reads_paired -s seed_lenght -d surrounding -l limit_multi-fasta]
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
145
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
146
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
147
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
148
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
149 Detailed ptions:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
150
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
151
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
152 Raw reads file in fastq format:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
153 -f INPUT_FILE, --fastq=INPUT_FILE
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
154 Fastq reads
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
155 (NGS sequences from random fragmentation DNA only,
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
156 e.g. Illumina TruSeq)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
157
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
158 Phage genome(s) in fasta format:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
159 -r INPUT_FILE, --ref=INPUT_FILE
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
160 Reference phage genome(s) as unique contig in fasta format
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
161
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
162
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
163
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
164 Other options common to both modes:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
165
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
166 Raw reads file in fastq format:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
167 -p INPUT_FILE, --paired=INPUT_FILE
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
168 Paired fastq reads
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
169 (NGS sequences from random fragmentation DNA only,
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
170 e.g. Illumina TruSeq)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
171
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
172 Analysis_name to write on output reports:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
173 --report_title USER_REPORT_NAME, --report_title=REPORT_NAME
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
174 Manually enter the name you want to have on your report outputs.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
175 Used as prefix for output files.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
176
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
177 Lenght of the seed used for reads in the mapping process:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
178 -s SEED_LENGHT, --seed=SEED_LENGHT
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
179 Manually enter the lenght of the seed used for reads
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
180 in the mapping process (Default: 20).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
181
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
182 Number of nucleotides around the main peak to consider for merging adjacent significant peaks (set to 1 to discover secondary terminus but sites).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
183 -d SUROUNDING_LENGHT, --surrounding=SUROUNDING_LENGHT
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
184 Manually enter the lenght of the surrounding used to
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
185 merge close peaks in the analysis process (Default: 20).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
186
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
187 Host genome in fasta format (option available only for analysis with a single phage genome):
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
188 -g INPUT_FILE, --host=INPUT_FILE
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
189 Genome of reference host (bacterial genome) in fasta format
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
190 Warning: increase drastically process time
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
191 This option can be used only when analyzing a single phage genome (not available for virome contigs as multifasta)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
192
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
193 Define phage mean coverage:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
194 -m MEAN_NBR, --mean=MEAN_NBR
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
195 Phage mean coverage to use (Default: 250).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
196
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
197 Define phage mean coverage:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
198 -l LIMIT_FASTA, —limit=LIMIT_FASTA
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
199 Minimum phage fasta length (Default: 500).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
200
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
201
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
202 Options for mono machine (default) mode only
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
203
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
204 Software run test:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
205 -t TEST_VALUE, --test=TEST_VALUE
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
206 TEST_VALUE=C5 : Test run for a 5' cohesive end (e.g. Lambda)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
207 TEST_VALUE=C3 : Test run for a 3' cohesive end (e.g. HK97)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
208 TEST_VALUE=DS : Test run for a short Direct Terminal Repeats end (e.g. T7)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
209 TEST_VALUE=DL : Test run for a long Direct Terminal Repeats end (e.g. T5)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
210 TEST_VALUE=H : Test run for a Headful packaging (e.g. P1)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
211 TEST_VALUE=M : Test run for a Mu-like packaging (e.g. Mu)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
212
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
213 Core processor number to use:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
214 -c CORE_NBR, --core=CORE_NBR
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
215 Number of core processor to use (Default: 1).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
216
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
217
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
218
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
219 Options for multi machine mode only
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
220
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
221 Indicate that PhageTerm should run on several machines:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
222 --mm
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
223
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
224
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
225 Options for step 1 of multi-machine mode (calculating reads coverage) on several machines
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
226
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
227 Directory for coverage results:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
228 --dir_cov_mm=DIR_PATH/DIR_NAME
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
229 Directory where to put coverage results.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
230 Note: it is up to the user to delete the files in this directory.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
231
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
232 Total number of cores to use
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
233 -c CORE_NBR, --core=CORE_NBR
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
234 Total number used accross over all machines.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
235
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
236 Index of read chunk to process on current core
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
237 --core_id=IDX
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
238 A number between 0 and CORE_NBR-1
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
239
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
240 Directory for checkpoint files:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
241 --dir_chk=DIR_PATH/DIR_NAME
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
242 Directory where phageTerm will put its ceckpoints.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
243 Note: the directory must exist before launching phageTerm.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
244 If the directory already contains a file, phageTerm will start from the results contained in this file.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
245
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
246 --chk_freq=FREQUENCY
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
247 The frequency in minutes at which checkpoints must be created.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
248 Note: default value is 0 which means that no checkpoint is created.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
249
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
250
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
251
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
252 Options for step 2 of multi-machine mode (calculating per sequence statistics from reads coverage results) on several machines
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
253
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
254 Directory for coverage results:
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
255 --dir_cov_mm=DIR_PATH/DIR_NAME
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
256 Directory where to put coverage results.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
257 Note: it is up to the user to delete the files in this directory.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
258
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
259 Directory for per sequence results
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
260 --dir_seq_mm=DIR_PATH/DIR_NAME
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
261 Directory where to put the information if no match was found for one/several sequences.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
262 Note: it is up to the user to delete the files in this directory.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
263
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
264 Directory for DR results
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
265 --DR_path=DIR_PATH/DIR_NAME
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
266 Directory where to put the information necessary to step 3 (final report generation).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
267 This information typically includes names of phage found and per sequence statistics.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
268 Note: it is up to the user to delete the files in this directory.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
269
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
270 Sequence identifier
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
271 --seq_id=IDX
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
272 Index of the sequence to be processed by the current phageTerm process.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
273 Let N be the number of sequences given at the end of step 1.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
274 Then IDX is number between 0 and N-1.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
275
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
276 Number of pieces
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
277 --nb_pieces=NP
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
278 Number of parts in which the reads were divided.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
279 Must be the same value as given via -c at step 1 (CORE_NBR).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
280
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
281
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
282 Options for step 3 of multi-machine mode (final report generation)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
283
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
284 Directory for DR results
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
285 --DR_path=DIR_PATH/DIR_NAME
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
286 Directory where to read the information necessary to step 3 (final report generation).
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
287 This information typically includes names of phage found and per sequence statistics.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
288 Note: it is up to the user to delete the files in this directory.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
289
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
290 Directory for per sequence results
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
291 --dir_seq_mm=DIR_PATH/DIR_NAME
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
292 Directory where to get the information if no match was found for one/several sequences.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
293 Note: it is up to the user to delete the files in this directory.
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
294
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
295
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
296
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
297
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
298
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
299
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
300 OUTPUT FILES
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
301 ==========
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
302
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
303 (i) Report (.pdf)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
304
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
305 (ii) Statistical table (.csv)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
306
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
307 (iii) File containingg contains re-organized to stat at the predicted termini (.fasta)
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
308
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
309
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
310 CONTACT
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
311 =======
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
312
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
313 Julian Garneau <julian.garneau@usherbrooke.ca>
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
314 Marc Monot <marc.monot@pasteur.fr>
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
315 David Bikard <david.bikard@pasteur.fr>
69e8f12c8b31 "planemo upload"
bioit_sciensano
parents:
diff changeset
316 VĂ©ronique Legrand <vlegrand@pasteur.fr>