comparison test-data/denovo_map/denovo_map.log @ 0:1b270fc77c81 draft

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/stacks2 commit b395fa36fa826e26085820ba3a9faacaeddcb460
author iuc
date Mon, 01 Jul 2019 10:59:53 -0400
parents
children bf0e43ab0416
comparison
equal deleted inserted replaced
-1:000000000000 0:1b270fc77c81
1 denovo_map.pl version 2.4 started at 2019-06-18 10:34:45
2 /home/berntm/miniconda3/envs/mulled-v1-2b57e7596f85ebb3b321e6c9681e8fd9250523a80d97945c46ac7743359454e7/bin/denovo_map.pl --samples demultiplexed --popmap denovo_map/popmap_cstacks.tsv -o stacks_outputs --paired
3
4 ustacks
5 ==========
6
7 Sample 1 of 2 'PopA_01'
8 ----------
9 ustacks -t fastq -f demultiplexed/PopA_01.1.fq -o stacks_outputs -i 1 --name PopA_01
10 ustacks parameters selected:
11 Input file: 'demultiplexed/PopA_01.1.fq'
12 Sample ID: 1
13 Min depth of coverage to create a stack (m): 3
14 Repeat removal algorithm: enabled
15 Max distance allowed between stacks (M): 2
16 Max distance allowed to align secondary reads: 4
17 Max number of stacks allowed per de novo locus: 3
18 Deleveraging algorithm: disabled
19 Gapped assembly: enabled
20 Minimum alignment length: 0.8
21 Model type: SNP
22 Alpha significance level for model: 0.05
23
24 Loading RAD-Tags...
25
26 Loaded 66 reads; formed:
27 4 stacks representing 63 primary reads (95.5%)
28 3 secondary stacks representing 3 secondary reads (4.5%)
29
30 Stack coverage: mean=15.75; stdev=7.46; max=27; n_reads=63(95.5%)
31 Removing repetitive stacks: cov > 39 (mean+3*stdev)...
32 Blacklisted 0 stacks.
33 Coverage after repeat removal: mean=15.75; stdev=7.46; max=27; n_reads=63(95.5%)
34
35 Assembling stacks (max. dist. M=2)...
36 Assembled 4 stacks into 3; blacklisted 0 stacks.
37 Coverage after assembling stacks: mean=21.00; stdev=4.24; max=27; n_reads=63(95.5%)
38
39 Merging secondary stacks (max. dist. N=4 from consensus)...
40 Merged 3 out of 3 secondary reads (100.0%), 1 merged with gapped alignments.
41 Coverage after merging secondary stacks: mean=22.00; stdev=4.32; max=28; n_reads=66(100.0%)
42
43 Assembling stacks, allowing for gaps (min. match length 80.0%)...
44 Assembled 3 stacks into 3 stacks.
45 Coverage after gapped assembly: mean=22.00; stdev=4.32; max=28; n_reads=66(100.0%)
46
47 Merging secondary stacks, allowing for gaps (min. match length 80.0%)...
48 Merged 0 out of 0 secondary reads (-nan%).
49 Coverage after merging gapped secondary stacks: mean=22.00; stdev=4.32; max=28; n_reads=66(100.0%)
50
51 Final coverage: mean=22.00; stdev=4.32; max=28; n_reads=66(100.0%)
52 Calling consensus sequences and haplotypes for catalog assembly...
53 Writing tags, SNPs, and alleles files...
54 Refetching read IDs...done.
55 ustacks is done.
56
57 Sample 2 of 2 'PopA_02'
58 ----------
59 ustacks -t fastq -f demultiplexed/PopA_02.1.fq -o stacks_outputs -i 2 --name PopA_02
60 ustacks parameters selected:
61 Input file: 'demultiplexed/PopA_02.1.fq'
62 Sample ID: 2
63 Min depth of coverage to create a stack (m): 3
64 Repeat removal algorithm: enabled
65 Max distance allowed between stacks (M): 2
66 Max distance allowed to align secondary reads: 4
67 Max number of stacks allowed per de novo locus: 3
68 Deleveraging algorithm: disabled
69 Gapped assembly: enabled
70 Minimum alignment length: 0.8
71 Model type: SNP
72 Alpha significance level for model: 0.05
73
74 Loading RAD-Tags...
75
76 Loaded 60 reads; formed:
77 4 stacks representing 55 primary reads (91.7%)
78 5 secondary stacks representing 5 secondary reads (8.3%)
79
80 Stack coverage: mean=13.75; stdev=9.12; max=26; n_reads=55(91.7%)
81 Removing repetitive stacks: cov > 42 (mean+3*stdev)...
82 Blacklisted 0 stacks.
83 Coverage after repeat removal: mean=13.75; stdev=9.12; max=26; n_reads=55(91.7%)
84
85 Assembling stacks (max. dist. M=2)...
86 Assembled 4 stacks into 3; blacklisted 0 stacks.
87 Coverage after assembling stacks: mean=18.33; stdev=6.55; max=26; n_reads=55(91.7%)
88
89 Merging secondary stacks (max. dist. N=4 from consensus)...
90 Merged 5 out of 5 secondary reads (100.0%), 0 merged with gapped alignments.
91 Coverage after merging secondary stacks: mean=20.00; stdev=6.53; max=28; n_reads=60(100.0%)
92
93 Assembling stacks, allowing for gaps (min. match length 80.0%)...
94 Assembled 3 stacks into 3 stacks.
95 Coverage after gapped assembly: mean=20.00; stdev=6.53; max=28; n_reads=60(100.0%)
96
97 Merging secondary stacks, allowing for gaps (min. match length 80.0%)...
98 Merged 0 out of 0 secondary reads (-nan%).
99 Coverage after merging gapped secondary stacks: mean=20.00; stdev=6.53; max=28; n_reads=60(100.0%)
100
101 Final coverage: mean=20.00; stdev=6.53; max=28; n_reads=60(100.0%)
102 Calling consensus sequences and haplotypes for catalog assembly...
103 Writing tags, SNPs, and alleles files...
104 Refetching read IDs...done.
105 ustacks is done.
106
107 Depths of Coverage for Processed Samples:
108 PopA_01: 22.00x
109 PopA_02: 20.00x
110
111 cstacks
112 ==========
113 cstacks -P stacks_outputs -M denovo_map/popmap_cstacks.tsv
114
115 cstacks parameters selected:
116 Loci matched based on sequence identity.
117 Number of mismatches allowed between stacks: 1
118 Gapped alignments: enabled
119 Constructing catalog from 2 samples.
120
121 Initializing new catalog...
122 Parsing stacks_outputs/PopA_01.tags.tsv
123 Parsing stacks_outputs/PopA_01.snps.tsv
124 Parsing stacks_outputs/PopA_01.alleles.tsv
125 3 loci were newly added to the catalog.
126
127 Processing sample stacks_outputs/PopA_02 [2 of 2]
128 Parsing stacks_outputs/PopA_02.tags.tsv
129 Parsing stacks_outputs/PopA_02.snps.tsv
130 Parsing stacks_outputs/PopA_02.alleles.tsv
131 Searching for sequence matches...
132 3 loci in the catalog, 184 kmers in the catalog hash.
133 Searching for gapped alignments...
134 Merging matches into catalog...
135 3 loci were matched to a catalog locus.
136 0 loci were matched to a catalog locus using gapped alignments.
137 0 loci were newly added to the catalog.
138 0 loci matched more than one catalog locus, linking them.
139 0 linked catalog loci were merged into 0 loci.
140
141 Writing catalog in directory 'stacks_outputs/'.
142 Final catalog contains 3 loci.
143 cstacks is done.
144
145 sstacks
146 ==========
147 sstacks -P stacks_outputs -M denovo_map/popmap_cstacks.tsv
148
149 Searching for matches by sequence identity...
150 Parsing stacks_outputs/catalog.tags.tsv
151 Parsing stacks_outputs/catalog.snps.tsv
152 Parsing stacks_outputs/catalog.alleles.tsv
153 Populating kmer dictionary for exact matches...done.
154 Populating kmer dictionary for gapped alignments...done.
155
156 Processing sample 'stacks_outputs/PopA_01' [1 of 2]
157 Parsing stacks_outputs/PopA_01.tags.tsv
158 Parsing stacks_outputs/PopA_01.snps.tsv
159 Parsing stacks_outputs/PopA_01.alleles.tsv
160 Searching for sequence matches...
161 3 sample loci compared against the catalog containing 3 loci.
162 3 matching loci, 0 contained no verified haplotypes.
163 0 loci matched more than one catalog locus and were excluded.
164 0 loci contained SNPs unaccounted for in the catalog and were excluded.
165 4 total haplotypes examined from matching loci, 4 verified.
166 Searching for gapped alignments...
167 Out of 3 query loci, 0 gapped alignments attempted.
168 0 loci matched one catalog locus; 0 total haplotypes examined, 0 verified.
169 0 loci matched no catalog locus;
170 0 loci matched more than one catalog locus and were excluded.
171 0 loci contained SNPs unaccounted for in the catalog and were excluded.
172 0 loci had no verified haplotypes.
173 0 loci had inconsistent alignments to a catalog locus and were excluded.
174 Outputing to file stacks_outputs/PopA_01.matches.tsv
175
176 Processing sample 'stacks_outputs/PopA_02' [2 of 2]
177 Parsing stacks_outputs/PopA_02.tags.tsv
178 Parsing stacks_outputs/PopA_02.snps.tsv
179 Parsing stacks_outputs/PopA_02.alleles.tsv
180 Searching for sequence matches...
181 3 sample loci compared against the catalog containing 3 loci.
182 3 matching loci, 0 contained no verified haplotypes.
183 0 loci matched more than one catalog locus and were excluded.
184 0 loci contained SNPs unaccounted for in the catalog and were excluded.
185 4 total haplotypes examined from matching loci, 4 verified.
186 Searching for gapped alignments...
187 Out of 3 query loci, 0 gapped alignments attempted.
188 0 loci matched one catalog locus; 0 total haplotypes examined, 0 verified.
189 0 loci matched no catalog locus;
190 0 loci matched more than one catalog locus and were excluded.
191 0 loci contained SNPs unaccounted for in the catalog and were excluded.
192 0 loci had no verified haplotypes.
193 0 loci had inconsistent alignments to a catalog locus and were excluded.
194 Outputing to file stacks_outputs/PopA_02.matches.tsv
195
196 sstacks is done.
197
198 tsv2bam
199 ==========
200 tsv2bam -P stacks_outputs -M denovo_map/popmap_cstacks.tsv -R demultiplexed/
201
202 Logging to 'stacks_outputs/tsv2bam.log'.
203 Configuration for this run:
204 Stacks directory: 'stacks_outputs/'
205 Population map: 'denovo_map/popmap_cstacks.tsv'
206 Num. samples: 2
207 Paired-end reads directory: 'demultiplexed/'
208
209 Paired-end reads files found, e.g. 'demultiplexed/PopA_01.2.fq'.
210 Loading the catalog...
211 Processing sample 'PopA_01'...
212 Processing sample 'PopA_02'...
213
214 Sample 'PopA_01': matched 3 sample loci to 3 catalog loci; found a paired-end read for 66 (100.0%) of the assembled forward reads; wrote 132 records.
215 Sample 'PopA_02': matched 3 sample loci to 3 catalog loci; found a paired-end read for 60 (100.0%) of the assembled forward reads; wrote 120 records.
216
217 tsv2bam is done.
218
219 gstacks
220 ==========
221 gstacks -P stacks_outputs -M denovo_map/popmap_cstacks.tsv
222
223 Logging to 'stacks_outputs/gstacks.log'.
224 Locus/sample distributions will be written to 'stacks_outputs/gstacks.log.distribs'.
225
226 Configuration for this run:
227 Input mode: denovo
228 Population map: 'denovo_map/popmap_cstacks.tsv'
229 Input files: 2, e.g. 'stacks_outputs/PopA_01.matches.bam'
230 Output to: 'stacks_outputs/'
231 Model: marukilow (var_alpha: 0.01, gt_alpha: 0.05)
232
233 Reading BAM headers...
234 Processing all loci...
235 20%...
236 50%...
237 100%
238
239 Attempted to assemble and align paired-end reads for 3 loci:
240 0 loci had no or almost no paired-end reads (0.0%);
241 0 loci had paired-end reads that couldn't be assembled into a contig (0.0%);
242 For the remaining 3 loci (100.0%), a paired-end contig was assembled;
243 Average contig size was 204.3 bp;
244 0 paired-end contigs overlapped the forward region (0.0%)
245 Mean overlap: -nanbp; mean size of overlapped loci after merging: -nan;
246 Out of 252 paired-end reads in these loci (mean 84.0 reads per locus),
247 252 were successfuly aligned (100.0%);
248 Mean insert length was -nan, stdev: nan (based on aligned reads in overlapped loci).
249
250 Genotyped 3 loci:
251 effective per-sample coverage: mean=31.0x, stdev=1.0x, min=30.0x, max=32.0x
252 mean number of sites per locus: 194.3
253 a consistent phasing was found for 5 of out 5 (100.0%) diploid loci needing phasing
254
255 gstacks is done.
256
257 populations
258 ==========
259 populations -P stacks_outputs -M denovo_map/popmap_cstacks.tsv
260
261 Logging to 'stacks_outputs/populations.log'.
262 Locus/sample distributions will be written to 'stacks_outputs/populations.log.distribs'.
263 populations parameters selected:
264 Percent samples limit per population: 0
265 Locus Population limit: 1
266 Percent samples overall: 0
267 Minor allele frequency cutoff: 0
268 Maximum observed heterozygosity cutoff: 1
269 Applying Fst correction: none.
270 Pi/Fis kernel smoothing: off
271 Fstats kernel smoothing: off
272 Bootstrap resampling: off
273
274 Parsing population map...
275 The population map contained 2 samples, 1 population(s), 1 group(s).
276 Working on 2 samples.
277 Working on 1 population(s):
278 1: PopA_01, PopA_02
279 Working on 1 group(s) of populations:
280 defaultgrp: 1
281
282 Genotyping markers will be written to 'stacks_outputs/populations.markers.tsv'
283 Raw Genotypes/Haplotypes will be written to 'stacks_outputs/populations.haplotypes.tsv'
284 Population-level summary statistics will be written to 'stacks_outputs/populations.sumstats.tsv'
285 Population-level haplotype summary statistics will be written to 'stacks_outputs/populations.hapstats.tsv'
286
287 Processing data in batches:
288 * load a batch of catalog loci and apply filters
289 * compute SNP- and haplotype-wise per-population statistics
290 * write the above statistics in the output files
291 * export the genotypes/haplotypes in specified format(s)
292 More details in 'stacks_outputs/populations.log.distribs'.
293 Now processing...
294 Batch 1
295
296 Removed 0 loci that did not pass sample/population constraints from 3 loci.
297 Kept 3 loci, composed of 613 sites; 0 of those sites were filtered, 6 variant sites remained.
298 Number of loci with PE contig: 3.00 (100.0%);
299 Mean length of loci: 194.33bp (stderr 0.33);
300 Number of loci with SE/PE overlap: 0.00 (0.0%);
301 Mean length of overlapping loci: -nanbp (stderr -0.00); mean overlap: -nanbp (stderr -0.00);
302 Mean genotyped sites per locus: 194.33bp (stderr 0.33).
303
304 Population summary statistics (more detail in populations.sumstats_summary.tsv):
305 1: 2 samples per locus; pi: 0.61111; all/variant/polymorphic sites: 583/6/6; private alleles: 0
306 Populations is done.
307 denovo_map.pl is done.
308
309 denovo_map.pl completed at 2019-06-18 10:34:45