comparison test-data/summary_mate.txt @ 14:b6aa3b6ba129 draft

planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/bismark commit f57cd875407ce987c4897fc352c5db0eeb8e9efe
author bgruening
date Tue, 30 Jul 2019 10:33:02 -0400
parents
children 0b656f8c5637
comparison
equal deleted inserted replaced
13:f211753166bd 14:b6aa3b6ba129
1 Create a temporary index with the offered files from the user. Utilizing the script: bismark_genome_preparation
2 Generating index with: 'bismark_genome_preparation --bowtie2 /tmp/tmpAHSx4i'
3 Writing bisulfite genomes out into a single MFA (multi FastA) file
4
5 Bisulfite Genome Indexer version v0.22.1 (last modified: 14 April 2019)
6
7 Step I - Prepare genome folders - completed
8
9
10
11 Total number of conversions performed:
12 C->T: 146875
13 G->A: 150504
14
15 Step II - Genome bisulfite conversions - completed
16
17
18 Bismark Genome Preparation - Step III: Launching the Bowtie 2 indexer
19 Please be aware that this process can - depending on genome size - take several hours!
20 Settings:
21 Output files: "BS_CT.*.bt2"
22 Line rate: 6 (line is 64 bytes)
23 Lines per side: 1 (side is 64 bytes)
24 Offset rate: 4 (one in 16)
25 FTable chars: 10
26 Strings: unpacked
27 Max bucket size: default
28 Max bucket size, sqrt multiplier: default
29 Max bucket size, len divisor: 4
30 Difference-cover sample period: 1024
31 Endianness: little
32 Actual local endianness: little
33 Sanity checking: disabled
34 Assertions: disabled
35 Random seed: 0
36 Sizeofs: void*:8, int:4, long:8, size_t:8
37 Input files DNA, FASTA:
38 genome_mfa.CT_conversion.fa
39 Building a SMALL index
40 Reading reference sizes
41 Time reading reference sizes: 00:00:00
42 Calculating joined length
43 Writing header
44 Reserving space for joined string
45 Joining reference sequences
46 Time to join reference sequences: 00:00:00
47 bmax according to bmaxDivN setting: 189039
48 Using parameters --bmax 141780 --dcv 1024
49 Doing ahead-of-time memory usage test
50 Passed! Constructing with these parameters: --bmax 141780 --dcv 1024
51 Constructing suffix-array element generator
52 Building DifferenceCoverSample
53 Building sPrime
54 Building sPrimeOrder
55 V-Sorting samples
56 V-Sorting samples time: 00:00:00
57 Allocating rank array
58 Ranking v-sort output
59 Ranking v-sort output time: 00:00:00
60 Invoking Larsson-Sadakane on ranks
61 Invoking Larsson-Sadakane on ranks time: 00:00:00
62 Sanity-checking and returning
63 Building samples
64 Reserving space for 12 sample suffixes
65 Generating random suffixes
66 QSorting 12 sample offsets, eliminating duplicates
67 QSorting sample offsets, eliminating duplicates time: 00:00:00
68 Multikey QSorting 12 samples
69 (Using difference cover)
70 Multikey QSorting samples time: 00:00:00
71 Calculating bucket sizes
72 Splitting and merging
73 Splitting and merging time: 00:00:00
74 Avg bucket size: 756159 (target: 141779)
75 Converting suffix-array elements to index image
76 Allocating ftab, absorbFtab
77 Entering Ebwt loop
78 Getting block 1 of 1
79 No samples; assembling all-inclusive block
80 Sorting block of length 756159 for bucket 1
81 (Using difference cover)
82 Sorting block time: 00:00:00
83 Returning block of 756160 for bucket 1
84 Exited Ebwt loop
85 fchr[A]: 0
86 fchr[C]: 235897
87 fchr[G]: 235897
88 fchr[T]: 386401
89 fchr[$]: 756159
90 Exiting Ebwt::buildToDisk()
91 Returning from initFromVector
92 Wrote 4446745 bytes to primary EBWT file: BS_CT.1.bt2
93 Wrote 189044 bytes to secondary EBWT file: BS_CT.2.bt2
94 Re-opening _in1 and _in2 as input streams
95 Returning from Ebwt constructor
96 Headers:
97 len: 756159
98 bwtLen: 756160
99 sz: 189040
100 bwtSz: 189040
101 lineRate: 6
102 offRate: 4
103 offMask: 0xfffffff0
104 ftabChars: 10
105 eftabLen: 20
106 eftabSz: 80
107 ftabLen: 1048577
108 ftabSz: 4194308
109 offsLen: 47260
110 offsSz: 189040
111 lineSz: 64
112 sideSz: 64
113 sideBwtSz: 48
114 sideBwtLen: 192
115 numSides: 3939
116 numLines: 3939
117 ebwtTotLen: 252096
118 ebwtTotSz: 252096
119 color: 0
120 reverse: 0
121 Total time for call to driver() for forward index: 00:00:00
122 Reading reference sizes
123 Time reading reference sizes: 00:00:00
124 Calculating joined length
125 Writing header
126 Reserving space for joined string
127 Joining reference sequences
128 Time to join reference sequences: 00:00:00
129 Time to reverse reference sequence: 00:00:00
130 bmax according to bmaxDivN setting: 189039
131 Using parameters --bmax 141780 --dcv 1024
132 Doing ahead-of-time memory usage test
133 Passed! Constructing with these parameters: --bmax 141780 --dcv 1024
134 Constructing suffix-array element generator
135 Building DifferenceCoverSample
136 Building sPrime
137 Building sPrimeOrder
138 V-Sorting samples
139 V-Sorting samples time: 00:00:00
140 Allocating rank array
141 Ranking v-sort output
142 Ranking v-sort output time: 00:00:00
143 Invoking Larsson-Sadakane on ranks
144 Invoking Larsson-Sadakane on ranks time: 00:00:00
145 Sanity-checking and returning
146 Building samples
147 Reserving space for 12 sample suffixes
148 Generating random suffixes
149 QSorting 12 sample offsets, eliminating duplicates
150 QSorting sample offsets, eliminating duplicates time: 00:00:00
151 Multikey QSorting 12 samples
152 (Using difference cover)
153 Multikey QSorting samples time: 00:00:00
154 Calculating bucket sizes
155 Splitting and merging
156 Splitting and merging time: 00:00:00
157 Avg bucket size: 756159 (target: 141779)
158 Converting suffix-array elements to index image
159 Allocating ftab, absorbFtab
160 Entering Ebwt loop
161 Getting block 1 of 1
162 No samples; assembling all-inclusive block
163 Sorting block of length 756159 for bucket 1
164 (Using difference cover)
165 Sorting block time: 00:00:00
166 Returning block of 756160 for bucket 1
167 Exited Ebwt loop
168 fchr[A]: 0
169 fchr[C]: 235897
170 fchr[G]: 235897
171 fchr[T]: 386401
172 fchr[$]: 756159
173 Exiting Ebwt::buildToDisk()
174 Returning from initFromVector
175 Wrote 4446745 bytes to primary EBWT file: BS_CT.rev.1.bt2
176 Wrote 189044 bytes to secondary EBWT file: BS_CT.rev.2.bt2
177 Re-opening _in1 and _in2 as input streams
178 Returning from Ebwt constructor
179 Headers:
180 len: 756159
181 bwtLen: 756160
182 sz: 189040
183 bwtSz: 189040
184 lineRate: 6
185 offRate: 4
186 offMask: 0xfffffff0
187 ftabChars: 10
188 eftabLen: 20
189 eftabSz: 80
190 ftabLen: 1048577
191 ftabSz: 4194308
192 offsLen: 47260
193 offsSz: 189040
194 lineSz: 64
195 sideSz: 64
196 sideBwtSz: 48
197 sideBwtLen: 192
198 numSides: 3939
199 numLines: 3939
200 ebwtTotLen: 252096
201 ebwtTotSz: 252096
202 color: 0
203 reverse: 1
204 Total time for backward call to driver() for mirror index: 00:00:01
205 Settings:
206 Output files: "BS_GA.*.bt2"
207 Line rate: 6 (line is 64 bytes)
208 Lines per side: 1 (side is 64 bytes)
209 Offset rate: 4 (one in 16)
210 FTable chars: 10
211 Strings: unpacked
212 Max bucket size: default
213 Max bucket size, sqrt multiplier: default
214 Max bucket size, len divisor: 4
215 Difference-cover sample period: 1024
216 Endianness: little
217 Actual local endianness: little
218 Sanity checking: disabled
219 Assertions: disabled
220 Random seed: 0
221 Sizeofs: void*:8, int:4, long:8, size_t:8
222 Input files DNA, FASTA:
223 genome_mfa.GA_conversion.fa
224 Building a SMALL index
225 Reading reference sizes
226 Time reading reference sizes: 00:00:00
227 Calculating joined length
228 Writing header
229 Reserving space for joined string
230 Joining reference sequences
231 Time to join reference sequences: 00:00:00
232 bmax according to bmaxDivN setting: 189039
233 Using parameters --bmax 141780 --dcv 1024
234 Doing ahead-of-time memory usage test
235 Passed! Constructing with these parameters: --bmax 141780 --dcv 1024
236 Constructing suffix-array element generator
237 Building DifferenceCoverSample
238 Building sPrime
239 Building sPrimeOrder
240 V-Sorting samples
241 V-Sorting samples time: 00:00:00
242 Allocating rank array
243 Ranking v-sort output
244 Ranking v-sort output time: 00:00:00
245 Invoking Larsson-Sadakane on ranks
246 Invoking Larsson-Sadakane on ranks time: 00:00:00
247 Sanity-checking and returning
248 Building samples
249 Reserving space for 12 sample suffixes
250 Generating random suffixes
251 QSorting 12 sample offsets, eliminating duplicates
252 QSorting sample offsets, eliminating duplicates time: 00:00:00
253 Multikey QSorting 12 samples
254 (Using difference cover)
255 Multikey QSorting samples time: 00:00:00
256 Calculating bucket sizes
257 Splitting and merging
258 Splitting and merging time: 00:00:00
259 Avg bucket size: 756159 (target: 141779)
260 Converting suffix-array elements to index image
261 Allocating ftab, absorbFtab
262 Entering Ebwt loop
263 Getting block 1 of 1
264 No samples; assembling all-inclusive block
265 Sorting block of length 756159 for bucket 1
266 (Using difference cover)
267 Sorting block time: 00:00:00
268 Returning block of 756160 for bucket 1
269 Exited Ebwt loop
270 fchr[A]: 0
271 fchr[C]: 386401
272 fchr[G]: 533276
273 fchr[T]: 533276
274 fchr[$]: 756159
275 Exiting Ebwt::buildToDisk()
276 Returning from initFromVector
277 Wrote 4446745 bytes to primary EBWT file: BS_GA.1.bt2
278 Wrote 189044 bytes to secondary EBWT file: BS_GA.2.bt2
279 Re-opening _in1 and _in2 as input streams
280 Returning from Ebwt constructor
281 Headers:
282 len: 756159
283 bwtLen: 756160
284 sz: 189040
285 bwtSz: 189040
286 lineRate: 6
287 offRate: 4
288 offMask: 0xfffffff0
289 ftabChars: 10
290 eftabLen: 20
291 eftabSz: 80
292 ftabLen: 1048577
293 ftabSz: 4194308
294 offsLen: 47260
295 offsSz: 189040
296 lineSz: 64
297 sideSz: 64
298 sideBwtSz: 48
299 sideBwtLen: 192
300 numSides: 3939
301 numLines: 3939
302 ebwtTotLen: 252096
303 ebwtTotSz: 252096
304 color: 0
305 reverse: 0
306 Total time for call to driver() for forward index: 00:00:00
307 Reading reference sizes
308 Time reading reference sizes: 00:00:00
309 Calculating joined length
310 Writing header
311 Reserving space for joined string
312 Joining reference sequences
313 Time to join reference sequences: 00:00:00
314 Time to reverse reference sequence: 00:00:00
315 bmax according to bmaxDivN setting: 189039
316 Using parameters --bmax 141780 --dcv 1024
317 Doing ahead-of-time memory usage test
318 Passed! Constructing with these parameters: --bmax 141780 --dcv 1024
319 Constructing suffix-array element generator
320 Building DifferenceCoverSample
321 Building sPrime
322 Building sPrimeOrder
323 V-Sorting samples
324 V-Sorting samples time: 00:00:00
325 Allocating rank array
326 Ranking v-sort output
327 Ranking v-sort output time: 00:00:00
328 Invoking Larsson-Sadakane on ranks
329 Invoking Larsson-Sadakane on ranks time: 00:00:00
330 Sanity-checking and returning
331 Building samples
332 Reserving space for 12 sample suffixes
333 Generating random suffixes
334 QSorting 12 sample offsets, eliminating duplicates
335 QSorting sample offsets, eliminating duplicates time: 00:00:00
336 Multikey QSorting 12 samples
337 (Using difference cover)
338 Multikey QSorting samples time: 00:00:00
339 Calculating bucket sizes
340 Splitting and merging
341 Splitting and merging time: 00:00:00
342 Avg bucket size: 756159 (target: 141779)
343 Converting suffix-array elements to index image
344 Allocating ftab, absorbFtab
345 Entering Ebwt loop
346 Getting block 1 of 1
347 No samples; assembling all-inclusive block
348 Sorting block of length 756159 for bucket 1
349 (Using difference cover)
350 Sorting block time: 00:00:00
351 Returning block of 756160 for bucket 1
352 Exited Ebwt loop
353 fchr[A]: 0
354 fchr[C]: 386401
355 fchr[G]: 533276
356 fchr[T]: 533276
357 fchr[$]: 756159
358 Exiting Ebwt::buildToDisk()
359 Returning from initFromVector
360 Wrote 4446745 bytes to primary EBWT file: BS_GA.rev.1.bt2
361 Wrote 189044 bytes to secondary EBWT file: BS_GA.rev.2.bt2
362 Re-opening _in1 and _in2 as input streams
363 Returning from Ebwt constructor
364 Headers:
365 len: 756159
366 bwtLen: 756160
367 sz: 189040
368 bwtSz: 189040
369 lineRate: 6
370 offRate: 4
371 offMask: 0xfffffff0
372 ftabChars: 10
373 eftabLen: 20
374 eftabSz: 80
375 ftabLen: 1048577
376 ftabSz: 4194308
377 offsLen: 47260
378 offsSz: 189040
379 lineSz: 64
380 sideSz: 64
381 sideBwtSz: 48
382 sideBwtLen: 192
383 numSides: 3939
384 numLines: 3939
385 ebwtTotLen: 252096
386 ebwtTotSz: 252096
387 color: 0
388 reverse: 1
389 Total time for backward call to driver() for mirror index: 00:00:01
390 Running bismark with: 'bismark --bam --gzip --temp_dir /tmp/tmp86syD7 -o /tmp/tmp86syD7/results --quiet --fastq -L 20 -D 15 -R 2 --un --ambiguous /tmp/tmpAHSx4i -1 input1_fq_1.fq -2 input1_fq_2.fq -I 0 -X 500'
391 Bowtie 2 seems to be working fine (tested command 'bowtie2 --version' [2.3.5])
392 Output format is BAM (default)
393 Alignments will be written out in BAM format. Samtools found here: '/home/abretaud/miniconda3/envs/mulled-v1-9f2317dbfb405ed6926c55752e5c11678eee3256a6ea680d1c0f912251153030/bin/samtools'
394 Reference genome folder provided is /tmp/tmpAHSx4i/ (absolute path is '/tmp/tmpAHSx4i/)'
395 FastQ format specified
396
397 Input files to be analysed (in current folder '/tmp/tmpFC2FCZ/job_working_directory/000/4/working'):
398 input1_fq_1.fq
399 input1_fq_2.fq
400 Library is assumed to be strand-specific (directional), alignments to strands complementary to the original top or bottom strands will be ignored (i.e. not performed!)
401 Created output directory /tmp/tmp86syD7/results/!
402
403 Output will be written into the directory: /tmp/tmp86syD7/results/
404
405 Using temp directory: /tmp/tmp86syD7
406 Temporary files will be written into the directory: /tmp/tmp86syD7/
407 Setting parallelization to single-threaded (default)
408
409 Summary of all aligner options: -q -L 20 -D 15 -R 2 --score-min L,0,-0.2 --ignore-quals --no-mixed --no-discordant --dovetail --minins 0 --maxins 500 --quiet
410 Current working directory is: /tmp/tmpFC2FCZ/job_working_directory/000/4/working
411
412 Now reading in and storing sequence information of the genome specified in: /tmp/tmpAHSx4i/
413
414 chr chrY_JH584300_random (182347 bp)
415 chr chrY_JH584301_random (259875 bp)
416 chr chrY_JH584302_random (155838 bp)
417 chr chrY_JH584303_random (158099 bp)
418
419 Single-core mode: setting pid to 1
420
421 Paired-end alignments will be performed
422 =======================================
423
424 The provided filenames for paired-end alignments are input1_fq_1.fq and input1_fq_2.fq
425 Input files are in FastQ format
426 Writing a C -> T converted version of the input file input1_fq_1.fq to /tmp/tmp86syD7/input1_fq_1.fq_C_to_T.fastq.gz
427
428 Created C -> T converted version of the FastQ file input1_fq_1.fq (1000 sequences in total)
429
430 Writing a G -> A converted version of the input file input1_fq_2.fq to /tmp/tmp86syD7/input1_fq_2.fq_G_to_A.fastq.gz
431
432 Created G -> A converted version of the FastQ file input1_fq_2.fq (1000 sequences in total)
433
434 Input files are input1_fq_1.fq_C_to_T.fastq.gz and input1_fq_2.fq_G_to_A.fastq.gz (FastQ)
435 Now running 2 instances of Bowtie 2 against the bisulfite genome of /tmp/tmpAHSx4i/ with the specified options: -q -L 20 -D 15 -R 2 --score-min L,0,-0.2 --ignore-quals --no-mixed --no-discordant --dovetail --minins 0 --maxins 500 --quiet
436
437 Now starting a Bowtie 2 paired-end alignment for CTread1GAread2CTgenome (reading in sequences from /tmp/tmp86syD7/input1_fq_1.fq_C_to_T.fastq.gz and /tmp/tmp86syD7/input1_fq_2.fq_G_to_A.fastq.gz, with the options: -q -L 20 -D 15 -R 2 --score-min L,0,-0.2 --ignore-quals --no-mixed --no-discordant --dovetail --minins 0 --maxins 500 --quiet --norc))
438 Found first alignment:
439 1_1/1 77 * 0 0 * * 0 0 TTGTATATATTAGATAAATTAATTTTTTTTGTTTGTATGTTAAATTTTTTAATTAATTTATTAATATTTTGTGAATTTTTAGATA AAAAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEAEEEEEE YT:Z:UP
440 1_1/2 141 * 0 0 * * 0 0 TTATATATATTAAATAAATTAATTTTTTTTATTTATATATTAAATTTTTTAATTAATTTATTAATATTTTATAAATTTTTAAATA AAAAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEAEEEEEE YT:Z:UP
441 Now starting a Bowtie 2 paired-end alignment for CTread1GAread2GAgenome (reading in sequences from /tmp/tmp86syD7/input1_fq_1.fq_C_to_T.fastq.gz and /tmp/tmp86syD7/input1_fq_2.fq_G_to_A.fastq.gz, with the options: -q -L 20 -D 15 -R 2 --score-min L,0,-0.2 --ignore-quals --no-mixed --no-discordant --dovetail --minins 0 --maxins 500 --quiet --nofw))
442 Found first alignment:
443 1_1/1 77 * 0 0 * * 0 0 TTGTATATATTAGATAAATTAATTTTTTTTGTTTGTATGTTAAATTTTTTAATTAATTTATTAATATTTTGTGAATTTTTAGATA AAAAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEAEEEEEE YT:Z:UP
444 1_1/2 141 * 0 0 * * 0 0 TTATATATATTAAATAAATTAATTTTTTTTATTTATATATTAAATTTTTTAATTAATTTATTAATATTTTATAAATTTTTAAATA AAAAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEAEEEEEE YT:Z:UP
445
446 >>> Writing bisulfite mapping results to input1_fq_1_bismark_bt2_pe.bam <<<
447
448 Unmapped sequences will be written to input1_fq_1.fq_unmapped_reads_1.fq.gz and input1_fq_2.fq_unmapped_reads_2.fq.gz
449 Ambiguously mapping sequences will be written to input1_fq_1.fq_ambiguous_reads_1.fq.gz and input1_fq_2.fq_ambiguous_reads_2.fq.gz
450
451 Reading in the sequence files input1_fq_1.fq and input1_fq_2.fq
452 Processed 1000 sequences in total
453
454
455 Successfully deleted the temporary files /tmp/tmp86syD7/input1_fq_1.fq_C_to_T.fastq.gz and /tmp/tmp86syD7/input1_fq_2.fq_G_to_A.fastq.gz
456
457 Final Alignment report
458 ======================
459 Sequence pairs analysed in total: 1000
460 Number of paired-end alignments with a unique best hit: 0
461 Mapping efficiency: 0.0%
462
463 Sequence pairs with no alignments under any condition: 1000
464 Sequence pairs did not map uniquely: 0
465 Sequence pairs which were discarded because genomic sequence could not be extracted: 0
466
467 Number of sequence pairs with unique best (first) alignment came from the bowtie output:
468 CT/GA/CT: 0 ((converted) top strand)
469 GA/CT/CT: 0 (complementary to (converted) top strand)
470 GA/CT/GA: 0 (complementary to (converted) bottom strand)
471 CT/GA/GA: 0 ((converted) bottom strand)
472
473 Number of alignments to (merely theoretical) complementary strands being rejected in total: 0
474
475 Final Cytosine Methylation Report
476 =================================
477 Total number of C's analysed: 0
478
479 Total methylated C's in CpG context: 0
480 Total methylated C's in CHG context: 0
481 Total methylated C's in CHH context: 0
482 Total methylated C's in Unknown context: 0
483
484 Total unmethylated C's in CpG context: 0
485 Total unmethylated C's in CHG context: 0
486 Total unmethylated C's in CHH context: 0
487 Total unmethylated C's in Unknown context: 0
488
489 Can't determine percentage of methylated Cs in CpG context if value was 0
490 Can't determine percentage of methylated Cs in CHG context if value was 0
491 Can't determine percentage of methylated Cs in CHH context if value was 0
492 Can't determine percentage of methylated Cs in unknown context (CN or CHN) if value was 0
493
494
495 Bismark completed in 0d 0h 0m 5s
496
497 ====================
498 Bismark run complete
499 ====================
500