diff test-data/output_file.tabular @ 22:7e570ba56b83 draft

planemo upload for repository https://github.com/monikaheinzl/duplexanalysis_galaxy/tree/master/tools/hd commit b8a2f7b7615b2bcd3b602027af31f4e677da94f6-dirty
author mheinzl
date Wed, 27 Feb 2019 04:50:56 -0500
parents 2e9f7ea7ae93
children
line wrap: on
line diff
--- a/test-data/output_file.tabular	Fri Dec 14 05:03:24 2018 -0500
+++ b/test-data/output_file.tabular	Wed Feb 27 04:50:56 2019 -0500
@@ -20,66 +20,67 @@
 sum	9	0	0	0	11	0	20	
 
 
-max. family size:	7
+max. family size in sample:	7
 absolute frequency:	1
 relative frequency:	0.05
 
 The hamming distances were calculated by comparing each half of all tags against the tag(s) with the minimum Hamming distance per half.
-It is possible that one tag can have the minimum HD from multiple tags, so the sample size in this calculation differs from the sample size entered by the user.
-actual number of tags with min HD = 171 (sample size by user = 20)
+Since this calculation was repeated, but starting with the second half to find all possible chimeras in the data, the actual number of tags in the plots differs from the sample size entered by the user.
+In addition, both family sizes of one tag will be included in the plots if only tags of reads that can form a DCS were allowed.
 length of one part of the tag = 12
 
 Hamming distance of each half in the tag
 	HD a	HD b'	HD b	HD a'	HD a+b	sum	
-HD=0	146	0	8	4	0	158	
-HD=1	0	2	2	21	11	36	
+HD=0	20	0	8	1	0	29	
+HD=1	0	0	1	19	8	28	
 HD=2	0	0	0	0	1	1	
-HD=5	0	0	4	0	0	4	
-HD=6	0	2	2	0	6	10	
-HD=7	0	16	9	0	21	46	
-HD=8	0	20	0	0	26	46	
-HD=9	0	50	0	0	50	100	
-HD=10	0	30	0	0	30	60	
-HD=11	0	18	0	0	18	36	
-HD=12	0	8	0	0	8	16	
-sum	146	146	25	25	171	513	
+HD=5	0	0	3	0	0	3	
+HD=6	0	0	2	0	3	5	
+HD=7	0	1	6	0	4	11	
+HD=8	0	2	0	0	7	9	
+HD=9	0	1	0	0	1	2	
+HD=10	0	2	0	0	2	4	
+HD=11	0	7	0	0	7	14	
+HD=12	0	7	0	0	7	14	
+sum	20	20	20	20	40	120	
 
 Absolute delta Hamming distances within the tag
 	FS=1	FS=2	FS=3	FS=4	FS=5-10	FS>10	sum	
 diff=0	1	0	0	0	0	0	1	
-diff=1	6	1	2	1	1	0	11	
-diff=4	4	0	0	0	0	0	4	
+diff=1	4	1	1	1	1	0	8	
+diff=4	3	0	0	0	0	0	3	
 diff=5	2	0	0	0	0	0	2	
-diff=6	6	0	0	1	1	0	8	
-diff=7	15	0	1	0	3	0	19	
-diff=8	15	2	0	1	2	0	20	
-diff=9	37	4	1	4	4	0	50	
-diff=10	22	2	1	4	1	0	30	
-diff=11	8	1	1	5	3	0	18	
-diff=12	6	1	0	1	0	0	8	
-sum	122	11	6	17	15	0	171	
+diff=6	3	0	0	1	1	0	5	
+diff=7	2	0	0	0	0	0	2	
+diff=8	1	0	0	0	1	0	2	
+diff=9	1	0	0	0	0	0	1	
+diff=10	2	0	0	0	0	0	2	
+diff=11	4	0	1	1	1	0	7	
+diff=12	5	1	0	1	0	0	7	
+sum	28	2	2	4	4	0	40	
 
 Chimera analysis: relative delta Hamming distances
 	FS=1	FS=2	FS=3	FS=4	FS=5-10	FS>10	sum	
 diff=0.0	1	0	0	0	0	0	1	
-diff=0.7	6	0	0	0	0	0	6	
-diff=0.8	4	0	0	1	1	0	6	
-diff=1.0	111	11	6	16	14	0	158	
-sum	122	11	6	17	15	0	171	
+diff=0.7	5	0	0	0	0	0	5	
+diff=0.8	3	0	0	1	1	0	5	
+diff=1.0	19	2	2	3	3	0	29	
+sum	28	2	2	4	4	0	40	
 
 Chimeras:
 All tags were filtered: only those tags where at least one half is identical with the half of the min. tag are kept.
-So the hamming distance of the non-identical half is compared.
+So the Hamming distance of the non-identical half is shown.
+Be aware that the real number of chimeric tags (where rel. diff = 1) is not shown in the plot because of the above reasons.
+real number of chimeric tags	20	1.0
 Hamming distances of non-zero half
 	FS=1	FS=2	FS=3	FS=4	FS=5-10	FS>10	sum	
-HD=1	6	1	2	1	1	0	11	
-HD=6	2	0	0	0	0	0	2	
-HD=7	15	0	1	0	3	0	19	
-HD=8	15	2	0	1	2	0	20	
-HD=9	37	4	1	4	4	0	50	
-HD=10	22	2	1	4	1	0	30	
-HD=11	8	1	1	5	3	0	18	
-HD=12	6	1	0	1	0	0	8	
-sum	111	11	6	16	14	0	158	
+HD=1	4	1	1	1	1	0	8	
+HD=7	2	0	0	0	0	0	2	
+HD=8	1	0	0	0	1	0	2	
+HD=9	1	0	0	0	0	0	1	
+HD=10	2	0	0	0	0	0	2	
+HD=11	4	0	1	1	1	0	7	
+HD=12	5	1	0	1	0	0	7	
+sum	19	2	2	3	3	0	29