comparison test-data/output_file2.tabular @ 19:2e9f7ea7ae93 draft

planemo upload for repository https://github.com/monikaheinzl/duplexanalysis_galaxy/tree/master/tools/hd commit dfaab79252a858e8df16bbea3607ebf1b6962e5a-dirty
author mheinzl
date Mon, 08 Oct 2018 05:56:04 -0400
parents
children
comparison
equal deleted inserted replaced
18:a8581bf627fd 19:2e9f7ea7ae93
1 Test_data2
2 number of tags per file 20 (from 20) against 20
3
4 Hamming distance separated by family size
5 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum
6 HD=1 2 0 0 0 1 0 3
7 HD=6 0 0 0 1 0 1 2
8 HD=7 2 0 1 1 2 1 7
9 HD=8 1 0 1 0 2 1 5
10 HD=9 1 0 0 0 0 1 2
11 HD=10 1 0 0 0 0 0 1
12 sum 7 0 2 2 5 4 20
13
14 Family size distribution separated by Hamming distance
15 HD=1 HD=2 HD=3 HD=4 HD=5-8 HD>8 sum
16 FS=1 2 0 0 0 3 2 7
17 FS=3 0 0 0 0 2 0 2
18 FS=4 0 0 0 0 2 0 2
19 FS=5 0 0 0 0 1 0 1
20 FS=6 0 0 0 0 1 0 1
21 FS=7 1 0 0 0 0 0 1
22 FS=8 0 0 0 0 1 0 1
23 FS=9 0 0 0 0 1 0 1
24 FS=12 0 0 0 0 2 0 2
25 FS=13 0 0 0 0 1 1 2
26 sum 3 0 0 0 14 3 20
27
28
29 max. family size: 13
30 absolute frequency: 2
31 relative frequency: 0.1
32
33 The hamming distances were calculated by comparing each half of all tags against the tag(s) with the minimum Hamming distance per half.
34 It is possible that one tag can have the minimum HD from multiple tags, so the sample size in this calculation differs from the sample size entered by the user.
35 actual number of tags with min HD = 79 (sample size by user = 20)
36 length of one part of the tag = 12
37
38 Hamming distance of each half in the tag
39 HD a HD b' HD b HD a' HD a+b sum
40 HD=0 20 0 0 5 0 25
41 HD=1 22 4 4 3 8 41
42 HD=2 9 2 0 9 2 22
43 HD=3 0 0 0 10 0 10
44 HD=4 0 0 2 1 0 3
45 HD=5 0 0 5 0 0 5
46 HD=6 0 5 7 0 3 15
47 HD=7 0 7 10 0 10 27
48 HD=8 0 6 0 0 10 16
49 HD=9 0 7 0 0 17 24
50 HD=10 0 11 0 0 13 24
51 HD=11 0 8 0 0 7 15
52 HD=12 0 1 0 0 5 6
53 HD=13 0 0 0 0 4 4
54 sum 51 51 28 28 79 237
55
56 Absolute delta Hamming distances within the tag
57 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum
58 diff=1 5 0 0 1 5 0 11
59 diff=2 4 0 0 0 0 0 4
60 diff=3 1 0 2 1 1 0 5
61 diff=4 1 0 1 0 2 1 5
62 diff=5 2 0 0 0 4 6 12
63 diff=6 1 0 0 1 1 7 10
64 diff=7 2 0 1 0 0 0 3
65 diff=8 0 0 1 0 1 3 5
66 diff=9 6 0 0 1 3 4 14
67 diff=10 4 0 0 0 3 2 9
68 diff=11 0 0 0 0 0 1 1
69 sum 26 0 5 4 20 24 79
70
71 Chimera analysis: relative delta Hamming distances
72 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum
73 diff=0.1 1 0 0 1 1 0 3
74 diff=0.3 3 0 2 0 0 0 5
75 diff=0.4 1 0 0 1 3 0 5
76 diff=0.5 0 0 1 0 0 1 2
77 diff=0.6 1 0 0 0 3 7 11
78 diff=0.7 1 0 0 0 1 5 7
79 diff=0.8 10 0 0 0 2 9 21
80 diff=1.0 9 0 2 2 10 2 25
81 sum 26 0 5 4 20 24 79
82
83 Chimeras:
84 All tags were filtered: only those tags where at least one half is identical with the half of the min. tag are kept.
85 So the hamming distance of the non-identical half is compared.
86 Hamming distances of non-zero half
87 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum
88 HD=1 4 0 0 0 4 0 8
89 HD=2 2 0 0 0 0 0 2
90 HD=6 0 0 0 1 0 2 3
91 HD=7 1 0 1 0 0 0 2
92 HD=8 0 0 1 0 1 0 2
93 HD=9 1 0 0 1 2 0 4
94 HD=10 1 0 0 0 3 0 4
95 sum 9 0 2 2 10 2 25
96
97