Mercurial > repos > mheinzl > hd
comparison test-data/hd_output.tab @ 29:6b15b3b6405c draft
planemo upload for repository https://github.com/monikaheinzl/duplexanalysis_galaxy/tree/master/tools/hd commit 5b3ab8c6467fe3a52e89f5a7d175bd8a0189018a-dirty
author | mheinzl |
---|---|
date | Wed, 24 Jul 2019 05:58:15 -0400 |
parents | 9e384b0741f1 |
children |
comparison
equal
deleted
inserted
replaced
28:1fa7342a140d | 29:6b15b3b6405c |
---|---|
1 hd_data.tab | 1 hd_data.tab |
2 number of tags per file 20 (from 20) against 20 | 2 nr of tags 20 |
3 sample size 20 | |
3 | 4 |
4 Hamming distance separated by family size | 5 Hamming distance separated by family size |
5 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum | 6 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum |
6 HD=1 5 1 1 1 1 0 9 | 7 HD=1 5 1 1 1 1 0 9 |
7 HD=6 3 0 0 0 0 0 3 | 8 HD=6 3 0 0 0 0 0 3 |
27 The Hamming distances were calculated by comparing the first halve against all halves and selected the minimum value (HD a). | 28 The Hamming distances were calculated by comparing the first halve against all halves and selected the minimum value (HD a). |
28 For the second half of the tag, we compared them against all tags which resulted in the minimum HD of the previous step and selected the maximum value (HD b'). | 29 For the second half of the tag, we compared them against all tags which resulted in the minimum HD of the previous step and selected the maximum value (HD b'). |
29 Finally, it was possible to calculate the absolute and relative differences between the HDs (absolute and relative delta HD). | 30 Finally, it was possible to calculate the absolute and relative differences between the HDs (absolute and relative delta HD). |
30 These calculations were repeated, but starting with the second half in the first step to find all possible chimeras in the data (HD b and HD For simplicity we used the maximum value between the delta values in the end. | 31 These calculations were repeated, but starting with the second half in the first step to find all possible chimeras in the data (HD b and HD For simplicity we used the maximum value between the delta values in the end. |
31 When only tags that can form DCS were allowed in the analysis, family sizes for the forward and reverse (ab and ba) will be included in the plots. | 32 When only tags that can form DCS were allowed in the analysis, family sizes for the forward and reverse (ab and ba) will be included in the plots. |
32 length of one part of the tag = 12 | 33 |
34 length of one half of the tag 12 | |
33 | 35 |
34 Hamming distance of each half in the tag | 36 Hamming distance of each half in the tag |
35 HD a HD b' HD b HD a' HD a+b sum | 37 HD DCS HD b' HD b HD a' HD a+b', a'+b sum |
36 HD=0 20 0 8 1 0 29 | 38 HD=0 20 0 8 1 0 29 |
37 HD=1 0 0 1 19 8 28 | 39 HD=1 0 0 1 19 8 28 |
38 HD=2 0 0 0 0 1 1 | 40 HD=2 0 0 0 0 1 1 |
39 HD=5 0 0 3 0 0 3 | 41 HD=5 0 0 3 0 0 3 |
40 HD=6 0 0 2 0 3 5 | 42 HD=6 0 0 2 0 3 5 |
44 HD=10 0 2 0 0 2 4 | 46 HD=10 0 2 0 0 2 4 |
45 HD=11 0 7 0 0 7 14 | 47 HD=11 0 7 0 0 7 14 |
46 HD=12 0 7 0 0 7 14 | 48 HD=12 0 7 0 0 7 14 |
47 sum 20 20 20 20 40 120 | 49 sum 20 20 20 20 40 120 |
48 | 50 |
49 Absolute delta Hamming distances within the tag | 51 Absolute delta Hamming distance within the tag |
50 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum | 52 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum |
51 diff=7 1 0 0 0 0 0 1 | 53 diff=7 1 0 0 0 0 0 1 |
52 diff=8 1 0 0 0 1 0 2 | 54 diff=8 1 0 0 0 1 0 2 |
53 diff=9 1 0 0 0 0 0 1 | 55 diff=9 1 0 0 0 0 0 1 |
54 diff=10 2 0 0 0 0 0 2 | 56 diff=10 2 0 0 0 0 0 2 |
55 diff=11 4 0 1 1 1 0 7 | 57 diff=11 4 0 1 1 1 0 7 |
56 diff=12 5 1 0 1 0 0 7 | 58 diff=12 5 1 0 1 0 0 7 |
57 sum 14 1 1 2 2 0 20 | 59 sum 14 1 1 2 2 0 20 |
58 | 60 |
59 Chimera analysis: relative delta Hamming distances | 61 Chimera analysis: relative delta Hamming distance |
60 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum | 62 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum |
61 diff=1.0 14 1 1 2 2 0 20 | 63 diff=1.0 14 1 1 2 2 0 20 |
62 sum 14 1 1 2 2 0 20 | 64 sum 14 1 1 2 2 0 20 |
63 | 65 |
64 Chimeras: | |
65 All tags were filtered: only those tags where at least one half was identical (HD=0) and therefore, had a relative delta of 1 were kept. These tags are considered as chimeric. | 66 All tags were filtered: only those tags where at least one half was identical (HD=0) and therefore, had a relative delta of 1 were kept. These tags are considered as chimeric. |
66 So the Hamming distances of the chimeric tags are shown. | 67 So the Hamming distances of the chimeric tags are shown. |
67 Hamming distances of chimeras | 68 Hamming distance of chimeric families separated after FS |
68 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum | 69 FS=1 FS=2 FS=3 FS=4 FS=5-10 FS>10 sum |
69 HD=7 1 0 0 0 0 0 1 | 70 HD=7 1 0 0 0 0 0 1 |
70 HD=8 1 0 0 0 1 0 2 | 71 HD=8 1 0 0 0 1 0 2 |
71 HD=9 1 0 0 0 0 0 1 | 72 HD=9 1 0 0 0 0 0 1 |
72 HD=10 2 0 0 0 0 0 2 | 73 HD=10 2 0 0 0 0 0 2 |
73 HD=11 4 0 1 1 1 0 7 | 74 HD=11 4 0 1 1 1 0 7 |
74 HD=12 5 1 0 1 0 0 7 | 75 HD=12 5 1 0 1 0 0 7 |
75 sum 14 1 1 2 2 0 20 | 76 sum 14 1 1 2 2 0 20 |
76 | 77 |
78 Hamming distance of chimeric families separated after DCS and single SSCS | |
79 DCS SSCS ab SSCS ba sum | |
80 HD=7.0 0 0 1 1 | |
81 HD=8.0 0 1 1 2 | |
82 HD=9.0 0 1 0 1 | |
83 HD=10.0 0 1 1 2 | |
84 HD=11.0 0 3 4 7 | |
85 HD=12.0 0 2 5 7 | |
86 sum 0 8 12 20 | |
77 | 87 |
88 |