Mercurial > repos > fubar > bigwig_outlier_bed
annotate README.md @ 8:032e930ef6a1 draft default tip
planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/bigwig_outlier_bed commit 7eac2f224d3126002edd2c02d1133b23ac1a4881
author | fubar |
---|---|
date | Wed, 24 Jul 2024 09:19:08 +0000 |
parents | eb17eb8a3658 |
children |
rev | line source |
---|---|
6
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
1 ## bigwig peak bed maker |
0
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
2 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
3 ### July 30 2024 for the VGP |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
4 |
6
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
5 This is a Galaxy tool, for building some of the [NIH MARBL T2T assembly polishing](https://github.com/marbl/training) tools as Galaxy workflows. |
0
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
6 |
6
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
7 JBrowse2 2.12.3 update will include a plugin for optional colours to distinguish bed features, shown being tested in the screenshots below. |
0
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
8 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
9 ### Find and mark BigWig peaks to a bed file for display |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
10 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
11 In the spirit of DeepTools, but finding contiguous regions where the bigwig value is either above or below a given centile. |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
12 0.99 and 0.01 for example. These quantile cut point values are found and applied over each chromosome using some [cunning numpy code](http://gregoryzynda.com/python/numpy/contiguous/interval/2019/11/29/contiguous-regions.html) |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
13 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
14 ![image](https://github.com/fubar2/bigwig_peak_bed/assets/6016266/cdee3a2b-ae31-4282-b744-992c15fb49db) |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
15 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
16 ![image](https://github.com/fubar2/bigwig_peak_bed/assets/6016266/59d1564b-0c34-42a3-b437-44332cf1b2f0) |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
17 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
18 Big differences between chromosomes 14,15,21,22 and Y in this "all contigs" view - explanations welcomed: |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
19 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
20 ![image](https://github.com/fubar2/bigwig_peak_bed/assets/6016266/162bf681-2977-4eb8-8d6f-9dad5b3931f8) |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
21 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
22 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
23 [pybedtools](https://github.com/jackh726/bigtools) is used for the bigwig interface. Optionally allow |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
24 multiple bigwigs to be processed into a single bed - the bed features have the bigwig name in the label for viewing. |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
25 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
26 ### Note on quantiles per chromosome rather than quantiles for the whole bigwig |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
27 |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
28 It is just not feasible to hold all contigs in the entire decoded bigwig in RAM to estimate quantiles. It may be |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
29 better to sample across all chromosomes so as not to lose any systematic differences between them - the current method will hide those |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
30 differences unfortunately. Sampling might be possible. Looking at the actual quantile values across a couple of test bigwigs suggests that |
c71db540eb38
planemo upload for repository https://github.com/jackh726/bigtools commit ce6b9f638ebcebcad5a5b10219f252962f30e5cc-dirty
fubar
parents:
diff
changeset
|
31 there is not much variation between chromosomes but there's now a tabular report to check them for each input bigwig. |
6
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
32 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
33 ### Table reports |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
34 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
35 The optional table output report gives a crude histogram and the top/bottom 10 values to help |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
36 understand what is likely to be informative. In this example, there are 26700 zero values so |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
37 using a lower cutoff quantile is likely to have a lot of them, although a large window requirement |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
38 will decease the overload... |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
39 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
40 Descriptive measures |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
41 bigwig test |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
42 contig chr10_PATERNAL |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
43 n 135711693 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
44 mean 12.178164 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
45 std 7.997467 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
46 min 0.000000 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
47 max 365.000000 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
48 qtop 364.00 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
49 qbot noqlo |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
50 First/Last 10 value counts |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
51 Value Count |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
52 0.00 26700 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
53 1.00 82900 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
54 2.00 261400 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
55 3.00 676993 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
56 4.00 1665500 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
57 5.00 3125700 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
58 6.00 5078000 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
59 7.00 7469000 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
60 8.00 10191700 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
61 9.00 12544600 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
62 355.00 100 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
63 356.00 100 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
64 357.00 300 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
65 358.00 100 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
66 360.00 500 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
67 361.00 300 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
68 362.00 200 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
69 363.00 600 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
70 364.00 900 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
71 365.00 700 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
72 Histogram of bigwig values |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
73 chr10_PATERNAL 18.25 | 127,047,593 | ************************************************************************** |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
74 chr10_PATERNAL 36.50 | 7,510,000 | **** |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
75 chr10_PATERNAL 54.75 | 818,900 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
76 chr10_PATERNAL 73.00 | 117,200 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
77 chr10_PATERNAL 91.25 | 51,900 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
78 chr10_PATERNAL 109.50 | 44,200 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
79 chr10_PATERNAL 127.75 | 21,600 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
80 chr10_PATERNAL 146.00 | 17,900 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
81 chr10_PATERNAL 164.25 | 16,400 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
82 chr10_PATERNAL 182.50 | 18,600 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
83 chr10_PATERNAL 200.75 | 5,400 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
84 chr10_PATERNAL 219.00 | 6,600 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
85 chr10_PATERNAL 237.25 | 6,200 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
86 chr10_PATERNAL 255.50 | 3,900 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
87 chr10_PATERNAL 273.75 | 4,500 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
88 chr10_PATERNAL 292.00 | 7,100 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
89 chr10_PATERNAL 310.25 | 3,000 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
90 chr10_PATERNAL 328.50 | 2,700 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
91 chr10_PATERNAL 346.75 | 3,500 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
92 chr10_PATERNAL 365.00 | 4,500 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
93 chr10_PATERNAL ------------ |------------ | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
94 chr10_PATERNAL N= | 135,711,693 | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
95 chr10_PATERNAL ------------ |------------ | |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
96 |
eb17eb8a3658
planemo upload commit 1baff96e75def9248afdcf21edec9bdc7ed42b1f-dirty
fubar
parents:
0
diff
changeset
|
97 |