comparison hicCompartmentsPolarization.xml @ 0:8d764e8acdf5 draft

"planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/hicexplorer commit fa19d3b6a9d0160a13f8d1e4a99f20c4dbe937b2-dirty"
author bgruening
date Mon, 16 Dec 2019 17:26:46 -0500
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:8d764e8acdf5
1 <tool id="hicexplorer_hiccompartmentspolarization" name="@BINARY@" version="@WRAPPER_VERSION@.0">
2 <description>compute pairwise correlations between multiple Hi-C contact matrices</description>
3 <macros>
4 <token name="@BINARY@">hicCompartmentsPolarization</token>
5 <import>macros.xml</import>
6 </macros>
7 <expand macro="requirements" />
8 <command detect_errors="exit_code"><![CDATA[
9
10 #for $counter, $m in enumerate($matrix_h5_cooler_multiple):
11 ln -s '$m' '${counter}_matrix.$m.ext' &&
12 #end for
13 #set $m = ' '.join([ '\'%s_matrix.%s\'' % ($counter, $matrix.ext) for $counter, $matrix in enumerate($matrix_h5_cooler_multiple) ])
14
15 @BINARY@
16
17 --obsexp_matrices $m
18
19 --pca '$pca'
20 ##--quantile #echo int($quantile) #
21
22 $outputMatrix
23 ##--outliers #echo float($outliers) #
24 --outputFileName plot.$image_file_format
25
26 && mv plot.$image_file_format plot
27
28
29
30 ]]>
31 </command>
32 <inputs>
33 <expand macro="matrix_h5_cooler_multiple_macro" />
34 <param name='pca' type="data" format="bedgraph" label="Eigenvector file" help='A PCA vector as a bedgraph file with
35 no header. In case of several matrices with
36 different conditions, ie. control
37 treatment, the PCA of control can be
38 used. Note that only one PCA can be provided.'/>
39 <!-- <param name='quantile' type="integer" label="Quantile" help='The number of to be used quantiles.' value="30"/>
40 <param name='outliers' type="float" label="Outliers" help='The number of outliers to be removed.' value="0.0"/> -->
41 <param name='outputMatrix' type='boolean' optional="true" truevalue='--outputMatrix output_matrix.npz' falsevalue='' label='Return computed matrix' help='Output .npz file includes all the generated matrices' />
42 <param name="image_file_format" type="select" label="Image output format">
43 <option value="png" selected="True">png</option>
44 <option value="svg">svg</option>
45 <option value="pdf">pdf</option>
46 </param>
47 </inputs>
48 <outputs>
49 <data format="png" name="outFileName" from_work_dir="plot" label="${tool.name} [${on_string}]: Plot">
50 <change_format>
51 <when input="image_file_format" value="svg" format="svg" />
52 <when input="image_file_format" value="pdf" format="pdf" />
53 </change_format>
54 </data>
55 <data name='output_matrix' from_work_dir='output_matrix.npz' format='hicexplorer_compartments_matrix' label='Output .npz file includes all the generated matrices'>
56 <filter>outputMatrix == '--outputMatrix output_matrix.npz'</filter>
57 </data>
58 </outputs>
59 <tests>
60 <test>
61 <param name="matrix_h5_cooler_multiple" value="hicTransform/obs_exp_norm.h5"/>
62 <param name="pca" value="hicCompartmentsPolarization/pca1.bedgraph"/>
63 <output name="outFileName" file="hicCompartmentsPolarization/compartmentsPolarizationRatio.png" ftype="png" compare="sim_size"/>
64 </test>
65 </tests>
66 <help><![CDATA[
67
68 Matrix correlation
69 ==================
70
71 **hicCorrelate** is a dedicated Quality Control tool that allows the correlation of multiple Hi-C matrices at once with either a heatmap or scatterplots output.
72
73 Computes pairwise correlations between Hi-C matrices data. The correlation is computed taking the values from each pair of matrices and discarding values that are zero in both matrices. Parameters that strongly affect correlations are bin size of the Hi-C matrices (can be changed using ``hicMergeMatrixBins``) and the considered range. The smaller the bin size of the matrices, the finer the differences you score. The *Range* parameter should be selected at a meaningful genomic scale according to, for example, the mean size of the TADs in the organism you work with or to specific ranges found using ``hicPlotDistVsCounts``.
74
75 _________________
76
77 Usage
78 -----
79
80 It is recommended to use this tool on corrected matrices (``hicCorrectMatrix``) at restriction enzyme resolution (unmerged bins).
81
82 _________________
83
84 Output
85 ------
86
87 **hicCorrelate** outputs correlation plots of multiple Hi-C matrices.
88
89 Below, you can find a correlation example of uncorrected Hi-C matrices obtained from *Drosophila melanogaster* embryos, either wild-type or having one gene knocked-down by RNAi.
90
91 Heatmap
92 _______
93
94 .. image:: $PATH_TO_IMAGES/hicCorrelate_Dmel_heatmap.png
95 :width: 45%
96
97 This example is showing a heatmap that was calculated using the Pearson correlation of corrected Hi-C matrices with a bin size of 6000 bp at a range of 5000 to 200000. The dendrogram indicates which samples are most similar to each other. You can see that the wild-type samples are seperated from the knock-down samples. In that case, Spearman correlation gives very similar results (not shown).
98
99 Scatterplot
100 ___________
101
102 .. image:: $PATH_TO_IMAGES/hicCorrelate_Dmel_scatterplot.png
103 :width: 45%
104
105 Additionally, pairwise scatterplots comparing interactions between each sample can be plotted.
106
107 _________________
108
109 For more information about HiCExplorer please consider our documentation on readthedocs.io_.
110
111 .. _readthedocs.io: http://hicexplorer.readthedocs.io/en/latest/index.html
112 .. _Colormap: https://matplotlib.org/examples/color/colormaps_reference.html
113
114 ]]></help>
115 <expand macro="citations" />
116 </tool>