comparison hicPCA.xml @ 4:38a612a120a8 draft

planemo upload for repository https://github.com/maxplanck-ie/HiCExplorer/tree/master/galaxy/wrapper/ commit eec0a4d5a7c5ba4ec0fbd2ead8280c3d143bb9d8
author iuc
date Fri, 27 Apr 2018 03:31:20 -0400
parents ea50f7bba657
children dbf4afa03b00
comparison
equal deleted inserted replaced
3:ea50f7bba657 4:38a612a120a8
1 <tool id="hicexplorer_hicpca" name="@BINARY@" version="@WRAPPER_VERSION@.0"> 1 <tool id="hicexplorer_hicpca" name="@BINARY@" version="@WRAPPER_VERSION@.0">
2 <description>computes the principal components for A / B compartment analysis</description> 2 <description>compute the principal components for A / B compartment analysis</description>
3 <macros> 3 <macros>
4 <token name="@BINARY@">hicPCA</token> 4 <token name="@BINARY@">hicPCA</token>
5 <import>macros.xml</import> 5 <import>macros.xml</import>
6 </macros> 6 </macros>
7 <expand macro="requirements" /> 7 <expand macro="requirements" />
8 <command detect_errors="exit_code"><![CDATA[ 8 <command detect_errors="exit_code"><![CDATA[
9 9
10 hicPCA --matrix '$matrix_h5_cooler' 10 hicPCA --matrix '$matrix_h5_cooler'
11 11
12 --outputFileName pca1.$outputFormat pca2.$outputFormat 12 --outputFileName pca1.$outputFormat pca2.$outputFormat
13 --format $outputFormat 13 --format $outputFormat
14 14
15 && mv pca1.$outputFormat pca1 15 && mv pca1.$outputFormat pca1
16 && mv pca2.$outputFormat pca2 16 && mv pca2.$outputFormat pca2
17 17
18 ]]> 18 ]]>
19 </command> 19 </command>
24 <option value="bedgraph">bedgraph</option> 24 <option value="bedgraph">bedgraph</option>
25 </param> 25 </param>
26 26
27 </inputs> 27 </inputs>
28 <outputs> 28 <outputs>
29 29
30 <data name="pca1" from_work_dir="pca1" format="bigwig"> 30 <data name="pca1" from_work_dir="pca1" format="bigwig" label="${tool.name} on ${matrix_h5_cooler.name} [${on_string}]: PC1">
31 <filter>outputFormat == 'bigwig'</filter> 31 <filter>outputFormat == 'bigwig'</filter>
32 <change_format> 32 <change_format>
33 <when input="outputFormat" value="bedgraph" format="bedgraph" /> 33 <when input="outputFormat" value="bedgraph" format="bedgraph" />
34 </change_format> 34 </change_format>
35 </data> 35 </data>
36 <data name="pca2" from_work_dir="pca2" format="bigwig"> 36 <data name="pca2" from_work_dir="pca2" format="bigwig" label="${tool.name} on ${matrix_h5_cooler.name} [${on_string}]: PC2">
37 <change_format> 37 <change_format>
38 <when input="outputFormat" value="bedgraph" format="bedgraph" /> 38 <when input="outputFormat" value="bedgraph" format="bedgraph" />
39 </change_format> 39 </change_format>
40 </data> 40 </data>
41 41
42 42
43 </outputs> 43 </outputs>
44 <tests> 44 <tests>
45 <test> 45 <test>
46 <param name="matrix_h5_cooler" value="small_test_matrix_50kb_res.h5"/> 46 <param name="matrix_h5_cooler" value="small_test_matrix_50kb_res.h5"/>
47 <param name="outputFormat" value="bedgraph" /> 47 <param name="outputFormat" value="bedgraph" />
57 </test> 57 </test>
58 </tests> 58 </tests>
59 <help><![CDATA[ 59 <help><![CDATA[
60 Principal component analysis 60 Principal component analysis
61 ============================ 61 ============================
62 `hicPCA` computes two eigenvector files based on the input matrix for an A / B compartment analysis.
63 62
64 Input 63 `Lieberman-Aiden et al. (2009)`_ demonstrated that open and closed chromatin domains throughout the genome occupy different spatial compartments in the nucleus, defined as A (activate) and B (inactive) compartments.
64
65 **hicPCA** computes two eigenvector files based on the input matrix for an A / B compartment analysis following the computation steps detailed by `Lieberman-Aiden et al. (2009)`_: the transformation of the contact matrix
66 into an observed vs. expected matrix and consecutively a Pearson correlation matrix shows a plaid pattern. These plaid pattern are called A and B. Applying a PCA on the Pearson correlation matrix gives the eigenvectors
67 and Lieberman-Aiden shows that the values of the eigenvectors correspond to the distribution of genes and with features of open and closed chromatin. In some cases the first principal component corresponds to the two
68 chromosomes arms and the second eigenvector to the plaid pattern. Therefore always the first two principal components needs to be returned and investigated.
69
70 _________________
71
72 Usage
65 ----- 73 -----
66 - the matrix to be analysed
67 74
68 Parameters 75 This tool must be used on Hi-C contact matrices with large bins (over 20kb) using ``hicMergeMatrixBins`` and corrected with ``hicCorrectMatrix``. Using matrices with a too high resolution (small bins or at restriction enzyme resolution) might take several days to run (even with over 100 CPU) or will fail due to memory limitations.
69 __________ 76
70 - Output file format: bigwig or bedgraph 77 _________________
71 78
72 Output 79 Output
73 ------ 80 ------
74 Two files with the first and the second eigenvector. 81 Two files are outputed by **hicPCA**, one with the first (pca1) and one with the second (pca2) eigenvector as bigwig or bedgraph. These files can be plotted alongside Hi-C heatmaps, gene density or external datasets such as open chromatin or histone marks enrichment using ``hicPlotTADs`` or ``hicPlotMatrix``.
82
83 For example, below you can find a ``hicPlotMatrix`` of the Pearson correlation matrix derived from a contact matrix for chromosome 6 in mouse computed with ``hicTransform`` (which is part of A/B compartments computation). The optional data track at the bottom shows the first eigenvector for A/B compartment obtained using **hicPCA**.
84
85 .. image:: $PATH_TO_IMAGES/hicPCA.png
86 :width: 60 %
87
88 _________________
75 89
76 | For more information about HiCExplorer please consider our documentation on readthedocs.io_ 90 | For more information about HiCExplorer please consider our documentation on readthedocs.io_
77 91
78 .. _readthedocs.io: http://hicexplorer.readthedocs.io/en/latest/index.html 92 .. _readthedocs.io: http://hicexplorer.readthedocs.io/en/latest/index.html
93 .. _`Lieberman-Aiden et al. (2009)`: https://dx.doi.org/10.1126%2Fscience.1181369
79 ]]></help> 94 ]]></help>
80 <expand macro="citations" /> 95 <expand macro="citations" />
81 </tool> 96 </tool>