3
|
1 <tool id="chip_seq_ratio_1" name="Chip-Seq Mapper" version="0.1.1">
|
|
2 <stdio>
|
|
3 <exit_code range="1:" level="fatal" description="Error"/>
|
|
4 </stdio>
|
|
5 <description></description>
|
|
6 <requirements>
|
|
7 <requirement type="package">r-base64enc</requirement>
|
|
8 <requirement type="package">r-r2html</requirement>
|
|
9 <requirement type="package">blast</requirement>
|
6
|
10 <!-- <requirement type="package">chip_seq_ration</requirement> -->
|
3
|
11 </requirements>
|
|
12 <command interpreter="python3">
|
|
13 ChipSeqRatioAnalysis.py
|
|
14 --ChipSeq=${ChipFile}
|
|
15 --InputSeq=${InputFile}
|
|
16 --Contigs=${ContigFile}
|
|
17 --output=${OutputFile}
|
|
18 --html=${ReportFile}
|
|
19 --max_cl=${MaxCl}
|
|
20 --bitscore=$bitscore
|
|
21 --nproc=16
|
|
22 </command>
|
|
23
|
|
24 <inputs>
|
9
|
25 <param name="ChipFile" label="Chip reads" type="data" format="fasta" help="Reads in FASTA format"/>
|
|
26 <param name="InputFile" label="Input reads" type="data" format="fasta" help="Reads in FASTA format"/>
|
|
27 <param name="ContigFile" label="Reference - contig sequences" type="data" format="fasta"
|
|
28 help="Contigs from RepeatExplorer clustering (the file "contigs.fasta")"/>
|
|
29 <param name="MaxCl" label="Number of top clusters to be shown in graph" type="integer" value="200"/>
|
|
30 <param name="bitscore" label="Bit score threshold" type="integer" value="50" help="Similarity hits with lower bit score will not be used for ChIP/Input ratio calculation"/>
|
3
|
31 </inputs>
|
|
32 <outputs>
|
7
|
33 <data name="OutputFile" format="tabular"
|
|
34 label="csv table from ChIP-Seq-Mapper on datasets ${InputFile.hid} (Input) ${ChipFile.hid} (ChIP) and ${ContigFile.hid} (reference)"/>
|
|
35
|
|
36 <data name="ReportFile" format="html"
|
|
37 label="HTML report from ChIP-Seq-Mapper on datasets ${InputFile.hid} (Input) ${ChipFile.hid} (ChIP) and ${ContigFile.hid} (reference)"/>
|
3
|
38 </outputs>
|
|
39
|
|
40 <help>
|
|
41 **What it does**
|
|
42
|
9
|
43 The ChIP-seq Mapper evaluates the enrichment of repetitive sequences in sequencing data from chromatin
|
|
44 immunoprecipitation experiments, using repeats identified by RepeatExplorer as the reference. The tool
|
|
45 performs BLASTN similarity search of the read sequences to the reference,
|
|
46 and the reads producing hits that passed the user-specified similarity threshold are assigned to the
|
|
47 repeat clusters. The assignment is made to the cluster that produced the best similarity hit, and every
|
|
48 read is assigned to only a single cluster. Following read mapping, the numbers of reads from the
|
|
49 INPUT and ChIP samples are evaluated, and ChIP/INPUT ratios of the normalized read counts are reported
|
|
50 for individual clusters.
|
|
51 ChIP and INPUT reads should be of uniform lengths of at least 40 nt. The bit score threshold value should be
|
|
52 adjusted based on the length of the analyzed reads (the value equal to the read length is recommended for a start).
|
3
|
53 This method was first used in (`Neumann et al. 2012`__) for
|
9
|
54 identification of repetitive sequences associated with centromeres:
|
3
|
55
|
|
56
|
|
57 `PLoS Genet. Epub 2012 Jun 21. Stretching the rules: monocentric chromosomes with multiple centromere domains. Neumann P, Navrátilová A, Schroeder-Reiter E, Koblížková A, Steinbauerová V, Chocholová E, Novák P, Wanner G, Macas J.`__.
|
|
58
|
|
59 .. __: http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1002777
|
|
60 .. __: http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1002777
|
|
61
|
|
62 </help>
|
|
63
|
|
64 </tool>
|
|
65
|
|
66
|