Mercurial > repos > petr-novak > re_utils
view ChipSeqRatioDef.xml @ 9:c2c69c6090f0 draft
Uploaded
author | petr-novak |
---|---|
date | Fri, 31 Jan 2020 06:55:23 -0500 |
parents | 89c5ba120b21 |
children | 5376e1c9adec |
line wrap: on
line source
<tool id="chip_seq_ratio_1" name="Chip-Seq Mapper" version="0.1.1"> <stdio> <exit_code range="1:" level="fatal" description="Error"/> </stdio> <description></description> <requirements> <requirement type="package">r-base64enc</requirement> <requirement type="package">r-r2html</requirement> <requirement type="package">blast</requirement> <!-- <requirement type="package">chip_seq_ration</requirement> --> </requirements> <command interpreter="python3"> ChipSeqRatioAnalysis.py --ChipSeq=${ChipFile} --InputSeq=${InputFile} --Contigs=${ContigFile} --output=${OutputFile} --html=${ReportFile} --max_cl=${MaxCl} --bitscore=$bitscore --nproc=16 </command> <inputs> <param name="ChipFile" label="Chip reads" type="data" format="fasta" help="Reads in FASTA format"/> <param name="InputFile" label="Input reads" type="data" format="fasta" help="Reads in FASTA format"/> <param name="ContigFile" label="Reference - contig sequences" type="data" format="fasta" help="Contigs from RepeatExplorer clustering (the file "contigs.fasta")"/> <param name="MaxCl" label="Number of top clusters to be shown in graph" type="integer" value="200"/> <param name="bitscore" label="Bit score threshold" type="integer" value="50" help="Similarity hits with lower bit score will not be used for ChIP/Input ratio calculation"/> </inputs> <outputs> <data name="OutputFile" format="tabular" label="csv table from ChIP-Seq-Mapper on datasets ${InputFile.hid} (Input) ${ChipFile.hid} (ChIP) and ${ContigFile.hid} (reference)"/> <data name="ReportFile" format="html" label="HTML report from ChIP-Seq-Mapper on datasets ${InputFile.hid} (Input) ${ChipFile.hid} (ChIP) and ${ContigFile.hid} (reference)"/> </outputs> <help> **What it does** The ChIP-seq Mapper evaluates the enrichment of repetitive sequences in sequencing data from chromatin immunoprecipitation experiments, using repeats identified by RepeatExplorer as the reference. The tool performs BLASTN similarity search of the read sequences to the reference, and the reads producing hits that passed the user-specified similarity threshold are assigned to the repeat clusters. The assignment is made to the cluster that produced the best similarity hit, and every read is assigned to only a single cluster. Following read mapping, the numbers of reads from the INPUT and ChIP samples are evaluated, and ChIP/INPUT ratios of the normalized read counts are reported for individual clusters. ChIP and INPUT reads should be of uniform lengths of at least 40 nt. The bit score threshold value should be adjusted based on the length of the analyzed reads (the value equal to the read length is recommended for a start). This method was first used in (`Neumann et al. 2012`__) for identification of repetitive sequences associated with centromeres: `PLoS Genet. Epub 2012 Jun 21. Stretching the rules: monocentric chromosomes with multiple centromere domains. Neumann P, Navrátilová A, Schroeder-Reiter E, Koblížková A, Steinbauerová V, Chocholová E, Novák P, Wanner G, Macas J.`__. .. __: http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1002777 .. __: http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1002777 </help> </tool>