annotate rseqc/read_NVC.xml @ 27:5dbd20d3d623

Uploaded
author nilesh
date Thu, 11 Jul 2013 12:28:04 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
27
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
1 <tool id="read_NVC" name="Read NVC">
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
2 <description>to check the nucleotide composition bias</description>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
3 <command interpreter="python"> /home/nilesh/RSeQC-2.3.3/scripts/read_NVC.py -i $input -o output
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
4
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
5 #if $nx
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
6 -x
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
7 #end if
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
8 </command>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
9 <inputs>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
10 <param name="input" type="data" format="bam,sam" label="input bam/sam file" />
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
11 <param name="nx" type="boolean" label="Include N,X in NVC plot" value="false" />
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
12 </inputs>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
13 <outputs>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
14 <data format="xls" name="outputxls" from_work_dir="output.NVC.xls"/>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
15 <data format="r" name="outputr" from_work_dir="output.NVC_plot.r" />
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
16 <data format="pdf" name="outputpdf" from_work_dir="output.NVC_plot.pdf" />
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
17 </outputs>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
18 <help>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
19 .. image:: https://code.google.com/p/rseqc/logo?cct=1336721062
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
20
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
21 -----
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
22
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
23 About RSeQC
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
24 +++++++++++
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
25
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
26 The RSeQC package provides a number of useful modules that can comprehensively evaluate high throughput sequence data especially RNA-seq data. “Basic modules” quickly inspect sequence quality, nucleotide composition bias, PCR bias and GC bias, while “RNA-seq specific modules” investigate sequencing saturation status of both splicing junction detection and expression estimation, mapped reads clipping profile, mapped reads distribution, coverage uniformity over gene body, reproducibility, strand specificity and splice junction annotation.
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
27
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
28 The RSeQC package is licensed under the GNU GPL v3 license.
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
29
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
30 Inputs
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
31 ++++++++++++++
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
32
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
33 Input BAM/SAM file
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
34 Alignment file in BAM/SAM format.
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
35
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
36 Include N,X in NVC plot
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
37 Plots N and X alongside A, T, C, and G in plot.
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
38
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
39 Output
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
40 ++++++++++++++
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
41
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
42 This module is used to check the nucleotide composition bias. Due to random priming, certain patterns are over represented at the beginning (5'end) of reads. This bias could be easily examined by NVC (Nucleotide versus cycle) plot. NVC plot is generated by overlaying all reads together, then calculating nucleotide composition for each position of read (or each sequencing cycle). In ideal condition (genome is random and RNA-seq reads is randomly sampled from genome), we expect A%=C%=G%=T%=25% at each position of reads.
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
43
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
44
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
45 1. output.NVC.xls: plain text file, each row is position of read (or sequencing cycle), each column is nucleotide (A,C,G,T,N,X)
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
46 2. output.NVC_plot.r: R script to generate NVC plot.
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
47 3. output.NVC_plot.pdf: NVC plot.
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
48
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
49
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
50 .. image:: http://dldcc-web.brc.bcm.edu/lilab/liguow/RSeQC/figure/NVC_plot.png
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
51
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
52 </help>
5dbd20d3d623 Uploaded
nilesh
parents:
diff changeset
53 </tool>