annotate mmseq/mmseq.xml @ 9:e3b8eefe6586

Incorporate samtools invocation
author mwoodbri
date Thu, 27 Oct 2011 19:00:59 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
9
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
1 <tool id="mmseq" name="MMSEQ">
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
2 <description>Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads</description>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
3 <command interpreter="bash">
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
4 mmseq.sh
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
5 $alignments_sam
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
6 ${alignments_sam.metadata.dbkey}
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
7 ${GALAXY_DATA_INDEX_DIR}
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
8 $transcripts
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
9 $identical_transcripts
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
10 $genes
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
11 </command>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
12 <inputs>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
13 <param name="alignments_sam" type="data" format="sam" metadata_name="dbkey" label="SAM file of reads aligned to reference transcripts">
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
14 <validator type="unspecified_build"/>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
15 <validator type="dataset_metadata_in_file" filename="sam_fa_indices.loc" metadata_name="dbkey" metadata_column="1" message="Sequences are
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
16 not currently available for the specified build." line_startswith="index"/>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
17 </param>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
18 </inputs>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
19 <outputs>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
20 <data format="tabular" name="transcripts" label="MMSEQ: transcript expression estimates"/>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
21 <data format="tabular" name="identical_transcripts" label="MMSEQ: amalgamated transcript expression estimates"/>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
22 <data format="tabular" name="genes" label="MMSEQ: gene expression estimates"/>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
23 </outputs>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
24 <help>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
25
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
26 **About MMSEQ**
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
27
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
28 MMSEQ_ is a novel statistical RNA-seq analysis method for estimating haplotype, isoform and gene specific expression. It deconvolves the mapping of reads to multiple transcripts (isoforms or haplotype-specific isoforms). It can take into account non-uniform read generation and works with paired-end reads. Please cite: Turro, E.; Su, S-Y.; Goncalves, A.; Coin, L.J.M.; Richardson, S. and A., Lewin(2011). Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads. Genome Biology. 12:R13.
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
29
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
30 .. _MMSEQ: http://www.bgx.org.uk/software/mmseq.html
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
31
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
32 --------
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
33
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
34 **Input formats**
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
35
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
36 MMSEQ accepts sorted BAM file as input. The SAM files obtained as a result of Bowtie alignment can be converted to BAM files and sorted using the SAMTools.
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
37
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
38 --------
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
39
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
40 **Outputs**
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
41
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
42 MMSEQ generates three output files with expectation maximization (EM) and Gibbs sampling (GS) expression estimates with associated Monte Carlo standard errors (MCSE) tabulated in each one of them (Turro et al., 2011).
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
43
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
44 The first file provides estimates at the transcript/haplo-isoform level. The second file provides aggregate estimates for sets of transcripts that have been amalgamated due to having identical sequences (and so indistinguishable expression levels). The third file aggregates transcript estimates into genes, thus providing gene level estimates. Homozygous transcripts are aggregated together, while heterozygous transcripts are aggregated separately to produce 'haplo-gene' level estimates.
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
45
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
46 Out of the three outputs, the amalgamated estimates are recommended for use as the individual transcript estimates exhibit high variability and anti-correlation, but the total expression of two identical transcripts can be well estimated.
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
47
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
48
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
49 </help>
e3b8eefe6586 Incorporate samtools invocation
mwoodbri
parents:
diff changeset
50 </tool>