Mercurial > repos > iuc > basil
comparison basil.xml @ 0:e6ef29001647 draft
planemo upload commit b89c8017aeef91f940543a1cc7dadb4a85290865
author | iuc |
---|---|
date | Thu, 30 May 2019 21:14:58 -0400 |
parents | |
children | 77fc7640abc7 |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:e6ef29001647 |
---|---|
1 <tool id="basil" name="basil" version="1.2.0"> | |
2 <description>Breakpoint detection, including large insertions</description> | |
3 <requirements> | |
4 <requirement type="package" version="1.2.0">anise_basil</requirement> | |
5 </requirements> | |
6 <version_command>basil --version 2>&1 | grep 'basil version' | cut -f 3 -d ' '</version_command> | |
7 <command detect_errors="aggressive"><![CDATA[ | |
8 ln -s '$ref' 'ref.fa' && | |
9 ln -s '$bam' 'in.bam' && | |
10 ln -s '$vcf' 'out.vcf' && | |
11 basil | |
12 --input-reference 'ref.fa' | |
13 --input-mapping 'in.bam' | |
14 --out-vcf 'out.vcf' | |
15 --oea-min-support-each-side '$min_oea_each_side' | |
16 ]]></command> | |
17 <inputs> | |
18 <param name="ref" argument="--input-reference" type="data" format="Fasta" label="Reference Sequence File" help="FASTA file with the reference."/> | |
19 <param name="bam" argument="--input-mapping" type="data" format="sam,bam" label="Alignment File" help="SAM/BAM file to use as the input."/> | |
20 <param name="min_oea_each_side" argument="--oea-min-support-each-side" type="integer" value="2" label="Minimum supporting reads, each side" help="Smallest number of OEA (one-end-anchor) reads on each side to support an insertion. In range [1..inf]. This is the minimum number of supporting reads (without mapped partners) on each side of an insertion breakpoint required to not be filtered." /> | |
21 </inputs> | |
22 <outputs> | |
23 <data name="vcf" format="vcf" /> | |
24 </outputs> | |
25 <tests> | |
26 <test> | |
27 <param name="ref" value="ref.fa"/> | |
28 <param name="bam" value="simulated.bam"/> | |
29 <param name="min_oea_each_side" value="2"/> | |
30 <output name="vcf" file="basil.vcf"/> | |
31 </test> | |
32 </tests> | |
33 <help><![CDATA[ | |
34 BASIL is a method to detect breakpoints for structural variants (including insertion breakpoints) from aligned paired HTS reads in BAM format. Use BASIL to analyze BAM files for tentative insertion sites. | |
35 | |
36 Note that BASIL will in general detect all kinds of breakpoints, e.g. for inversions on real-world data. | |
37 | |
38 BASIL VCF fields | |
39 | |
40 A typical line in BASIL might look as follows. | |
41 | |
42 1 5001 site_0 T <INS> . PASS IMPRECISE;SVTYPE=INS GSCORE:CLEFT:CRIGHT:OEALEFT:OEARIGHT 46.4256:10:12:35:32 | |
43 | |
44 The first seven columns are as usually in VCF files (ref name, 1-based position, reference base, abbreviation for long insertion, no assigned quality, passing all filters, imprecise insertion SV). | |
45 | |
46 The eighth column contains the names of the score values given in the ninth column: | |
47 | |
48 GSCORE Geometric mean of the sum of "1 + $score" for all of the following scores. | |
49 CLEFT Number of clipping signatures supporting the site from the left side. | |
50 CRIGHT Number of clipping signatures supporting the site from the right side. | |
51 OEALEFT Number of OEA alignments supporting the site from the left. | |
52 OEARIGHT Number of OEA alignmetns supproting the site from the right. | |
53 | |
54 Generally, one should filter for a minimum support of OEA records on each side, e.g. a value of 10 makes sense for a 30x coverage and showed good results on simulated data. | |
55 | |
56 For a ranking, GSCORE is a suitable measure but we did not develop any statistical model for BASIL matches and it is a mean of pseudocounts only. It carries no statistically precise meaning. | |
57 ]]></help> | |
58 <citations> | |
59 <citation type="doi">10.1093/bioinformatics/btv051</citation> | |
60 </citations> | |
61 </tool> |