annotate strelka2/strelka_config.sample @ 1:f48854499d41

Deleted selected files
author mini
date Thu, 25 Sep 2014 12:02:14 -0400
parents 7a9f20ca4ad5
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
1
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
2 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
3 ; User configuration options for Strelka somatic small-variant caller
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
4 ; workflow:
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
5 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
6
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
7 [user]
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
8
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
9 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
10 ; isSkipDepthFilters should be set to 1 to skip depth filtration for
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
11 ; whole exome or other targeted sequencing data
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
12 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
13 isSkipDepthFilters = 1
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
14
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
15 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
16 ; strelka will not accept input reads above this depth (they will be skipped
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
17 ; until the depth drops below this value). Set this value <= 0 to disable
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
18 ; this feature. Using this filter will bound memory usage given extremely high
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
19 ; depth input, but may be problematic in high-depth targeted sequencing
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
20 ; applications.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
21 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
22 maxInputDepth = 10000
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
23
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
24 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
25 ; If the depth filter is not skipped, all variants which occur at a
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
26 ; depth greater than depthFilterMultiple*chromosome mean depth will be
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
27 ; filtered out.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
28 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
29 depthFilterMultiple = 3.0
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
30
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
31 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
32 ; Somatic SNV calls are filtered at sites where greater than this
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
33 ; fraction of basecalls have been removed by the mismatch density
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
34 ; filter in either sample.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
35 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
36 snvMaxFilteredBasecallFrac = 0.4
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
37
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
38 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
39 ; Somatic SNV calls are filtered at sites where greater than this
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
40 ; fraction of overlapping reads contain deletions which span the SNV
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
41 ; call site.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
42 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
43 snvMaxSpanningDeletionFrac = 0.75
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
44
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
45 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
46 ; Somatic indel calls are filtered if they represent an expansion or
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
47 ; contraction of a repeated pattern with a repeat count greater than
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
48 ; indelMaxRefRepeat in the reference (ie. if indelMaxRefRepeat is 8,
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
49 ; then the indel is filtered when it is an expansion/contraction of a
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
50 ; homopolymer longer than 8 bases, a dinucleotide repeat longer than
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
51 ; 16 bases, etc.)
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
52 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
53 indelMaxRefRepeat = 8
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
54
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
55 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
56 ; Somatic indel calls are filtered if greater than this fraction of
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
57 ; basecalls in a window extending 50 bases to each side of an indel's
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
58 ; call position have been removed by the mismatch density filter.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
59 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
60 indelMaxWindowFilteredBasecallFrac = 0.3
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
61
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
62 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
63 ; Somatic indels are filtered if they overlap ’interrupted
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
64 ; homopolymers’ greater than this length. The term 'interrupted
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
65 ; homopolymer' is used to indicate the longest homopolymer which can
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
66 ; be found intersecting or adjacent to the called indel when a single
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
67 ; non-homopolymer base is allowed.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
68 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
69 indelMaxIntHpolLength = 14
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
70
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
71 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
72 ; prior probability of a somatic snv or indel
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
73 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
74 ssnvPrior = 0.000001
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
75 sindelPrior = 0.000001
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
76
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
77 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
78 ; probability of an snv or indel noise allele
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
79 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
80 ; NB: in the calling model a noise allele is shared in tumor and
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
81 ; normal samples, but occurs at any frequency.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
82 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
83 ssnvNoise = 0.0000005
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
84 sindelNoise = 0.000001
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
85
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
86 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
87 ; Fraction of snv noise attributed to strand-bias.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
88 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
89 ; It is not recommended to change this setting. However, if it is
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
90 ; essential to turn the strand bias penalization off, the following is
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
91 ; recommended:
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
92 ; Assuming the current value of ssnvNoiseStrandBiasFrac is 0.5,
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
93 ; (1) set ssnvNoiseStrandBiasFrac = 0
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
94 ; (2) divide the current ssnvNoise value by 2
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
95 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
96 ssnvNoiseStrandBiasFrac = 0.5
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
97
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
98 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
99 ; minimum MAPQ score for PE reads at tier1:
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
100 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
101 minTier1Mapq = 20
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
102
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
103 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
104 ; minimum MAPQ score for PE and SE reads at tier2:
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
105 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
106 minTier2Mapq = 5
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
107
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
108 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
109 ; Somatic quality score (QSS_NT, NT=ref) below which somatic SNVs are
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
110 ; marked as filtered:
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
111 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
112 ssnvQuality_LowerBound = 15
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
113
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
114 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
115 ; Somatic quality score (QSI_NT, NT=ref) below which somatic indels
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
116 ; are marked as filtered:
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
117 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
118 sindelQuality_LowerBound = 30
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
119
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
120 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
121 ; Optionally write out read alignments which were altered during the
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
122 ; realignment step. At the completion of the workflow run, the
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
123 ; realigned reads can be found in:
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
124 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
125 ; ${ANALYSIS_DIR}/realigned/{normal,tumor}.realigned.bam
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
126 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
127 isWriteRealignedBam = 0
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
128
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
129 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
130 ; Jobs are parallelized over segments of the reference genome no larger
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
131 ; than this size:
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
132 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
133 binSize = 25000000
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
134
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
135 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
136 ; Additional arguments passed to strelka.
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
137 ;
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
138 extraStrelkaArguments =
7a9f20ca4ad5 Uploaded
mini
parents:
diff changeset
139