annotate PsiCLASS-1.0.2/samtools-0.1.19/NEWS @ 0:903fc43d6227 draft default tip

Uploaded
author lsong10
date Fri, 26 Mar 2021 16:52:45 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
1 Beta Release 0.1.19 (15 March, 2013)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
3
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
4 Notable changes in samtools and bcftools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
5
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
6 * The latest source code and development moved to github,
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
7 http://github.com/samtools/samtools
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
8
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
9 * Many important bugfixes and contributions by many people. Thanks to all!
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
10
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
11 * Performance improvements (multi-threading)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
12
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
13 * Important changes in calling, see
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
14 - samtools mpileup -p
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
15 - bcftools view -m
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
16
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
17 * New annotations useful for filtering (RPB, HWE, QBD, MDV)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
18
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
19 * New tools, bamcheck and plot-bamcheck
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
20
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
21 * New features in samtools tview
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
22
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
23 * And much more..
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
24
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
25 For a detailed list of commits, please see
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
26 http://github.com/samtools/samtools/commits/master
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
27
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
28 (0.1.19: 15 March 2013, commit 96b5f2294ac0054230e88913c4983d548069ea4e)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
29
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
30
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
31 Beta Release 0.1.18 (2 September, 2011)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
33
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
34 Notable changes in samtools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
35
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
36 * Support the new =/X CIGAR operators (by Peter Cock).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
37
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
38 * Allow to subsample BAM while keeping the pairing intact (view -s).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
39
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
40 * Implemented variant distance bias as a new filter (by Petr Danecek).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
41
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
42 * Bugfix: huge memory usage during indexing
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
43
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
44 * Bugfix: use of uninitialized variable in mpileup (rare)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
45
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
46 * Bugfix: wrong BAQ probability (rare)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
47
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
48 Notable changes in bcftools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
49
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
50 * Support indel in the contrast caller.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
51
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
52 * Bugfix: LRT2=nan in rare cases
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
53
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
54 (0.1.18: 2 September 2011, r982:295)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
55
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
56
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
57
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
58 Beta Release 0.1.17 (6 July, 2011)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
60
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
61 With the maturity of `mpileup' and the lack of update in the `pileup' command,
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
62 the `pileup' command is now formally dropped. Most of the pileup functionality,
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
63 such as outputting mapping quality and read positions, have been added
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
64 `mpileup'.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
65
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
66 Since this release, `bcftools view' is able to perform contrast SNP calling
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
67 (option -T) for discovering de novo and/or somatic mutations between a pair of
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
68 samples or in a family trio. Potential mutations are scored by a log likelihood
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
69 ratio, which is very simple in math, but should be comparable to more
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
70 sophisticated methods. Note that getting the score is only the very first step.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
71 A lot more need to be done to reduce systematical errors due to mapping and
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
72 reference errors and structural variations.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
73
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
74 Other notable changes in samtools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
75
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
76 * Improved sorting order checking during indexing.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
77
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
78 * Improved region parsing. Colons in reference sequence names are parsed
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
79 properly.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
80
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
81 * Fixed an issue where mpileup does not apply BAQ for the first few reads when
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
82 a region is specified.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
83
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
84 * Fixed an issue where `faidx' does not work with FASTA files with long lines.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
85
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
86 * Bugfix: wrong SP genotype information in the BCF output.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
87
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
88 Other notable changes in bcftools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
89
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
90 * Output the ML esitmate of the allele count.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
91
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
92 * Added the HWE plus F<0 filter to varFilter. For multiple samples, it
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
93 effectively filters false heterozygous calls around centromeres.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
94
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
95 * For association mapping, perform both 1-degree and 2-degree test. The
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
96 2-degree test is conservative but more robust to HWE violation.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
97
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
98 (0.1.17: 6 July 2011, r973:277)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
99
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
100
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
101
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
102 Beta Release 0.1.16 (21 April, 2011)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
104
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
105 Notable changes in samtools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
106
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
107 * Support the new SAM/BAM type `B' in the latest SAM spec v1.4.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
108
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
109 * When the output file of `samtools merge' exists, do not overwrite it unless
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
110 a new command-line option `-f' is applied.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
111
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
112 * Bugfix: BED support is not working when the input BED is not sorted.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
113
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
114 * Bugfix: some reads without coordinates but given on the reverse strand are
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
115 lost in merging.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
116
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
117 Notable changes in bcftools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
118
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
119 * Code cleanup: separated max-likelihood inference and Bayesian inference.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
120
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
121 * Test Hardy-Weinberg equilibrium with a likelihood-ratio test.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
122
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
123 * Provided another association test P-value by likelihood-ratio test.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
124
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
125 * Use Brent's method to estimate the site allele frequency when EM converges
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
126 slowly. The resulting ML estimate of allele frequnecy is more accurate.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
127
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
128 * Added the `ldpair' command, which computes r^2 between SNP pairs given in
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
129 an input file.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
130
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
131 Also, the `pileup' command, which has been deprecated by `mpileup' since
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
132 version 0.1.10, will be dropped in the next release. The old `pileup' command
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
133 is substandard and causing a lot of confusion.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
134
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
135 (0.1.16: 21 April 2011, r963:234)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
136
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
137
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
138
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
139 Beta Release 0.1.15 (10 April, 2011)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
141
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
142 Noteable changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
143
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
144 * Allow to perform variant calling or to extract information in multiple
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
145 regions specified by a BED file (`samtools mpileup -l', `samtools view -L'
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
146 and `bcftools view -l').
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
147
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
148 * Added the `depth' command to samtools to compute the per-base depth with a
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
149 simpler interface. File `bam2depth.c', which implements this command, is the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
150 recommended example on how to use the mpileup APIs.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
151
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
152 * Estimate genotype frequencies with ML; perform chi^2 based Hardy-Weinberg
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
153 test using this estimate.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
154
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
155 * For `samtools view', when `-R' is specified, drop read groups in the header
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
156 that are not contained in the specified file.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
157
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
158 * For `samtools flagstat', separate QC-pass and QC-fail reads.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
159
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
160 * Improved the command line help of `samtools mpileup' and `bcftools view'.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
161
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
162 * Use a global variable to control the verbose level of samtools stderr
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
163 output. Nonetheless, it has not been full utilized.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
164
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
165 * Fixed an issue in association test which may report false associations,
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
166 possibly due to floating point underflow.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
167
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
168 (0.1.15: 10 April 2011, r949:203)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
169
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
170
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
171
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
172 Beta release 0.1.14 (21 March, 2011)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
174
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
175 This release implements a method for testing associations for case-control
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
176 data. The method does not call genotypes but instead sums over all genotype
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
177 configurations to compute a chi^2 based test statistics. It can be potentially
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
178 applied to comparing a pair of samples (e.g. a tumor-normal pair), but this
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
179 has not been evaluated on real data.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
180
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
181 Another new feature is to make X chromosome variant calls when female and male
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
182 samples are both present. The user needs to provide a file indicating the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
183 ploidy of each sample (see also manual bcftools/bcftools.1).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
184
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
185 Other notable changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
186
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
187 * Added `bcftools view -F' to parse BCF files generated by samtools r921 or
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
188 older which encodes PL in a different way.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
189
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
190 * Changed the behavior of `bcftools view -s'. Now when a list of samples is
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
191 provided, the samples in the output will be reordered to match the ordering
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
192 in the sample list. This change is mainly designed for association test.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
193
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
194 * Sped up `bcftools view -v' for target sequencing given thousands of samples.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
195 Also added a new option `view -d' to skip loci where only a few samples are
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
196 covered by reads.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
197
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
198 * Dropped HWE test. This feature has never been implemented properly. An EM
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
199 should be much better. To be implemented in future.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
200
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
201 * Added the `cat' command to samtools. This command concatenate BAMs with
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
202 identical sequence dictionaries in an efficient way. Modified from bam_cat.c
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
203 written by Chris Saunders.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
204
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
205 * Added `samtools view -1' to write BAMs at a low compression level but twice
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
206 faster to create. The `sort' command generates temporary files at a low
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
207 compression level as well.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
208
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
209 * Added `samtools mpileup -6' to accept "BAM" with Illumina 1.3+ quality
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
210 strings (strictly speaking, such a file is not BAM).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
211
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
212 * Added `samtools mpileup -L' to skip INDEL calling in regions with
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
213 excessively high coverage. Such regions dramatically slow down mpileup.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
214
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
215 * Updated `misc/export2sam.pl', provided by Chris Saunders from Illumina Inc.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
216
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
217 (0.1.14: 21 March 2011, r933:170)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
218
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
219
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
220
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
221 Beta release 0.1.13 (1 March, 2011)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
223
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
224 The most important though largely invisible modification is the change of the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
225 order of genotypes in the PL VCF/BCF tag. This is to conform the upcoming VCF
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
226 spec v4.1. The change means that 0.1.13 is not backward compatible with VCF/BCF
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
227 generated by samtools older than r921 inclusive. VCF/BCF generated by the new
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
228 samtools will contain a line `##fileformat=VCFv4.1' as well as the samtools
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
229 version number.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
230
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
231 Single Individual Haplotyping (SIH) is added as an experimental feature. It
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
232 originally aims to produce haploid consensus from fosmid pool sequencing, but
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
233 also works with short-read data. For short reads, phased blocks are usually too
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
234 short to be useful in many applications, but they can help to rule out part of
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
235 SNPs close to INDELs or between copies of CNVs.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
236
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
237
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
238 Other notable changes in samtools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
239
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
240 * Construct per-sample consensus to reduce the effect of nearby SNPs in INDEL
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
241 calling. This reduces the power but improves specificity.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
242
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
243 * Improved sorting order checking in indexing. Now indexing is the preferred way
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
244 to check if a BAM is sorted.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
245
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
246 * Added a switch `-E' to mpileup and calmd. This option uses an alternative way
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
247 to apply BAQ, which increases sensistivity, especially to MNPs, at the cost of
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
248 a little loss in specificity.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
249
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
250 * Added `mpileup -A' to allow to use reads in anomalous pairs in SNP calling.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
251
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
252 * Added `mpileup -m' to allow fine control of the collection of INDEL candidates.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
253
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
254 * Added `mpileup -S' to compute per-sample strand bias P-value.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
255
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
256 * Added `mpileup -G' to exclude read groups in variant calling.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
257
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
258 * Fixed segfault in indel calling related to unmapped and refskip reads.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
259
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
260 * Fixed an integer overflow in INDEL calling. This bug produces wrong INDEL
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
261 genotypes for longer short INDELs, typically over 10bp.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
262
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
263 * Fixed a bug in tview on big-endian machines.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
264
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
265 * Fixed a very rare memory issue in bam_md.c
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
266
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
267 * Fixed an out-of-boundary bug in mpileup when the read base is `N'.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
268
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
269 * Fixed a compiling error when the knetfile library is not used. Fixed a
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
270 library compiling error due to the lack of bam_nt16_nt4_table[] table.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
271 Suppress a compiling warning related to the latest zlib.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
272
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
273
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
274 Other notable changes in bcftools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
275
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
276 * Updated the BCF spec.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
277
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
278 * Added the `FQ' VCF INFO field, which gives the phred-scaled probability
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
279 of all samples being the same (identical to the reference or all homozygous
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
280 variants). Option `view -f' has been dropped.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
281
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
282 * Implementated of "vcfutils.pl vcf2fq" to generate a consensus sequence
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
283 similar to "samtools.pl pileup2fq".
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
284
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
285 * Make sure the GT FORMAT field is always the first FORMAT to conform the VCF
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
286 spec. Drop bcf-fix.pl.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
287
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
288 * Output bcftools specific INFO and FORMAT in the VCF header.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
289
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
290 * Added `view -s' to call variants from a subset of samples.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
291
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
292 * Properly convert VCF to BCF with a user provided sequence dictionary. Nonetheless,
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
293 custom fields are still unparsed and will be stored as a missing value.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
294
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
295 * Fixed a minor bug in Fisher's exact test; the results are rarely changed.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
296
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
297
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
298 (0.1.13: 1 March 2011, r926:134)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
299
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
300
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
301
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
302 Beta release 0.1.12a (2 December, 2010)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
304
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
305 This is another bug fix release:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
306
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
307 * Fixed a memory violation in mpileup, which causes segfault. Release
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
308 0.1.9 and above are affected.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
309
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
310 * Fixed a memory violation in the indel caller, which does not causes
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
311 segfault, but may potentially affect deletion calls in an unexpected
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
312 way. Release 0.1.10 and above are affected.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
313
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
314 * Fixed a bug in computing r-square in bcftools. Few are using this
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
315 functionality and it only has minor effect.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
316
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
317 * Fixed a memory leak in bam_fetch().
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
318
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
319 * Fixed a bug in writing meta information to the BAM index for the last
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
320 sequence. This bug is invisible to most users, but it is a bug anyway.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
321
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
322 * Fixed a bug in bcftools which causes false "DP4=0,0,0,0" annotations.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
323
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
324 (0.1.12: 2 December 2010, r862)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
325
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
326
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
327
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
328 Beta release 0.1.11 (21 November, 2010)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
330
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
331 This is mainly a bug fix release:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
332
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
333 * Fixed a bug in random retrieval (since 0.1.8). It occurs when reads
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
334 are retrieved from a small region containing no reads.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
335
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
336 * Fixed a bug in pileup (since 0.1.9). The bug causes an assertion
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
337 failure when the first CIGAR operation is a deletion.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
338
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
339 * Improved fault tolerence in remote access.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
340
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
341 One minor feature has been implemented in bcftools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
342
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
343 * Added a reference-free variant calling mode. In this mode, a site is
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
344 regarded as a variat iff the sample(s) contains two or more alleles;
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
345 the meaning of the QUAL field in the VCF output is changed
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
346 accordingly. Effectively, the reference allele is irrelevant to the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
347 result in the new mode, although the reference sequence has to be
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
348 used in realignment when SAMtools computes genotype likelihoods.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
349
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
350 In addition, since 0.1.10, the `pileup' command has been deprecated by
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
351 `mpileup' which is more powerful and more accurate. The `pileup' command
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
352 will not be removed in the next few releases, but new features will not
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
353 be added.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
354
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
355 (0.1.11: 21 November 2010, r851)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
356
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
357
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
358
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
359 Beta Release 0.1.10 (16 November, 2010)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
360 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
361
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
362 This release is featured as the first major improvement to the indel
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
363 caller. The method is similar to the old one implemented in the pileup
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
364 command, but the details are handled more carefully both in theory and
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
365 in practice. As a result, the new indel caller usually gives more
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
366 accurate indel calls, though at the cost of sensitivity. The caller is
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
367 implemented in the mpileup command and is invoked by default. It works
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
368 with multiple samples.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
369
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
370 Other notable changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
371
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
372 * With the -r option, the calmd command writes the difference between
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
373 the original base quality and the BAQ capped base quality at the BQ
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
374 tag but does not modify the base quality. Please use -Ar to overwrite
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
375 the original base quality (the 0.1.9 behavior).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
376
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
377 * Allow to set a maximum per-sample read depth to reduce memory. In
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
378 0.1.9, most of memory is wasted for the ultra high read depth in some
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
379 regions (e.g. the chr1 centromere).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
380
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
381 * Optionally write per-sample read depth and per-sample strand bias
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
382 P-value.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
383
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
384 * Compute equal-tail (Bayesian) credible interval of site allele
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
385 frequency at the CI95 VCF annotation.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
386
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
387 * Merged the vcfutils.pl varFilter and filter4vcf for better SNP/indel
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
388 filtering.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
389
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
390 (0.1.10: 16 November 2010, r829)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
391
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
392
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
393
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
394 Beta Release 0.1.9 (27 October, 2010)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
396
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
397 This release is featured as the first major improvement to the samtools'
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
398 SNP caller. It comes with a revised MAQ error model, the support of
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
399 multi-sample SNP calling and the computation of base alignment quality
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
400 (BAQ).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
401
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
402 The revised MAQ error model is based on the original model. It solves an
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
403 issue of miscalling SNPs in repetitive regions. Althought such SNPs can
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
404 usually be filtered at a later step, they mess up unfiltered calls. This
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
405 is a theoretical flaw in the original model. The revised MAQ model
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
406 deprecates the orginal MAQ model and the simplified SOAPsnp model.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
407
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
408 Multi-sample SNP calling is separated in two steps. The first is done by
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
409 samtools mpileup and the second by a new program, bcftools, which is
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
410 included in the samtools source code tree. Multi-sample SNP calling also
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
411 works for single sample and has the advantage of enabling more powerful
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
412 filtration. It is likely to deprecate pileup in future once a proper
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
413 indel calling method is implemented.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
414
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
415 BAQ is the Phred-scaled probability of a read base being wrongly
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
416 aligned. Capping base quality by BAQ has been shown to be very effective
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
417 in suppressing false SNPs caused by misalignments around indels or in
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
418 low-complexity regions with acceptable compromise on computation
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
419 time. This strategy is highly recommended and can be used with other SNP
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
420 callers as well.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
421
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
422 In addition to the three major improvements, other notable changes are:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
423
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
424 * Changes to the pileup format. A reference skip (the N CIGAR operator)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
425 is shown as '<' or '>' depending on the strand. Tview is also changed
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
426 accordingly.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
427
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
428 * Accelerated pileup. The plain pileup is about 50% faster.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
429
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
430 * Regional merge. The merge command now accepts a new option to merge
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
431 files in a specified region.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
432
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
433 * Fixed a bug in bgzip and razip which causes source files to be
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
434 deleted even if option -c is applied.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
435
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
436 * In APIs, propogate errors to downstream callers and make samtools
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
437 return non-zero values once errors occur.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
438
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
439 (0.1.9: 27 October 2010, r783)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
440
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
441
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
442
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
443 Beta Release 0.1.8 (11 July, 2010)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
444 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
445
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
446 Notable functional changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
447
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
448 * Added the `reheader' command which replaces a BAM header with a new
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
449 header. This command is much faster than replacing header by
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
450 BAM->SAM->BAM conversions.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
451
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
452 * Added the `mpileup' command which computes the pileup of multiple
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
453 alignments.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
454
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
455 * The `index' command now stores the number of mapped and unmapped
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
456 reads in the index file. This information can be retrieved quickly by
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
457 the new `idxstats' command.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
458
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
459 * By default, pileup used the SOAPsnp model for SNP calling. This
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
460 avoids the floating overflow in the MAQ model which leads to spurious
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
461 calls in repetitive regions, although these calls will be immediately
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
462 filtered by varFilter.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
463
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
464 * The `tview' command now correctly handles CIGARs like 7I10M and
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
465 10M1P1I10M which cause assertion failure in earlier versions.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
466
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
467 * Tview accepts a region like `=10,000' where `=' stands for the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
468 current sequence name. This saves typing for long sequence names.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
469
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
470 * Added the `-d' option to `pileup' which avoids slow indel calling
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
471 in ultradeep regions by subsampling reads locally.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
472
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
473 * Added the `-R' option to `view' which retrieves alignments in read
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
474 groups listed in the specified file.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
475
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
476 Performance improvements:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
477
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
478 * The BAM->SAM conversion is up to twice faster, depending on the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
479 characteristic of the input.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
480
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
481 * Parsing SAM headers with a lot of reference sequences is now much
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
482 faster.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
483
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
484 * The number of lseek() calls per query is reduced when the query
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
485 region contains no read alignments.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
486
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
487 Bug fixes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
488
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
489 * Fixed an issue in the indel caller that leads to miscall of indels.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
490 Note that this solution may not work well when the sequencing indel
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
491 error rate is higher than the rate of SNPs.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
492
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
493 * Fixed another issue in the indel caller which may lead to incorrect
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
494 genotype.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
495
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
496 * Fixed a bug in `sort' when option `-o' is applied.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
497
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
498 * Fixed a bug in `view -r'.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
499
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
500 APIs and other changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
501
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
502 * Added iterator interfaces to random access and pileup. The callback
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
503 interfaces directly call the iterator interfaces.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
504
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
505 * The BGZF blocks holding the BAM header are indepedent of alignment
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
506 BGZF blocks. Alignment records shorter than 64kB is guaranteed to be
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
507 fully contained in one BGZF block. This change is fully compatible
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
508 with the old version of samtools/picard.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
509
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
510 Changes in other utilities:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
511
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
512 * Updated export2sam.pl by Chris Saunders.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
513
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
514 * Improved the sam2vcf.pl script.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
515
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
516 * Added a Python version of varfilter.py by Aylwyn Scally.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
517
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
518 (0.1.8: 11 July 2010, r613)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
519
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
520
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
521
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
522 Beta Release 0.1.7 (10 November, 2009)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
523 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
524
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
525 Notable changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
526
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
527 * Improved the indel caller in complex scenariors, in particular for
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
528 long reads. The indel caller is now able to make reasonable indel
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
529 calls from Craig Venter capillary reads.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
530
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
531 * Rewrote single-end duplicate removal with improved
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
532 performance. Paired-end reads are not touched.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
533
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
534 * Duplicate removal is now library aware. Samtools remove potential
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
535 PCR/optical dupliates inside a library rather than across libraries.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
536
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
537 * SAM header is now fully parsed, although this functionality is not
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
538 used in merging and so on.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
539
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
540 * In samtools merge, optionally take the input file name as RG-ID and
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
541 attach the RG tag to each alignment.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
542
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
543 * Added FTP support in the RAZF library. RAZF-compressed reference
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
544 sequence can be retrieved remotely.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
545
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
546 * Improved network support for Win32.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
547
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
548 * Samtools sort and merge are now stable.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
549
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
550 Changes in other utilities:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
551
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
552 * Implemented sam2vcf.pl that converts the pileup format to the VCF
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
553 format.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
554
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
555 * This release of samtools is known to work with the latest
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
556 Bio-Samtools Perl module.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
557
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
558 (0.1.7: 10 November 2009, r510)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
559
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
560
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
561
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
562 Beta Release 0.1.6 (2 September, 2009)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
563 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
564
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
565 Notable changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
566
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
567 * In tview, do not show a blank screen when no reads mapped to the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
568 corresponding region.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
569
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
570 * Implemented native HTTP support in the BGZF library. Samtools is now
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
571 able to directly open a BAM file on HTTP. HTTP proxy is also
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
572 supported via the "http_proxy" environmental variable.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
573
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
574 * Samtools is now compitable with the MinGW (win32) compiler and the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
575 PDCurses library.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
576
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
577 * The calmd (or fillmd) command now calculates the NM tag and replaces
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
578 MD tags if they are wrong.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
579
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
580 * The view command now recognizes and optionally prints FLAG in HEXs or
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
581 strings to make a SAM file more friendly to human eyes. This is a
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
582 samtools-C extension, not implemented in Picard for the time
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
583 being. Please type `samtools view -?' for more information.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
584
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
585 * BAM files now have an end-of-file (EOF) marker to facilitate
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
586 truncation detection. A warning will be given if an on-disk BAM file
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
587 does not have this marker. The warning will be seen on BAM files
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
588 generated by an older version of samtools. It does NO harm.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
589
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
590 * New key bindings in tview: `r' to show read names and `s' to show
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
591 reference skip (N operation) as deletions.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
592
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
593 * Fixed a bug in `samtools merge -n'.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
594
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
595 * Samtools merge now optionally copies the header of a user specified
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
596 SAM file to the resultant BAM output.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
597
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
598 * Samtools pileup/tview works with a CIGAR with the first or the last
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
599 operation is an indel.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
600
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
601 * Fixed a bug in bam_aux_get().
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
602
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
603
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
604 Changes in other utilies:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
605
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
606 * Fixed wrong FLAG in maq2sam.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
607
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
608
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
609 (0.1.6: 2 September 2009, r453)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
610
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
611
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
612
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
613 Beta Release 0.1.5 (7 July, 2009)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
614 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
615
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
616 Notable changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
617
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
618 * Support opening a BAM alignment on FTP. Users can now use "tview" to
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
619 view alignments at the NCBI ftp site. Please read manual for more
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
620 information.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
621
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
622 * In library, propagate errors rather than exit or complain assertion
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
623 failure.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
624
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
625 * Simplified the building system and fixed compiling errors caused by
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
626 zlib<1.2.2.1.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
627
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
628 * Fixed an issue about lost header information when a SAM is imported
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
629 with "view -t".
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
630
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
631 * Implemented "samtool.pl varFilter" which filters both SNPs and short
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
632 indels. This command replaces "indelFilter".
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
633
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
634 * Implemented "samtools.pl pileup2fq" to generate FASTQ consensus from
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
635 pileup output.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
636
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
637 * In pileup, cap mapping quality at 60. This helps filtering when
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
638 different aligners are in use.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
639
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
640 * In pileup, allow to output variant sites only.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
641
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
642 * Made pileup generate correct calls in repetitive region. At the same
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
643 time, I am considering to implement a simplified model in SOAPsnp,
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
644 although this has not happened yet.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
645
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
646 * In view, added '-u' option to output BAM without compression. This
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
647 option is preferred when the output is piped to other commands.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
648
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
649 * In view, added '-l' and '-r' to get the alignments for one library or
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
650 read group. The "@RG" header lines are now partially parsed.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
651
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
652 * Do not include command line utilities to libbam.a.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
653
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
654 * Fixed memory leaks in pileup and bam_view1().
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
655
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
656 * Made faidx more tolerant to empty lines right before or after FASTA >
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
657 lines.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
658
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
659
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
660 Changes in other utilities:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
661
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
662 * Updated novo2sam.pl by Colin Hercus, the key developer of novoalign.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
663
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
664
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
665 This release involves several modifications to the key code base which
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
666 may potentially introduce new bugs even though we have tried to minimize
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
667 this by testing on several examples. Please let us know if you catch
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
668 bugs.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
669
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
670 (0.1.5: 7 July 2009, r373)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
671
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
672
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
673
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
674 Beta Release 0.1.4 (21 May, 2009)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
675 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
676
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
677 Notable changes:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
678
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
679 * Added the 'rmdupse' command: removing duplicates for SE reads.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
680
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
681 * Fixed a critical bug in the indel caller: clipped alignments are not
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
682 processed correctly.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
683
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
684 * Fixed a bug in the tview: gapped alignment may be incorrectly
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
685 displayed.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
686
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
687 * Unified the interface to BAM and SAM I/O. This is done by
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
688 implementing a wrapper on top of the old APIs and therefore old APIs
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
689 are still valid. The new I/O APIs also recognize the @SQ header
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
690 lines.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
691
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
692 * Generate the MD tag.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
693
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
694 * Generate "=" bases. However, the indel caller will not work when "="
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
695 bases are present.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
696
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
697 * Enhanced support of color-read display (by Nils Homer).
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
698
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
699 * Implemented the GNU building system. However, currently the building
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
700 system does not generate libbam.a. We will improve this later. For
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
701 the time being, `make -f Makefile.generic' is preferred.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
702
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
703 * Fixed a minor bug in pileup: the first read in a chromosome may be
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
704 skipped.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
705
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
706 * Fixed bugs in bam_aux.c. These bugs do not affect other components as
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
707 they were not used previously.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
708
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
709 * Output the 'SM' tag from maq2sam.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
710
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
711 (0.1.4: 21 May 2009, r297)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
712
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
713
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
714
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
715 Beta Release 0.1.3 (15 April, 2009)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
716 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
717
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
718 Notable changes in SAMtools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
719
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
720 * SAMtools is more consistent with the specification: a) '*' in the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
721 QUAL field is allowed; b) the field separator is TAB only and SPACE
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
722 is treated as a character in a field; c) empty header is allowed.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
723
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
724 * Implemented GLFv3 support in pileup.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
725
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
726 * Fixed a severe bug in fixmate: strand information is wrongly
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
727 overwritten.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
728
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
729 * Fixed a bug in alignment retrieval: alignments bridging n*16384bp are
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
730 not correctly retrieved sometimes.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
731
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
732 * Fixed a bug in rmdup: segfault if unmapped reads are present.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
733
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
734 * Move indel_filter.pl to samtools.pl and improved the filtering by
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
735 checking the actual number of alignments containing indels. The indel
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
736 pileup line is also changed a little to make this filtration easier.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
737
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
738 * Fixed a minor bug in indexing: the bin number of an unmapped read is
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
739 wrongly calculated.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
740
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
741 * Added `flagstat' command to show statistics on the FLAG field.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
742
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
743 * Improved indel caller by setting the maximum window size in local
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
744 realignment.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
745
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
746 Changes in other utilities:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
747
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
748 * Fixed a bug in maq2sam: a tag name is obsolete.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
749
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
750 * Improvement to wgsim: a) added support for SOLiD read simulation; b)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
751 show the number of substitutions/indels/errors in read name; c)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
752 considerable code clean up.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
753
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
754 * Various converters: improved functionality in general.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
755
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
756 * Updated the example SAM due to the previous bug in fixmate.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
757
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
758 (0.1.3: 15 April 2009, r227)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
759
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
760
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
761
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
762 Beta Release 0.1.2 (28 January, 2008)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
763 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
764
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
765 Notable changes in SAMtools:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
766
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
767 * Implemented a Bayesian indel caller. The new caller generate scores
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
768 and genotype and is potentially more accurate than Maq's indel
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
769 caller. The pileup format is also changed accordingly.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
770
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
771 * Implemented rmdup command: remove potential PCR duplicates. Note that
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
772 this command ONLY works for FR orientation and requires ISIZE is
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
773 correctly set.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
774
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
775 * Added fixmate command: fill in mate coordinates, ISIZE and mate
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
776 related flags from a name-sorted alignment.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
777
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
778 * Fixed a bug in indexing: reads bridging 16x kbp were not retrieved.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
779
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
780 * Allow to select reads shown in the pileup output with a mask.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
781
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
782 * Generate GLFv2 from pileup.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
783
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
784 * Added two more flags for flagging PCR/optical duplicates and for QC
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
785 failure.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
786
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
787 * Fixed a bug in sort command: name sorting for large alignment did not
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
788 work.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
789
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
790 * Allow to completely disable RAZF (using Makefile.lite) as some people
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
791 have problem to compile it.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
792
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
793 * Fixed a bug in import command when there are reads without
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
794 coordinates.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
795
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
796 * Fixed a bug in tview: clipping broke the alignment viewer.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
797
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
798 * Fixed a compiling error when _NO_CURSES is applied.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
799
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
800 * Fixed a bug in merge command.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
801
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
802 Changes in other utilities:
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
803
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
804 * Added wgsim, a paired-end reads simulator. Wgsim was adapted from
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
805 maq's reads simulator. Colin Hercus further improved it to allow
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
806 longer indels.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
807
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
808 * Added wgsim_eval.pl, a script that evaluates the accuracy of
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
809 alignment on reads generated by wgsim.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
810
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
811 * Added soap2sam.pl, a SOAP2->SAM converter. This converter does not
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
812 work properly when multiple hits are output.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
813
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
814 * Added bowtie2sam.pl, a Bowtie->SAM converter. Only the top hit will
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
815 be retained when multiple hits are present.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
816
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
817 * Fixed a bug in export2sam.pl for QC reads.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
818
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
819 * Support RG tag at MAQ->SAM converter.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
820
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
821 * Added novo2sam.pl, a NovoAlign->SAM converter. Multiple hits and
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
822 indel are not properly handled, though.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
823
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
824 * Added zoom2sam.pl, a ZOOM->SAM converter. It only works with the
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
825 default Illumina output.
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
826
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
827 (0.1.2: 28 January 2008; r116)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
828
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
829
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
830
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
831 Beta Release 0.1.1 (22 December, 2008)
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
832 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
833
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
834 The is the first public release of samtools. For more information,
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
835 please check the manual page `samtools.1' and the samtools website
903fc43d6227 Uploaded
lsong10
parents:
diff changeset
836 http://samtools.sourceforge.net