annotate readme.rst @ 1:15be98f10c13 draft

change to readme file regarding the loc files.
author geert-vandeweyer
date Thu, 25 Sep 2014 05:27:45 -0400
parents e8a32d824f39
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
1 BACKGROUND:
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
2
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
3 DC_Genotyper stands for Deep-Coverage Genotyper, and is aimed at detecting low fraction SNPs (no indels) in high-ploidy (or pooled) samples with very high coverage. It is being developed at the University of Antwerp by Geert Vandeweyer.
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
4
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
5 METHOD:
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
6
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
7 DC_Genotyper generates a background noise distributions on a per-sample basis, and uses these distrubutions to detect non-reference sites. For non-reference sites, Allele-specific distributions are used to estimate if an allele surpasses the background signal (e.g. A_to_G has different distribution as A_to_C, also reflected in Tr/Tv ratios).
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
8
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
9 LIMITATION:
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
10
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
11 This is a very early version with several limitations. Current limitations are : no support for indels, no plotting of the noise-models, incorrect syntax in for multi-allelic sites in the VCF file.
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
12
e8a32d824f39 Uploaded
geert-vandeweyer
parents:
diff changeset
13 Any feedback is welcome
1
15be98f10c13 change to readme file regarding the loc files.
geert-vandeweyer
parents: 0
diff changeset
14
15be98f10c13 change to readme file regarding the loc files.
geert-vandeweyer
parents: 0
diff changeset
15
15be98f10c13 change to readme file regarding the loc files.
geert-vandeweyer
parents: 0
diff changeset
16 INSTALLATION:
15be98f10c13 change to readme file regarding the loc files.
geert-vandeweyer
parents: 0
diff changeset
17
15be98f10c13 change to readme file regarding the loc files.
geert-vandeweyer
parents: 0
diff changeset
18 After installation, complete the dbsnp.loc and dc_genotyper_indexes.loc files. DCG supports multithreading, but keep tests have shown that using more than 6-8 threads will lead to I/O bottlenecks.