Repository 'annovar'
hg clone https://toolshed.g2.bx.psu.edu/repos/saskia-hiltemann/annovar

Changeset 2:565c0e690238 (2013-11-18)
Previous changeset 1:7d9353127f8a (2013-11-05) Next changeset 3:ff5325029a8e (2014-04-10)
Commit message:
Added support for LJB2, COSMIC67, CLINVAR and NCI60. Fixed dgv annotation to use new UCSC table dgvMerged instead.
modified:
README
README~
tool-data/annovar.loc.sample
tools/annovar/annovar.sh
tools/annovar/annovar.xml
b
diff -r 7d9353127f8a -r 565c0e690238 README
--- a/README Tue Nov 05 07:16:32 2013 -0500
+++ b/README Mon Nov 18 10:32:33 2013 -0500
b
b'@@ -33,181 +33,210 @@\n \t\n list of files in my own humandb folder:\n \n-\thg18_ALL.sites.2012_04.txt\n-\thg18_ALL.sites.2012_04.txt.idx\n-\thg18_avsift.txt\n-\thg18_avsift.txt.idx\n-\thg18_CEU.sites.2010_07.txt\n-\thg18_CEU.sites.2010_07.txt.idx\n-\thg18_cg46.txt\n-\thg18_cg46.txt.idx\n-\thg18_cg69.txt\n-\thg18_cg69.txt.idx\n-\thg18_cytoBand.txt\n-\thg18_dgv.txt\n-\thg18_ensGeneMrna.fa\n-\thg18_ensGene.txt\n-\thg18_esp5400_aa.txt\n-\thg18_esp5400_aa.txt.idx\n-\thg18_esp5400_all.txt\n-\thg18_esp5400_all.txt.idx\n-\thg18_esp5400_ea.txt\n-\thg18_esp5400_ea.txt.idx\n-\thg18_esp6500_aa.txt\n-\thg18_esp6500_aa.txt.idx\n-\thg18_esp6500_all.txt\n-\thg18_esp6500_all.txt.idx\n-\thg18_esp6500_ea.txt\n-\thg18_esp6500_ea.txt.idx\n-\thg18_esp6500si_aa.txt\n-\thg18_esp6500si_aa.txt.idx\n-\thg18_esp6500si_all.txt\n-\thg18_esp6500si_all.txt.idx\n-\thg18_esp6500si_ea.txt\n-\thg18_esp6500si_ea.txt.idx\n-\thg18_example_db_generic.txt\n-\thg18_example_db_gff3.txt\n-\thg18_genomicSuperDups.txt\n-\thg18_gerp++gt2.txt\n-\thg18_gerp++gt2.txt.idx\n-\thg18_gwasCatalog.txt\n-\thg18_JPTCHB.sites.2010_07.txt\n-\thg18_JPTCHB.sites.2010_07.txt.idx\n-\thg18_keggMapDesc.txt\n-\thg18_keggPathway.txt\n-\thg18_kgXref.txt\n-\thg18_knownGeneMrna.fa\n-\thg18_knownGene.txt\n-\thg18_ljb_all.txt\n-\thg18_ljb_all.txt.idx\n-\thg18_ljb_lrt.txt\n-\thg18_ljb_lrt.txt.idx\n-\thg18_ljb_mt.txt\n-\thg18_ljb_mt.txt.idx\n-\thg18_ljb_phylop.txt\n-\thg18_ljb_phylop.txt.idx\n-\thg18_ljb_pp2.txt\n-\thg18_ljb_pp2.txt.idx\n-\thg18_ljb_sift.txt\n-\thg18_ljb_sift.txt.idx\n-\thg18_phastConsElements44way.txt\n-\thg18_refGeneMrna.fa\n-\thg18_refGene.txt\n-\thg18_refLink.txt\n-\thg18_snp128NonFlagged.txt\n-\thg18_snp128NonFlagged.txt.idx\n-\thg18_snp128.txt\n-\thg18_snp128.txt.idx\n-\thg18_snp129NonFlagged.txt\n-\thg18_snp129NonFlagged.txt.idx\n-\thg18_snp129.txt\n-\thg18_snp129.txt.idx\n-\thg18_snp130NonFlagged.txt\n-\thg18_snp130NonFlagged.txt.idx\n-\thg18_snp130.txt\n-\thg18_snp130.txt.idx\n-\thg18_snp131NonFlagged.txt\n-\thg18_snp131NonFlagged.txt.idx\n-\thg18_snp131.txt\n-\thg18_snp131.txt.idx\n-\thg18_snp132NonFlagged.txt\n-\thg18_snp132NonFlagged.txt.idx\n-\thg18_snp132.txt\n-\thg18_snp132.txt.idx\n-\thg18_tfbsConsSites.txt\n-\thg18_YRI.sites.2010_07.txt\n-\thg18_YRI.sites.2010_07.txt.idx\n-\thg19_AFR.sites.2012_04.txt\n-\thg19_AFR.sites.2012_04.txt.idx\n-\thg19_ALL.sites.2010_11.txt\n-\thg19_ALL.sites.2010_11.txt.idx\n-\thg19_ALL.sites.2012_02.txt\n-\thg19_ALL.sites.2012_02.txt.idx\n-\thg19_ALL.sites.2012_04.txt\n-\thg19_ALL.sites.2012_04.txt.idx\n-\thg19_AMR.sites.2012_04.txt\n-\thg19_AMR.sites.2012_04.txt.idx\n-\thg19_ASN.sites.2012_04.txt\n-\thg19_ASN.sites.2012_04.txt.idx\n-\thg19_avsift.txt\n-\thg19_avsift.txt.idx\n-\thg19_cg46.txt\n-\thg19_cg46.txt.idx\n-\thg19_cg69.txt\n-\thg19_cg69.txt.idx\n-\thg19_cosmic61.txt\n-\thg19_cosmic61.txt.idx\n-\thg19_cosmic63.txt\n-\thg19_cosmic63.txt.idx\n-\thg19_cosmic64.txt\n-\thg19_cosmic64.txt.idx\n-\thg19_cosmic65.txt\n-\thg19_cosmic65.txt.idx\n-\thg19_cytoBand.txt\n-\thg19_dgv.txt\n-\thg19_ensGeneMrna.fa\n-\thg19_ensGene.txt\n-\thg19_esp5400_aa.txt\n-\thg19_esp5400_aa.txt.idx\n-\thg19_esp5400_all.txt\n-\thg19_esp5400_all.txt.idx\n-\thg19_esp5400_ea.txt\n-\thg19_esp5400_ea.txt.idx\n-\thg19_esp6500_aa.txt\n-\thg19_esp6500_aa.txt.idx\n-\thg19_esp6500_all.txt\n-\thg19_esp6500_all.txt.idx\n-\thg19_esp6500_ea.txt\n-\thg19_esp6500_ea.txt.idx\n-\thg19_esp6500si_aa.txt\n-\thg19_esp6500si_aa.txt.idx\n-\thg19_esp6500si_all.txt\n-\thg19_esp6500si_all.txt.idx\n-\thg19_esp6500si_ea.txt\n-\thg19_esp6500si_ea.txt.idx\n-\thg19_EUR.sites.2012_04.txt\n-\thg19_EUR.sites.2012_04.txt.idx\n-\thg19_genomicSuperDups.txt\n-\thg19_gerp++gt2.txt\n-\thg19_gerp++gt2.txt.idx\n-\thg19_gwasCatalog.txt\n-\thg19_keggMapDesc.txt\n-\thg19_keggPathway.txt\n-\thg19_kgXref.txt\n-\thg19_knownGeneMrna.fa\n-\thg19_knownGene.txt\n-\thg19_ljb_all.txt\n-\thg19_ljb_all.txt.idx\n-\thg19_ljb_lrt.txt\n-\thg19_ljb_lrt.txt.idx\n-\thg19_ljb_mt.txt\n-\thg19_ljb_mt.txt.idx\n-\thg19_ljb_phylop.txt\n-\thg19_ljb_phylop.txt.idx\n-\thg19_ljb_pp2.txt\n-\thg19_ljb_pp2.txt.idx\n-\thg19_ljb_sift.txt\n-\thg19_ljb_sift.txt.idx\n-\thg19_phastConsElements46way.txt\n-\thg19_refGeneMrna.fa\n-\thg19_refGene.txt\n-\thg19_refLink.txt\n-\thg19_snp130NonFlagged.txt\n-\thg19_snp130NonFlagged.txt.idx\n-\thg19_snp130.txt\n-\thg19_snp130.t'..b'+hg18_genomicSuperDups.txt\n+hg18_gerp++gt2.txt\n+hg18_gerp++gt2.txt.idx\n+hg18_gwasCatalog.txt\n+hg18_kgXref.txt\n+hg18_knownGene.txt\n+hg18_knownGeneMrna.fa\n+hg18_ljb2_fathmm.txt\n+hg18_ljb2_fathmm.txt.idx\n+hg18_ljb2_gerp++.txt\n+hg18_ljb2_gerp++.txt.idx\n+hg18_ljb2_ma.txt\n+hg18_ljb2_ma.txt.idx\n+hg18_ljb2_mt.txt\n+hg18_ljb2_mt.txt.idx\n+hg18_ljb2_phylop.txt\n+hg18_ljb2_phylop.txt.idx\n+hg18_ljb2_pp2hdiv.txt\n+hg18_ljb2_pp2hdiv.txt.idx\n+hg18_ljb2_pp2hvar.txt\n+hg18_ljb2_pp2hvar.txt.idx\n+hg18_ljb2_sift.txt\n+hg18_ljb2_sift.txt.idx\n+hg18_ljb2_siphy.txt\n+hg18_ljb2_siphy.txt.idx\n+hg18_phastConsElements44way.txt\n+hg18_refGene.txt\n+hg18_refGeneMrna.fa\n+hg18_refLink.txt\n+hg18_snp128.txt\n+hg18_snp128.txt.idx\n+hg18_snp128NonFlagged.txt\n+hg18_snp128NonFlagged.txt.idx\n+hg18_snp129.txt\n+hg18_snp129.txt.idx\n+hg18_snp129NonFlagged.txt\n+hg18_snp129NonFlagged.txt.idx\n+hg18_snp130.txt\n+hg18_snp130.txt.idx\n+hg18_snp130NonFlagged.txt\n+hg18_snp130NonFlagged.txt.idx\n+hg18_snp131.txt\n+hg18_snp131.txt.idx\n+hg18_snp131NonFlagged.txt\n+hg18_snp131NonFlagged.txt.idx\n+hg18_snp132.txt\n+hg18_snp132.txt.idx\n+hg18_snp132NonFlagged.txt\n+hg18_snp132NonFlagged.txt.idx\n+hg18_tfbsConsSites.txt\n+hg19_AFR.sites.2012_04.txt\n+hg19_AFR.sites.2012_04.txt.idx\n+hg19_ALL.sites.2010_11.txt\n+hg19_ALL.sites.2010_11.txt.idx\n+hg19_ALL.sites.2012_02.txt\n+hg19_ALL.sites.2012_02.txt.idx\n+hg19_ALL.sites.2012_04.txt\n+hg19_ALL.sites.2012_04.txt.idx\n+hg19_AMR.sites.2012_04.txt\n+hg19_AMR.sites.2012_04.txt.idx\n+hg19_ASN.sites.2012_04.txt\n+hg19_ASN.sites.2012_04.txt.idx\n+hg19_EUR.sites.2012_04.txt\n+hg19_EUR.sites.2012_04.txt.idx\n+hg19_avsift.txt\n+hg19_avsift.txt.idx\n+hg19_cg46.txt\n+hg19_cg46.txt.idx\n+hg19_cg69.txt\n+hg19_cg69.txt.idx\n+hg19_clinvar_20131105.txt\n+hg19_clinvar_20131105.txt.idx\n+hg19_cosmic61.txt\n+hg19_cosmic61.txt.idx\n+hg19_cosmic63.txt\n+hg19_cosmic63.txt.idx\n+hg19_cosmic64.txt\n+hg19_cosmic64.txt.idx\n+hg19_cosmic65.txt\n+hg19_cosmic65.txt.idx\n+hg19_cosmic67.txt\n+hg19_cytoBand.txt\n+hg19_dgvMerged.txt\n+hg19_ensGene.txt\n+hg19_ensGeneMrna.fa\n+hg19_esp5400_aa.txt\n+hg19_esp5400_aa.txt.idx\n+hg19_esp5400_all.txt\n+hg19_esp5400_all.txt.idx\n+hg19_esp6500_aa.txt\n+hg19_esp6500_aa.txt.idx\n+hg19_esp6500_all.txt\n+hg19_esp6500_all.txt.idx\n+hg19_esp6500_ea.txt\n+hg19_esp6500_ea.txt.idx\n+hg19_esp6500si_aa.txt\n+hg19_esp6500si_aa.txt.idx\n+hg19_esp6500si_all.txt\n+hg19_esp6500si_all.txt.idx\n+hg19_esp6500si_ea.txt\n+hg19_esp6500si_ea.txt.idx\n+hg19_genomicSuperDups.txt\n+hg19_gerp++gt2.txt\n+hg19_gerp++gt2.txt.idx\n+hg19_gwasCatalog.txt\n+hg19_kgXref.txt\n+hg19_knownGene.txt\n+hg19_knownGeneMrna.fa\n+hg19_ljb2_fathmm.txt\n+hg19_ljb2_fathmm.txt.idx\n+hg19_ljb2_gerp++.txt\n+hg19_ljb2_gerp++.txt.idx\n+hg19_ljb2_ma.txt\n+hg19_ljb2_ma.txt.idx\n+hg19_ljb2_mt.txt\n+hg19_ljb2_phylop.txt\n+hg19_ljb2_phylop.txt.idx\n+hg19_ljb2_pp2hdiv.txt\n+hg19_ljb2_pp2hdiv.txt.idx\n+hg19_ljb2_pp2hvar.txt\n+hg19_ljb2_pp2hvar.txt.idx\n+hg19_ljb2_sift.txt\n+hg19_ljb2_sift.txt.idx\n+hg19_ljb2_siphy.txt\n+hg19_nci60.txt\n+hg19_nci60.txt.idx\n+hg19_phastConsElements46way.txt\n+hg19_refGene.txt\n+hg19_refGeneMrna.fa\n+hg19_refLink.txt\n+hg19_snp130.txt\n+hg19_snp130.txt.idx\n+hg19_snp130NonFlagged.txt\n+hg19_snp130NonFlagged.txt.idx\n+hg19_snp131.txt\n+hg19_snp131NonFlagged.txt\n+hg19_snp131NonFlagged.txt.idx\n+hg19_snp132.txt\n+hg19_snp132.txt.idx\n+hg19_snp132NonFlagged.txt\n+hg19_snp132NonFlagged.txt.idx\n+hg19_snp135.txt\n+hg19_snp135NonFlagged.txt\n+hg19_snp135NonFlagged.txt.idx\n+hg19_snp137.txt\n+hg19_snp137NonFlagged.txt\n+hg19_snp137NonFlagged.txt.idx\n+hg19_tfbsConsSites.txt\n \n+\n+obsolete functional impact database files: (disabled by default)\n+hg18_avsift.txt\n+hg18_avsift.txt.idx\n+hg19_ljb_all.txt\n+hg19_ljb_all.txt.idx\n+hg19_ljb_lrt.txt\n+hg19_ljb_lrt.txt.idx\n+hg19_ljb_mt.txt\n+hg19_ljb_mt.txt.idx\n+hg19_ljb_phylop.txt\n+hg19_ljb_phylop.txt.idx\n+hg19_ljb_pp2.txt\n+hg19_ljb_pp2.txt.idx\n+hg18_ljb_all.txt\n+hg18_ljb_all.txt.idx\n+hg18_ljb_lrt.txt\n+hg18_ljb_lrt.txt.idx\n+hg18_ljb_mt.txt\n+hg18_ljb_mt.txt.idx\n+hg18_ljb_phylop.txt\n+hg18_ljb_phylop.txt.idx\n+hg18_ljb_pp2.txt\n+hg18_ljb_pp2.txt.idx\n'
b
diff -r 7d9353127f8a -r 565c0e690238 README~
--- a/README~ Tue Nov 05 07:16:32 2013 -0500
+++ b/README~ Mon Nov 18 10:32:33 2013 -0500
b
@@ -141,6 +141,8 @@
  hg19_cosmic63.txt.idx
  hg19_cosmic64.txt
  hg19_cosmic64.txt.idx
+ hg19_cosmic65.txt
+ hg19_cosmic65.txt.idx
  hg19_cytoBand.txt
  hg19_dgv.txt
  hg19_ensGeneMrna.fa
b
diff -r 7d9353127f8a -r 565c0e690238 tool-data/annovar.loc.sample
--- a/tool-data/annovar.loc.sample Tue Nov 05 07:16:32 2013 -0500
+++ b/tool-data/annovar.loc.sample Mon Nov 18 10:32:33 2013 -0500
[
@@ -2,5 +2,5 @@
 #
 # <columns>value, dbkey, name, ANNOVAR_scripts, ANNOVAR_humandb</columns>
 
-hg18 hg18 build 36 (hg18) /path/to/annovarscripts /path/to/humandb
-hg19 hg19 build 37 (hg19) /path/to/annovarscripts /path/to/humandb
+#hg18 hg18 hg18 [Human Mar. 2006 (NCBI36/hg18)] /path/to/annovarscripts /path/to/humandb
+#hg19 hg19 hg19 [Human Feb. 2009 (GRCh37/hg19)] /path/to/annovarscripts /path/to/humandb
b
diff -r 7d9353127f8a -r 565c0e690238 tools/annovar/annovar.sh
--- a/tools/annovar/annovar.sh Tue Nov 05 07:16:32 2013 -0500
+++ b/tools/annovar/annovar.sh Mon Nov 18 10:32:33 2013 -0500
[
b'@@ -1,7 +1,12 @@\n #!/bin/bash\n \n test="N"\n+dofilter="N"\n \n+#########################\n+#\t   DEFINE SOME\n+#\t    FUNCTIONS\n+#########################\n \n function usage(){\n \techo "usage: $0 todo"\n@@ -167,7 +172,14 @@\n \n \n \n-set -- `getopt -n$0 -u -a --longoptions="inputfile: buildver: humandb: varfile: VCF: chrcol: startcol: endcol: refcol: obscol: vartypecol: convertcoords: geneanno: verdbsnp: tfbs: mce: cytoband: segdup: dgv: gwas: ver1000g: cg46: cg69: impactscores: esp: gerp: cosmic61: cosmic63: cosmic64: cosmic65: outall: outfilt: outinvalid: scriptsdir: dorunannovar: dofilter: filt_dbsnp: filt1000GALL: filt1000GAFR: filt1000GAMR: filt1000GASN: filt1000GEUR: filtESP6500ALL: filtESP6500EA: filtESP6500AA: filtcg46: filtcg69: dummy:" "h:" "$@"` || usage\n+#################################\n+#\n+#\t   PARSE PARAMETERS\n+#\n+#################################\n+\n+\n+set -- `getopt -n$0 -u -a --longoptions="inputfile: buildver: humandb: varfile: VCF: chrcol: startcol: endcol: refcol: obscol: vartypecol: convertcoords: geneanno: hgvs: verdbsnp: tfbs: mce: cytoband: segdup: dgv: gwas: ver1000g: cg46: cg69: impactscores: newimpactscores: otherinfo: esp: gerp: cosmic61: cosmic63: cosmic64: cosmic65: cosmic67: clinvar: nci60: outall: outfilt: outinvalid: scriptsdir: dorunannovar: dofilter: filt_dbsnp: filt1000GALL: filt1000GAFR: filt1000GAMR: filt1000GASN: filt1000GEUR: filtESP6500ALL: filtESP6500EA: filtESP6500AA: filtcg46: filtcg69: dummy:" "h:" "$@"` || usage\n [ $# -eq 0 ] && usage\n \n \n@@ -176,8 +188,8 @@\n do\n     case "$1" in\n        \t--inputfile)      \t\t\tinfile=$2;shift;;  # inputfile\n-\t\t--buildver)\t\t\t\t\tbuildver=$2;shift;; # hg18 or hg19\n-\t\t--humandb)\t\t\t\t\thumandb=$2;shift;; # location of humandb database\n+\t\t--buildver)\t\t\t\t\tbuildvertmp=$2;shift;; # hg18 or hg19\n+\t\t--humandb)\t\t\t\t\thumandbtmp=$2;shift;; # location of humandb database\n \t \t--varfile)      \t\t\tvarfile=$2;shift;; # Y or N  \n \t\t--VCF)\t\t\t\t\t\tvcf=$2;shift;; #Y or N\n \t\t--chrcol)      \t\t\t\tchrcol=$2;shift;;  # which column has chr \n@@ -188,6 +200,7 @@\n \t\t--vartypecol)      \t\t\tvartypecol=$2;shift;;  # which column has vartype\n \t\t--convertcoords)\t\t\tconvertcoords=$2;shift;;  # Y or N convert coordinate from CG to 1-based?\t\t\n \t\t--geneanno)      \t\t\tgeneanno=$2;shift;; # comma-separated list of strings refSeq, knowngene, ensgene  \n+\t\t--hgvs)\t\t\t\t\t\thgvs=$2;shift;;\n \t\t--verdbsnp)\t\t\t\t\tverdbsnp=$2;shift;; #comma-separated list of dbsnp version to annotate with (e.g. "132,135NonFlagged,137")"\n \t\t--tfbs)      \t\t\t\ttfbs=$2;shift;; \t# Y or N \n \t\t--mce)      \t\t\t\tmce=$2;shift;; \t# Y or N \n@@ -199,13 +212,18 @@\n \t\t--cg46)\t\t\t\t\t\tcg46=$2;shift;;\n \t\t--cg69)\t\t\t\t\t\tcg69=$2;shift;;\t\t\n \t\t--impactscores)      \t\timpactscores=$2;shift;; # Y or N \n-\t\t--scriptsdir)\t      \t\tscriptsdir=$2;shift;; # Y or N \n+\t\t--newimpactscores)      \tnewimpactscores=$2;shift;; # Y or N \n+\t\t--otherinfo)\t\t\t\totherinfo=$2;shift;; \n+\t\t--scriptsdir)\t      \t\tscriptsdirtmp=$2;shift;; # Y or N \n \t\t--esp)      \t\t\t\tesp=$2;shift;; \t# Y or N \n \t\t--gerp)      \t\t\t\tgerp=$2;shift;; \t# Y or N \n \t\t--cosmic61)\t\t\t\t\tcosmic61=$2;shift;;  # Y or N\n \t\t--cosmic63)\t\t\t\t\tcosmic63=$2;shift;;  # Y or N\n \t\t--cosmic64)\t\t\t\t\tcosmic64=$2;shift;;  # Y or N\n \t\t--cosmic65)\t\t\t\t\tcosmic65=$2;shift;;  # Y or N\n+\t\t--cosmic67)\t\t\t\t\tcosmic67=$2;shift;;  # Y or N\n+\t\t--nci60)\t\t\t\t\tnci60=$2;shift;;  # Y or N\n+\t\t--clinvar)\t\t\t\t\tclinvar=$2;shift;;  # Y or N\n \t\t--filt_dbsnp)\t\t\t\tfilt_dbsnp=$2;shift;;\n \t\t--filt1000GALL)\t\t\t\tthreshold_1000g_ALL=$2;shift;; #threshold value\n \t\t--filt1000GAFR)\t\t\t\tthreshold_1000g_AFR=$2;shift;; #threshold value\n@@ -220,8 +238,7 @@\n \t\t--outall)      \t\t\t\toutfile_all=$2;shift;; # file \n \t\t--outfilt)      \t\t\toutfile_filt=$2;shift;; # file\n \t\t--outinvalid)\t\t\t\toutfile_invalid=$2;shift;; #file\n-\t\t--dorunannovar)\t\t\t\tdorunannovar=$2;shift;; \t#Y or N\n-\t\t--dofilter)\t\t\t\t\tdofilter=$2;shift;; #Y or N\t\n+\t\t--dorunannovar)\t\t\t\tdorunannovar=$2;shift;; \t#Y or N\t\t\n        -h)        \tshift;;\n \t   --)        \tshift;break;;\n        -*)        \tusage;;\n@@ -230,6 +247,11 @@\n     shift\n done\n '..b'er -dbtype esp6500si_all annovarinput $humandb 2>&1\n-\t\n-\t\tannovarout=annovarinput.${buildver}_esp6500si_all_dropped\n-\t\tsed -i \'1i\\db\\t\'$esp6500_colheader_ALL\'\\tchromosome\\tstart\\tend\\treference\\talleleSeq"\'"$vcfheader"\'"\' $annovarout \n-\t\tjoinresults originalfile $annovarout 3 4 5 6 7 B.$esp6500_colheader_ALL\n-\n-\n-\t\t# European American\n-\t\t$scriptsdir/annotate_variation.pl --filter --buildver $buildver -dbtype esp6500si_ea annovarinput $humandb 2>&1\n-\t\n-\t\tannovarout=annovarinput.${buildver}_esp6500si_ea_dropped\n-\t\tsed -i \'1i\\db\\t\'$esp6500_colheader_EA\'\\tchromosome\\tstart\\tend\\treference\\talleleSeq"\'"$vcfheader"\'"\' $annovarout \n-\t\tjoinresults originalfile $annovarout 3 4 5 6 7 B.$esp6500_colheader_EA\n-\n-\t\t# African Americans\n-\t\t$scriptsdir/annotate_variation.pl --filter --buildver $buildver -dbtype esp6500si_aa annovarinput $humandb 2>&1\n-\t\n-\t\tannovarout=annovarinput.${buildver}_esp6500si_aa_dropped\n-\t\tsed -i \'1i\\db\\t\'$esp6500_colheader_AA\'\\tchromosome\\tstart\\tend\\treference\\talleleSeq"\'"$vcfheader"\'"\' $annovarout \n-\t\tjoinresults originalfile $annovarout 3 4 5 6 7 B.$esp6500_colheader_AA\n-\tfi\n-\n-\n-\n \t#GERP++\n \tif [ $gerp == "Y" ]\n \tthen\n@@ -1070,6 +1248,39 @@\n \n \tfi\n \n+\tif [[ $cosmic67 == "Y" && $buildver == "hg19" ]]\n+\tthen\n+\t\techo -e "\\nCOSMIC67 Annotation"\n+\t\t$scriptsdir/annotate_variation.pl --filter --buildver $buildver -dbtype cosmic67 annovarinput $humandb 2>&1\n+\t\n+\t\tannovarout="annovarinput.${buildver}_cosmic67_dropped"\n+\t\tsed -i \'1i\\db\\tCOSMIC67\\tchromosome\\tstart\\tend\\treference\\talleleSeq"\'"$vcfheader"\'"\' $annovarout \n+\t\tjoinresults originalfile $annovarout 3 4 5 6 7 B.COSMIC67\n+\n+\tfi\n+\t\n+\tif [[ $clinvar == "Y" && $buildver == "hg19" ]]\n+\tthen\n+\t\techo -e "\\nCLINVAR Annotation"\n+\t\t$scriptsdir/annotate_variation.pl --filter --buildver $buildver -dbtype clinvar_20131105 annovarinput $humandb 2>&1\n+\t\n+\t\tannovarout="annovarinput.${buildver}_clinvar_20131105_dropped"\n+\t\tsed -i \'1i\\db\\tCLINVAR\\tchromosome\\tstart\\tend\\treference\\talleleSeq"\'"$vcfheader"\'"\' $annovarout \n+\t\tjoinresults originalfile $annovarout 3 4 5 6 7 B.CLINVAR\n+\n+\tfi\n+\t\n+\tif [[ $nci60 == "Y" && $buildver == "hg19" ]]\n+\tthen\n+\t\techo -e "\\nNCI60 Annotation"\n+\t\t$scriptsdir/annotate_variation.pl --filter --buildver $buildver -dbtype nci60 annovarinput $humandb 2>&1\n+\t\n+\t\tannovarout="annovarinput.${buildver}_nci60_dropped"\n+\t\tsed -i \'1i\\db\\tNCI60\\tchromosome\\tstart\\tend\\treference\\talleleSeq"\'"$vcfheader"\'"\' $annovarout \n+\t\tjoinresults originalfile $annovarout 3 4 5 6 7 B.NCI60\n+\n+\tfi\n+\t\n \t#cg46\n \tif [[ $cg46 == "Y"  ]]\n \tthen\n@@ -1138,52 +1349,6 @@\n \n \n \n-############################################\n-#\n-#       Filter Annotated Variants \n-#\n-############################################\n-\n-\n-if [[ $dofilter == "Y" ]]\n-then\n-\techo "starting filtering"\n-\tcp originalfile filteredfile\n-\t\n-\t### do the filtering\n-\t# usage: runfilter <column name> <threshold>   (-1=do not filter, 0=filter any value)\n-\t\n-\t#1000genomes\n-\trunfilter filteredfile ${g1000_colheader_ALL} ${threshold_1000g_ALL}\n-\trunfilter filteredfile ${g1000_colheader_AFR} ${threshold_1000g_AFR}\n-\trunfilter filteredfile ${g1000_colheader_AMR} ${threshold_1000g_AMR}\n-\trunfilter filteredfile ${g1000_colheader_ASN} ${threshold_1000g_ASN}\n-\trunfilter filteredfile ${g1000_colheader_EUR} ${threshold_1000g_EUR}\n-\n-\t#esp\n-\trunfilter filteredfile ${esp6500_colheader_ALL} ${threshold_ESP6500_ALL}\n-\trunfilter filteredfile ${esp6500_colheader_EA} ${threshold_ESP6500_EA}\n-\trunfilter filteredfile ${esp6500_colheader_AA} ${threshold_ESP6500_AA}\t\t\n-\t\n-\t#dbsnp\n-\tfor version in $filt_dbsnpstr\n-\tdo\n-\t\tif [ $version == "None" ] \n-\t\tthen\n-\t\t\tbreak\n-\t\tfi \n-\t\trunfilter filteredfile "db$version" "text"  #-42 will filter any non-empty string in that field\n-\n-\tdone\n-\t\n-\t#complete genomics\n-\trunfilter filteredfile ${cg46_colheader} ${threshold_cg46}\n-\trunfilter filteredfile ${cg69_colheader} ${threshold_cg69}\n-\n-\t#move filtered output file to galaxy output file\n-\tcp filteredfile $outfile_filt\n-\t\n-fi\n \n \n \n@@ -1201,3 +1366,4 @@\n \n \n \n+\n'
b
diff -r 7d9353127f8a -r 565c0e690238 tools/annovar/annovar.xml
--- a/tools/annovar/annovar.xml Tue Nov 05 07:16:32 2013 -0500
+++ b/tools/annovar/annovar.xml Mon Nov 18 10:32:33 2013 -0500
b
@@ -7,13 +7,13 @@
 
  <command interpreter="bash">
  annovar.sh
- --impactscores ${impactscores}
  --esp ${esp}
  --gerp ${gerp}
  --cosmic61 ${cosmic61}
  --cosmic63 ${cosmic63}
  --cosmic64 ${cosmic64}
- --cosmic65 ${cosmic65}
+ --cosmic65 ${cosmic65}
+ --cosmic67 ${cosmic67}
  --outall ${annotated}
  --outinvalid ${invalid}
  --dorunannovar ${dorun}
@@ -57,6 +57,10 @@
  --cg46 ${cgfortysix}
  --cg69 ${cgsixtynine}
  --ver1000g ${ver1000g}
+ --hgvs ${hgvs}
+ --otherinfo ${otherinfo}
+ --newimpactscores ${newimpactscores}
+ --clinvar ${clinvar}
 
  </command>
 
@@ -98,7 +102,8 @@
  <option value="knowngene"> UCSC KnownGene </option>
  <option value="ensgene"  > Ensembl </option>
  </param>
-
+ <param name="hgvs" type="boolean" checked="False" truevalue="-hgvs" falsevalue="N" label="Use HGVS nomenclature for RefSeq annotation" help="if checked, cDNA level annotation is compatible with HGVS"/>
+
 
  <!-- region-based annotation -->
  <param name="cytoband" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Cytogenic band Annotation?" help="This option identifies Giemsa-stained chromosomes bands, (e.g. 1q21.1-q23.3)."/>
@@ -154,14 +159,32 @@
 
  <param name="gerp" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="GERP++ Annotation?" help="GERP identifies constrained elements in multiple alignments by quantifying substitution deficits (see http://mendel.stanford.edu/SidowLab/downloads/gerp/ for details) This option annotates those variants having GERP++>2 in human genome, as this threshold is typically regarded as evolutionarily conserved and potentially functional"/>
 
+ <param name="clinvar" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="CLINVAR Annotation? (hg19 only)" help="version 2013-11-05. Annotations include Variant Clinical Significance (unknown, untested, non-pathogenic, probable-non-pathogenic, probable-pathogenic, pathogenic, drug-response, histocompatibility, other) and Variant disease name."/>
+ <param name="nci60" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Annotate with NCI60? (hg19 only)" help="NCI-60 exome allele frequency data"/>
  <param name="cgfortysix" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Complete Genomics 46 Genomes?" help="Diversity Panel; 46 unrelated individuals"/>
  <param name="cgsixtynine" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Complete Genomics 69 Genomes?" help="Diversity Panel, Pedigree, YRI trio and PUR trio"/>
  <param name="cosmic61" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Annotate with COSMIC61? (hg19 only)"/>
  <param name="cosmic63" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Annotate with COSMIC63? (hg19 only)"/>
  <param name="cosmic64" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Annotate with COSMIC64? (hg19 only)"/>
  <param name="cosmic65" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Annotate with COSMIC65? (hg19 only)"/>
+ <param name="cosmic67" type="boolean" checked="False" truevalue="Y" falsevalue="N" label="Annotate with COSMIC67? (hg19 only)"/>
 
-<param name="impactscores" type="select" label="Select functional impact scores annotate with" multiple="true" display="checkboxes" optional="true" help="LJB refers to Liu, Jian, Boerwinkle paper in Human Mutation, pubmed ID 21520341.">
+ <param name="newimpactscores" type="select" label="Select functional impact scores (LJB2)" multiple="true" display="checkboxes" optional="true" help="LJB refers to Liu, Jian, Boerwinkle paper in Human Mutation, pubmed ID 21520341. ">
+ <option value="ljb2_sift"> SIFT score </option>
+ <option value="ljb2_pp2hdiv"> PolyPhen2 HDIV score </option>
+ <option value="ljb2_pp2hvar" > PolyPhen2 HVAR score </option>
+ <option value="ljb2_mt" > MutationTaster score </option>
+ <option value="ljb2_ma" > MutationAssessor score </option>
+ <option value="ljb2_lrt"> LRT score (Likelihood Ratio Test) </option>
+ <option value="ljb2_phylop"> PhyloP score </option>
+ <option value="ljb2_fathmm" > FATHMM score </option>
+ <option value="ljb2_gerp"> GERP++ score </option>
+ <option value="ljb2_siphy"> SiPhy score </option>
+ </param>
+ <param name="otherinfo" type="boolean" checked="False" truevalue="-otherinfo" falsevalue="N" label="Also get predictions where possible?" help="e.g. annotated as -score,damaging- or -score,benign- instead of just score"/>
+
+ <!--  OBSOLETE impact scores, uncomment for backwards compatibility, add argument impactscores to command
+<param name="impactscores" type="select" label="Select functional impact scores annotate with (OBSOLETE)" multiple="true" display="checkboxes" optional="true" help="LJB refers to Liu, Jian, Boerwinkle paper in Human Mutation, pubmed ID 21520341.">
  <option value="avsift"> AV SIFT </option>
  <option value="ljbsift"> LJB SIFT (corresponds to 1-SIFT)</option>
  <option value="pp2"> PolyPhen2 </option>
@@ -169,7 +192,7 @@
  <option value="lrt"> LRT (Likelihood Ratio Test) </option>
  <option value="phylop"> PhyloP </option>
  </param>
-
+ -->
 
  <!-- prefix for output file so you dont have to manually rename history items -->
  <param name="fname" type="text" value="" label="Prefix for your output file" help="Optional"/>
@@ -196,13 +219,23 @@
 
 Input Formats may be one of the following:
 
- VCF file
-
- Complete Genomics varfile
+VCF file
+Complete Genomics varfile
+
+Custom tab-delimited file (specify chromosome, start, end, reference allele, observed allele columns)
 
- Custom tab-delimited file (specify chromosome, start, end, reference allele, observed allele columns)
+Custom tab-delimited CG-derived file (specify chromosome, start, end, reference allele, observed allele, varType columns)
+
+
+**Database Notes**
 
- Custom tab-delimited CG-derived file (specify chromosome, start, end, reference allele, observed allele, varType columns)
+see ANNOVAR website for extensive documentation, a few notes on some of the databases:
+
+**LJB2 Database**
+
+PolyPhen2 HVAR should be used for diagnostics of Mendelian diseases, which requires distinguishing mutations with drastic effects from all the remaining human variation, including abundant mildly deleterious alleles.The authors recommend calling probably damaging if the score is between 0.909 and 1, and possibly damaging if the score is between 0.447 and 0.908, and benign if the score is between 0 and 0.446.
+
+PolyPhen HDIV should be used when evaluating rare alleles at loci potentially involved in complex phenotypes, dense mapping of regions identified by genome-wide association studies, and analysis of natural selection from sequence data. The authors recommend calling probably damaging if the score is between 0.957 and 1, and possibly damaging if the score is between 0.453 and 0.956, and benign is the score is between 0 and 0.452. 
 
  </help>