# HG changeset patch # User jjohnson # Date 1578327066 18000 # Node ID b59ae99e47d4ed87df16c60c6e1804b53264360a "planemo upload for repository https://github.com/galaxyproteomics/tools-galaxyp/tree/master/tools/drep commit b155a1d533b7317ceb0ec642ffe3e986117df539" diff -r 000000000000 -r b59ae99e47d4 drep_compare.xml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/drep_compare.xml Mon Jan 06 11:11:06 2020 -0500 @@ -0,0 +1,59 @@ + + compare a list of genomes + + macros.xml + + + + + + + + + + + + + + + + + diff -r 000000000000 -r b59ae99e47d4 drep_dereplicate.xml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/drep_dereplicate.xml Mon Jan 06 11:11:06 2020 -0500 @@ -0,0 +1,66 @@ + + De-replicate a list of genomes + + macros.xml + + + + + + + + + + + + + + + + + + + + + diff -r 000000000000 -r b59ae99e47d4 macros.xml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/macros.xml Mon Jan 06 11:11:06 2020 -0500 @@ -0,0 +1,388 @@ + + 2.3.2 + + + drep + + + + + + 10.1038/ismej.2017.126 + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + --ignoreGenomeQuality is useful with + bacteriophages or eukaryotes or things where checkM + scoring does not work. Will only choose genomes based + on length and N50. + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + (for ANIn/ANImf only; gANI can only do larger method) + + + + + (passed to scipy.cluster.hierarchy.linkage) + + + + + + + + + + + + + + + + + +Based off of the formula: +A*Completeness - B*Contamination + C*(Contamination * (strain_heterogeneity/100)) + D*log(N50) + E*log(size) +A = completeness_weight; B = contamination_weight; C = strain_heterogeneity_weight; D = N50_weight; E = size_weight; + + + + + + + + + + + + + + + + + + + + + (for ANIn/ANImf only; gANI can only do larger method) + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +