Previous changeset 10:fe3e8506112c (2021-10-01) Next changeset 12:b7e297c2ede6 (2021-10-22) |
Commit message:
"planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/pangolin commit 593540812c77e419c093a423e3598270a49b5e45" |
modified:
pangolin.xml |
removed:
test-data/2021-04-21/__init__.py test-data/2021-04-21/data/decisionTreeHeaders_v1.joblib test-data/2021-04-21/data/decisionTree_v1.joblib test-data/2021-04-21/data/decision_tree_rules.txt test-data/2021-04-21/data/lineage_recall_report.csv test-data/2021-04-21/data/lineages.downsample.csv test-data/2021-04-21/data/lineages.metadata.csv test-data/2021-04-21/supporting_information/data_prep_description.md test-data/pangolearn.loc test-data/pangolearn.loc.sample |
b |
diff -r fe3e8506112c -r 073de3fb84a0 pangolin.xml --- a/pangolin.xml Fri Oct 01 14:21:05 2021 +0000 +++ b/pangolin.xml Fri Oct 22 10:12:25 2021 +0000 |
[ |
@@ -1,10 +1,14 @@ -<tool id="pangolin" name="Pangolin" version="@TOOL_VERSION@+galaxy0" profile="20.01"> +<tool id="pangolin" name="Pangolin" version="@TOOL_VERSION@+galaxy1" profile="20.01"> <description>Phylogenetic Assignment of Outbreak Lineages</description> <macros> <token name="@TOOL_VERSION@">3.1.14</token> </macros> <requirements> <requirement type="package" version="@TOOL_VERSION@">pangolin</requirement> + <!-- the scorpio requirement is because pangolin got an update to use scorpio 0.3.13 (to detect AY.4.2 better) + while still using the pangolin 3.1.14 code. when pangolin gets updated past 3.1.14 this requirement can + be removed --> + <requirement type="package" version="0.3.13">scorpio</requirement> <requirement type="package" version="0.23.0">csvtk</requirement> </requirements> <command detect_errors="exit_code">< which excludes records without proper dates, removes duplicate sequences (taking the earliest sample of the duplicates), omits some sequences with known issues, filters by length and coverage, and trims the sequences to CDS. - -It also aligns the sequences using `mafft` and builds an ML tree using `iqtree`. A lineages is assigned to each sequence using `pangolin` with the previous data release. - -### Lineage Curation - -The phylogeny is annotated with lineage and then in `FigTree` the lineages are manually curated, drawing together a number of pieces of information including monophyly in the ML phylogeny (generally a bootstrap > 70 is required) and epidemiological data such as country and travel history. Any changes to lineage definitions and new lineages are documented during this process. - -- The lineage may have been defined earlier in the outbreak and with added sequence data, there is less support for that lineage. In these cases the associated epidemiological metadata is examined and the lineage may be refined or even dropped entirely. The lineage number will not be 'recycled', but the members will get reassigned the parent lineage designation. -- The lineage may have very clear epidemiological support and ambiguities or homoplasies in the sequences/ tree could contribute to low bootstrap values. In these cases, if the support is strong, the lineages are called. Recall rates for these lingeages within `pangolin` may be lower however. - - |
b |
diff -r fe3e8506112c -r 073de3fb84a0 test-data/pangolearn.loc --- a/test-data/pangolearn.loc Fri Oct 01 14:21:05 2021 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 |
b |
@@ -1,10 +0,0 @@ -# this is a tab separated file describing the location of pangoLEARN databases used for the -# pangolin SARS-CoV-2 lineage typing tool -# -# the columns are: -# value description format_version path -# -# for example -# 2021-04-14 pangoLEARN data release 2021-04-14 1.0 /tmp/database/pangolearn/pangolearn/2021-04-14 -2021-04-21 pangoLEARN data release 2021-04-21 1.0 ${__HERE__}/2021-04-21 -2021-05-27 pangoLEARN data release 2021-05-27 3.0 ${__HERE__}/2021-05-27 |
b |
diff -r fe3e8506112c -r 073de3fb84a0 test-data/pangolearn.loc.sample --- a/test-data/pangolearn.loc.sample Fri Oct 01 14:21:05 2021 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 |
b |
@@ -1,8 +0,0 @@ -# this is a tab separated file describing the location of pangoLEARN databases used for the -# pangolin SARS-CoV-2 lineage typing tool -# -# the columns are: -# value description format_version path -# -# for example -# 2021-04-14 pangoLEARN data release 2021-04-14 1.0 /tmp/database/pangolearn/pangolearn/2021-04-14 |