Custom sript that performs necessary formatting operations for the SARS-CoV2 Selection Analysis workflow.
If using NCBI as data source the file can be obtained from https://www.ncbi.nlm.nih.gov/projects/genome/sars-cov-2-seqs/ncov-sequences.yaml. If not given the tool will download it automatically.
If using non-NCBI data, the metadata input file must be tabular with the following columns: ID, collection_date, country, state (optional), and locality (optional). Optional columns should still be created even if they are not used.
Dates should be of the format: YYMMDD (example: May 1 2020 = 20200501).