view test-data/README.test_db @ 1:edb671f0661e draft default tip

planemo upload for repository https://github.com/Helmholtz-UFZ/ufz-galaxy-tools/blob/main/tools/genomad/ commit dddf8b42771f66318f3325607bbd7aa33a3716b5
author ufz
date Mon, 07 Oct 2024 11:51:02 +0000
parents 955e33326e20
children
line wrap: on
line source

creating of a smaller reference database: https://github.com/apcamargo/genomad/issues/104#issuecomment-2170949010

- Download reference db v1.7
- store in dir genomad_db in test-data
- run test and get ids with `awk -v FS="\t" 'NR>1 && $9!="NA" {print $9}' output/sequence_annotate/sequence_genes.tsv | sort -u > markers
- join -1 2 -2 1 genomad_db/genomad_db.lookup markers | cut -d" " -f 2 | sort -u -n > sorted_markers
- cd genomad_db 
- `mmseqs createsubdb ~/projects/tools-iuc/tools/genomad/test-data/sorted_markers genomad_db genomad_microdb`
- mv genomad_microdb.index genomad_db.index
- mv genomad_microdb.dbtype genomad_db.dbtype
- mv genomad_microdb genomad_db
- genomad_microdb*