This approach screens two proteins against all nucleotide sequence from the
NCBI nt database within hours on our cluster, leading to all organisms with an inter- esting gene structure for further investigation. As usual in Galaxy workflows every parameter, including the proximity distance, can be changed and additional steps can be easily added. For example additional filtering to refine the initial BLAST hits, or inclusion of a third query sequence. |
hg clone https://toolshed.g2.bx.psu.edu/repos/bgruening/find_genes_located_nearby_workflow