Repository revision
12:983bf725dfc2

Repository 'dbbuilder'
hg clone https://toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder

Protein Database Downloader tool metadata
Miscellaneous
dbbuilder
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.3.4
0.3.4
None
True
Version lineage of this tool (guids ordered most recent to oldest)
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.3.4 (this tool)
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.3.3
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.3.2
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.3.1
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.3.0
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.2.1
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.2.0
toolshed.g2.bx.psu.edu/repos/galaxyp/dbbuilder/dbbuilder/0.1.0
dbbuilder
Requirements (dependencies defined in the <requirements> tag set)
name version type
wget 1.20.1 package
python 3.8 package
requests 2.20.1 package
Additional information about this tool
#if $source.from == "uniprot"
            #set $query_fields = [] 
            #if $source.taxon_id
                #silent $query_fields.append('taxonomy_id:' + str($source.taxon_id))
            #else
                #silent $query_fields.append('taxonomy_id:' + str($source.taxon))
            #end if
            #if $source.reviewed:
                #silent $query_fields.append(str($source.reviewed))
            #end if
            #if $source.set:
                #silent $query_fields.append(str($source.set))
            #end if
            #set $query  = '(' + ')+AND+('.join($query_fields) + ')' 
            #set $url = 'https://rest.uniprot.org/uniprotkb/stream?compressed=true&format=fasta' + str($source.include_isoform) + '&query=' + $query
            #set $type = "uniprotkb_stream"
        #elif $source.from == "cRAP"
            ##set $url = "ftp://ftp.thegpm.org/fasta/cRAP/crap.fasta"
            #set $url = "https://raw.githubusercontent.com/pravs3683/cRAP/master/cRAP_protein_database.fasta"
            #set $type = "direct"
        #elif $source.from == "HMP"
            #set $url = 'http://downloads.hmpdacc.org/data/reference_genomes/body_sites/' + str($source.site) + '.pep.fsa'
            #set $type = "direct"
        #elif $source.from == "HOMD"
            #set $url = 'ftp://ftp.homd.org/human_oral_microbial_genomic_sequences/current/' + str($source.annotation)
            #if str($source.annotation).endswith('.tar.gz'):
                #set $type = "tgz" 
            #elif str($source.annotation).endswith('.zip'):
                #set $type = "zip" 
            #end if 
        #elif $source.from == "url"
            #set $url = $source.url
            #set $type = $source.archive_type
        #end if
        #if $type =="uniprotkb_stream"
            python '$__tool_directory__/uniprotkb.py' --url '$url' -o 'tmp.gz' && gzip -dc 'tmp.gz' > '${output_database}' 
        #elif $type =="direct"
            wget -nv '$url' -O '${output_database}' --no-check-certificate
        #elif $type =="zip"
            wget -nv '$url' -O tmp.zip --no-check-certificate && zcat -c tmp.zip > '${output_database}'
        #elif $type =="gzip"
            wget -nv '$url' -O tmp.gz --no-check-certificate && (if `command -v gzcat > /dev/null`;  then gzcat tmp.gz; else zcat tmp.gz ; fi) > '${output_database}'
        #elif $type =="bzip2"
            wget -nv '$url' -O tmp.bz2 --no-check-certificate && bzcat tmp.bz2 > '${output_database}'
        #elif $type =="tgz"
            wget -nv '$url' -O tmp.tar.gz && tar zxfO tmp.tar.gz > '${output_database}'
        #elif $type =="tbz"
            wget -nv '$url' -O tmp.tar.bz && tar jxfO tmp.tar.bz > '${output_database}'
        #end if

    
None
False
Functional tests
name inputs outputs required files
Test-1 source|from: cRAP
name: value
value
Test-2 source|taxon: 83333
source|taxon_id: 2697049
source|from: uniprot
name: value
value
Test-3 source|taxon_id: 2697049
source|reviewed: reviewed:true
source|set:
source|from: uniprot
name: value
value
Test-4 source|taxon_id: 2697049
source|reviewed: reviewed:false
source|set:
source|from: uniprot
name: value
value
Test-5 source|taxon: 83333
source|reviewed: reviewed:true
source|set:
source|include_isoform: True
source|from: uniprot
name: value
value
Test-6 source|taxon: 83333
source|reviewed: reviewed:true
source|set:
source|include_isoform: False
source|from: uniprot
name: value
value
Test-7 source|url: https://raw.githubusercontent.com/pravs3683/cRAP/master/cRAP_protein_database.fasta
source|archive_type: direct
source|from: url
name: value
value