This tool allows to create Sequence Bloom Trees starting from a set of FASTA or FASTQ files. It also allows to control the quality of the input dataset and exclude the files that do not reach a specified quality level.
Input file
The input of this tool is a set of FASTA or FASTQ experiments, additionally to a set of SRA accession numbers. For each of the selected experiments, the minimum abundance for the corresponding Bloom filter is required. Additionally, a quality control procedure could be applied to guarantee that the quality of every experiment always exceed a specified treshold. Otherwise, experiments with low quality level will be discarded.
The k-mer length must also be specified, additionally to the Bloom filter size. This last field is optional and it will be automatically estimated if not provided.
Output
This tool returns a collection containing the Sequence Bloom Tree nodes and a file representing the organization of the tree.
Take a look at the Query tool documentation for a detailed description about how to query a Sequence Bloom Tree.
Notes
This Galaxy tool has been developed by Fabio Cumbo.
Please visit this GithHub_repository for more information about the BloomTree Manager