The AllSome Sequence Bloom Tree Search Engine is a fast querying tool to identify all publicly available sequenced samples which express a transcript of interest.
The input for this tool is a list of (ID, TRANSCRIPT) couples, one for each line, in a tab delimited format:
id0 CCAACCAAAGGGAAAACTTTTTTCCGACTTTGGCCTAAAGGGTTTAACGGCCAAGTCAGAAGGGAAAAAGTTGCGCCA id1 TTAATGACAGGGCCACATGATGTGAAAAAAAATCAGAAACCGAGTCAACGTGAGAAGATAGTACGTACTACCGCAAAT ... idn CAATTAATGATAAATATTTTATAAGGTGCGGAAATAAAGTGAGGAATATCTTTTAAATTCAAGTTCAATTCTGAAAGC
The ID can contain alphanumeric characters in addition to spaces, dots, dashes, and round and square brackets. Any additional character will be trimmed out.
The output of the tool is a collection that contains a file for each ID with a list of accession numbers representing the samples that express one particular transcript.
Notes
This Galaxy tool has been developed by Fabio Cumbo.
Please visit this GithHub_repository for more information about the AllSome Sequence Bloom Tree Search Engine