WHAT IT DOES
This tool runs filtering on either primary GFF3 file of all domains, i.e. output of Protein Domains Finder tool or already filtered GFF3 file. Domains can be filtered based on:
All the records containing ambiguous domain type (e.g. RH/INT) are filtered out automatically. They do not take place in filtered gff file neither the protein sequence is derived from these potentially chimeric domains. Optimal results (for general usage) should be reached using the default quality filtering parameters which are appropriate to find all types of protein domains. Keep in mind that the results should be critically assessed based on your input data anyhow.
Filtered GFF3 file
Translated protein sequences of the filtered domains regions of original DNA sequence in fasta format
Translated sequences are taken from the best alignment (Best_Hit attribute) within a domain region, however this alignment does not necessarily have to cover the whole region reported as a domain in gff file