view test-data/strict_anno_stats.txt @ 2:9ca209477dfd draft default tip

planemo upload for repository https://github.com/Onnodg/Naturalis_NLOOR/tree/main/NLOOR_scripts/process_annotations_tool commit 4017d38cf327c48a6252e488ba792527dae97a70-dirty
author onnodg
date Mon, 15 Dec 2025 16:43:36 +0000
parents
children
line wrap: on
line source

Starting processing for FASTA
=== PARAMETERS USED ===
uncertain_threshold: 0.95
eval_threshold: 1e-20
use_counts: False
ignore_rank: unknown
ignore_taxonomy: environmental
bitscore_perc_cutoff: 0.0
min_bitscore: 150
ignore_obiclean_type: singleton
ignore_illuminapairend_type: pairend
min_identity: 98
min_coverage: 95
ignore_seqids: 
min_support: 1
=== END PARAMETERS ===
Filtered FASTA written succesfully(4682 sequences)
FASTA: total headers: 4725
FASTA: headers kept after filters and min_support=1: 4682
FASTA: removed due to header filters (illumina/obiclean/etc.): 43
FASTA: removed due to low dereplicated count (<1): 0
FASTA: total invalid (header filter + low support): 43
Reading BLAST annotations
BLAST: total hits read: 38539
BLAST: hits kept after quality filters: 10244
BLAST: hits filtered (evalue/coverage/identity/bitscore): 28295
BLAST: hits removed due to invalid taxon: 0
BLAST: hits removed due to ignored seqids: 0
ANNOTATION: total FASTA headers considered: 4682
ANNOTATION: reads with BLAST hits: 1635
ANNOTATION: reads without BLAST hits: 3047
ANNOTATION: unique annotated count (from header counts): 89087
ANNOTATION: total unique count (from FASTA): 111525
Header annotations written succesfully
=== ANNOTATION STATISTICS ===
percentage_annotated: 34.6031746031746
annotated_sequences: 1635
total_sequences: 4725
percentage_unique_annotated: 79.88074422775162
unique_annotated: 89087
total_unique: 111525