Starting processing for FASTA: test-data/obiclean_test_input_numbered.fasta
=== PARAMETERS USED ===
input_anno: test-data/obiclean_input_anno_final.tabular
input_unanno: test-data/obiclean_test_input_numbered.fasta
eval_plot: test-data/obiclean_output.png
taxa_output: test-data/obiclean_taxa.txt
circle_data: test-data/lca_singleton_circle.txt
header_anno: test-data/lca_obiclean_singleton_output.xlsx
anno_stats: test-data/obiclean_anno_stats.txt
filtered_fasta: test-data/filtered_fasta_test.fasta
uncertain_threshold: 0.9
eval_threshold: 1e-10
use_counts: False
ignore_rank: 
ignore_taxonomy: alnus,fagus
bitscore_perc_cutoff: 8.0
min_bitscore: 50
ignore_obiclean_type: 
ignore_illuminapairend_type: 
min_identity: 0
min_coverage: 0
ignore_seqids: 
min_support: 2
=== END PARAMETERS ===
Filtered FASTA written to: test-data/filtered_fasta_test.fasta (779 sequences)
FASTA: total headers: 4725
FASTA: headers kept after filters and min_support=2: 779
FASTA: removed due to header filters (illumina/obiclean/etc.): 0
FASTA: removed due to low dereplicated count (<2): 3946
FASTA: total invalid (header filter + low support): 3946
Reading BLAST annotations: test-data/obiclean_input_anno_final.tabular
BLAST: total hits read: 38539
BLAST: hits kept after quality filters: 37751
BLAST: hits filtered (evalue/coverage/identity/bitscore): 788
BLAST: hits removed due to invalid taxon: 447
BLAST: hits removed due to ignored seqids: 0
Note: 2598 BLAST q_ids not in FASTA (showing up to 10): ['M01687:476:000000000-LL5F5:1:1102:8280:1614_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:20052:2016_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:8559:2087_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:22618:2719_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:24990:3037_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:21041:3093_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:13143:3913_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:15381:4073_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:9098:4091_CONS(1)', 'M01687:476:000000000-LL5F5:1:1102:24520:4332_CONS(1)']
ANNOTATION: total FASTA headers considered: 779
ANNOTATION: reads with BLAST hits: 585
ANNOTATION: reads without BLAST hits: 194
ANNOTATION: unique annotated count (from header counts): 95658
ANNOTATION: total unique count (from FASTA): 106100
E-value plot written to: test-data/obiclean_output.png
Taxa summary written to: test-data/obiclean_taxa.txt
Header annotations written to: test-data/lca_obiclean_singleton_output.xlsx
Circle diagram JSON written to: test-data/lca_singleton_circle.txt
=== ANNOTATION STATISTICS ===
percentage_annotated: 12.380952380952381
annotated_sequences: 585
total_sequences: 4725
percentage_unique_annotated: 90.1583411875589
unique_annotated: 95658
total_unique: 106100