comparison cdhit_analysis.py @ 3:c6981ea453ae draft default tip

planemo upload for repository https://github.com/Onnodg/Naturalis_NLOOR/tree/main/NLOOR_scripts/process_clusters_tool commit ef31054ae26e19eff2f1b1f6c7979e39c47c0d5b-dirty
author onnodg
date Fri, 24 Oct 2025 09:38:24 +0000
parents 706b7acdb230
children
comparison
equal deleted inserted replaced
2:706b7acdb230 3:c6981ea453ae
8 Main steps: 8 Main steps:
9 1. Parse cd-hit-est cluster file and (optional) annotation file. 9 1. Parse cd-hit-est cluster file and (optional) annotation file.
10 2. Process each cluster to extract similarity, taxa, and e-value information. 10 2. Process each cluster to extract similarity, taxa, and e-value information.
11 3. Aggregate results across clusters. 11 3. Aggregate results across clusters.
12 4. Generate requested outputs: text summaries, plots, and Excel reports. 12 4. Generate requested outputs: text summaries, plots, and Excel reports.
13
14
15 Note: Uses a non-interactive matplotlib backend (Agg) for compatibility with Galaxy.
16 """ 13 """
17 14
18 import argparse 15 import argparse
19 from collections import Counter, defaultdict 16 from collections import Counter, defaultdict
20 import os 17 import os