Mercurial > repos > onnodg > cdhit_analysis
comparison cdhit_analysis.py @ 3:c6981ea453ae draft default tip
planemo upload for repository https://github.com/Onnodg/Naturalis_NLOOR/tree/main/NLOOR_scripts/process_clusters_tool commit ef31054ae26e19eff2f1b1f6c7979e39c47c0d5b-dirty
| author | onnodg |
|---|---|
| date | Fri, 24 Oct 2025 09:38:24 +0000 |
| parents | 706b7acdb230 |
| children |
comparison
equal
deleted
inserted
replaced
| 2:706b7acdb230 | 3:c6981ea453ae |
|---|---|
| 8 Main steps: | 8 Main steps: |
| 9 1. Parse cd-hit-est cluster file and (optional) annotation file. | 9 1. Parse cd-hit-est cluster file and (optional) annotation file. |
| 10 2. Process each cluster to extract similarity, taxa, and e-value information. | 10 2. Process each cluster to extract similarity, taxa, and e-value information. |
| 11 3. Aggregate results across clusters. | 11 3. Aggregate results across clusters. |
| 12 4. Generate requested outputs: text summaries, plots, and Excel reports. | 12 4. Generate requested outputs: text summaries, plots, and Excel reports. |
| 13 | |
| 14 | |
| 15 Note: Uses a non-interactive matplotlib backend (Agg) for compatibility with Galaxy. | |
| 16 """ | 13 """ |
| 17 | 14 |
| 18 import argparse | 15 import argparse |
| 19 from collections import Counter, defaultdict | 16 from collections import Counter, defaultdict |
| 20 import os | 17 import os |
