Mercurial > repos > onnodg > cdhit_analysis
comparison README.md @ 2:706b7acdb230 draft
planemo upload for repository https://github.com/Onnodg/Naturalis_NLOOR/tree/main/NLOOR_scripts/process_clusters_tool commit c2020ecc91cea0c8cf7439180cf796743c838b4d-dirty
| author | onnodg |
|---|---|
| date | Tue, 21 Oct 2025 07:54:21 +0000 |
| parents | |
| children |
comparison
equal
deleted
inserted
replaced
| 1:ff68835adb2b | 2:706b7acdb230 |
|---|---|
| 1 This script processes cluster output files from cd-hit-est for use in Galaxy. | |
| 2 It extracts cluster information, associates taxa and e-values from annotation files, | |
| 3 performs statistical calculations, and generates text and plot outputs | |
| 4 summarizing similarity and taxonomic distributions. | |
| 5 | |
| 6 | |
| 7 Main steps: | |
| 8 1. Parse cd-hit-est cluster file and (optional) annotation file. | |
| 9 2. Process each cluster to extract similarity, taxa, and e-value information. | |
| 10 3. Aggregate results across clusters. | |
| 11 4. Generate requested outputs: text summaries, plots, and Excel reports. | |
| 12 | |
| 13 | |
| 14 Note: Uses a non-interactive matplotlib backend (Agg) for compatibility with Galaxy. |
