comparison README.md @ 2:706b7acdb230 draft

planemo upload for repository https://github.com/Onnodg/Naturalis_NLOOR/tree/main/NLOOR_scripts/process_clusters_tool commit c2020ecc91cea0c8cf7439180cf796743c838b4d-dirty
author onnodg
date Tue, 21 Oct 2025 07:54:21 +0000
parents
children
comparison
equal deleted inserted replaced
1:ff68835adb2b 2:706b7acdb230
1 This script processes cluster output files from cd-hit-est for use in Galaxy.
2 It extracts cluster information, associates taxa and e-values from annotation files,
3 performs statistical calculations, and generates text and plot outputs
4 summarizing similarity and taxonomic distributions.
5
6
7 Main steps:
8 1. Parse cd-hit-est cluster file and (optional) annotation file.
9 2. Process each cluster to extract similarity, taxa, and e-value information.
10 3. Aggregate results across clusters.
11 4. Generate requested outputs: text summaries, plots, and Excel reports.
12
13
14 Note: Uses a non-interactive matplotlib backend (Agg) for compatibility with Galaxy.