What it does
CONCOCT (Clustering cONtigs with COverage and ComposiTion) performs unsupervised binning of metagenomic contigs by using nucleotide composition - kmer frequencies - and coverage data for multiple samples. CONCOCT can accurately (up to species level) bin metagenomic contigs.
The tool accepts 2 inputs; a tabular file where each row corresponds to a contig and each column corresponds to a sample (the values are the average coverage for this contig in that sample) and a file containing sequences in fasta format.
Three outputs are produced; clustering of the > 1000 kmer count, the PCA transformed matrix and the PCA components.
The intended use of the CONCOCT tools is shown in the following image.
More information may be found on the CONCOCT homepage:: https://github.com/BinPro/CONCOCT