PICRUSt2 Metagenome prediction (version 2.5.3+galaxy0)

Sequence abundance table (OTUs or ASVs):

The sequence abundances should be in read counts and not relative abundances. The tool will normalize the input sequence abundance table by the predicted number of marker genes

Sequence abundance table type:

Table with predicted gene family copy numbers:

This table is generated by the tool for Hidden state prediction (HSP)

Table of predicted marker gene (16S or other) copy numbers:

This table is generated by the tool for Hidden state prediction (HSP)

Maximum Nearest-sequenced taxon index (NSTI):

Sequences with larger values will be excluded

Generate an output table stratified by sequences:

Skip normalizing sequence abundances by predicted marker gene copy numbers:

What it does

PICRUSt2 (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States) is a tool for predicting functional abundances based only on marker gene sequences.

Read more about the tool: https://github.com/picrust/picrust2/wiki

Metagenome Pipeline

Reads in a sequence abundance table (the abundances of OTUs or ASVs in BIOM, TSV, or mothur shared file format), the predicted marker gene abundances, and the predicted gene family abundances (these last two files are output by hsp.py).

Note

Per-sample metagenome functional profiles are generated based on the predicted functions for each study sequence. Note that typically these sequences correspond to OTUs or ASVs. The specified sequence abundance table will be normalized by the predicted number of marker gene copies before outputting the final files by default. The sample metagenome table stratified by contributing ASVs can optionally also be output.

The sequence abundances should be in read counts and not relative abundances. It will normalize the input sequence abundance table by the predicted number of marker genes. It will then determine the predicted functional profiles per sample. Output stratified by sequence ids (i.e. taxonomic contributors) will also be output if the --strat_out option is used. Also, rare ASVs can be collapsed into the same category in the stratified output table based on the --min_reads and --min_samples options. Note the output files are tab-delimited even if the input files was in BIOM format. The normalized sequence abundance table and the weighted nearest-sequenced taxon index values per-sample will also be output to the output directory as separate files.

Input

Table of sequence abundances (BIOM, TSV, or mothur shared file format).

Output

Metagenome predictions:

Predicted per-sample metagenome functional profiles
Normalized sequence abundance table
Weighted nearest-sequenced taxon index (NSTI) values per-sample
When chosen within the tool's parameters: Predicted per-sample metagenome functional profiles, stratified by sequence ids (i.e. taxonomic contributors)
When chosen within the tool's parameters: Predicted per-sample metagenome functional profiles, wide-format stratified by sequence ids (i.e. taxonomic contributors)