Galaxy | Tool Preview

obiuniq (version 1.2.13)
Input format options
Input format options 0

What it does

The obiuniq command is in some way analog to the standard Unix uniq -c command.

Instead of working text line by text line as the standard Unix tool, the processing is done on sequence records.

A sequence record is a complex object composed of an identifier, a set of attributes (key=value), a definition, and the sequence itself.

The obiuniq command groups together sequence records. Then, for each group, a sequence record is printed.

A group is defined by the sequence and optionally by the values of a set of attributes specified with the -c option.

As the identifier, the set of attributes (key=value) and the definition of the sequence records that are grouped together may be different, two options (-m and -i) allow refining how these parts of the records are reported.

* By default, only attributes with identical values within a group of sequence records are kept.

* A count attribute is set to the total number of sequence records for each group.

* For each attribute specified by the -m option, a new attribute whose key is prefixed by merged is created. These new attributes contain the number of times each value occurs within the group of sequence records.


Project links:

OBITools