Galaxy | Tool Preview

MasterVar to gd_snp (version 1.0.0)

Dataset formats

The input dataset is in the MasterVar format provided by the Complete Genomics analysis process (Galaxy considers this to be tabular, but it must have the columns specified for MasterVar). The output dataset is a gd_snp table. (Dataset missing?)


What it does

This converts a Complete Genomics MasterVar file to gd_snp format, so it can be used with the genome diversity tools. It can either start a new dataset or append to an old one. When appending, if any new SNPs appear only in the MasterVar file they can either be skipped or backfilled with "-1" (unknown) for previous individuals/groups in the gd_snp dataset. Positions homozygous for the reference are skipped.


Examples