Galaxy | Tool Preview

GEMINI gene_wise (version 0.20.1+galaxy1)
Only files with version 0.20.1 are accepted.
Genotype filter expressions
Genotype filter expression 0
(--min-filters)
Region Filters
Region Filter 0
Additional constraints on variants
Additional constraints on variants 0
Output - included information
Output - included information 0

What it does

This tool extends the GEMINI inheritance pattern tool in that it lets you search for custom gene-wise inheritance patterns of variants, instead of fixed ones.

See also: the command line tool documentation


Genotype filters

The syntax for specifying a genotype filter (--gt-filter command line option) is the same as for the GEMINI query tool and is described here.

The difference with the gene_wise tool is that it lets you specify multiple such filters and, if you do, every filter can be met by a different variant as long as all of them are in the same gene.

This is useful if your analysis includes several families that you suspect (based on a shared phenotype) to have the same gene affected, but not necessarily through the same variant. In this case, you can formulate one filter per family like, for example:

gt_types.fam1_kid == HET and gt_types.fam1_mom == HOM_REF and gt_types.fam1_dad == HOM_REF

gt_types.fam2_kid == HET

gt_types.fam3_kid == HET

, which would allow you to find a causal gene that's affected by different (dominant) variants in children from three different families. Note that the first filter combines three conditions applied to family 1, which, thus, must be met by the same variant site.

Regular and required filters (--gt-filter vs --gt-filter-required) and the Minimum number of filters

For every single genotype filter you define you can specify whether it should be applied as a regular or as a required filter. The difference is that, if a variant doesn't pass a required filter it is excluded from further analysis. Of the regular filters, a gene and its variants only have to pass a threshold number defined by Minimum number of filters (--min-filters). Imagine, with the above filters you had specified --min-filters as 2, then a gene for which the child in family 3 carries one copy of a variant allele and the child in family 3 carries a copy of a different allele would be reported no matter if any other allele in that gene passes the first filter, etc..


Region filters

They let you restrict your analysis to parts of the genome, which can be useful if you have prior knowledge of the approximate location of the causative gene.

If you specify more then one region filter, they get combined with a logical OR, meaning variants and genes falling in any of the regions are reported.


Additional constraints on variants

These get translated directly into the WHERE clause of an SQL query and, thus, have to be expressed in valid SQL syntax. Of particular interest, here, is the fact that, by default, the gene-wise tool applies the WHERE clause: is_exonic = 1 and impact_severity != 'LOW', which means the tool only considers variants in exons that are not of LOW impact severity (i.e., not silent mutations). While this can be a good and biologically justifiable setting, you can overwrite it if you need.

Note that in SQL syntax tests for equality use a single =, while genotype filters (discussed above) are following Python syntax and use == for the same purpose. Also note that non-numerical values need to be enclosed in single-quotes, e.g. 'LOW', but numerical values must NOT be.