What it does
Assuming you have defined the familial relationships between samples when loading your VCF into GEMINI, you can use this tool to identify candidate genes and variants that explain the inheritance pattern of a phenotype of interest.
Inheritance pattern detection rules
Autosomal recessive
Criteria:
If --lenient is specified, the 2 criteria prefixed with “[default]” are not applied.
If --allow-unaffected is specified, the criterion prefixed with “[affected]” is not enforced.
Autosomal dominant
Criteria:
If --lenient is specified, the criteria prefixed with “[default]” are not enforced.
If --allow-unaffected is specified, the criterion prefixed with “[affected]” is not enforced.
Note that, for autosomal dominant, --lenient allows singleton affecteds to be used to meet the --min-kindreds requirement if they are HET.
If there is incomplete penetrance in the kindred (unaffected obligate carriers), these individuals currently must be coded as having unknown phenotype or as being affected.
X-linked recessive
Criteria:
Note: Pseudo-autosomal regions are not accounted for by the tool.
X-linked dominant
Criteria:
Note: Pseudo-autosomal regions are not accounted for by the tool.
De-novo mutations
Criteria:
The last 3 items, prefixed with [default] can be turned off with --lenient.
If --allow-unaffected is specified, then the criterion prefixed [affected] is not enforced.
X-linked de-novo mutations
Criteria:
Note: Pseudo-autosomal regions are not accounted for by the tool.
Compound heterozygosity
Unlike canonical recessive sites where the same recessive allele is inherited from both parents at the same site in the gene, compound heterozygosity occurs when the individual’s phenotype is caused by two heterozygous recessive alleles at different sites in a particular gene.
To detect compound heterozygosity, the tool looks for two heterozygous variants impacting the same gene at different loci. The complicating factor is that this is a case of recessive inheritance and as such, we must also require that the consequential alleles at each heterozygous site were inherited on different chromosomes (one from each parent). Hence, where possible, the tool will phase by transmission.
Criteria (default):
All affected individuals must be heterozygous at both sites.
No unaffected can be homozygous alterate at either site.
Neither parent of an affected sample can be homozygous reference at both sites.
If any unphased-unaffected is het at both sites, the site will be given lower priority.
No phased-unaffected can be heterozygous at both sites.
Candidates where an affected from the same family does NOT share the same het pair are removed.
Sites are automatically phased by transmission when parents are present in order to remove false positive candidates.
If data from one or both parents are unavailable and the child’s data was not phased prior to loading into GEMINI, all comp_het variant pairs will automatically be given at most priority == 2. If there’s only a single parent and both the parent and the affected are HET at both sites, the candidate will have priority 3.
Criteria (--pattern-only):
Violation of Mendelian laws
The tool can be used to detect the following kinds of non-Mendelian patterns:
Criteria: