Mercurial > repos > miller-lab > genome_diversity
comparison discover_familial_relationships.xml @ 31:a631c2f6d913
Update to Miller Lab devshed revision 3c4110ffacc3
author | Richard Burhans <burhans@bx.psu.edu> |
---|---|
date | Fri, 20 Sep 2013 13:25:27 -0400 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
30:4188853b940b | 31:a631c2f6d913 |
---|---|
1 <tool id="gd_discover_familial_relationships" name="Close relatives" version="1.0.0"> | |
2 <description>: Discover familial relationships</description> | |
3 | |
4 <command interpreter="python"> | |
5 #import json | |
6 #import base64 | |
7 #import zlib | |
8 #set $ind_names = $input.dataset.metadata.individual_names | |
9 #set $ind_colms = $input.dataset.metadata.individual_columns | |
10 #set $ind_dict = dict(zip($ind_names, $ind_colms)) | |
11 #set $ind_json = json.dumps($ind_dict, separators=(',',':')) | |
12 #set $ind_comp = zlib.compress($ind_json, 9) | |
13 #set $ind_arg = base64.b64encode($ind_comp) | |
14 discover_familial_relationships.py '$input' '$input.ext' '$ind_arg' '$pop_input' '$output' | |
15 </command> | |
16 | |
17 <inputs> | |
18 <param name="input" type="data" format="gd_snp,gd_genotype" label="Input dataset" /> | |
19 <param name="pop_input" type="data" format="gd_indivs" label="Individuals dataset" /> | |
20 </inputs> | |
21 | |
22 <outputs> | |
23 <data name="output" format="tabular" /> | |
24 </outputs> | |
25 | |
26 <requirements> | |
27 <requirement type="package" version="0.1">gd_c_tools</requirement> | |
28 </requirements> | |
29 | |
30 <!-- | |
31 <tests> | |
32 </tests> | |
33 --> | |
34 | |
35 <help> | |
36 | |
37 **Dataset formats** | |
38 | |
39 The input datasets are in gd_snp_, gd_genotype_, and gd_indivs_ formats. | |
40 The output dataset is in tabular_ format. | |
41 | |
42 .. _gd_snp: ./static/formatHelp.html#gd_snp | |
43 .. _gd_genotype: ./static/formatHelp.html#gd_genotype | |
44 .. _gd_indivs: ./static/formatHelp.html#gd_indivs | |
45 .. _tabular: ./static/formatHelp.html#tab | |
46 | |
47 ----- | |
48 | |
49 **What it does** | |
50 | |
51 The user specifies a SNP table (either gd_snp or gd_genotype format) and | |
52 a set of individuals. The command runs the KING program (Manichaikul et | |
53 al., 2010) to look for pairs of distinct individuals in the specified | |
54 set that have a close family relationship. Putatively related pairs | |
55 are classified into five categories: | |
56 | |
57 1. duplicate or MZ twin | |
58 2. 1st degree relatives -- siblings (other than identical twins) or parent-child | |
59 3. 2nd degree relatives -- e.g., half-siblings, grandparent-grandchild pair, individual-uncle/aunt pair | |
60 4. 3rd degree relatives -- e.g., first cousins | |
61 5. unrelated | |
62 | |
63 Reference: | |
64 | |
65 Manichaikul A, Mychaleckyj JC, Rich SS, Daly K, Sale M, Chen WM (2010) Robust relationship inference in genome-wide association studies. Bioinformatics 26: 2867-2873. | |
66 </help> | |
67 </tool> |