comparison discover_familial_relationships.xml @ 31:a631c2f6d913

Update to Miller Lab devshed revision 3c4110ffacc3
author Richard Burhans <burhans@bx.psu.edu>
date Fri, 20 Sep 2013 13:25:27 -0400
parents
children
comparison
equal deleted inserted replaced
30:4188853b940b 31:a631c2f6d913
1 <tool id="gd_discover_familial_relationships" name="Close relatives" version="1.0.0">
2 <description>: Discover familial relationships</description>
3
4 <command interpreter="python">
5 #import json
6 #import base64
7 #import zlib
8 #set $ind_names = $input.dataset.metadata.individual_names
9 #set $ind_colms = $input.dataset.metadata.individual_columns
10 #set $ind_dict = dict(zip($ind_names, $ind_colms))
11 #set $ind_json = json.dumps($ind_dict, separators=(',',':'))
12 #set $ind_comp = zlib.compress($ind_json, 9)
13 #set $ind_arg = base64.b64encode($ind_comp)
14 discover_familial_relationships.py '$input' '$input.ext' '$ind_arg' '$pop_input' '$output'
15 </command>
16
17 <inputs>
18 <param name="input" type="data" format="gd_snp,gd_genotype" label="Input dataset" />
19 <param name="pop_input" type="data" format="gd_indivs" label="Individuals dataset" />
20 </inputs>
21
22 <outputs>
23 <data name="output" format="tabular" />
24 </outputs>
25
26 <requirements>
27 <requirement type="package" version="0.1">gd_c_tools</requirement>
28 </requirements>
29
30 <!--
31 <tests>
32 </tests>
33 -->
34
35 <help>
36
37 **Dataset formats**
38
39 The input datasets are in gd_snp_, gd_genotype_, and gd_indivs_ formats.
40 The output dataset is in tabular_ format.
41
42 .. _gd_snp: ./static/formatHelp.html#gd_snp
43 .. _gd_genotype: ./static/formatHelp.html#gd_genotype
44 .. _gd_indivs: ./static/formatHelp.html#gd_indivs
45 .. _tabular: ./static/formatHelp.html#tab
46
47 -----
48
49 **What it does**
50
51 The user specifies a SNP table (either gd_snp or gd_genotype format) and
52 a set of individuals. The command runs the KING program (Manichaikul et
53 al., 2010) to look for pairs of distinct individuals in the specified
54 set that have a close family relationship. Putatively related pairs
55 are classified into five categories:
56
57 1. duplicate or MZ twin
58 2. 1st degree relatives -- siblings (other than identical twins) or parent-child
59 3. 2nd degree relatives -- e.g., half-siblings, grandparent-grandchild pair, individual-uncle/aunt pair
60 4. 3rd degree relatives -- e.g., first cousins
61 5. unrelated
62
63 Reference:
64
65 Manichaikul A, Mychaleckyj JC, Rich SS, Daly K, Sale M, Chen WM (2010) Robust relationship inference in genome-wide association studies. Bioinformatics 26: 2867-2873.
66 </help>
67 </tool>