annotate xenome-1.0.1-r/xenome.1 @ 0:6d87470d68aa draft default tip

Uploaded
author sangok
date Thu, 23 Apr 2020 08:32:34 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
6d87470d68aa Uploaded
sangok
parents:
diff changeset
1 .TH xenome 1 "September 12, 2012" "Xenome User Manual"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
2 .SH NAME
6d87470d68aa Uploaded
sangok
parents:
diff changeset
3 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
4 xenome - a tool for classifying reads from xenograft sources.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
5 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
6 Version 1.0.1
6d87470d68aa Uploaded
sangok
parents:
diff changeset
7 .SH SYNOPSIS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
8 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
9 xenome index -T 8 -P idx -H mouse.fa -G human.fa
6d87470d68aa Uploaded
sangok
parents:
diff changeset
10 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
11 xenome classify -T 8 -P idx \[em]pairs \[em]host-name mouse
6d87470d68aa Uploaded
sangok
parents:
diff changeset
12 \[em]graft-name human -i in_1.fastq -i in_2.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
13 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
14 xenome help
6d87470d68aa Uploaded
sangok
parents:
diff changeset
15 .SH DESCRIPTION
6d87470d68aa Uploaded
sangok
parents:
diff changeset
16 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
17 Shotgun sequence read data derived from xenograft material contains
6d87470d68aa Uploaded
sangok
parents:
diff changeset
18 a mixture of reads arising from the host and reads arising from the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
19 graft.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
20 Xenome is an application for classifying the read mixture to
6d87470d68aa Uploaded
sangok
parents:
diff changeset
21 separate the two, allowing for more precise analysis to be
6d87470d68aa Uploaded
sangok
parents:
diff changeset
22 performed.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
23 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
24 Xenome uses host and graft reference sequences to characterise the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
25 set of all possible k-mers according to whether they belong to:
6d87470d68aa Uploaded
sangok
parents:
diff changeset
26 .IP \[bu] 2
6d87470d68aa Uploaded
sangok
parents:
diff changeset
27 only the graft (and NOT the host)
6d87470d68aa Uploaded
sangok
parents:
diff changeset
28 .IP \[bu] 2
6d87470d68aa Uploaded
sangok
parents:
diff changeset
29 only the host (and NOT the graft)
6d87470d68aa Uploaded
sangok
parents:
diff changeset
30 .IP \[bu] 2
6d87470d68aa Uploaded
sangok
parents:
diff changeset
31 both references
6d87470d68aa Uploaded
sangok
parents:
diff changeset
32 .IP \[bu] 2
6d87470d68aa Uploaded
sangok
parents:
diff changeset
33 neither reference
6d87470d68aa Uploaded
sangok
parents:
diff changeset
34 .IP \[bu] 2
6d87470d68aa Uploaded
sangok
parents:
diff changeset
35 the subset of the host (or graft) k-mers which is one base
6d87470d68aa Uploaded
sangok
parents:
diff changeset
36 substitution away from being in the graft (or host) - we call these
6d87470d68aa Uploaded
sangok
parents:
diff changeset
37 k-mers \[lq]marginal\[rq]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
38 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
39 Given a read, or read pair, xenome will calculate which of the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
40 above categories its k-mers belong to, and classify it as one of:
6d87470d68aa Uploaded
sangok
parents:
diff changeset
41 graft, host, both, neither, or ambiguous.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
42 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
43 Xenome has two distinct stages, which are embodied in two separate
6d87470d68aa Uploaded
sangok
parents:
diff changeset
44 commands: `index' and `classify'.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
45 Before reads can be classified, an index must be constructed from
6d87470d68aa Uploaded
sangok
parents:
diff changeset
46 the graft and host reference sequences.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
47 The references must be in FASTA format, and may optionally be
6d87470d68aa Uploaded
sangok
parents:
diff changeset
48 compressed (gzip).
6d87470d68aa Uploaded
sangok
parents:
diff changeset
49 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
50 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
51 xenome\ index\ -M\ 24\ -T\ 8\ -P\ idx\ -H\ mouse.fa\ -G\ human.fa
6d87470d68aa Uploaded
sangok
parents:
diff changeset
52 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
53 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
54 A xenome index consists of a number of related files which can be
6d87470d68aa Uploaded
sangok
parents:
diff changeset
55 identified by a user-specified prefix, e.g.\ `idx' in the above
6d87470d68aa Uploaded
sangok
parents:
diff changeset
56 command.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
57 The prefix may contain `/' characters, allowing the index to be in
6d87470d68aa Uploaded
sangok
parents:
diff changeset
58 a sub-directory.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
59 (Any such sub-directory must already exist - xenome will not create
6d87470d68aa Uploaded
sangok
parents:
diff changeset
60 it.)
6d87470d68aa Uploaded
sangok
parents:
diff changeset
61 For example, the set of files comprising an index with prefix `idx'
6d87470d68aa Uploaded
sangok
parents:
diff changeset
62 are:
6d87470d68aa Uploaded
sangok
parents:
diff changeset
63 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
64 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
65 idx-both.header
6d87470d68aa Uploaded
sangok
parents:
diff changeset
66 idx-both.kmers-d0
6d87470d68aa Uploaded
sangok
parents:
diff changeset
67 idx-both.kmers-d1
6d87470d68aa Uploaded
sangok
parents:
diff changeset
68 idx-both.kmers.header
6d87470d68aa Uploaded
sangok
parents:
diff changeset
69 idx-both.kmers.high-bits
6d87470d68aa Uploaded
sangok
parents:
diff changeset
70 idx-both.kmers.low-bits.lwr
6d87470d68aa Uploaded
sangok
parents:
diff changeset
71 idx-both.kmers.low-bits.upr
6d87470d68aa Uploaded
sangok
parents:
diff changeset
72 idx-both.lhs-bits
6d87470d68aa Uploaded
sangok
parents:
diff changeset
73 idx-both.rhs-bits
6d87470d68aa Uploaded
sangok
parents:
diff changeset
74 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
75 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
76 Once an index is available, reads can be classified according to
6d87470d68aa Uploaded
sangok
parents:
diff changeset
77 whether they appear to contain graft or host material.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
78 In the simplest case, Xenome can classify each read from a single
6d87470d68aa Uploaded
sangok
parents:
diff changeset
79 source file individually.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
80 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
81 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
82 xenome\ classify\ -P\ idx\ -i\ in.fastq\
6d87470d68aa Uploaded
sangok
parents:
diff changeset
83 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
84 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
85 This step produces a file for each read category, containing all of
6d87470d68aa Uploaded
sangok
parents:
diff changeset
86 the reads which have been assigned that classification:
6d87470d68aa Uploaded
sangok
parents:
diff changeset
87 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
88 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
89 ambiguous.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
90 both.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
91 graft.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
92 host.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
93 neither.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
94 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
95 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
96 Input files are base-space reads in FASTA or FASTQ format or in a
6d87470d68aa Uploaded
sangok
parents:
diff changeset
97 format with one read per line and in either plain text or
6d87470d68aa Uploaded
sangok
parents:
diff changeset
98 compressed format (gzip).
6d87470d68aa Uploaded
sangok
parents:
diff changeset
99 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
100 The files produced are in the same format as the input file, with
6d87470d68aa Uploaded
sangok
parents:
diff changeset
101 all of the input read data preserved.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
102 i.e.\ if the input reads are in FASTQ format, the reads written to
6d87470d68aa Uploaded
sangok
parents:
diff changeset
103 each of the output files will also be in FASTQ format.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
104 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
105 Multiple input files may be specified, but all inputs in the same
6d87470d68aa Uploaded
sangok
parents:
diff changeset
106 format will be written to the same set of output files.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
107 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
108 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
109 xenome\ classify\ -P\ idx\ -i\ inA.fastq\ -i\ inB.fastq\ -I\ inC.fasta
6d87470d68aa Uploaded
sangok
parents:
diff changeset
110 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
111 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
112 The above will result in the following set of files:
6d87470d68aa Uploaded
sangok
parents:
diff changeset
113 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
114 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
115 ambiguous.fasta
6d87470d68aa Uploaded
sangok
parents:
diff changeset
116 ambiguous.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
117 both.fasta
6d87470d68aa Uploaded
sangok
parents:
diff changeset
118 both.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
119 graft.fasta
6d87470d68aa Uploaded
sangok
parents:
diff changeset
120 graft.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
121 host.fasta
6d87470d68aa Uploaded
sangok
parents:
diff changeset
122 host.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
123 neither.fasta
6d87470d68aa Uploaded
sangok
parents:
diff changeset
124 neither.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
125 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
126 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
127 Each of the FASTQ files contains a mixture of reads from inA.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
128 and inB.fastq.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
129 The FASTA files contain reads from inC.fasta.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
130 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
131 If the combining of input reads from separate files is not desired,
6d87470d68aa Uploaded
sangok
parents:
diff changeset
132 xenome should be run separately for each input.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
133 The output from different runs can be distinguished by prefixing
6d87470d68aa Uploaded
sangok
parents:
diff changeset
134 the filenames with a distinct string.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
135 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
136 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
137 xenome\ classify\ -P\ idx\ -i\ inA.fastq\ --output-filename-prefix\ A
6d87470d68aa Uploaded
sangok
parents:
diff changeset
138 xenome\ classify\ -P\ idx\ -i\ inB.fastq\ --output-filename-prefix\ B
6d87470d68aa Uploaded
sangok
parents:
diff changeset
139 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
140 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
141 Running these two commands yields:
6d87470d68aa Uploaded
sangok
parents:
diff changeset
142 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
143 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
144 A_ambiguous.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
145 A_both.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
146 A_graft.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
147 A_host.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
148 A_neither.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
149 B_ambiguous.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
150 B_both.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
151 B_graft.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
152 B_host.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
153 B_neither.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
154 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
155 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
156 Xenome can also process pairs of reads.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
157 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
158 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
159 xenome\ classify\ -P\ idx\ --pairs\ -i\ in_1.fastq\ -i\ in_2.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
160 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
161 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
162 This results in a pair of files for each read category.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
163 The two reads of each pair are written to the corresponding `_1'
6d87470d68aa Uploaded
sangok
parents:
diff changeset
164 and `_2' files respectively.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
165 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
166 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
167 ambiguous_1.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
168 ambiguous_2.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
169 both_1.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
170 both_2.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
171 graft_1.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
172 graft_2.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
173 host_1.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
174 host_2.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
175 neither_1.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
176 neither_2.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
177 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
178 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
179 If desired, more specific names can be used in place of `host' and
6d87470d68aa Uploaded
sangok
parents:
diff changeset
180 `graft'.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
181 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
182 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
183 xenome\ classify\ -P\ idx\ -i\ in.fastq\ --graft-name\ human\ --host-name\ mouse
6d87470d68aa Uploaded
sangok
parents:
diff changeset
184 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
185 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
186 This will cause xenome to produce the following files.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
187 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
188 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
189 ambiguous.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
190 both.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
191 human.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
192 mouse.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
193 neither.fastq
6d87470d68aa Uploaded
sangok
parents:
diff changeset
194 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
195 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
196 In addition to generating sets of output files, the classify
6d87470d68aa Uploaded
sangok
parents:
diff changeset
197 command produces statistics about the number and proportion of
6d87470d68aa Uploaded
sangok
parents:
diff changeset
198 reads assigned to each category.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
199 These are printed to standard out at the end of a run and look as
6d87470d68aa Uploaded
sangok
parents:
diff changeset
200 follows:
6d87470d68aa Uploaded
sangok
parents:
diff changeset
201 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
202 \f[CR]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
203 Statistics
6d87470d68aa Uploaded
sangok
parents:
diff changeset
204 B\ \ \ \ \ \ \ G\ \ \ \ \ \ \ H\ \ \ \ \ \ \ M\ \ \ \ \ \ \ count\ \ \ \ \ percent\ \ \ class
6d87470d68aa Uploaded
sangok
parents:
diff changeset
205 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1900\ \ \ \ \ \ 0.938267\ \ "neither"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
206 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 21\ \ \ \ \ \ \ \ 0.0103703\ "both"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
207 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 28491\ \ \ \ \ 14.0696\ \ \ "definitely\ host"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
208 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 7366\ \ \ \ \ \ 3.63751\ \ \ "probably\ host"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
209 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 91895\ \ \ \ \ 45.38\ \ \ \ \ "definitely\ graft"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
210 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 30059\ \ \ \ \ 14.8439\ \ \ "probably\ graft"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
211 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 282\ \ \ \ \ \ \ 0.139259\ \ "ambiguous"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
212 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 330\ \ \ \ \ \ \ 0.162962\ \ "ambiguous"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
213 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 2878\ \ \ \ \ \ 1.42123\ \ \ "both"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
214 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 254\ \ \ \ \ \ \ 0.125431\ \ "probably\ both"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
215 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 610\ \ \ \ \ \ \ 0.301233\ \ "definitely\ host"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
216 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 5815\ \ \ \ \ \ 2.87159\ \ \ "probably\ host"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
217 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 3843\ \ \ \ \ \ 1.89777\ \ \ "definitely\ graft"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
218 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 27775\ \ \ \ \ 13.716\ \ \ \ "probably\ graft"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
219 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 0\ \ \ \ \ \ \ 99\ \ \ \ \ \ \ \ 0.0488886\ "ambiguous"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
220 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 1\ \ \ \ \ \ \ 883\ \ \ \ \ \ \ 0.436047\ \ "ambiguous"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
221
6d87470d68aa Uploaded
sangok
parents:
diff changeset
222 Summary
6d87470d68aa Uploaded
sangok
parents:
diff changeset
223 count\ \ \ \ \ percent\ \ \ class
6d87470d68aa Uploaded
sangok
parents:
diff changeset
224 153572\ \ \ \ 75.8377\ \ \ "graft"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
225 42282\ \ \ \ \ 20.8799\ \ \ "host"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
226 3153\ \ \ \ \ \ 1.55703\ \ \ "both"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
227 1900\ \ \ \ \ \ 0.938267\ \ "neither"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
228 1594\ \ \ \ \ \ 0.787157\ \ "ambiguous"
6d87470d68aa Uploaded
sangok
parents:
diff changeset
229 \f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
230 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
231 Both tables contain a single heading line, followed by rows of
6d87470d68aa Uploaded
sangok
parents:
diff changeset
232 TAB-separated elements; a format suitable for loading into R or a
6d87470d68aa Uploaded
sangok
parents:
diff changeset
233 spreadsheet.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
234 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
235 Each row represents the number and proportion of reads assigned to
6d87470d68aa Uploaded
sangok
parents:
diff changeset
236 a particular class.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
237 The B, G, H, and M fields represent the presence (1) or absence (0)
6d87470d68aa Uploaded
sangok
parents:
diff changeset
238 of k-mers belonging to the both, graft, host and marginal k-mer
6d87470d68aa Uploaded
sangok
parents:
diff changeset
239 subsets, according to the reference index.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
240 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
241 The Statistics table contains 16 rows; one for each possible
6d87470d68aa Uploaded
sangok
parents:
diff changeset
242 combination of k-mer classes present within a read.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
243 The first row of the above table, indicates that for the given
6d87470d68aa Uploaded
sangok
parents:
diff changeset
244 input, 1,900 reads (or pairs) - 0.938267% of the total reads -
6d87470d68aa Uploaded
sangok
parents:
diff changeset
245 contained no k-mers that belonged to the B, G, H, or M k-mer
6d87470d68aa Uploaded
sangok
parents:
diff changeset
246 subsets, and are accordingly neither host nor graft reads.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
247 Similarly, the fourteenth line states that 27,775 reads (or pairs)
6d87470d68aa Uploaded
sangok
parents:
diff changeset
248 - 13.716% of the total - contained k-mers that belong to the B, G,
6d87470d68aa Uploaded
sangok
parents:
diff changeset
249 M, but not H subsets, and are therefore \[lq]probably graft\[rq]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
250 reads.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
251 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
252 In the Summary table, the B, G, H, and M columns are removed, and
6d87470d68aa Uploaded
sangok
parents:
diff changeset
253 the classes from the Statistics table have been collapsed into the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
254 five shown; the definitely/probably graft/host classes are combined
6d87470d68aa Uploaded
sangok
parents:
diff changeset
255 into just graft/host classes.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
256 Notice that the different read output files, described earlier,
6d87470d68aa Uploaded
sangok
parents:
diff changeset
257 correspond exactly to these classes.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
258 .SH OPTIONS COMMON TO ALL COMMANDS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
259 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
260 The following options can be used with all of the \f[I]xenome\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
261 commands and are therefore not listed separately for each command.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
262 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
263 .B -h, --help
6d87470d68aa Uploaded
sangok
parents:
diff changeset
264 Show a help message.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
265 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
266 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
267 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
268 .B -l \f[I]FILE\f[], --log-file \f[I]FILE\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
269 Place to write progress messages.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
270 Messages are only written if the -v flag is used.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
271 If omitted, messages are written to stderr.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
272 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
273 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
274 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
275 .B -T \f[I]INT\f[], --num-threads \f[I]INT\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
276 The maximum number of \f[I]worker\f[] threads to use.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
277 The actual number of threads used during the algorithms depends on
6d87470d68aa Uploaded
sangok
parents:
diff changeset
278 each implementation.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
279 \f[I]xenome\f[] may use a small number of additional threads for
6d87470d68aa Uploaded
sangok
parents:
diff changeset
280 performing non cpu-bound operations, such as file I/O.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
281 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
282 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
283 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
284 .B --tmp-dir \f[I]DIRECTORY\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
285 A directory to use for temporary files.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
286 This flag may be repeated in order to nominate multiple temporary
6d87470d68aa Uploaded
sangok
parents:
diff changeset
287 directories.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
288 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
289 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
290 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
291 .B -v, --verbose
6d87470d68aa Uploaded
sangok
parents:
diff changeset
292 Show progress messages.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
293 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
294 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
295 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
296 .B -V, --version
6d87470d68aa Uploaded
sangok
parents:
diff changeset
297 Show the software version.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
298 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
299 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
300 .SH COMMANDS AND OPTIONS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
301 .SS xenome index
6d87470d68aa Uploaded
sangok
parents:
diff changeset
302 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
303 xenome index [-k \f[I]INT\f[]] [-M \f[I]INT\f[]] -P \f[I]PREFIX\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
304 -G \f[I]FASTA-filename\f[] -H \f[I]FASTA-filename\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
305 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
306 Build the xenome reference index from the graft and host reference
6d87470d68aa Uploaded
sangok
parents:
diff changeset
307 sequences.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
308 The input files must be in FASTA format.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
309 They may be gzip compressed, in which case the filename suffix must
6d87470d68aa Uploaded
sangok
parents:
diff changeset
310 be \f[I]\&.gz\f[].
6d87470d68aa Uploaded
sangok
parents:
diff changeset
311 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
312 The k-mer size may be specified using the \f[I]-k\f[] flag.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
313 If omitted, xenome defaults to k=25.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
314 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
315 During index construction, xenome maintains a hash table of the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
316 k-mers seen so far.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
317 When this table fills, its contents are written to disk, and the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
318 table is reinitialised.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
319 The more memory xenome can use, the less often it will need to
6d87470d68aa Uploaded
sangok
parents:
diff changeset
320 write to disk, and the faster index construction will run.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
321 By default, xenome will limit itself to 2 GB during index
6d87470d68aa Uploaded
sangok
parents:
diff changeset
322 construction.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
323 The -M, \[em]max-memory flag can be used to explicitly control the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
324 amount of memory available to xenome (in GB).
6d87470d68aa Uploaded
sangok
parents:
diff changeset
325 To improve performance, this should generally be set close to the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
326 amount memory available in the system - having accounted for
6d87470d68aa Uploaded
sangok
parents:
diff changeset
327 operating system and other overhead.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
328 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
329 \f[I]OPTIONS\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
330 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
331 .B -k \f[I]INT\f[], --kmer-size \f[I]INT\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
332 The k-mer size to use for building the graph: in version 1.0.0 this
6d87470d68aa Uploaded
sangok
parents:
diff changeset
333 \f[I]must be an integer strictly less than 63\f[].
6d87470d68aa Uploaded
sangok
parents:
diff changeset
334 If not supplied, the default value of 25 is used.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
335 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
336 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
337 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
338 .B -M \f[I]INT\f[], --max-memory \f[I]INT\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
339 The maximum amount of memory (in GB) of memory to use.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
340 Making more memory available will reduce the number of times xenome
6d87470d68aa Uploaded
sangok
parents:
diff changeset
341 writes intermediate index data to disk.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
342 The default is 2 GB.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
343 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
344 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
345 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
346 .B -P \f[I]PREFIX\f[], --prefix \f[I]PREFIX\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
347 The path prefix for all generated reference index files.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
348 The prefix may contain directory separators (e.g.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
349 `/') in order to have the index files written to another directory.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
350 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
351 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
352 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
353 .B -G \f[I]FILE\f[], --graft \f[I]FILE\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
354 The name of the FASTA file containing the graft reference sequence.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
355 If the filename ends in \f[I]\&.gz\f[] it will be read as a gzip
6d87470d68aa Uploaded
sangok
parents:
diff changeset
356 file.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
357 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
358 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
359 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
360 .B -H \f[I]FILE\f[], --host \f[I]FILE\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
361 The name of the FASTA file containing the host reference sequence.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
362 If the filename ends in \f[I]\&.gz\f[] it will be read as a gzip
6d87470d68aa Uploaded
sangok
parents:
diff changeset
363 file.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
364 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
365 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
366 .SS xenome classify
6d87470d68aa Uploaded
sangok
parents:
diff changeset
367 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
368 xenome classify -P \f[I]PREFIX\f[] {-I \f[I]FASTA-filename\f[] | -i
6d87470d68aa Uploaded
sangok
parents:
diff changeset
369 \f[I]FASTQ-filename\f[] | \[em]line-in \f[I]filename\f[]}+
6d87470d68aa Uploaded
sangok
parents:
diff changeset
370 [\[em]pairs] [-M \f[I]INT\f[]] [\[em]graft-name \f[I]STRING\f[]]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
371 [\[em]host-name \f[I]STRING\f[]] [\[em]output-filename-prefix
6d87470d68aa Uploaded
sangok
parents:
diff changeset
372 \f[I]STRING\f[]] [\[em]dont-write-reads] [\[em]preserve-read-order]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
373 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
374 Classifies input reads according to a pre-computed k-mer index.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
375 The reads are written into separate files, according to their
6d87470d68aa Uploaded
sangok
parents:
diff changeset
376 classification, and a breakdown of the number and proportion of
6d87470d68aa Uploaded
sangok
parents:
diff changeset
377 reads in each class is printed.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
378 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
379 If the total size of the index files is greater than available RAM,
6d87470d68aa Uploaded
sangok
parents:
diff changeset
380 xenome will perform poorly.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
381 To overcome this, the -M, \[em]max-memory flag may be used to
6d87470d68aa Uploaded
sangok
parents:
diff changeset
382 specify the maximum amount of memory (in GB) that xenome may use at
6d87470d68aa Uploaded
sangok
parents:
diff changeset
383 any time.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
384 If this amount is less than the size of the index structures,
6d87470d68aa Uploaded
sangok
parents:
diff changeset
385 xenome will (effectively) partition the index into multiple
6d87470d68aa Uploaded
sangok
parents:
diff changeset
386 subsets, each no larger than the specified maximum memory size, and
6d87470d68aa Uploaded
sangok
parents:
diff changeset
387 classify the reads in multiple passes - with each pass using a
6d87470d68aa Uploaded
sangok
parents:
diff changeset
388 different index subset.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
389 The results from each passes are combined, and the result is
6d87470d68aa Uploaded
sangok
parents:
diff changeset
390 produced as usual.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
391 If run with the -v, \[em]verbose flag, xenome will report the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
392 number of passes it will perform.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
393 Note that runtime will increase with the number of passes
6d87470d68aa Uploaded
sangok
parents:
diff changeset
394 performed; the biggest increase will occur with the step from one
6d87470d68aa Uploaded
sangok
parents:
diff changeset
395 pass to two.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
396 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
397 \f[I]OPTIONS\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
398 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
399 .B -P \f[I]PREFIX\f[], --prefix \f[I]PREFIX\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
400 The path prefix for all reference index files.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
401 The prefix may contain directory separators (e.g.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
402 `/') in order to have the index files written to another directory.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
403 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
404 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
405 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
406 .B -I \f[I]FILE\f[], --fasta-in \f[I]FILE\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
407 Input file in FASTA format.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
408 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
409 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
410 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
411 .B -i \f[I]FILE\f[], --fastq-in \f[I]FILE\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
412 Input file in FASTQ format.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
413 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
414 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
415 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
416 .B \[em]line-in \f[I]FILE\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
417 Input file with one read per line and no other annotation.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
418 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
419 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
420 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
421 .B \[em]pairs
6d87470d68aa Uploaded
sangok
parents:
diff changeset
422 Treat reads from consecutive input files of the same type as pairs.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
423 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
424 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
425 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
426 .B -M \f[I]INT\f[], --max-memory \f[I]INT\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
427 The maximum amount of memory (in GB) to use while classifying
6d87470d68aa Uploaded
sangok
parents:
diff changeset
428 reads.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
429 If not specified, xenome will use as much memory as required to
6d87470d68aa Uploaded
sangok
parents:
diff changeset
430 classify all reads in a single pass.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
431 When the maximum amount of memory is less than the size of the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
432 reference index files, xenome will need to perform multiple passes
6d87470d68aa Uploaded
sangok
parents:
diff changeset
433 over the input data - increasing runtime.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
434 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
435 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
436 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
437 .B \[em]graft-name \f[I]STRING\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
438 The name of the graft reference to appear in filenames and
6d87470d68aa Uploaded
sangok
parents:
diff changeset
439 statistics.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
440 If no explicit name is provided, the string \[lq]graft\[rq] is
6d87470d68aa Uploaded
sangok
parents:
diff changeset
441 used.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
442 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
443 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
444 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
445 .B \[em]host-name \f[I]STRING\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
446 The name of the host reference to appear in filenames and
6d87470d68aa Uploaded
sangok
parents:
diff changeset
447 statistics.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
448 If no explicit name is provided, the string \[lq]host\[rq] is used.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
449 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
450 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
451 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
452 .B \[em]output-filename-prefix \f[I]STRING\f[]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
453 An optional prefix to apply to all output read filenames.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
454 The prefix is separated from the rest of the filename by an
6d87470d68aa Uploaded
sangok
parents:
diff changeset
455 underscore (`_').
6d87470d68aa Uploaded
sangok
parents:
diff changeset
456 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
457 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
458 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
459 .B \[em]dont-write-reads
6d87470d68aa Uploaded
sangok
parents:
diff changeset
460 The reads will not be written to any files after classification,
6d87470d68aa Uploaded
sangok
parents:
diff changeset
461 and none of the usual per-category output files will be created.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
462 The classification statistics will still be printed to standard
6d87470d68aa Uploaded
sangok
parents:
diff changeset
463 out.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
464 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
465 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
466 .TP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
467 .B \[em]preserve-read-order
6d87470d68aa Uploaded
sangok
parents:
diff changeset
468 The relative ordering of reads within each output file will be the
6d87470d68aa Uploaded
sangok
parents:
diff changeset
469 same as that in the input files.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
470 i.e.\ if read \f[I]r1\f[] precedes \f[I]r2\f[] in a single output
6d87470d68aa Uploaded
sangok
parents:
diff changeset
471 file, then \f[I]r1\f[] also precedes \f[I]r2\f[] in the input.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
472 Note: If this flag is specified, the -T/\[em]num-threads flag is
6d87470d68aa Uploaded
sangok
parents:
diff changeset
473 ignored, and xenome will only operate with a single worker thread.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
474 .RS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
475 .RE
6d87470d68aa Uploaded
sangok
parents:
diff changeset
476 .SS xenome help
6d87470d68aa Uploaded
sangok
parents:
diff changeset
477 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
478 xenome help
6d87470d68aa Uploaded
sangok
parents:
diff changeset
479 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
480 Prints a summary of all of the xenome commands.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
481 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
482 \[em]
6d87470d68aa Uploaded
sangok
parents:
diff changeset
483 .SH FUTURE RELEASES
6d87470d68aa Uploaded
sangok
parents:
diff changeset
484 .PP
6d87470d68aa Uploaded
sangok
parents:
diff changeset
485 Bzip support will be introduced.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
486 .SH AUTHORS
6d87470d68aa Uploaded
sangok
parents:
diff changeset
487 Bryan Beresford-Smith, Andrew Bromage, Thomas Conway, Jeremy Wazny.
6d87470d68aa Uploaded
sangok
parents:
diff changeset
488