comparison blockclust.xml @ 4:49e600128a73 draft

Uploaded
author rnateam
date Wed, 09 Jul 2014 08:38:01 -0400
parents 27dde42069e0
children 6721468f2f9f
comparison
equal deleted inserted replaced
3:27dde42069e0 4:49e600128a73
159 processing patterns. We propose a novel way to encode expression profiles 159 processing patterns. We propose a novel way to encode expression profiles
160 in compact discrete structures, which can then be processed using 160 in compact discrete structures, which can then be processed using
161 fast graph-kernel techniques. BlockClust allows both clustering and 161 fast graph-kernel techniques. BlockClust allows both clustering and
162 classification of small non-coding RNAs. 162 classification of small non-coding RNAs.
163 163
164 BlockClust runs in three modes: 164 BlockClust runs in three operating modes:
165
165 1) Pre-processing - converts given mapped reads (BAM) into BED file of tags 166 1) Pre-processing - converts given mapped reads (BAM) into BED file of tags
166 2) Clustering and classification - of given input block groups (from blockbuster tool) as explained in the original paper. 167
168 2) Clustering and classification - of given input blockgroups (output of blockbuster tool) as explained in the original paper.
169
167 3) Post-processing - extracts distribution of clusters searched against Rfam database and plots hierarchical clustering made out of centroids of each BlockClust predicted cluster. 170 3) Post-processing - extracts distribution of clusters searched against Rfam database and plots hierarchical clustering made out of centroids of each BlockClust predicted cluster.
168 171
169 For a thorough analysis of your data, we suggest you to use complete blockclust workflow, which contains all three modes of operation. 172 For a thorough analysis of your data, we suggest you to use complete blockclust workflow, which contains all three modes of operation.
170 173
171 **Inputs** 174 **Inputs**
172 175
173 BlockClust input files are dependent on the mode of operation: 176 BlockClust input files are dependent on the mode of operation:
174 1) Pre-processing mode: 177
175 Binary Sequence Alignment Map (BAM) file 178 1. Pre-processing mode:
176 179 * Binary Sequence Alignment Map (BAM) file
177 2) Clustering and classification: 180
178 A blockgroups file generated by blockbuster tool 181 2. Clustering and classification:
179 Select reference genome 182 * A blockgroups file generated by blockbuster tool
180 183 * Select reference genome
181 3) Post-processing: 184
182 Output of cmsearch, searched clusters generated by BlockClust against Rfam 185 3. Post-processing:
183 BED file containing clusters generated by BlockClust 186 * Output of cmsearch, searched clusters generated by BlockClust against Rfam
184 Pairwise similarities of blockgroups generated by BlockClust 187 * BED file containing clusters generated by BlockClust
185 188 * Pairwise similarities of blockgroups generated by BlockClust
186 **Output** 189
187 1) Pre-processing mode: 190 **Outputs**
188 BED file of tags with expressions 191
189 192 1. Pre-processing mode:
190 2) Clustering and classification: 193 * BED file of tags with expressions
191 Hierarchical clustering plot of all input blockgroups by their similarity 194
192 Pairwise similarities of all input blockgroups 195 2. Clustering and classification:
193 BED file containing predicted clusters 196 * Hierarchical clustering plot of all input blockgroups by their similarity
194 BED file containing prediction of blockgroups by pre-compiled SVM binary classification model. 197 * Pairwise similarities of all input blockgroups
195 198 * BED file containing predicted clusters
196 3) Post-processing: 199 * BED file containing prediction of blockgroups by pre-compiled SVM binary classification model.
197 Distribution of clusters with annotations searched against Rfam database 200
198 hierarchical clustering made out of centroids of each BlockClust predicted cluster 201 3. Post-processing:
202 * Distribution of clusters with annotations searched against Rfam database
203 * Hierarchical clustering made out of centroids of each BlockClust predicted cluster
199 204
200 ------ 205 ------
201 206
202 **References** 207 **References**
203 208