comparison SMART/galaxy/GetDifferentialExpression.xml @ 15:440ceca58672

Uploaded
author m-zytnicki
date Mon, 22 Apr 2013 11:08:07 -0400
parents 769e306b7933
children 94ab73e8a190
comparison
equal deleted inserted replaced
14:c79b9ae3f65f 15:440ceca58672
49 -o $outputFileGff 49 -o $outputFileGff
50 50
51 $simple 51 $simple
52 $adjusted 52 $adjusted
53 53
54
55 #if $optionSimplePara.simplePara == 'Yes': 54 #if $optionSimplePara.simplePara == 'Yes':
56 -S $optionSimplePara.paraValue 55 -S $optionSimplePara.paraValue
57 #end if 56 #end if
58 57
59 #if $optionFixedSizeFactor.FSF == 'Yes': 58 #if $optionFixedSizeFactor.FSF == 'Yes':
61 #end if 60 #end if
62 61
63 #if $optionFDR.FDR == 'Yes': 62 #if $optionFDR.FDR == 'Yes':
64 -d $optionFDR.FDRValue 63 -d $optionFDR.FDRValue
65 #end if 64 #end if
66 $plot $outputFilePNG
67 </command> 65 </command>
68 66
69 <inputs> 67 <inputs>
70 <conditional name="formatType"> 68 <conditional name="formatType">
71 <param name="FormatInputFileName1" type="select" label="Input File Format 1"> 69 <param name="FormatInputFileName1" type="select" label="Input File Format 1">
152 <when value="gtf"> 150 <when value="gtf">
153 <param name="inputFileNameRef" format="gtf" type="data" label="Input Ref File"/> 151 <param name="inputFileNameRef" format="gtf" type="data" label="Input Ref File"/>
154 </when> 152 </when>
155 </conditional> 153 </conditional>
156 154
157 <param name="simple" type="boolean" truevalue="-s" falsevalue="" checked="false" label="normalize using the number of reads in each condition"/> 155 <param name="simple" type="boolean" truevalue="-s" falsevalue="" checked="false" label="Normalize using the number of reads in each condition"/>
158 <param name="adjusted" type="boolean" truevalue="-a" falsevalue="" checked="false" label="normalize using the number of reads of 'mean' regions"/> 156 <param name="adjusted" type="boolean" truevalue="-a" falsevalue="" checked="false" label="Normalize using the number of reads of interquartile expression region"/>
159 157
160 <conditional name="optionSimplePara"> 158 <conditional name="optionSimplePara">
161 <param name="simplePara" type="select" label="provide the number of reads" > 159 <param name="simplePara" type="select" label="provide the number of reads" >
162 <option value="Yes">Yes</option> 160 <option value="Yes">Yes</option>
163 <option value="No" selected="true">No</option> 161 <option value="No" selected="true">No</option>
168 <when value="No"> 166 <when value="No">
169 </when> 167 </when>
170 </conditional> 168 </conditional>
171 169
172 <conditional name="optionFixedSizeFactor"> 170 <conditional name="optionFixedSizeFactor">
173 <param name="FSF" type="select" label="give the magnification factor for the normalization using fixed size sliding windows in reference regions (leave empty for no such normalization)"> 171 <param name="FSF" type="select" label="Give the magnification factor for the normalization using fixed size sliding windows in reference regions (leave empty for no such normalization)">
174 <option value="Yes">Yes</option> 172 <option value="Yes">Yes</option>
175 <option value="No" selected="true">No</option> 173 <option value="No" selected="true">No</option>
176 </param> 174 </param>
177 <when value="Yes"> 175 <when value="Yes">
178 <param name="FSFValue" type="integer" value="0" /> 176 <param name="FSFValue" type="integer" value="0" />
191 </when> 189 </when>
192 <when value="No"> 190 <when value="No">
193 </when> 191 </when>
194 </conditional> 192 </conditional>
195 193
196 <param name="plot" type="boolean" truevalue="-p" falsevalue="" checked="false" label="plot option" help="plot cloud plot"/>
197
198 </inputs> 194 </inputs>
199 195
200 <outputs> 196 <outputs>
201 <data name="outputFileGff" format="gff3" label="[GetDifferentialExpression]out file"/> 197 <data name="outputFileGff" format="gff3" label="[GetDifferentialExpression]out file"/>
202 <data name="outputFilePNG" format="PNG" label="[GetDifferentialExpression]PNG file">
203 <filter>plot</filter>
204 </data>
205 </outputs> 198 </outputs>
206 199
207 <help> 200 <help>
208 example: python GetDifferentialExpression.py -i input1 -f gff3 -j input2 -g gff3 -k ref -l gff3 -o output.gff3 201 This tool compares two sets of data and find the differential expression. One very important component of the tool is the reference set. Actually, to use the tool, you need the two input sets of data, of course, and the reference set. The reference set is a set of genomic coordinates and, for each interval, it will count the number of feature on each sample and compute the differential expression. For each reference interval, it will output the direction of the regulation (up or down, with respect to the first input set), and a *p*-value from a Fisher exact test.
202
203 This reference set seems boring. Why not computing the differential expression without this set? The answer is: the differential expression of what? I cannot guess it. Actually, you might want to compare the expression of genes, of small RNAs, of transposable elements, of anything... So the reference set can be a list of genes, and in this case, you can compute the differential expression of genes. But you can also compute many other things.
204
205 Suppose that you cluster the data of your two input samples (you can do it with the *clusterize* and the *mergeTranscriptLists* tools). You now have a list of all the regions which are transcribed in at least one of the input samples. This can be your reference set. This reference set is interesting since you can detect the differential expression of data which is outside any annotation.
206
207 Suppose now that you clusterize using a sliding window the two input samples (you can do it with the *clusterizeBySlidingWindows* and the *mergeSlidingWindowsClusters* tools). You can now select all the regions of a given size which contain at least one read in one of the two input samples (do it with *selectByTag* and the tag **nbElements**). Again, this can be an other interesting reference set.
208
209 In most cases, the sizes of the two input samples will be different, so you should probably normalize the data, which is an available option. The ---rather crude--- normalization increases the number of data in the least populated sample and decreases the number of data in the most populated sample to the average number of data.
209 </help> 210 </help>
210 </tool> 211 </tool>