comparison train_test_split.xml @ 5:ce2fd1edbc6e draft

"planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/sklearn commit 2afb24f3c81d625312186750a714d702363012b5"
author bgruening
date Fri, 02 Oct 2020 08:59:31 +0000
parents 0985b0dd6f1a
children 5da2217cd788
comparison
equal deleted inserted replaced
4:145208b3579d 5:ce2fd1edbc6e
266 <output name="out_test" file="train_test_split_test03.tabular" ftype="tabular"/> 266 <output name="out_test" file="train_test_split_test03.tabular" ftype="tabular"/>
267 </test> 267 </test>
268 </tests> 268 </tests>
269 <help><![CDATA[ 269 <help><![CDATA[
270 **What it does** 270 **What it does**
271 This tool implements splitter function and classes from `sklearn.model_selection` module to split contents (rows) of a table into two subsets for training and test, respectively . The simple train test split mode not only supports shuffle split and stratified shuffle split natively carried by the `train_test_split` function, but also gets extended to do group shuffle. The cross-validation splitter mode supports more diverse splitting strategies. Each tool run outputs one split, train and test. To get different splitting sets, for example, nested CV, multiple tool runs are needed with different `nth_split`. 271
272 This tool implements splitter function and classes from `sklearn.model_selection` module to split contents (rows) of a table into
273 two subsets for training and test, respectively . The simple train test split mode not only supports shuffle split and stratified
274 shuffle split natively carried by the `train_test_split` function, but also gets extended to do group shuffle.
275 The cross-validation splitter mode supports more diverse splitting strategies. Each tool run outputs one split, train and test.
276 To get different splitting sets, for example, nested CV, multiple tool runs are needed with different `nth_split`.
277 Example: 6-fold CV. Set `n_splits` to 6. Run the tool 6 times with the same parameters, but set `nth_split` according to the number of the run (1-6).
272 278
273 - Train Test Split mode 279 - Train Test Split mode
274 - direct split, no shuffle 280 - direct split, no shuffle
275 - shuffle split 281 - shuffle split
276 - stratified shuffle split 282 - stratified shuffle split