Repository revision
16:a2e3cf2f9f40

Repository 'sklearn_train_test_split'
hg clone https://toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split

Split Dataset tool metadata
Miscellaneous
into training and test subsets
sklearn_train_test_split
toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split/sklearn_train_test_split/1.0.11.0
1.0.11.0
echo "1.0.11.0"
True
Version lineage of this tool (guids ordered most recent to oldest)
toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split/sklearn_train_test_split/1.0.11.0 (this tool)
toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split/sklearn_train_test_split/1.0.10.0
toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split/sklearn_train_test_split/1.0.8.4
toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split/sklearn_train_test_split/1.0.8.3
toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split/sklearn_train_test_split/1.0.8.2
toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split/sklearn_train_test_split/1.0.8.1
toolshed.g2.bx.psu.edu/repos/bgruening/sklearn_train_test_split/sklearn_train_test_split/1.0.7.12
sklearn_train_test_split
Requirements (dependencies defined in the <requirements> tag set)
name version type
python 3.9 package
galaxy-ml 0.10.0 package
Additional information about this tool
python '$__tool_directory__/train_test_split.py'
            --inputs '$inputs'
            --infile_array '$infile_array'
            #if $mode_selection.selected_mode == 'train_test_split' and $mode_selection.options.shuffle_selection.shuffle not in ['None', 'simple']
                --infile_labels '$mode_selection.options.shuffle_selection.labels'
            #end if
            #if $mode_selection.selected_mode == 'cv_splitter' and $mode_selection.cv_selector.selected_cv in ['StratifiedKFold', 'RepeatedStratifiedKFold', 'StratifiedShuffleSplit', 'OrderedKFold', 'RepeatedOrderedKFold']
                --infile_labels '$mode_selection.cv_selector.target_input.labels'
            #end if
            #if $mode_selection.selected_mode == 'cv_splitter' and $mode_selection.cv_selector.selected_cv in ['GroupKFold', 'GroupShuffleSplit', 'LeaveOneGroupOut', 'LeavePGroupsOut']
                --infile_groups '$mode_selection.cv_selector.groups_selector.infile_g'
            #end if
            --outfile_train '$out_train'
            --outfile_test '$out_test'
    
    
None
False
Functional tests
name inputs outputs required files
Test-1 infile_array: regression_X.tabular
header0: True
mode_selection|options|random_state: 123
mode_selection|options|shuffle_selection|shuffle: simple
mode_selection|selected_mode: train_test_split
name: value
name: value
regression_X.tabular
value
Test-2 infile_array: regression_X.tabular
header0: True
mode_selection|cv_selector|n_splits: 2
mode_selection|cv_selector|test_size: 0.25
mode_selection|cv_selector|random_state: 123
mode_selection|cv_selector|selected_cv: ShuffleSplit
mode_selection|selected_mode: cv_splitter
name: value
name: value
regression_X.tabular
value
Test-3 infile_array: imblearn_X.tabular
header0: False
mode_selection|options|test_size: 0.2
mode_selection|options|random_state: 123
mode_selection|options|shuffle_selection|labels: imblearn_y.tabular
mode_selection|options|shuffle_selection|header1: False
mode_selection|options|shuffle_selection|col: 1
mode_selection|options|shuffle_selection|shuffle: stratified
mode_selection|selected_mode: train_test_split
name: value
name: value
imblearn_X.tabular
imblearn_y.tabular
value
Test-4 infile_array: imblearn_X.tabular
header0: False
mode_selection|cv_selector|n_splits: 1
mode_selection|cv_selector|test_size: 0.2
mode_selection|cv_selector|random_state: 123
mode_selection|cv_selector|target_input|labels: imblearn_y.tabular
mode_selection|cv_selector|target_input|header1: False
mode_selection|cv_selector|target_input|col: 1
mode_selection|cv_selector|selected_cv: StratifiedShuffleSplit
mode_selection|selected_mode: cv_splitter
name: value
name: value
imblearn_X.tabular
imblearn_y.tabular
value
Test-5 infile_array: regression_X.tabular
header0: True
mode_selection|cv_selector|n_splits: 5
mode_selection|cv_selector|shuffle: True
mode_selection|cv_selector|random_state: 123
mode_selection|cv_selector|target_input|labels: regression_y.tabular
mode_selection|cv_selector|target_input|header1: True
mode_selection|cv_selector|target_input|col: 1
mode_selection|cv_selector|selected_cv: OrderedKFold
mode_selection|selected_mode: cv_splitter
name: value
name: value
regression_X.tabular
regression_y.tabular
value