Domain-Based Benchmark Experiments: Exploratory and Inferential Analysis-Reference-Cited by-同舟云学术

Domain-Based Benchmark Experiments: Exploratory and Inferential Analysis

Published:2016-02-24 Issue:1 Volume:41 Page:
ISSN:1026-597X
Container-title:Austrian Journal of Statistics
language:
Short-container-title:AJS

Author:

Eugster Manuel J. A.,Hothorn Torsten,Leisch Friedrich

Abstract

Benchmark experiments are the method of choice to compare learning algorithms empirically. For collections of data sets, the empirical performance distributions of a set of learning algorithms are estimated, compared, and ordered. Usually this is done for each data set separately. The present manuscript extends this single data set-based approach to a joint analysis for the complete collection, the so called problem domain. This enablesto decide which algorithms to deploy in a specific application or to compare newly developed algorithms with well-known algorithms on established problem domains.Specialized visualization methods allow for easy exploration of huge amounts of benchmark data. Furthermore, we take the benchmark experiment design into account and use mixed-effects models to provide a formal statistical analysis. Two domain-based benchmark experiments demonstrate our methods: the UCI domain as a well-known domain when one is developing a new algorithm; and the Grasshopper domain as a domain where we want to find the best learning algorithm for a prediction component in an enterprise application software system.

Publisher

Austrian Statistical Society

Subject

Applied Mathematics,Statistics, Probability and Uncertainty,Statistics and Probability

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Development and application of chemometrical methods based on 1H-NMR spectra of fruit juices – Structural approach for analysis of wide and unbalanced datasets;Food Control;2024-10

2. Performance Comparison of Machine Learning Platforms;INFORMS Journal on Computing;2019-04

3. Complexity vs. performance;Proceedings of the 2017 Internet Measurement Conference;2017-11

4. A Statistical Framework for Hypothesis Testing in Real Data Comparison Studies;The American Statistician;2015-07-03

5. Human Computer Interaction Meets Psychophysiology: A Critical Perspective;Symbiotic Interaction;2015