Affiliation:
1. Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, WI, USA
Abstract
Machine Learning (ML) techniques have been effectively used to learn the intricate relationships between variables that play a significant role in the field of engine design. However, there are two challenges to this approach – (1) Identifying ML regression models that could capture the trends of a response variable, given a non-parametric training set of relatively small size, with an acceptable accuracy and response time, and (2) identifying the size of the dataset, with respect to the input parameters, to be used for training and validation of the ML models. There is not enough information in the literature to reach a consensus on the sampling size to be used for an engine design problem that would yield an acceptable measure of goodness-of-fit. This is evident from the varied size of the training/validation data size used within the engine research community, as will be elaborated on in the following sections. The objective of this paper is to provide an insight into the sampling size required by the different ML models to achieve an acceptable fit between the model and the data, to be used in three types of engine design/optimization problems – (1) conventional diesel combustion (CDC) engine performance over a wide range of speed and load, (2) cold-start operation of a direct-injected spark-ignition (DISI) engine, and (3) high-load performance of a dual-fuel reactivity-controlled compression ignition (RCCI) engine.
Subject
Mechanical Engineering,Ocean Engineering,Aerospace Engineering,Automotive Engineering