Affiliation:
1. SR University, Warangal, India
2. Kalinga Institute of Industrial Technology, Bhubaneswar, India
3. Cardiff Metropolitan University, Cardiff, UK
4. Pandit Deendayal Energy University, Gandhinagar, India
Abstract
In this research, we introduce two new machine learning regression methods: the Ensemble Average and the Pipelined Model. These methods aim to enhance traditional regression analysis for predictive tasks and have undergone thorough evaluation across three datasets, Kaggle House Price, Boston House Price, and California Housing, using various performance metrics. The results consistently show that our models outperform existing methods in terms of accuracy and reliability across all three datasets. The Pipelined Model, in particular, is notable for its ability to combine predictions from multiple models, leading to higher accuracy and impressive scalability. This scalability allows for their application in diverse fields like technology, finance, and healthcare. Furthermore, these models can be adapted for real-time and streaming data analysis, making them valuable for applications such as fraud detection, stock market prediction, and IoT sensor data analysis. Enhancements to the models also make them suitable for big data applications, ensuring their relevance for large datasets and distributed computing environments. It is important to acknowledge some limitations of our models, including potential data biases, specific assumptions, increased complexity, and challenges related to interpretability when using them in practical scenarios. Nevertheless, these innovations advance predictive modeling, and our comprehensive evaluation underscores their potential to provide increased accuracy and reliability across a wide range of applications. The results indicate that the proposed models outperform existing models in terms of accuracy and robustness for all three datasets. The source code can be found at
https://huggingface.co/DebajyotyBanik/Ensemble-Pipelined-Regression/tree/main
Funder
Government of Gujarat, India
Publisher
Association for Computing Machinery (ACM)
Reference25 articles.
1. Devansh Arpit Huan Wang Yingbo Zhou and Caiming Xiong. 2022. Ensemble of averages: Improving model selection and boosting performance in domain generalization. Advances in Neural Information Processing Systems 35 (2022) 8265–8277.
2. K. C. Arum F. I. Ugwuowo H. E. Oranye T. O. Alakija T. E. Ugah and O. C. Asogwa. 2023. Combating outliers and multicollinearity in linear regression model using robust Kibria-Lukman mixed with principal component estimator simulation and computation. Scientific African (2023) e01566.
3. Ali Bager Monica Roman Meshal Algelidh and Bahr Mohammed. 2017. Addressing multicollinearity in regression models: A ridge regression application. Journal of Social and Economic Statistics 6 1 (July 2017) 30–45. https://ideas.repec.org/a/aes/jsesro/v6y2017i1p30-45.html
4. Usage of Ensemble Regression Technique for Product Price Prediction
5. Mitigating the multicollinearity problem and its machine learning approach: A review;Chan Jireh Yi-Le;Mathematics,2022
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献