Handling Imbalanced Data With Weighted Logistic Regression and Propensity Score Matching methods-Reference-Cited by-同舟云学术

Handling Imbalanced Data With Weighted Logistic Regression and Propensity Score Matching methods

Published:2024-01-07 Issue:1 Volume:35 Page:1-37
ISSN:1063-8016
Container-title:Journal of Database Management
language:ng
Short-container-title:

Author:

Agrawal Lavlin¹^ORCID,Mulgund Pavankumar²^ORCID,Sharman Raj³^ORCID

Affiliation:

1. North Carolina Agricultural and Technical State University, USA

2. University of Memphis, USA

3. University at Buffalo, USA

Abstract

The adoption of empirical methods for secondary data analysis has witnessed a significant surge in IS research. However, the secondary data is often incomplete, skewed, and imbalanced at best. Consequently, there is a growing recognition of the importance of empirical techniques and methodological decisions made to navigate through such issues. However, there is not enough methodological guidance, especially in the form of a worked case study that demonstrates the challenges of imbalanced datasets and offers prescriptive on how to deal with them. Using data on P2P money transfer services, this article presents a running example by analyzing the same dataset using several different methods. It then compares the outcomes of these choices and explicates the rationale behind some decisions such as inclusion and categorization of variables, parameter setting, and model selection. Finally, the article discusses certain regressions models such as weighted logistic regression and propensity matching, and when they should be used.

Publisher

IGI Global

Reference136 articles.

1. Matching on the Estimated Propensity Score

2. Impact of meta-analytic decisions on the conclusions drawn on the business value of information technology

3. FinTech, Lending and Payment Innovation: A Review

4. The theory of planned behavior

5. Sentiment Analysis of Emirati Dialect