Unleashing class imbalance problem in loan dataset through a novel oversampling approach based on FCM

dc.contributor.authorAkter, Subrina
dc.date.accessioned2025-01-04T08:21:51Z
dc.date.issued2023-12
dc.descriptionVol.-1, Issue-1, December 2023, pp. 61-82
dc.description.abstractCommercial companies highly depend on loan approval models trained by machine learning and statistical methods to predict loan status. However, imbalanced datasets present a key challenge in this sector. Addressing this issue, this paper proposes a new oversampling method based on Fuzzy C means clustering. This clustering algorithm assigns the instances to several groups by assigning a flexible degree of membership. K-Nearest Neighbor and the Decision Tree served as the basic classifiers in an extensive test on the Kaggle loan dataset. Three distinct imbalanced ratios—2.2, 4.2, and 8.45—were used in the experiment. The effectiveness of the recommended strategy was compared with SMOTE and WBOT using 5-fold CV. The outcomes showed that the proposed method outperformed both SMOTE and WBOT, obtaining higher average F-measure and G-mean values across the machine learning algorithms. These findings show how the suggested method may correct class imbalance while also enhancing prediction accuracy in the context of loan acceptance.
dc.description.sponsorshipDepartment of Computer Science and Engineering International Islamic University Chittagong
dc.identifier.issn3005-5873
dc.identifier.urihttp://dspace.iiuc.ac.bd/handle/123456789/8479
dc.language.isoen
dc.publisherCRP, International Islamic University Chittagong
dc.subjectDecision tree
dc.subjectF-measure
dc.subjectG-mean
dc.subjectImbalanced dataset
dc.subjectKNN
dc.subjectLoan approval
dc.subjectMachine learning
dc.titleUnleashing class imbalance problem in loan dataset through a novel oversampling approach based on FCM
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Article 5.pdf
Size:
17.45 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: