0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A novel clustering-based over-sampling technique for imbalanced data sets
نویسندگان :
Behzad Mirzaei
1
Hossein Nezamabadi-pour
2
Javad Mahmoodi
3
1- دانشگاه شهید باهنر کرمان
2- دانشگاه شهید باهنر کرمان
3- دانشگاه شهید باهنر کرمان
کلمات کلیدی :
Imbalanced data،Clustering،K-means algorithm،Over-sampling،Preprocessing methods
چکیده :
One of the most challenging problems in machine learning is the classification of imbalanced data. This problem arises when the samples of data are distributed unevenly among the classes, such that compared to one class (the majority or negative class), the other class (the minority or positive class) has far fewer samples. The classical classifiers are inappropriate to classify data sets of this nature. To address these classifiers' shortcoming in class imbalance situations, we present a novel clustering-based over-sampling technique in this paper. First, the k-means clustering algorithm is used to cluster the minority class samples. Then, sparse clusters including fewer samples are chosen. Finally, we use the nearest neighbor of each cluster center to create synthetic samples for the minority class. Also, to select clusters based on probabilities, we apply the roulette wheel selection operator during over-sampling. The C4.5 decision tree classifier is utilized in our experiments, and the F-measure criterion is considered to evaluate methods. According to the results, our method outperforms six other methods over fifteen imbalanced data sets.
لیست مقالات
لیست مقالات بایگانی شده
One-Way Edge Modes Induced by Synthetic Magnetic Field in Time-Varying LC Circuit
Sadeq Bahmani - Amir Nader Askarpour
Fusion of Multi-Level CNN With LBP Features For Facial Emotion Recognition
Ehsan Bahmanabady - Maryam Imani - Hassan Ghassemian
Design, Prototyping and Performance Analysis of a Barometric-Based Soft Force Sensor
Mohammad Reza SheykhAzimi - Mohammad Reza Nayeri - Mehdi Tale Masouleh - Ahmad Kalhor
Multinomial Emoji Prediction Using Deep Bidirectional Transformers and Topic Modeling
Zahra Ebrahimian - Ramin Toosi - Mohammad Ali Akhaee
Optimal Control of Rectangular Singular Systems
Masoud Shafiee
A Novel Interpretation of Coding in Time-Modulated Arrays
Mehdi Gholami - Mohammad Neshat
A Novel Approach to Cheating Prevention in Demand Side Management Algorithms
Farahnaz Haftbaradaran - Ali Akhtari - Massoud Reza Hashemi - Zahra Baharlouei
Swin Wavelet Super Resolution
Zahra Moammeri - Ahmad Mahmoudi-Aznaveh
Fast and Low Power Modified Carry Look-Ahead Adder
Sanaz Salem - Amir hossein Owji
Mountain Gazelle Optimized PID Controller for a MIMO System with External Disturbance
Siavash Shirali - Hamoun Maleki - Hadi Delavari
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0