0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A novel clustering-based over-sampling technique for imbalanced data sets
نویسندگان :
Behzad Mirzaei
1
Hossein Nezamabadi-pour
2
Javad Mahmoodi
3
1- دانشگاه شهید باهنر کرمان
2- دانشگاه شهید باهنر کرمان
3- دانشگاه شهید باهنر کرمان
کلمات کلیدی :
Imbalanced data،Clustering،K-means algorithm،Over-sampling،Preprocessing methods
چکیده :
One of the most challenging problems in machine learning is the classification of imbalanced data. This problem arises when the samples of data are distributed unevenly among the classes, such that compared to one class (the majority or negative class), the other class (the minority or positive class) has far fewer samples. The classical classifiers are inappropriate to classify data sets of this nature. To address these classifiers' shortcoming in class imbalance situations, we present a novel clustering-based over-sampling technique in this paper. First, the k-means clustering algorithm is used to cluster the minority class samples. Then, sparse clusters including fewer samples are chosen. Finally, we use the nearest neighbor of each cluster center to create synthetic samples for the minority class. Also, to select clusters based on probabilities, we apply the roulette wheel selection operator during over-sampling. The C4.5 decision tree classifier is utilized in our experiments, and the F-measure criterion is considered to evaluate methods. According to the results, our method outperforms six other methods over fifteen imbalanced data sets.
لیست مقالات
لیست مقالات بایگانی شده
Design of a High-Efficiency Balanced Power Amplifier with 68% Fractional Bandwidth
Fatemeh Mohabati - Marzieh Chegini - Mahmoud Kamarei
Jacobian matrix calculation in scattering from dielectric objects using semi-explicit MoM
Fatemeh Mandegari - Leila Ahmadi - Amir Ahmad Shishegar
کنترل حرارت مبتنی بر روش LQG در پیل سوختی غشاء پلیمری
احمدرضا ولی - محمدعلی علیرضاپوری - محمدمهدی برزگری
A Comprehensive Analysis of a Digital Control Strategy for Photovoltaic-Based Single-Phase Grid-Tied Inverter Systems
Soheil Hasani Sangani - Mohamad Reza Moslemnejad - Mojtaba Saeedi - Alireza Jalalitalab - Reza Beiranvand
Iranian stock market fluctuations: from social news to forecasting models
Maryam Sharifinia - Farzaneh Ghayour Baghbani
Design of a Controllable and State-observable MEMS Nonlinear Resonator Based on the Awl-shaped Serpentine Spring
Ehsan Ranjbar - Amirabolfazl Suratgar
Partitioning-based Graph Signal Denoising via Heat Kernel Smoothing
Mohammadreza Fattahi - Hamid Saeedi-Sourck - Vahid Abootalebi
Secure Control System Using Iterative Secret Sharing
Younes Esmaeili - Mohammad Haeri - Saeed Adelipour
Surface roughness classification in dynamic touch using EEG signals
Ali Amini - Karim Faez - Mahmood Amiri
An Investigation of Hardware Implementation of Multi-Valued Logic Using Different Nanodevices
Abdolah Amirany - Kian Jafari - Mohammad Hossein Moaiyeri
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.2