0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A novel clustering-based over-sampling technique for imbalanced data sets
نویسندگان :
Behzad Mirzaei
1
Hossein Nezamabadi-pour
2
Javad Mahmoodi
3
1- دانشگاه شهید باهنر کرمان
2- دانشگاه شهید باهنر کرمان
3- دانشگاه شهید باهنر کرمان
کلمات کلیدی :
Imbalanced data،Clustering،K-means algorithm،Over-sampling،Preprocessing methods
چکیده :
One of the most challenging problems in machine learning is the classification of imbalanced data. This problem arises when the samples of data are distributed unevenly among the classes, such that compared to one class (the majority or negative class), the other class (the minority or positive class) has far fewer samples. The classical classifiers are inappropriate to classify data sets of this nature. To address these classifiers' shortcoming in class imbalance situations, we present a novel clustering-based over-sampling technique in this paper. First, the k-means clustering algorithm is used to cluster the minority class samples. Then, sparse clusters including fewer samples are chosen. Finally, we use the nearest neighbor of each cluster center to create synthetic samples for the minority class. Also, to select clusters based on probabilities, we apply the roulette wheel selection operator during over-sampling. The C4.5 decision tree classifier is utilized in our experiments, and the F-measure criterion is considered to evaluate methods. According to the results, our method outperforms six other methods over fifteen imbalanced data sets.
لیست مقالات
لیست مقالات بایگانی شده
Partitioning-based Graph Signal Denoising via Heat Kernel Smoothing
Mohammadreza Fattahi - Hamid Saeedi-Sourck - Vahid Abootalebi
تجزیه و تحلیل عملکرد سیستم ناوبری اینرسیایی با استفاده از الگوریتم GAME
نرجس احمدیان - بیژن ذاکری گتابی
User Management in Cell-Free Massive MIMO Systems with Limited Fronthaul Capacity
Siminfar Samakoush Galougah - Hamed Masoumi - Mohammad Javad Emadi
Ultra-wideband RCS Reduction Using Checkerboard Configuration of Bed of Nails
Sadegh Sarjoughian - Mohsen Maddahali - Ahmad Bakhtafrouz
بهبود عملکرد یک ( LOC ) Lab – On –Chipپیشرفته مبتنی بر فناوری MEMSبه کمک تقویت میدان الکتریکی ساختار
شیوا عظیمی نام - فهیمه مروی - کیان جعفری
An Uncertain Optimal Factorization of Cooperative Manipulators for Robust Optimal Control Schemes
Neda Nasiri - Ahmad Fakharian - Mohammad Bagher Menhaj
An Enhanced Chaotic System Based Color Image Encryption using DNA Encoding
Mobin Vaziri - Mohammad Mehdi Rahimifar - Hadi Jahanirad
Application of Artificial Neural Network on Diagnosing Location and Extent of Disk Space Variations in Transformer Windings Using Frequency Response Analysis
Reza Behkam - Hossein Karami - Mahdi Salay Naderi - Gevork Gharehpetian
Study of Performance Characteristics of a Line-Start Synchronous Reluctance Motor Over its Synchronization Region
Ali Jamali-Fard - Mojtaba Mirsalim
تشخیص حالت عادی و غیرعادی شبکه برق هوشمند با استفاده از شبکه عصبی مصنوعی
محمد گنج خانی - علی عباسپورطهرانی فرد - سجاد فتاحیان دهکردی - محمد غلامی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0