0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A novel clustering-based over-sampling technique for imbalanced data sets
نویسندگان :
Behzad Mirzaei
1
Hossein Nezamabadi-pour
2
Javad Mahmoodi
3
1- دانشگاه شهید باهنر کرمان
2- دانشگاه شهید باهنر کرمان
3- دانشگاه شهید باهنر کرمان
کلمات کلیدی :
Imbalanced data،Clustering،K-means algorithm،Over-sampling،Preprocessing methods
چکیده :
One of the most challenging problems in machine learning is the classification of imbalanced data. This problem arises when the samples of data are distributed unevenly among the classes, such that compared to one class (the majority or negative class), the other class (the minority or positive class) has far fewer samples. The classical classifiers are inappropriate to classify data sets of this nature. To address these classifiers' shortcoming in class imbalance situations, we present a novel clustering-based over-sampling technique in this paper. First, the k-means clustering algorithm is used to cluster the minority class samples. Then, sparse clusters including fewer samples are chosen. Finally, we use the nearest neighbor of each cluster center to create synthetic samples for the minority class. Also, to select clusters based on probabilities, we apply the roulette wheel selection operator during over-sampling. The C4.5 decision tree classifier is utilized in our experiments, and the F-measure criterion is considered to evaluate methods. According to the results, our method outperforms six other methods over fifteen imbalanced data sets.
لیست مقالات
لیست مقالات بایگانی شده
Stability Analysis for the Non-linear Model Predictive Control of a Flexible Joint Manipulator with Dynamics Uncertainties
Mohamadreza Satvati - Hossein Karimpour - Keivan Torabi - Mohammad Motaharifar
Significant Methods to Improve Control of Quadrotors, Hexarotors and Octorotors
Peyman Amiri - Nima Sina - Mohammad Danesh
DWT-Based Epileptic Seizure Detection Using Fuzzy Logic Model with Entropy and Table Lookup Scheme
Alireza Mohammadi - Arvin Esfandyari - Ali Doustmohammadi - Amir Abolfazl Suratgar - Masoud Shafiee
Transmission Dynamics and Optimal Control Strategy to Mitigate the Spread of Novel Coronavirus: The Case of Iran
Reza Shadi - Ahmad Fakharian - Hamid Khaloozadeh
Design and Determing Two Separate Rotor Axial Flux Permanent Magnet Motor Load and Efficinecy
Siamak Omrani - Ahmad Darabi
Designing of Multilayer Planar Spiral Air-Core Inductor for Power Electronic Applications
Mohammad Khakroei - Mohsen Mostafaei - Mansour Arefian - Afshin Rezaei-Zare - Majid Najafi Zarmehri
تشخیص و تفکیک برخط خطای مدار باز کلید در اینورترهای تک فاز PWM
مهدی اره پناهی - علی اکبر سلیمی
Design and Modeling of Graphene Based Electro-absorption Modulator Integrated with Hybrid Plasmonic Waveguides
Hadi Soofi - Shima Karkon Bagheri - Hamid Vahed
Si/SiO2/Ag optical sensor
Alireza Karimpour - Mehrdad Naemi Dehkharghani - Faramarz Hossein-babaei
Angular Stable Multiband Miniaturized Flexible Frequency Selective Surface
Mozhgun Moazzamnia - Javad Nourinia - Changiz Ghobadi - Keyhan Hosseini - Mohsen Karamirad - Baman Mohammadi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4