0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A novel clustering-based over-sampling technique for imbalanced data sets
نویسندگان :
Behzad Mirzaei
1
Hossein Nezamabadi-pour
2
Javad Mahmoodi
3
1- دانشگاه شهید باهنر کرمان
2- دانشگاه شهید باهنر کرمان
3- دانشگاه شهید باهنر کرمان
کلمات کلیدی :
Imbalanced data،Clustering،K-means algorithm،Over-sampling،Preprocessing methods
چکیده :
One of the most challenging problems in machine learning is the classification of imbalanced data. This problem arises when the samples of data are distributed unevenly among the classes, such that compared to one class (the majority or negative class), the other class (the minority or positive class) has far fewer samples. The classical classifiers are inappropriate to classify data sets of this nature. To address these classifiers' shortcoming in class imbalance situations, we present a novel clustering-based over-sampling technique in this paper. First, the k-means clustering algorithm is used to cluster the minority class samples. Then, sparse clusters including fewer samples are chosen. Finally, we use the nearest neighbor of each cluster center to create synthetic samples for the minority class. Also, to select clusters based on probabilities, we apply the roulette wheel selection operator during over-sampling. The C4.5 decision tree classifier is utilized in our experiments, and the F-measure criterion is considered to evaluate methods. According to the results, our method outperforms six other methods over fifteen imbalanced data sets.
لیست مقالات
لیست مقالات بایگانی شده
Revolutionizing Energy Efficiency: A Case Study on Self-supply of Electrical Energy in the Mobarake Steel Industry
Mahdi Shadi - Seyed Mohammad Shobeiry - Mohammad Sadegh Ghazizadeh - Hassan Mardani
A New Model of Interleaved Boost CF-CLLC Integrated Resonant Converter with Fixed-Frequency PWM Control for Renewable Energy Applications in Fuel Cell and Battery-Powered Electric Vehicles
Mina Taheri - Hossein Askariyan Abyane
Vehicle stability control and trajectory tracking utilizing a type-2 fuzzy controller
Mohammad Mahdavi Mazdeh - Mehdi Pourgholi - Vahid Fakhari
A Novel Approch to No-reference Image Quality Assessment Utilizing Saliency map
Reza Sabr Ali Pour - Mohieddin Moradi
طراحی کنترلکننده مد لغزشی دینامیک برای سیستم تعلیق فعال غیر خطی با عملگر غیرایدهآل
مونا عظیمی - الهه مرادی
Non-contact Radar Technology and Machine Learning for Automated Sleep Apnea-Hypopnea Syndrome Detection
ُSaman Faridsoltani - Mohaddeseh Sadeghi - Zahra Rahmani - Somayyeh Chamaani
Model Reference Adaptive Control for Nonlinear Systems in the Presence of Unknown External Disturbances
Ehsan Nazemorroaya - Mohsen Shafieirad - Majid Hajatipour
Chemical Stability and Electronic Properties of Silicon Doped Carbon Nanotubes: A First Pricniples Study
Maryam Hakimi - Ebrahim Nadimi
LoRa-based Intelligent Helmet for Coal Miner Safety: Neural Network Prediction and BLE Location Tracking
Saba Pirahmadian - Sorin Yousefnia - Soheil Ganjefar
Photonic Crystal-based Plasmonic Biosensor with Low-cost and High-sensitivity Properties
Mahdieh Ahmadi Motlagh - Mahdieh Bozorgi - Mahmood Rafaei-Booket
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0