ACADSTAFF UGM

CREATION
Title : Radius-SMOTE: A New Oversampling Technique of Minority Samples Based on Radius Distance for Learning From Imbalanced Data
Author :

GEDE ANGGA PRADIPTA (1) Prof. Drs. Retantyo Wardoyo, M.Sc., Ph.D. (2) Aina Musdholifah, S.Kom., M.Kom. Ph.D (3) Dr. dr. I Nyoman Hariyasa Sanjaya, SpOG(K), MARS (4)

Date : 14 2021
Keyword : Imbalanced learning, oversampling, SMOTE, radius distance, initial selection. Imbalanced learning, oversampling, SMOTE, radius distance, initial selection.
Abstract : Imbalanced learning problems are a challenge faced by classifiers when data samples have an unbalanced distribution in each class. Furthermore, the synthetic oversampling method (SMOTE) is a preprocessing technique widely used to synthesize new data and balance the different numbers of samples in each class. One of the SMOTE method’s expansions is based on the initial selection approach, which determines the best candidates to be oversampled in the data before the process of synthetic example generation starts. However, SMOTE and most of the existing oversampling methods based on initial selection still found overlapping data on the final result. This issue makes it difficult for any classifiers to determine the decision boundary of each class. Therefore, this research proposes a new oversampling technique called Radius-SMOTE, which emphasizes the initial selection approach by creating synthetic data based on a safe radius distance. Furthermore, new synthetic data are prevented from overlapping in the opposite class with the safe radius distance. The Radius-SMOTE was evaluated extensively with thirteen artificial imbalanced datasets from the KEEL repository. The experimental results show that the proposed method is able to achieve the best results on 5 datasets, namely yeast-1-4-5-8_vs_7, ecoli- 0-1-3-7_vs_2-6, Umbilical cord, Pima, and Haberman dataset in term of various assessment metrics. Besides that, the computational cost for our proposed method is also relatively low, with an average time of 0.5 to 1 second on the 13 tested datasets.
Group of Knowledge : Ilmu Komputer
Original Language : English
Level : Internasional
Status :
Published
Document
No Title Document Type Action
1 Paper IEEE ACCESS RW.pdf
Document Type : [PAK] Full Dokumen
[PAK] Full Dokumen View
2 Similarity Radius_SMOTE__A_New_Oversampling_Technique_of_Mino.pdf
Document Type : [PAK] Cek Similarity
[PAK] Cek Similarity View
3 Bukti Korespondensi IEEE Access.pdf
Document Type : [PAK] Bukti Korespondensi Penulis
[PAK] Bukti Korespondensi Penulis View
4 Surat Pernyataan Paper melibatkan mahasiswa- Gede Angga Pradipta- IEEE Access signed RW.pdf
Document Type : Dokumen Pendukung Karya Ilmiah (Hibah, Publikasi, Penelitian, Pengabdian)
Dokumen Pendukung Karya Ilmiah (Hibah, Publikasi, Penelitian, Pengabdian) View