Analisis Perbandingan VGG-16 dan ResNet50 untuk Klasifikasi Multilabel Gambar Kerbau Toraja: Pendekatan Deep Learning
Abstract
This study aims to compare the performance of two Convolutional Neural Networks (CNN) models, namely VGG-16 and ResNet50, in the task of multilabel classification of buffalo images. The Dataset used consists of 2009 buffalo images labeled with five categories: human, motorcycle, truck, wild animal, and buffalo. The CNN models were trained using 30 epochs and evaluated using loss, accuracy, Precision, recall, and f1-score metrics. The experimental results show that VGG-16 consistently outperforms ResNet50 by achieving the highest accuracy of 0.95 in the training set and 0.94 in the validation set, and f1-score of 0.94 in the training set and 0.92 in the validation set. These findings indicate that a deeper and more structured CNN architecture, such as VGG-16, provides better results in classifying buffalo images with complex label variations.
Downloads
References
Abdel-Khalek, S. (2023). Quantum neural network-based multilabel image classification in high-resolution unmanned aerial vehicle imagery. Soft Computing, 27(18), 13027–13038. doi: 10.1007/s00500-021-06460-3
Berrahal, M. (2021). Augmented binary multi-labeled CNN for practical facial attribute classification. Indonesian Journal of Electrical Engineering and Computer Science, 23(2), 973–979. doi: 10.11591/ijeecs.v23.i2.pp973-979
Bi, Z. (2021). Improved VGG model-based efficient traffic sign recognition for safe driving in 5G scenarios. International Journal of Machine Learning and Cybernetics, 12(11), 3069–3080. doi: 10.1007/s13042-020-01185-5
Deepak, S. (2019). Brain tumor classification using deep CNN features via transfer learning. Computers in Biology and Medicine, 111. doi: 10.1016/j.compbiomed.2019.103345
Dickel, H., Podolskiy, V., & Gerndt, M. (2019). Evaluation of autoscaling metrics for (stateful) IoT gateways. 2019 IEEE 12th Conference ….
Han, S. (2022). The multilabel fault diagnosis model of bearing based on integrated convolutional neural network and gated recurrent unit. International Journal of Intelligent Computing and Cybernetics, 15(3), 401–413. doi: 10.1108/IJICC-08-2021-0153
He, K. (2020). Mask R-CNN. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(2), 386–397. doi: 10.1109/TPAMI.2018.2844175
Jamil, M., Warsito, B., Wibowo, A., & Kiswanto, K. (2023). Diabetes Mellitus Early Detection Simulation using The K-Nearest Neighbors Algorithm with Cloud-Based Runtime (COLAB). ILKOM Jurnal Ilmiah, 15(2), 215–221. Retrieved from https://jurnal.fikom.umi.ac.id/index.php/ILKOM/article/view/1510
Kaur, T. (2019). Automated brain image classification based on VGG-16 and transfer learning. In Proceedings - 2019 International Conference on Information Technology, ICIT 2019 (pp. 94–98). doi: 10.1109/ICIT48102.2019.00023
Kim, H. G. (2019). Improvement diagnostic accuracy of sinusitis recognition in paranasal sinus X-ray using multiple Deep Learning models. Quantitative Imaging in Medicine and Surgery, 9(6), 942–951. doi: 10.21037/qims.2019.05.15
Lin, Z. (2019). A unified matrix-based convolutional neural network for fine-grained image classification of wheat leaf diseases. IEEE Access, 7, 11570–11590. doi: 10.1109/ACCESS.2019.2891739
Liu, Z. (2019). The applications of radiomics in Precision diagnosis and treatment of oncology: Opportunities and challenges. In Theranostics (Vol. 9, Issue 5, pp. 1303–1322). doi: 10.7150/thno.30309
Polsinelli, M. (2020). A light CNN for detecting COVID-19 from CT scans of the chest. Pattern Recognition Letters, 140, 95–100. doi: 10.1016/j.patrec.2020.10.001
Reddy, A. S. B. (2019). Transfer learning with RESNET-50 for malaria cell-image classification. In Proceedings of the 2019 IEEE International Conference on Communication and Signal Processing, ICCSP 2019 (pp. 945–949). doi: 10.1109/ICCSP.2019.8697909
Sengupta, S. (2020). A review of Deep Learning with special emphasis on architectures, applications and recent trends. Knowledge-Based Systems, 194. doi: 10.1016/j.knosys.2020.105596
Sharif, O. (2022). M-BAD: A Multilabel Dataset for Detecting Aggressive Texts and Their Targets. In CONSTRAINT 2022 - 2nd Workshop on Combating Online Hostile Posts in Regional Languages during Emergency Situation, Proceedings of the Workshop (pp. 75–85). Retrieved from https://api.elsevier.com/content/abstract/scopus_id/85137428853
Simarmata, A. M., Putra, A. Z., & Husein, A. M. (2024). Penerapan Metode Computer Vision Dalam Klasifikasi Buah Jeruk Menggunakan Teknik Image Pre-Processing. Jurnal Data Science Indonesia, 3(2), 110–116.
Song, Z. (2021). Attention-based multi-label neural networks for integrated prediction and interpretation of twelve widely occurring RNA modifications. Nature Communications, 12(1). doi: 10.1038/s41467-021-24313-3
Sun, Y. (2020). Automatically Designing CNN Architectures Using the Genetic Algorithm for Image Classification. IEEE Transactions on Cybernetics, 50(9), 3840–3854. doi: 10.1109/TCYB.2020.2983860
Technology, E. (n.d.). Strengthening Student’s Competencies as the Z Generation and Future Change Agents: Learning from Extension Science and Communication of Innovation Course (KPM121C).
Wu, Z. (2019). Wider or Deeper: Revisiting the ResNet Model for Visual Recognition. Pattern Recognition, 90, 119–133. doi: 10.1016/j.patcog.2019.01.006
Yildiz, C. (2021). An improved residual-based convolutional neural network for very short-term wind power forecasting. Energy Conversion and Management, 228. doi: 10.1016/j.enconman.2020.113731
Younis, A. (2022). Brain Tumor Analysis Using Deep Learning and VGG-16 Ensembling Learning Approaches. Applied Sciences (Switzerland), 12(14). doi: 10.3390/app12147282
Copyright (c) 2024 Tri Anita Resky Ramadhani, Abdul Rachman Manga (Author)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.