Deep-shallow framework-Based Hybrid Feature extractor for Classification of Ethiopian Coffee Bean Varieties using Image Processing

Deep-shallow framework-Based Hybrid Feature extractor for Classification of Ethiopian Coffee Bean Varieties using Image Processing

Kene, Yirga

URI: http://hdl.handle.net/123456789/7483

Date: 2024-04-11

Abstract:

Ethiopia is the birthplace of coffee, where it is naturally gifted and a major source of foreign income. In Ethiopia, coffees are cultivated in different regions but the dominant regions are Harar, Jimma, Limu, Sidama, and Wellega. The different varieties of coffees are circulated domestically and export to international market based on their specific origin manually. However, maintaining and assuring the quality of coffee based on their specific origin manually is a worthful task; because it is highly biased, subjective and prone to error. As a result, automation of these manual coffee bean varieties identification and classification based on their growing region is required. The main objective of this study is designing and developing an automatic Ethiopian coffee bean varieties identification based on their growing regions using image processing and machinelearning techniques. To achieve the objective of this study, images of coffee beans are collected by taking different coffee samples of Harar, Jimma, Limu, Sidama and Wellega from Ethiopian Commodity exchange (ECX), Addis Ababa. Following the collection of the dataset, the developed system incorporates various processing phases such as image resizing, image filtering, image contrast enhancement, noise removal, grayscale conversion for preprocessing and a combined approach of thresholding and K-means segmentation technique for grayscale and RGB image respectively. After, preprocessing and segmentation of images, we have used an end-to-end CNN model using SoftMax and RBF kernel function of SVM for classification of Ethiopian coffee bean varieties. We compared the proposed end-to-end CNN to VGG16 and ResNet50, and the proposed endto- end CNN had the highest accuracy of 89.99% than VGG16 (87.75%) and ResNet50 (81.25%). Even if, the proposed end-to-end CNN model achieved 89.99% accuracy but the performance of the model is affected by similarity of color features and we investigated to merge the color feature with texture feature by applying local feature descriptor using HOG. We evaluated the performance of HOG feature extraction, CNN feature extraction, and a hybrid feature vector (HOG-CNN) by using a multi-class SVM classifier, in which 74.17%, 85.83% and 97.5% accuracy achieved respectively. Finally, the deep-shallow based feature (CNN-HOG) are registered with highest accuracy of 97.5 % in this study. Keywords: CNN, HOG, HOG-CNN, PCA, SVM, SoftMax, K-means, Thresholding

Show full item record