Improving Semi-Supervised Classification using Clustering

Arora, J. and Tushir, M. and Kashyap, R. (2020) Improving Semi-Supervised Classification using Clustering. EAI Endorsed Transactions on Scalable Information Systems, 7 (25): e3. ISSN 2032-9407

[thumbnail of eai.29-7-2019.159793.pdf]
eai.29-7-2019.159793.pdf - Published Version
Available under License Creative Commons Attribution No Derivatives.

Download (1MB) | Preview


Supervised classification techniques, broadly depend on the availability of labeled data. However, collecting this labeled data is always a tedious and costly process. To reduce these efforts and improve the performance of classification process, this paper proposes a new framework, which combines a most basic classification technique with the semi-supervised process of clustering. Semi-supervised clustering algorithms, aim to increase the accuracy of clustering process by effectively exploring available supervision from a limited amount of labeled data and help to label the unlabeled data. In our paper, a semi-supervised clustering is integrated with naive bayes classification technique which helps to better train the classifier. To evaluate the performance of the proposed technique, we conduct experiments on several real world benchmark datasets. The experimental results show that the proposed approach surpasses the competing approaches in both accuracy and efficiency.

Item Type: Article
Uncontrolled Keywords: Semi-Supervised Clustering, Naive Bayes Classification, Probability, Fuzzy C- means
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
QA75 Electronic computers. Computer science
Depositing User: EAI Editor II.
Date Deposited: 08 Oct 2020 13:52
Last Modified: 08 Oct 2020 13:52

Actions (login required)

View Item
View Item