sis 21(30): e1

Research Article

Cross-domain sentiment classification initiated with Polarity Detection Task

Download1104 downloads
  • @ARTICLE{10.4108/eai.26-5-2020.165965,
        author={Nancy Kansal and Lipika Goel and Sonam Gupta},
        title={Cross-domain sentiment classification initiated with Polarity Detection Task},
        journal={EAI Endorsed Transactions on Scalable Information Systems},
        volume={8},
        number={30},
        publisher={EAI},
        journal_a={SIS},
        year={2020},
        month={8},
        keywords={Machine Learning, Sentiment Analysis, Polarity Detection Task (PDT), Cross-Domain Sentiment Analysis},
        doi={10.4108/eai.26-5-2020.165965}
    }
    
  • Nancy Kansal
    Lipika Goel
    Sonam Gupta
    Year: 2020
    Cross-domain sentiment classification initiated with Polarity Detection Task
    SIS
    EAI
    DOI: 10.4108/eai.26-5-2020.165965
Nancy Kansal1,*, Lipika Goel1, Sonam Gupta1
  • 1: Ajay Kumar Garg Engineering College, Ghaziabad, Uttar Pradesh, India
*Contact email: Er.nancy001@gmail.com

Abstract

INTRODUCTION: The requirement of the labeled dataset in the source domain makes the Cross Domain Sentiment Classification (CDSC) task complicate in the situation when the dataset is labeled manually. OBJECTIVES: To overcome the dependency of CDSC tasks on manual labeling of the dataset by proposing a polarity detection task. METHODS: We have proposed the CDSC-PDT method that is the polarity Detection Task (PDT) followed by the CDSC task. The proposed PDT task extracts the polarity of reviews from the source domain using the contextual and relevancy information of words in documents and this automatic labeled dataset is further used to train classifiers to make the further classification. RESULTS: Proposed method is comparable to the traditional learning method giving the highest precision 85.7%. CONCLUSION: The proposed method does not need to manually label the documents in either of the domain (source or target), hence it overcomes the human intervention and is also time saving and cheap process, unlike traditional CDSC tasks.