Algorithm for Persian Text Sentiment Analysis in Correspondences on an E- Learning Social Website

Authors

  • Anahid Rais Rohani Karaj Branch, Islamic Azad University
  • A’zam Bastanfard Karaj Branch, Islamic Azad University

DOI:

https://doi.org/10.24200/jrset.vol4iss01pp11-15

Abstract

By 2000, sentiment analysis had been only studied based on speech and changes in facial expressions. Since then, studies have been focused on text. Concerning Persian text mining, studies have been conducted on the methods for extracting properties for classification and examination of opinions on social websites with an aim to determine text polarity. The present research was aimed to prepare and implement an algorithm for Persian text sentiment analysis based on the following six basic emotional states: happiness, sadness, fear, anger, surprise, and disgust. In this research, sentiment analysis was carried out using the unsupervised lexical method. Lexicons are divided into four categories, namely the emotional, boosters, negation, and stop lists. The algorithm was written in six different ways using different properties. In the first method, the algorithm was capable of identifying an emotional word in a sentence. The sentiment of the sentence was determined based on the given emotional word. However, it should be noted that the text itself is also important for sentiment analysis because in addition to the emotional words, other factors (such as boosters and negating factors) are also present in the sentence and affect the text sentiment. Hence, the algorithm was enhanced in the subsequent methods to detect the boosters and negating words. Results of running the algorithm using different methods indicated that the algorithm accuracy increased with an increase in the number properties involved. In the sixth method, an algorithm capable of identifying emotional, boosters and negative words was applied to two data samples including sentences written by typical users and sentences written by university students on an electronic learning social website. The accuracy of the algorithm with 100 data samples from typical users and 100 data samples from university students was 80% and 84%, respectively.  

References

Diman Ghazi a, Diana Inkpen a, Stan Szpakowicz , “Prior and contextual emotion of words in sentential context”, Computer Speech and Language, 2014, vol. 28 , pp. 76–92

Emma Haddia, Xiaohui Liua, Yong Shib,”The Role of Text Pre-processing in Sentiment Analysis”, Procedia Computer Science, 2013, vol. 17, pp. 26 – 32

Masashi Hadanoa, Kazutaka Shimadaa, Tsutomu Endoa,” Aspect identification of sentiment sentences using a clustering Algorithm”, Procedia - Social and Behavioral Sciences, 2011, vol. 27, pp. 22 – 31

Fiorella Carla Dotti,” Overcoming Problems in Automated Appraisal Recognition: the Attitude System in Inscribed Appraisa”, Procedia - Social and Behavioral Sciences, 2013, vol. 95, pp. 442 – 446

Alexandra Balahur, Marco Turchi,” Comparative experiments using supervised learning and machine translation for multilingual sentiment analysis”, Computer Speech and Language, 2014,vol. 28 , pp. 56–75

Carmen Banea a, Rada Mihalcea a, Janyce Wiebe,” Sense-level subjectivity in a multilingual setting”, Computer Speech and Language, 2014, vol. 28, pp. 7–19

Tomáˇs Brychcín, Miloslav Konopík,” Semantic spaces for improving language modeling”, Computer Speech and Language, 2014, vol. 28, pp. 192 – 209.

Liang-Chih Yu a, Chung-Hsien Wu b, Fong-Lin Jang,”Psychiatric document retrieval using a discourse-aware model”, Artificial Intelligence, 2009, vol. 173, pp. 817–829

Mohamad Hardyman Barawi, Yet Yong Seng,” Evaluation of resource creations accuracy by using sentiment Analysis”, Procedia - Social and Behavioral Sciences, 2013, vol. 97 , pp.522 – 527

Dinko Lambova, Sebastião Paisa, Gãel Diasa,”Merged Agreement Algorithms for Domain Independent Sentiment Analysis”, Procedia - Social and Behavioral Sciences, 2011, vol. 27, pp. 248 – 257.

Published

2019-09-13

Issue

Section

Articles