Effect of Sentence Length in Sentiment Analysis Using Support Vector Machine and Convolutional Neural Network Method

https://doi.org/10.22146/ijccs.61627

Agung Pambudi(1*), Suprapto Suprapto(2)

(1) Master Program in Computer Science, FMIPA UGM, Yogyakarta
(2) Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta
(*) Corresponding Author

Abstract


Based on Article 10 paragraph 1 of Law No. 14 of 2005, a teacher must have four competencies: pedagogical, personality, social, and professional. ICT training at Sunan Kalijaga State Islamic University involves instructors as educators who must have such competencies. An instructor's performance is assessed through students' learning evaluation system by giving comments to the instructions. These comments contain positive and negative sentiments that can be reviewed by conducting sentiment analysis. Research related to sentiment analysis in recent years has been widely done, but researchers rarely pay attention to the effect of sentence length from the dataset on the method's performance. This study tried to analyze sentiment related to sentence length effect on ICT training student comments using Support Vector Machine and Convolutional Neural Network methods.

This study concluded that the sentence length on the dataset would affect the SVM and CNN methods' performance when combined with Word2vec. While the SVM+TFIDF method performance is not affected by sentence length, this method has the fastest process time than other methods. The CNN+Word2vec method produced the best performance in this study with a value of 0.94% accuracy, 0.95% precision, 0.96% recall, and 0.95% f1-score.


Keywords


Sentiment analysis; SVM; CNN

Full Text:

PDF


References

[1]      B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Information Retrieval, vol. 2, no. 1-2, p. 1– 135, 2008.

[2]      S.B. Kotsiantis, "Supervised Machine Learning: A Review of Classification Techniques," Informatica, vol. 31, no. 3, p. 249-268, 2007 [Online]. Available: http://www.informatica.si/index.php/informatica/article/view/148/140. [Accessed: 04-Oct-2020].

[3]      Q. T. Ain, M. Ali, A. Riaz, A. Noureen, M. Kamran, B. Hayat, and A. Rehman, "Sentiment Analysis Using Deep Learning Techniques: A Review," International Journal of Advanced Computer Science and Applications (IJACSA), vol. 8, no. 6, p. 424-433, 2017.

[4]      X. Ouyang, P. Zhou, C. H. Li, L. Liu, "Sentiment Analysis Using Convolutional Neural Network," IEEE (International Conference on Computer and Information Technology), 2015.

[5]      L. B. Ilmawan, "Aplikasi Mobile Untuk Analisis Sentimen Pada Google Play," thesis, Jurusan Ilmu Komputer FMIPA UGM, 2014.

[6]      O. Kharisman, "Analisis Sentimen Pada Review Konsumen Maskapai Penerbangan Menggunakan Kombinasi Lexicon Berbasis Sentiwordnet dan Supervised Model," thesis, Jurusan Ilmu Komputer FMIPA UGM, 2017.

[7]      Y. Kim,"Convolutional Neural Networks for Sentence Classification," In Proc. of EMNLP conference, p. 1746-1751, 2014 [Online]. Available: https://arxiv.org/abs/1408.5882. [Accessed: 04-Oct-2020].

[8]      A. Razi, "Klasifikasi Artikel Berita Berbahasa Indonesia Menggunakan Convolutional Neural Network," thesis, Jurusan Ilmu Komputer FMIPA UGM, 2017.

[9]      E. Haddi, X. Liu, and Y. Shi, "The Role of Text Pre-processing in Sentiment Analysis," Procedia Computer Science, vol. 17, p. 26–32, 2013 [Online]. Available: https://www.sciencedirect.com/science/article/pii/S1877050913001385. [Accessed: 04-Oct-2020].

[10]    I. Hemalatha, P.G. Varma, and A. Govardhan, "Preprocessing the Informal Text for Efficient Sentiment Analysis," International Journal of Emerging Trends & Technology in Computer Science (IJETTCS), vol. 1, July – August, 2012.

[11]    H. Siqueira and F. Barros, "A Feature Extraction Process for Sentiment Analysis of Opinions on Services," Proceedings of International Workshop on Web and Text Intelligence, 2010 [Online]. Available: http://sites.labic.icmc.usp.br/wti2012/IIIWTI_camera_ready/74769.pdf. [Accessed: 05-Oct-2020].

[12]    N. Wyse, R. Dubes, and A. K. Jain, "A critical evaluation of intrinsic dimensionality, Pattern Recognition in Practice", Morgan Kaufmann Publisher, Inc., p. 415-425, 1980.

[13]    V. N. Vapnik, "The Nature of Statistical Learning Theory", Springer-Verlag, New York, 1995.

[14]    C. M. Bishop, "Pattern Recognition and Machine Learning," Springer, New York, 2006.

[15]    A. Severyn and A. Moschitti, "UNITN: Training deep convolutional neural network for Twitter sentiment classification," In Proc. of the 9th SemEval workshop, p. 464-469, 2015.

[16]    A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," In Advances in neural information processing systems, p. 1097-1105, 2012 [Online]. Available: https://arxiv.org/abs/1606.07006. [Accessed: 08-Oct-2020].

[17]    X. Yang, C. Macdonald, and I. Ounis, “Using Word Embeddings in Twitter Election Classification,” Information Retrieval Journal, vol. 21, p. 183–207, 2018 [Online]. Available: https://arxiv.org/abs/1606.07006. [Accessed: 08-Oct-2020].



DOI: https://doi.org/10.22146/ijccs.61627

Article Metrics

Abstract views : 569 | views : 506

Refbacks

  • There are currently no refbacks.




Copyright (c) 2021 IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



Copyright of :
IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
ISSN 1978-1520 (print); ISSN 2460-7258 (online)
is a scientific journal the results of Computing
and Cybernetics Systems
A publication of IndoCEISS.
Gedung S1 Ruang 416 FMIPA UGM, Sekip Utara, Yogyakarta 55281
Fax: +62274 555133
email:ijccs.mipa@ugm.ac.id | http://jurnal.ugm.ac.id/ijccs



View My Stats1
View My Stats2