The Effect of Text Summarization in Essay Scoring (Case Study: Teach on E-Learning)

Sensa Gudya Sauma Syahra(1*), Yunita Sari(2), Yohanes Suyanto(3)

(1) Master Program of Computer Science, FMIPA UGM, Yogyakarta
(2) Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta
(3) Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta
(*) Corresponding Author


The development of automated essay scoring (AES) in the neural network (NN) approach has eliminated feature engineering. However, feature engineering is still needed, moreover, data with labels in the form of rubric scores, which are complementary to AES holistic scores, are still rarely found. In general, data without labels/scores is found more. However, unsupervised AES research has not progressed with the more common use of publicly labeled data. Based on the case studies adopted in the research, automatic text summarization (ATS) was used as a feature engineering model of AES and readability index as the definition of rubric values for data without labels.

This research focuses on developing AES by implementing ATS results on SOM and HDBSCAN. The data used in this research are 403 documents of TEACH ON E-learning essays. Data is represented in the form of a combination of word vectors and a readability index. Based on the tests and measurements carried out, it was concluded that AES with ATS implementation had no good potential for the assessment of TEACH ON essays in increasing the silhouette score. The model produces the best silhouette score of 0.727286113 with original essay data.


AES; ATS; readability index; unsupervised

Full Text:




Ke, Z. & Ng, V., 2019, Automated essay scoring: A survey of the state of the art, IJCAI International Joint Conference on Artificial Intelligence, 2019-August, 6300–6308.

Kumar, V. & Boulanger, D., 2020, Explainable Automated Essay Scoring: Deep Learning Really Has Pedagogical Value, Frontiers in Education, 5, October, 1–22.

Zhou, S., Jeong, H. & Green, P.A., 2017, How consistent are the best-known readability equations in estimating the readability of design standards?, IEEE Transactions on Professional Communication, 60, 1, 97–111.

Beinborn, L., Zesch, T. & Gurevych, I., 2014, Readability for foreign language learning, ITL - International Journal of Applied Linguistics, 165, 2, 136–162.

Zhao, S., Zhang, Y., Xiong, X., Botelho, A. & Heffernan, N., 2017, A memory-Augmented neural model for automated grading, L@S 2017 - Proceedings of the 4th (2017) ACM Conference on Learning at Scale, 189–192.

West-Smith, P., Butler, S. & Mayfield, E., 2018, Trustworthy Automated Essay Scoring without Explicit Construct Validity, 95–102.

Nemati, M. & Azizi, M., 2013, Readability index of essays as an alternative to the scoring procedure in L2 academic writing, 4, Winter, 2–10.

Le, Q. & Mikolov, T., 2014, Distributed representations of sentences and documents, 31st International Conference on Machine Learning, ICML 2014, 4, 2931–2939.

Prabowo, D.A., Fhadli, M., Najib, M.A., Fauzi, H.A. & Cholissodin, I., 2016, TF-IDF-Enhanced Genetic Algorithm Untuk Extractive Automatic Text Summarization, Jurnal Teknologi Informasi dan Ilmu Komputer, 3, 3, 208.

Mihalcea, R. & Tarau, P., 2004, TextRank : Bringing Order into Text, Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 404–411.

Astudillo, C.A. & Oommen, B.J., 2014, Topology-oriented self-organizing maps: A survey, Pattern Analysis and Applications, 17, 2, 223–248.

Asan, U., Soyer, A. & Serdarasan, S., 2012, Computational Intelligence Systems in Industrial Engineering, C. Kahraman, ed., Atlantis Press, Paris.,.

Campello, R.J.G.B., Moulavi, D. & Sander, J., 2013, Density-based clustering based on hierarchical density estimates, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7819 LNAI, PART 2, 160–172.


Article Metrics

Abstract views : 1359 | views : 1279


  • There are currently no refbacks.

Copyright (c) 2022 IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Copyright of :
IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
ISSN 1978-1520 (print); ISSN 2460-7258 (online)
is a scientific journal the results of Computing
and Cybernetics Systems
A publication of IndoCEISS.
Gedung S1 Ruang 416 FMIPA UGM, Sekip Utara, Yogyakarta 55281
Fax: +62274 555133 |

View My Stats1
View My Stats2