Development Synonym Set for the English Wordnet Using the Method of Comutative and Agglomerative Clustering

Munirsyah Munirsyah(1*), Moch. Arif Bijaksana(2), Widi Astuti(3)

(1) Telkom University
(2) Telkom University
(3) Telkom University
(*) Corresponding Author

Abstract


Wordnet is a collection of words that interpret or present a meaning, in its development Wordnet has an important part, the Synonym Set or Synset. In making Synonym sets, synonyms are needed and the commutative nature of words is needed. To get word synonyms, the English language thesaurus becomes the reference data for taking synonym data. Broadly speaking, the difference between Wordnet and the dictionary is that the meaning of the word is related to other words, to determine the equation requires a commutative process. The process is made easy by using commutative methods that will produce a candidate synonym set. Candidates for the synonym set cannot be used for word syntax, the grouping process of words which produces the Synonym set as the final result must be carried out. The process of grouping words can one of them use clustering techniques, in this study will use Agglomerative Clustering techniques. In the process of agglomerative clustering techniques there is a threshold value to determine the number of repetitions or as a condition to stop the iteration process. The clustering process in this study will use a threshold value of 0.1 to 1 to test the best threshold value to produce the best Synonym set and calculate its accuracy value. Accuracy calculation and evaluation will use the F-measure method to find the best results.

Keywords


Wordnet; Synonym Set; Agglomerative Clustering

Full Text:

PDF

References


G. A. Miller, "Introduction to WordNet: An on-line lexical database," International journal of lexicography, vol. 3, pp. 235-244, 1990.

M. I. Pribadi, "Pendeteksian Relasi Antar Makna Pada Wordnet Bahasa Indonesia," Universitas Komputer Indonesia, 2017.

F. R. d. D. P. D. Zamzami, "Apliasi wordne indonesia berdasarkan kamus thesaurus bahasa indonesia berdasarkan kamus thesaurus bahasa indonesia menggunakan algoritma rule based text parsing," Seminar Informatika Aplikatif Polinema, 2016.

M. A. B. d. K. M. Lhaksamana, "Pembangunan Synonym Set untuk WordNet Bahasa Indonesia dengan Menggunakan Metode Komutatif,," Indonesia Journal on Computing (Indo-JC), vol. 4, pp. 147-156, 2019.

G. A. Pradnyana, "Perancangan dan Implementasi Automated Document Integration dengan Menggunakan Algoritma Complete Linkage Agglomerative Hierarchical Clustering,” Jurnal Ilmu Komputer, vol. 5, no. 2, 2012.," Jurnal Ilmu Komputer, vol. 5, 2012.

L. D. Anggaraini, "Analisis Pembangunan Word Sense pada WordNet Bahasa Indonesia Menggunakan Metode Hierarchical Clustering," Telkom University, Bandung, 2019.

D. P. Nasional, Kamus besar bahasa Indonesia, 2008.

w. a. c. l. suprafti, "KONSTRUKSI TESAURUS NASKAH KUNO DENGAN PENDEKATAN LITERARY DAN USER WARRANT," jurnla ilmu perpustakaan, vol. 7, pp. 221-230, 2018.

A. Fadliana, "Penerapan metode Agglomerative Hierarchical Clustering untuk klasifikasi Kabupaten/Kota di Provinsi Jawa Timur berdasarkan kualitas pelayanan keluarga berencana," Universitas Islam Negeri Maulana Malik Ibrahim, 2015.

P. Etzioni, "Adaptive Web Sites: Conceptual Cluster Mining," IJCAI, p. 6, 1997.

M. A. B. I. P. P. Ananda, "Pembangunan Synsets untuk WordNet Bahasa Indonesia dengan Metode Komutatif," eProceedings of Engineering, vol. 5, 2018.

D. J. Restina, "Pembangunan Synonym Set untuk WordNet Bahasa Indonesia dengan Menggunakan Metode Komutati," Indo-JC, vol. 4, no. 2, 2019.

a. sabrina, "KLASIFIKASI ARTIKEL ONLINE TENTANG GEMPA DI INDONESIA MENGGUNAKAN MULTINOMIAL NA{"I}VE BAYES," publikasi tugas akhir s-1 PSTI FT-UNRAM, 2020.

D. M. Powers, "Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation," 2011.

R. a. I. P. P. A. Cahyani, "Analisis Sentimen terhadap Ulasan Hotel menggunakan Boosting Weighted Extreme Learning Machine," Jurnal Pengembangan Teknologi Indormasi dan Ilmu Komputer, p. 2548, 2019.

B. Sasirekha K, "Agglomerative hierarchical clustering algorithm-a," International Journal of Scientific and Research Publications, vol. 83, p. 83, 2013.

D. Mullner, "Modern hierarchical, agglomerative clustering algorithms," 2011.

A. Saputra, "Building synsets for Indonesian Wordnet with monolingual lexical resources," International Conference on Asian Language Processing, pp. 297-300, 2010.

rahayu, "Analisis Dan Implementasi Algoritma Agglomerative Hierarchical Clustering Untuk Deteksi Komunitas Pada Media Sosial Facebook," eProceedings of Engineering, vol. 5, 2018.

S. T. a. F. M. Z. Cristina Bosco, "Somewhere between Valency Frames and Synsets. Comparing Latin Vallex and Latin WordNet," in Proceesings of the Second Italian Conference on Computational Linguitics, trento, Accademia University Press, 2015.




DOI: https://doi.org/10.32736/sisfokom.v9i2.855

Refbacks

  • There are currently no refbacks.



Indexed By:

 



Creative Commons License
Jurnal Sisfokom (Sistem Informasi dan Komputer) has ISSN 2301-7988 and e-ISSN 2581-0588 which is published by Lembaga Penelitian dan Pengabdian Masyarakat (LPPM) ISB Atma Luhur under a Creative Commons Attribution-ShareAlike 4.0 International License.
Web Analytics Made Easy - StatCounter