Development Synonym Set for the English Wordnet Using the Method of Comutative and Agglomerative Clustering
DOI:
https://doi.org/10.32736/sisfokom.v9i2.855Keywords:
Wordnet, Synonym Set, Agglomerative ClusteringAbstract
Wordnet is a collection of words that interpret or present a meaning, in its development Wordnet has an important part, the Synonym Set or Synset. In making Synonym sets, synonyms are needed and the commutative nature of words is needed. To get word synonyms, the English language thesaurus becomes the reference data for taking synonym data. Broadly speaking, the difference between Wordnet and the dictionary is that the meaning of the word is related to other words, to determine the equation requires a commutative process. The process is made easy by using commutative methods that will produce a candidate synonym set. Candidates for the synonym set cannot be used for word syntax, the grouping process of words which produces the Synonym set as the final result must be carried out. The process of grouping words can one of them use clustering techniques, in this study will use Agglomerative Clustering techniques. In the process of agglomerative clustering techniques there is a threshold value to determine the number of repetitions or as a condition to stop the iteration process. The clustering process in this study will use a threshold value of 0.1 to 1 to test the best threshold value to produce the best Synonym set and calculate its accuracy value. Accuracy calculation and evaluation will use the F-measure method to find the best results.References
G. A. Miller, "Introduction to WordNet: An on-line lexical database," International journal of lexicography, vol. 3, pp. 235-244, 1990.
M. I. Pribadi, "Pendeteksian Relasi Antar Makna Pada Wordnet Bahasa Indonesia," Universitas Komputer Indonesia, 2017.
F. R. d. D. P. D. Zamzami, "Apliasi wordne indonesia berdasarkan kamus thesaurus bahasa indonesia berdasarkan kamus thesaurus bahasa indonesia menggunakan algoritma rule based text parsing," Seminar Informatika Aplikatif Polinema, 2016.
M. A. B. d. K. M. Lhaksamana, "Pembangunan Synonym Set untuk WordNet Bahasa Indonesia dengan Menggunakan Metode Komutatif,," Indonesia Journal on Computing (Indo-JC), vol. 4, pp. 147-156, 2019.
G. A. Pradnyana, "Perancangan dan Implementasi Automated Document Integration dengan Menggunakan Algoritma Complete Linkage Agglomerative Hierarchical Clustering,” Jurnal Ilmu Komputer, vol. 5, no. 2, 2012.," Jurnal Ilmu Komputer, vol. 5, 2012.
L. D. Anggaraini, "Analisis Pembangunan Word Sense pada WordNet Bahasa Indonesia Menggunakan Metode Hierarchical Clustering," Telkom University, Bandung, 2019.
D. P. Nasional, Kamus besar bahasa Indonesia, 2008.
w. a. c. l. suprafti, "KONSTRUKSI TESAURUS NASKAH KUNO DENGAN PENDEKATAN LITERARY DAN USER WARRANT," jurnla ilmu perpustakaan, vol. 7, pp. 221-230, 2018.
A. Fadliana, "Penerapan metode Agglomerative Hierarchical Clustering untuk klasifikasi Kabupaten/Kota di Provinsi Jawa Timur berdasarkan kualitas pelayanan keluarga berencana," Universitas Islam Negeri Maulana Malik Ibrahim, 2015.
P. Etzioni, "Adaptive Web Sites: Conceptual Cluster Mining," IJCAI, p. 6, 1997.
M. A. B. I. P. P. Ananda, "Pembangunan Synsets untuk WordNet Bahasa Indonesia dengan Metode Komutatif," eProceedings of Engineering, vol. 5, 2018.
D. J. Restina, "Pembangunan Synonym Set untuk WordNet Bahasa Indonesia dengan Menggunakan Metode Komutati," Indo-JC, vol. 4, no. 2, 2019.
a. sabrina, "KLASIFIKASI ARTIKEL ONLINE TENTANG GEMPA DI INDONESIA MENGGUNAKAN MULTINOMIAL NA{"I}VE BAYES," publikasi tugas akhir s-1 PSTI FT-UNRAM, 2020.
D. M. Powers, "Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation," 2011.
R. a. I. P. P. A. Cahyani, "Analisis Sentimen terhadap Ulasan Hotel menggunakan Boosting Weighted Extreme Learning Machine," Jurnal Pengembangan Teknologi Indormasi dan Ilmu Komputer, p. 2548, 2019.
B. Sasirekha K, "Agglomerative hierarchical clustering algorithm-a," International Journal of Scientific and Research Publications, vol. 83, p. 83, 2013.
D. Mullner, "Modern hierarchical, agglomerative clustering algorithms," 2011.
A. Saputra, "Building synsets for Indonesian Wordnet with monolingual lexical resources," International Conference on Asian Language Processing, pp. 297-300, 2010.
rahayu, "Analisis Dan Implementasi Algoritma Agglomerative Hierarchical Clustering Untuk Deteksi Komunitas Pada Media Sosial Facebook," eProceedings of Engineering, vol. 5, 2018.
S. T. a. F. M. Z. Cristina Bosco, "Somewhere between Valency Frames and Synsets. Comparing Latin Vallex and Latin WordNet," in Proceesings of the Second Italian Conference on Computational Linguitics, trento, Accademia University Press, 2015.
Downloads
Additional Files
Published
Issue
Section
License
The copyright of the article that accepted for publication shall be assigned to Jurnal Sisfokom (Sistem Informasi dan Komputer) and LPPM ISB Atma Luhur as the publisher of the journal. Copyright includes the right to reproduce and deliver the article in all form and media, including reprints, photographs, microfilms, and any other similar reproductions, as well as translations.
Jurnal Sisfokom (Sistem Informasi dan Komputer), LPPM ISB Atma Luhur, and the Editors make every effort to ensure that no wrong or misleading data, opinions or statements be published in the journal. In any way, the contents of the articles and advertisements published in Jurnal Sisfokom (Sistem Informasi dan Komputer) are the sole and exclusive responsibility of their respective authors.
Jurnal Sisfokom (Sistem Informasi dan Komputer) has full publishing rights to the published articles. Authors are allowed to distribute articles that have been published by sharing the link or DOI of the article. Authors are allowed to use their articles for legal purposes deemed necessary without the written permission of the journal with the initial publication notification from the Jurnal Sisfokom (Sistem Informasi dan Komputer).
The Copyright Transfer Form can be downloaded [Copyright Transfer Form Jurnal Sisfokom (Sistem Informasi dan Komputer).
This agreement is to be signed by at least one of the authors who have obtained the assent of the co-author(s). After submission of this agreement signed by the corresponding author, changes of authorship or in the order of the authors listed will not be accepted. The copyright form should be signed originally, and send it to the Editorial in the form of scanned document to sisfokom@atmaluhur.ac.id.