Title | : | Word Embeddings Evaluation on Indonesian Translation of AI-Quran and Hadiths |
Author | : |
MUHAMMAD ZIDNY N (1) Yunita Sari, S.Kom., M.Sc., Ph.D. (2) Dr. Yohanes Suyanto, M.I.Kom. (3) |
Date | : | 0 2021 |
Keyword | : | Word vector,CBOW,Binary Relevance,Multilayer Perceptron Word vector,CBOW,Binary Relevance,Multilayer Perceptron |
Abstract | : | Word vectors are an important part of machine learning. Word vectors are a numerical representation of text data. One of the methods that can be used to convert text into numerics is word embeddings. The word embeddings algorithm that researchers often use is Continuous Bag of Word, Skip-Gram, and FastText. This paper will discuss the transformation of textual data from Islamic knowledge domain documents into numerical forms using these three algorithms, then evaluate the word vector results using intrinsic and extrinsic evaluation techniques. We conduct intrinsic evaluations by determining the words to be evaluated, then checking for the existence of synonyms, antonyms, related words, and derived words from the nearest set of words based on vector values. We also tried to use vector words to solve word analogy problems. The best word vector in extrinsic evaluation is the result of the CBOW algorithm which is integrated with Binary Relevance and Multilayer Perceptron, with an accuracy value of 77.56% and a hamming loss value of 8.14%. |
Group of Knowledge | : | |
Level | : | Nasional |
Status | : |
Published
|
No | Title | Action |
---|---|---|
1 |
Naf’an_2021_IOP_Conf__Ser___Mater__Sci__Eng__1077_012025.pdf
Document Type : [PAK] Full Dokumen
|
View |
2 |
Turnitin-Word Embeddings Evaluation on Indonesian Translation of AI-Quran and Hadiths.pdf
Document Type : Cek Similarity
|
View |
3 |
_2021_IOP_Conf__Ser___Mater__Sci__Eng__1077_011001.pdf
Document Type : Dokumen Pendukung Karya Ilmiah (Hibah, Publikasi, Penelitian, Pengabdian)
|
View |