Efficient Estimation of Nepali Word Representations in Vector Space
DOI:
https://doi.org/10.3126/jiee.v3i1.34327Keywords:
CBOW, NCE loss, One Hot Encoding, Skip-gram, TF-IDF, Word2VecAbstract
Word representation is a means of representing a word as mathematical entities that can be read, reasoned and manipulated by computational models. The representation is required for input to any new modern data models and in many cases, the accuracy of a model depends on it. In this paper, we analyze various methods of calculating vector space for Nepali words and postulate a word to vector model based on the Skip-gram model with NCE loss capturing syntactic and semantic word relationships.
This is an attempt to implement a paper by Mikolov on Nepali words.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Upon acceptance of an article, the copyright for the published works remains in the JIEE, Thapathali Campus and the authors.