Cross-Lingual Short-Text Semantic Similarity for Kannada–English Language Pair
Document Type
Article
Publication Title
Computers
Abstract
Analyzing the semantic similarity of cross-lingual texts is a crucial part of natural language processing (NLP). The computation of semantic similarity is essential for a variety of tasks such as evaluating machine translation systems, quality checking human translation, information retrieval, plagiarism checks, etc. In this paper, we propose a method for measuring the semantic similarity of Kannada–English sentence pairs that uses embedding space alignment, lexical decomposition, word order, and a convolutional neural network. The proposed method achieves a maximum correlation of 83% with human annotations. Experiments on semantic matching and retrieval tasks resulted in promising results in terms of precision and recall.
DOI
10.3390/computers13090236
Publication Date
9-1-2024
Recommended Citation
Muralikrishna, S. N.; Holla, Raghurama; Harivinod, N.; and Ganiga, Raghavendra, "Cross-Lingual Short-Text Semantic Similarity for Kannada–English Language Pair" (2024). Open Access archive. 10073.
https://impressions.manipal.edu/open-access-archive/10073