arXiv Research
Abstract:We explore the impact of leveraging the relatedness of languages that belong to the same family inNLPmodels using multilingual fine-tuning. We hypothesize and validate that multilingual fine-tuning of pre-trained language models can yield better performance on downstream…▽ MoreWe explore the impact of leveraging the relatedness of languages that belong to the same family inNLPmodels using
Open Full News
arXiv Research
Abstract:Phoneme recognition is a largely unsolved problem inNLP, especially for low-resource languages like…▽ MorePhoneme recognition is a largely unsolved problem inNLP, especially for low-resource languages likeUrdu. The systems that try to extract the phonemes from audio speech require hand-labeled phonetic transcriptions. This requires expert linguists to annotate speech data with its relevan
Open Full News
arXiv Research
Abstract:Finding similarities between two inter-language news articles is a challenging problem of Natural Language Processing (NLP). It is difficult to find similar news articles in a different language other than the native language of user, there is a need for a Machine Learning based automatic system to find the similarity between two inter-language news articles…▽ MoreFinding similarities bet
Open Full News
arXiv Research
Abstract:There are several online newspapers inurdubut for the users it is difficult to find the content they are looking for because these most of them contain irrelevant data and most users did not get what they want to retrieve. Our proposed framework will help to predict…▽ MoreThere are several online newspapers inurdubut for the users it is difficult to find the content they are looking for b
Open Full News
arXiv Research
Abstract:…propose the novel task of detecting propaganda techniques in code-switched text. To support this task, we create a corpus of 1,030 texts code-switching between English and RomanUrdu, annotated with 20 propaganda techniques, which we make publicly available. We perform a number of experiments contrasting different experimental setups, and we find that it is…▽ MorePropaganda is a form of c
Open Full News
arXiv Research
Abstract:This study providesUrdupoetry generated using different deep-learning techniques and algorithms. The data was collected through the Rekhta website, containing 1341 text files with several couplets. The data on poetry was not from any specific genre or poet. Instead, it was a collection of mixed…▽ MoreThis study providesUrdupoetry generated using different deep-learning techniques and algo
Open Full News
arXiv Research
Abstract:With the advent of Deep Learning based Artificial Neural Networks models, Natural Language Processing (NLP) has witnessed significant improvements in textual data processing in terms of its efficiency and accuracy. However, the research is mostly restricted to high-resource languages such as English and low-resource languages still suffer from a lack of avai…▽ MoreWith the advent of Deep
Open Full News
arXiv Research
Abstract:This paper introduces UQA, a novel dataset for question answering and text comprehension inUrdu, a low-resource language with over 70 million native speakers. UQA is generated by translating the Stanford Question Answering Dataset (SQuAD2.0), a large-scale English QA dataset, using a technique called EATS (Enclose to Anchor, Translate, Seek), which preserve…▽ MoreThis paper introduces UQA
Open Full News
arXiv Research
Abstract:…processing research, by transitioning from languages and task specific model pipelines to a single model adapted on a variety of tasks. However majority of existing multilingualNLPbenchmarks for LLMs provide evaluation data in only few languages with little linguistic diversity. In addition these benchmarks lack quality assessment against the respective st…▽ MoreLarge Language Models (LL
Open Full News
arXiv Research
Abstract:Empathy plays a pivotal role in fostering prosocial behavior, often triggered by the sharing of personal experiences through narratives. However, modeling empathy usingNLPapproaches remains challenging due to its deep interconnection with human interaction dynamics. Previous approaches, which involve fine-tuning language models (LMs) on human-annotated emp…▽ MoreEmpathy plays a pivotal ro
Open Full News
arXiv Research
Abstract:…and Llama-3-8b--that have been fine-tuned on specific tasks. We focus on seven classification and seven generation tasks to evaluate the performance of these models onUrdulanguage.…▽ MoreIn this paper, we compare general-purpose models, GPT-4-Turbo and Llama-3-8b, with special-purpose models--XLM-Roberta-large, mT5-large, and Llama-3-8b--that have been fine-tuned on specific tasks. We fo
Open Full News
arXiv Research
Abstract:Large language models (LLMs) have garnered significant interest in natural language processing (NLP), particularly their remarkable performance in various downstream tasks in resource-rich languages. Recent studies have highlighted the limitations of LLMs in low-resource languages, primarily focusing on binary classification tasks and giving minimal attentio…▽ MoreLarge language models (L
Open Full News