UrduLM: A Resource-Efficient MonolingualUrduLanguage Model
… Low-resourceLLMmethodologies require addressing data scarcity, tokenization inefficiency…for this work is to promoteAIaccessibility forUrdu-speaking communities, who are currently …
Monitoring AI enhancements in Urdu
Google Scholar papers focused on Urdu AI, LLMs, and datasets
… Low-resourceLLMmethodologies require addressing data scarcity, tokenization inefficiency…for this work is to promoteAIaccessibility forUrdu-speaking communities, who are currently …
… howartificialintelligence, neural machine translators (NMT) and large language models (LLM…The paper reviews the accuracy of NMT translating classical Arabic texts intoUrduand the …
…Urdu, a low-resource language with limited datasets and NLP tools. To deal with this gap, we employed the Urdu Human and AI … LLM text detection. Their results showed that humans … /think.
… We solve the issue of dataset rigor by using a primary dataset of 64,000Urdunews articles… at summarizingUrdu. Consequently, this research represents a notable contribution toUrdu…
… which may occasionally have syntactical English andUrdutexts due to Pakistan’s code…language RomanUrdu, a custom dataset was used to handleNaturalLanguageProcessingand …
… sentiment analysis an intersection ofnaturallanguageprocessing(NLP) and machine learning(… in Sindhi andUrdu, thereby contributing to both multilingualNLPresearch and electoral …
… XLM-R is adopted due to its strong cross-lingual performance and support forUrduin itsvocabulary [10, 21], while mBERT is included as a widely used baseline for multilingualNLP[13]…
… Low resource Perso-Arabic languages such as Persian andUrdulack sufficient Grapheme-…We address this by exploiting phonological similarities between Arabic, Persian, andUrduto …
… in RomanUrdu, RomanUrduis commonly used script forUrdu… showed its ability in RomanUrdutext. The research presents … discoveries by utilizing hybridNLPmethods for improving …
…NLPtechniques to identify depressive cues in social media posts, focusing on both standardUrduand code-mixed RomanUrdu, … , we converted NastaliqUrduinto RomanUrduand …
… of Arabicscript languages within theNLPcommunity. … , which, despite its long tradition ofNLPresearch, continues to face … , used in languages such as Persian,Urdu, Pashto, and others, …
… between French andUrduthrough a componential and … between French andUrdulexicalcorrespondences despite … analysis can informNaturalLanguageProcessing(NLP) — …