GitHub
The project combines the capabilities of a Text-to-Speech (TTS) model and a fine-tuned Language Model (LLM) to create a seamless virtual human assistant. The TTS model converts text responses from the LLM into natural-sounding speech inUrdu, providing an engaging and interactive experience for users.
Validation Rationale:
The post specifically discusses a project using a Text-to-Speech (TTS) model and a Language Model (LLM) tailored for Urdu, aligning with Urdu AI development.
Visit GitHub
GitHub
An advanced Retrieval-Augmented Generation (RAG) Voice Chatbot designed for seamless human-like conversations inUrduandEnglish. Built with Google GenerativeAI, Streamlit, and LangChain, this chatbot offers robust functionality, including: Voice-based Q&A: SupportsUrduandEnglish voice inputs for real-time, context-aware conversations.
Validation Rationale:
The content explicitly mentions a specific Urdu-based RAG Voice Chatbot developed using AI technologies, directly aligning with the specified categories.
Visit GitHub
GitHub
TheUrdu-Instruct dataset is a high-quality multilingual synthetic corpus containing 51,686 training and 1,084 test examples. It was generated using GPT-4o under a modified self-instruction framework to improve instruction-following, reasoning, and bilingual understanding inUrdu. This dataset is part of the Alif-1.-8B-Instruct project and was created by the Traversaal.AIResearch Team. It ...
Validation Rationale:
The content clearly refers to the Urdu-Instruct dataset, which is a specific example of Urdu AI development.
Visit GitHub
GitHub
A Streamlit-basedUrduVoice Chatbot: This project demonstrates an interactiveUrduvoice chatbot built using Streamlit and integrated with Google's speech recognition and text-to-speech services, as well as Gemini's GenerativeAImodel. Users can record their voice inUrdu, receive responses inUrdu,andinteract with anAI-powered chatbot.
Validation Rationale:
The post clearly discusses an Urdu-based AI chatbot project using specific technologies and services, aligning with Urdu AI development.
Visit GitHub
GitHub
TheUrdu-Instruct dataset is a high-quality multilingual synthetic corpus containing 51,686 training and 1,084 test examples. It was generated using GPT-4o under a modified self-instruction framework to improve instruction-following, reasoning, and bilingual understanding inUrdu. This dataset is part of the Alif-1.-8B-Instruct project and was created by the Traversaal.AIResearch Team. It ...
Validation Rationale:
The snippet discusses the Urdu-Instruct dataset, which is a multilingual synthetic corpus and part of a specific Urdu AI project, indicating a focus on Urdu AI development.
Visit GitHub