Ritam Digital English
  • Home
  • Nation
  • World
  • Videos
    • Special Updates
    • Entertainment
    • Legal
    • Business
    • History
    • Viral Videos
  • Politics
  • Business
  • Lifestyle
    • Entertainment
    • Health
    • Food
    • Fashion
    • Viral
  • Sports
  • Opinion
No Result
View All Result
Ritam Digital English
  • Home
  • Nation
  • World
  • Videos
    • Special Updates
    • Entertainment
    • Legal
    • Business
    • History
    • Viral Videos
  • Politics
  • Business
  • Lifestyle
    • Entertainment
    • Health
    • Food
    • Fashion
    • Viral
  • Sports
  • Opinion
No Result
View All Result
Ritam Digital English
No Result
View All Result
  • Home
  • Nation
  • World
  • Videos
  • Politics
  • Business
  • Entertainment
  • Lifestyle
  • Sci & Tech
  • Sports
  • Opinion
Home Nation

Sarvam AI Launches OpenHathi: Groundbreaking Hindi Language Model

Sarvam AI, a relatively new player in the AI field, recently introduced OpenHathi-Hi-v0.1, the inaugural model in their OpenHathi series.

Editor Ritam English by Editor Ritam English
Dec 14, 2023, 10:49 am IST
Sarvam AI Launches OpenHathi: Groundbreaking Hindi Language Model

Sarvam AI Launches OpenHathi: Groundbreaking Hindi Language Model

FacebookTwitterWhatsAppTelegram

Sarvam AI, a relatively new player in the AI field, recently introduced OpenHathi-Hi-v0.1, the inaugural model in their OpenHathi series. This Hindi large language model (LLM) is based on Meta AI’s Llama2-7B architecture, known for its robust capabilities. Sarvam AI claims that OpenHathi-Hi-v0.1 performs on par with GPT-3.5 specifically for Indic languages, showcasing its potential in the multilingual AI landscape.

The AI model developed by Sarvam AI extends Llama2-7B’s tokenizer with a substantial 48,000-token addition. The training process is divided into two phases: first, an embedding alignment phase that aligns randomly initialized Hindi embeddings, followed by a bilingual language modeling phase where the model is trained to attend cross-lingually across tokens.

Sarvam AI emphasizes that their model’s effectiveness goes beyond standard Natural Language Generation (NLG) tasks, as they have evaluated its performance in real-world applications. The company collaborated with KissanAI to fine-tune the base model using conversational data collected from interactions between a GPT-powered bot and farmers in various languages.

“We show that our model works as well as, if not better than GPT-3.5 on various Hindi tasks while maintaining its English performance,” the company said in a post on X (formerly Twitter).

In their efforts to enhance Llama-2’s Hindi capabilities, Sarvam AI took steps to improve the efficiency of tokenization on Hindi text. They aimed to make both training and inferencing faster by decreasing the fertility score, which represents the average number of tokens a word is split into. This involved training a sentence-piece tokenizer from a subset of the Sangraha corpus and merging it with the Llama2 tokenizer, resulting in a new tokenizer with a 48,000-word vocabulary.

Founded in July 2023 by Vivek Raghavan and Pratyush Kumar, Sarvam AI has quickly garnered attention and support. In a funding round earlier this month, the startup secured $41 million in investment, with leading contributions from Lightspeed Ventures, and participation from Peak XV Partners and Khosla Ventures. This substantial funding reflects the industry’s confidence in Sarvam AI’s potential and the significance of their contributions to the AI landscape.

Tags: AIArtificial IntelligenceSarvam AIOpenHathiHindi large language modelLlama2-7BIndic languagesLlama2Vivek RaghavanPratyush Kumar
ShareTweetSendShare

Related News

Indians are boycotting Turkey (PC: The Times Of India)
Nation

India Rallies Against Turkey: Trade and Travel Boycotts Surge After Diplomatic Fallout

Encounter breaks out between security forces and terrorists (PC: Republic World)
Nation

4 Terrorists Eliminated, But Is The Beginning? Tral Gunfight, Shopian Clash & India’s Air Defence Strike Back Hard

Wants Water, But Spreads Terror? Pakistan Begs India After Indus Treaty Suspension – But Here’s Why Modi Said No
Nation

Wants Water, But Spreads Terror? Pakistan Begs India After Indus Treaty Suspension – But Here’s Why Modi Said No

Is RSS Behind the Alleged Attack on Colonel Sofiya Qureshi’s Family? Police Say It’s 100% Fake News Spread from Abroad
Nation

Is RSS Behind the Alleged Attack on Colonel Sofiya Qureshi’s Family? Police Say It’s 100% Fake News Spread from Abroad

The Lion Who Never Bowed: Chhatrapati Sambhaji Maharaj’s Glorious Defiance Against Aurangzeb
Nation

The Lion Who Never Bowed: Chhatrapati Sambhaji Maharaj’s Glorious Defiance Against Aurangzeb

Comments

The comments posted here/below/in the given space are not on behalf of Ritam Digital Media Foundation. The person posting the comment will be in sole ownership of its responsibility. According to the central government's IT rules, obscene or offensive statement made against a person, religion, community or nation is a punishable offense, and legal action would be taken against people who indulge in such activities.

Latest News

Indians are boycotting Turkey (PC: The Times Of India)

India Rallies Against Turkey: Trade and Travel Boycotts Surge After Diplomatic Fallout

Encounter breaks out between security forces and terrorists (PC: Republic World)

4 Terrorists Eliminated, But Is The Beginning? Tral Gunfight, Shopian Clash & India’s Air Defence Strike Back Hard

Wants Water, But Spreads Terror? Pakistan Begs India After Indus Treaty Suspension – But Here’s Why Modi Said No

Wants Water, But Spreads Terror? Pakistan Begs India After Indus Treaty Suspension – But Here’s Why Modi Said No

Anita Anand has been appointed as Canada's new foreign minister (PC: Republic World)

Meet Canada’s New Hindu Foreign Minister: Anita Anand’s Inspiring Journey from Nova Scotia to Global Stage

Is RSS Behind the Alleged Attack on Colonel Sofiya Qureshi’s Family? Police Say It’s 100% Fake News Spread from Abroad

Is RSS Behind the Alleged Attack on Colonel Sofiya Qureshi’s Family? Police Say It’s 100% Fake News Spread from Abroad

The Lion Who Never Bowed: Chhatrapati Sambhaji Maharaj’s Glorious Defiance Against Aurangzeb

The Lion Who Never Bowed: Chhatrapati Sambhaji Maharaj’s Glorious Defiance Against Aurangzeb

Why Did India Expel a Pakistani Diplomat in Just 24 Hours? Shocking ISI Spy Network Exposed After Pahalgam Attack

Could India’s Shield Be the Reason Pakistan’s Missiles Failed? Inside the Air Defence That Stopped a War

Pakistan High Commission in Delhi (PC: Dawn.com)

Why Did India Expel a Pakistani Diplomat in Just 24 Hours? Shocking ISI Spy Network Exposed After Pahalgam Attack

Pakistan’s ISPR doctored an Indian MoD video (PC: OpIndia)

Pakistan’s ISPR Spread Lies, Shares False Indian Press Briefing Video

PM Modi at the Adampur airbase (PC: Narendra Modi/X)

PM Modi Praises Indian Forces: Aligns Their Skills With Cutting-Edge Technology

  • Home
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms & Conditions
  • Disclaimer

© Ritam Digital Media Foundation.
Tech-enabled by Ananthapuri Technologies

No Result
View All Result
  • Home
  • Nation
  • World
  • Videos
  • Politics
  • Business
  • Entertainment
  • Lifestyle
  • Sci & Tech
  • Sports
  • Opinion
  • About & Policies
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions
    • Disclaimer

© Ritam Digital Media Foundation.
Tech-enabled by Ananthapuri Technologies