Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
ghananlpcommunity 's Collections
Parallel Domain Translation Dataset
MT Accuracy Evaluation
Domain Speech Datasets
Ghana Adolescent Conversations Datasets
UNICEF ASR Datasets
Ghana Navigation Corpus
SoTA LLM datasets
SoTA MT datasets
SoTA ASR datasets
Ghana TTS

SoTA LLM datasets

updated Apr 1

These are our best datasets for Training Large Language Models

Upvote
-

  • ghananlpcommunity/pristine-twi

    Viewer • Updated Mar 22 • 999k • 161
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs