Shaip
  • What We Do
        • What We Do Best

          AI Data Services

          • Data CollectionCreate global audio, images, text & video.
          • Data Annotation & LabelingAccurately annotate to make AI/ML think faster.
          • Data LicensingOff-the-Shelf Curated Data. Smarter Models.

          Speciality

          • Healthcare AITransform complex data into actionable insight.
          • Conversational AILocalize speech models with multi-lingual data.
          • Computer VisionBest-in-class visual training data.
          • Physical AIFuel robotics and autonomy with multimodal data.
          • Generative AIFuel your Gen AI with our premium training data.
            • RAG
            • Fine-Tuning
            • Multimodal AI
            • RLHF
            • AI Prompt Generation
  • Off-the-shelf Data
        • Off-The-Shelf Data Catalog & Licensing

          Medical Datasets

          • Physician Dictation Datasets
          • Transcribed Medical Records
          • Electronic Health Records (EHR)
          • View All

          Computer Vision Datasets

          • Bank Statement Dataset
          • Damaged Car Image Dataset
          • Facial Recognition Datasets
          • Pay Slips Dataset
          • View All

          Speech/Audio Datasets

          • New York English
          • Chinese Traditional
          • Canadian French
          • Arabic
          • View All
          • TTS
          • Wake Word
          • Call-Center
          • General Conversation
          • Podcast
          • Scripted Monologue
          • Spontaneous IVR
          • Singing Audio
  • Solutions
        • Solutions

          By Industry

          • HealthcareTransform complex data into actionable insight.
          • TechnologyPowering Technology with Precision Data
          • eCommerceImprove Conversion, Order Value, & Revenue
          • View All

          By Use Case

          • Biometric DataHigh-Quality Biometric Datasets
          • Facial RecognitionAuto-detect faces via facial landmarks
          • Wake Word Training Data CollectionBuilding Accurate Wake Words for your Brand
          • View All
          • Indic Language DataPre-labeled Indian language speech datasets
          • Multimodal Training DataMultimodal training data to improve AI model performance
          • Medical Data AnnotationExtract entities from unstructured data
  • Platform
  • Company
    • About
    • Blogs
    • Events & Webinars
    • Careers
    • Press Room
    • Security & Compliance
    • Resources
      • Case Study
      • Buyer’s Guide
      • Infographics
      • In The Media
  • What We Do
    • AI Data Services
      • Data Collection
      • Data Annotation & Labeling
      • Data Catalogs & Licensing
    • Speciality
      • Healthcare AI
      • Conversational AI
      • Computer Vision
      • Physical AI
      • Generative AI
  • Off-the-shelf Data
    • Medical Data Catalog
    • Speech Data Catalog
    • Computer Vision Data Catalog
  • Solutions
    • By Industry
      • Healthcare
      • Technology
      • eCommerce
    • By Use Case
      • Biometric Data
      • Facial Recognition
      • Wake Word Data Collection
      • Indian Language Datasets
      • Medical Data Annotation
      • View All
  • Data Platform
  • Resources
    • Case Study
    • Buyer’s Guide
    • Blogs
  • Company
    • About Us
    • Careers
  • Contact
  • Collaborate with Us
Contact Us
Freelancer/Vendor

Home » Speech Datasets » TTS Dataset

High-Quality TTS Datasets

Enhance your ASR, NLP, and speech synthesis projects with diverse multilingual TTS datasets

Speech Datasets
Burmese Dataset Speech Data

General Conversation, TTS

Burmese Dataset

View More

Canadian French Dataset Speech Data

TTS

Canadian French Dataset

View More

Chinese Simplified Dataset Speech Data

TTS

Chinese Simplified Dataset

View More

Chinese Traditional Dataset Speech Data

TTS

Chinese Traditional Dataset

View More

Chittagonian Dataset Speech Data

General Conversation, TTS

Chittagonian Dataset

View More

Danish Dataset Speech Data

Call-Center, General Conversation, Podcast, Scripted Monologue, TTS

Danish Dataset

View More

Dari Dataset Speech Data

General Conversation, TTS

Dari Dataset

View More

Dogri Dataset Speech Data

General Conversation, TTS

Dogri Dataset

View More

Gojri Dataset Speech Data

General Conversation, TTS

Gojri Dataset

View More

Hindi Dataset Speech Data

Call-Center, General Conversation, Podcast, Scripted Monologue, TTS

Hindi Dataset

View More

Kashmiri Dataset Speech Data

General Conversation, TTS

Kashmiri Dataset

View More

Nagamese Dataset Speech Data

General Conversation, TTS

Nagamese Dataset

View More

Sinhalese Dataset Speech Data

General Conversation, TTS

Sinhalese Dataset

View More

Comprehensive Speech Data Solutions: Fast, Flexible, and Best-in-Class Quality

Comprehensive Voice Data Solutions

End-to-end service: Complete service with expert domain knowledge and fast delivery.

Flexible: Choose custom, semi-custom, or off-the-shelf voice datasets with flexible ownership.

Domain Expert: Hire a Specialized Domain Expert for Fast, Quality AI Datasets.

Quality: Get quality checks from industry experts.

Licensing: Get a license tailored to your needs.

Ethical Data: We ensure contributors are informed and consent to data use.

AI Data Services
  • Data Licensing
  • Data Collection
  • Data Annotation
Speciality
  • Healthcare AI
  • Conversational AI
  • Computer Vision
  • Generative AI
  • Physical AI
Resources
  • Blogs
  • Case Study
  • Buyer’s Guide
  • Media
  • AI Glossary
Company
  • About
  • Compliance
  • Press Room
  • Partners
Contact Us

(US): (866) 473-5655

marketing@shaip.com
vendorcolab@shaip.com
career@shaip.com

Linkedin X-twitter Facebook Youtube Discord Instagram
© 2026 Shaip. All rights reserved.
Privacy Policy Vendor Privacy Notice Cookie Policy Terms of Service