Shaip
  • What We Do Best
        • AI Training Data

          Text Annotation TextUnlock critical information found deep within unstructured text.

          Speech Annotation SpeechBuild multi-lingual conversational AI with high-quality speech datasets.

          Image Annotation ImageAssign keywords to a digital image making it recognizable by machines.

          Video Annotation VideoAnnotate keypoints to moving objects making them recognizable by machines.

        • AI Data Services

          Data CollectionData CollectionCreate, collect & curate audio, images, text, and video from across the globe.

          Data TranscriptionData TranscriptionAI-driven, cloud-based transcription that supports 150+ languages.

          Data AnnotationData Annotation & LabelingAccurately annotate training data to make AI & ML think faster & smarter.

          Data De-IdentificationData De-identificationEnsure compliance with credentialed & certified domain experts.

        • Data Catalog & Licensing

          Medical DatasetsMedical DatasetsGold standard, high-quality, de-identified healthcare data.View Data Catalog

          Speech DatasetsSpeech DatasetsSource, transcribed & annotated speech data in over 50 languages.View Data Catalog

          Open DatasetsOpen DatasetsPublicly available datasets to train your AI/ML models.View Data Catalog

        • View all
  • ShaipCloud™ Platform
  • Solutions
  • Resources
        • Resources

          Case Study - ShaipCase Study

          One Pager - ShaipOne Pager

          Buyer's Guide - ShaipBuyer’s Guide

          Sample Datasets - ShaipSample Dataset

          Blogs - ShaipBlog

        • Recent Blogs

          • Ai In HealthcareHow the IoT and AI in Healthcare Are Poised to Transform the Industry
          • Ai Training DataThe Only Guide On AI Training Data You Will need in 2021
          • Healthcare AiHow Shaip Helps Teams Build Healthcare AI Solutions
  • Company
        • About Us - ShaipAbout

          Leadership Team - ShaipLeadership

          Social Impact - ShaipSocial Impact

        • EventsEvents & Webinar

          Security & Compliance - ShaipSecurity & Compliance

          Press Room - ShaipPress Room

        • Shaip PartnersPartners

          Careers - ShaipCareers

          Contact Us - ShaipContact

Contact Us
Shaip
Menu
  • What We Do Best
    • Training Data
      • Text
      • Speech
      • Image
      • Video
    • AI Data Services
      • Data Collection
      • Data Transcription
      • Data Annotation & Labeling
      • Data De-Identification
    • Data Catalog & Licensing
      • Medical Datasets
      • Speech Datasets
      • Open Datasets
  • ShaipCloud™ Platform
  • Solutions
  • Resources
    • Case Study
    • One Pager
    • Buyer’s Guide
    • Sample Datasets
    • Blog
  • Company
    • About
    • Leadership
    • Social Impact
    • Events & Webinar
    • Security and Compliance
    • Press Room
    • Partners
    • Careers
    • Contact

High Quality Curated Data to Train Your AI Model

Download our sample datasets for your Machine Learning Models

DatasetsFileUse CaseDescriptionDownload
Physician Dictation
Physician Dictation Audio Files
Audio Files
HealthcareAn hour of audio, dictated by physicians describing patients’ clinical condition & plan of care in the hospital/clinical setting.Download
Physician Dictation
Verbatim Transcribed Text Files
Verbatim Transcribed Text Files
HealthcareA set of transcribed documents corresponding to the dictation audio dataset. Verbatim transcription, as required to train speech recognition acoustic & vocabulary models.Download
Physician Clinical Notes
Physician Dictation Notes
Dictation Notes
HealthcareA set of clinical documents as dictated by the physician describing patients’ clinical condition.Download
Physician Clinical Notes
Physician Dictation Notes
De-identified Dictation Notes
HealthcareA set of formatted clinical documents as dictated by the physicians to train medical AI models.Download
Human-Bot Conversations
Canadian French
Canadian French
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Human-Bot Conversations
Australian English
Australian English
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Human-Bot Conversations
Uk English
UK English
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Danish
Danish
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Hindi
Hindi
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Telugu
Telugu
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Indonesian
Indonesian
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Hebrew
Hebrew
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Malay
Malay
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Afrikaans
Afrikaans
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Arabic
Arabic
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Irish
Irish
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Scottish
Scottish
Conversational AIAn hour of audio conversation & transcribed json filesDownload
Conversations Datasets
Welsh
Welsh
Conversational AIAn hour of audio conversation & transcribed json filesDownload

Shaip
Information
  • What We Do Best
  • ShaipCloud™ Platform
  • Solutions
  • Resources
  • Company
Contact Us
Address

US Office

12806 Townepark Way Louisville, KY 40243-2311

India Office

B-605, Wall Street-2, Opp. Orient Club, Ellisbridge, Ahmedabad, Gujarat 380006

Contact Us

Phone (US): (866) 473-5655
Phone (RoW): (91) 80684-71130
Email: info@shaip.com

Follow Us
LinkedIn Icon
Twitter Icon
Facebook Icon
Instagram Icon

© 2018 – 2021 Shaip | All Rights Reserved

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies. Read More
Cookie settingsAccept
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled

Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.

Non-necessary

Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.

SAVE & ACCEPT

Where should we send your training data?

  • By registering, you confirm that you agree to the storing and processing of your personal data by Shaip as described in the Privacy Statement

Where should we send your training data?

  • By registering, you confirm that you agree to the storing and processing of your personal data by Shaip as described in the Privacy Statement

Where should we send your training data?

  • By registering, you confirm that you agree to the storing and processing of your personal data by Shaip as described in the Privacy Statement

Where should we send your training data?

  • By registering, you confirm that you agree to the storing and processing of your personal data by Shaip as described in the Privacy Statement

Where should we send your training data?

  • By registering, you confirm that you agree to the storing and processing of your personal data by Shaip as described in the Privacy Statement

Where should we send your training data?

  • By registering, you confirm that you agree to the storing and processing of your personal data by Shaip as described in the Privacy Statement

Where should we send your training data?

  • By registering, you confirm that you agree to the storing and processing of your personal data by Shaip as described in the Privacy Statement

Where should we send your training data?