High-quality Off-the-Shelf AI Training datasets to train your AI Model
Get a professional, scalable, & reliable sample dataset to train your Chatbot, Conversational AI, & Healthcare applications to train your ML Models
Datasets | File | Use Case | Description | Download |
---|---|---|---|---|
Physician Dictation |
Audio Files | Healthcare | An hour of audio, dictated by physicians describing patients’ clinical condition & plan of care in the hospital/clinical setting. | Download |
Physician Dictation |
Verbatim Transcribed Text Files | Healthcare | A set of transcribed documents corresponding to the dictation audio dataset. Verbatim transcription, as required to train speech recognition acoustic & vocabulary models. | Download |
Physician Clinical Notes |
Dictation Notes | Healthcare | A set of clinical documents as dictated by the physician describing patients’ clinical condition. | Download |
Physician Clinical Notes |
De-identified Dictation Notes | Healthcare | A set of formatted clinical documents as dictated by the physicians to train medical AI models. | Download |
Human-Bot Conversations |
Australian English | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Human-Bot Conversations |
UK English | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Danish | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Hindi | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets | Telugu | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Indonesian | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Hebrew | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Malay | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Afrikaans | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Arabic | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Irish | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Scottish | Conversational AI | An hour of audio conversation & transcribed json files | Download |
Conversations Datasets |
Welsh | Conversational AI | An hour of audio conversation & transcribed json files | Download |
We deal with all types of Data Licensing be it text, audio, video, or image. The above sample datasets consist of Human-Bot Conversations, Chatbot Training Dataset, Conversational AI Datasets, Physician Dictation Dataset, Physician Clinical Notes, Medical Conversation Dataset,
Medical Transcription Dataset, Doctor-Patient Conversational Dataset, etc.
Can’t find what you are looking for? New off-the-shelf datasets are being collected across all data types i.e. text, audio, image, & video. Contact us today.