High Quality Curated Data to Train Your AI Model

Download to check the kind of data we can deliver

Download our sample datasets for your Machine Learning Models

DatasetsFileUse CaseDescriptionDownload
Physician Dictation
Physician Dictation Audio Files
Audio Files
HealthcareA set of 1 hour of audio, dictated by physicians describing patients' clinical condition & plan of care in the hospital/clinical setting.
Physician Dictation
Verbatim Transcribed Text Files
Verbatim Transcribed Text Files
HealthcareA set of transcribed documents corresponding to the dictation audio dataset. Verbatim transcription, as required to train speech recognition acoustic & vocabulary models.
Physician Clinical Notes
Physician Dictation Notes
De-identified Dictation Notes
HealthcareA set of clinical documents as dictated by the physician describing patients’ clinical condition.
Physician Clinical Notes
Physician Dictation Notes
De-identified Dictation Notes
HealthcareA set of formatted clinical documents as dictated by the physicians to train medical AI models.
Human-Bot Conversations
Canadian French
Canadian French
Conversational AI1 hour of audio conversation & transcribed json files
Human-Bot Conversations
Australian English
Australian English
Conversational AI1 hour of audio conversation & transcribed json files
Human-Bot Conversations
UK English
UK English
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Danish
Danish
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Hindi
Hindi
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Telugu
Telugu
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Indonesian
Indonesian
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Hebrew
Hebrew
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Malay
Malay
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Afrikaans
Afrikaans
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Arabic
Arabic
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Irish
Irish
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Scottish
Scottish
Conversational AI1 hour of audio conversation & transcribed json files
Conversations Datasets
Welsh
Welsh
Conversational AI1 hour of audio conversation & transcribed json files

Still have questions about shAIp Data Services?

Contact Us