Overview
Title (Language)
English Language Dataset
Dataset Types
Call Center, Music, Medical
Country
All
Description
Unscripted, synthetic telephonic conversations between an agent and a customer are available with an approximate duration of 5 to 15 minutes, along with singing audio collections with transcription, and medical data resources.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
| Dataset Type | Sampling Rate | Speakers | Channel | Total Hours | Total Number of Speakers |
|---|---|---|---|---|---|
| Call Center | 16 kHz | 2 Speakers | Mono | 3,109:24:52 | On Request |
| Medical | 16 kHz | 2 Speakers | Mono | - | 2,141 |
| Music | 48 kHz | Single Speaker | Mono | 23:43:20 | 43 |
Featured Clients
Empowering teams to build world-leading AI products.
Can’t find what you are looking for?
New off-the-shelf datasets are being collected across all data types
Contact us now to let go of your audio/speech training data collection worries