Overview
Title (Language)
Indian English Language Dataset
Dataset Types
Call Center, General Conversation, Utterance
Country
India
Description
This dataset includes unscripted synthetic agent–customer telephonic conversations (5–15 minutes), natural human-to-human telephonic conversations (15–60 minutes), and utterance-level data.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
| Dataset Type | Sampling Rate | Speakers | Channel | Total Hours | Total Number of Speakers |
|---|---|---|---|---|---|
| Call Center | 8 kHz | 2 Speakers | Mono | 666:56:30 | 1,900 |
| Call Center | 16 kHz | 2 Speakers | Mono | 215:56:10 | 6,802 |
| General Conversation | 16 kHz | 2 Speakers | Mono | 46:28:54 | 214 |
| Utterance | 32 kHz | Single Speaker | Mono | 201:00:00 | 61 |
Featured Clients
Empowering teams to build world-leading AI products.
Can’t find what you are looking for?
New off-the-shelf datasets are being collected across all data types
Contact us now to let go of your audio/speech training data collection worries