Kannada Dataset
ಕನ್ನಡ ಡೇಟಾಸೆಟ್
Overview
Title
Kannada Language Dataset
Dataset Type
Call-Center
Description
Unscripted, synthetic telephonic conversation between “agent” and “customer”, Approx. Audio Duration (Range) 5-15 Minutes.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
Total hours
60
Sample Rate
–
Audio Channel
–
Recording Platform
Desktop
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
India
Language
Kannada
Gender
–
Number of Speakers
–
Age
18-50
Overview
Title
Kannada Language Dataset
Dataset Type
General Conversation
Description
Unscripted, synthetic telephonic conversation between “agent” and “customer”, Approx. Audio Duration (Range) 5-15 Minutes.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
Total hours
100
Sample Rate
–
Audio Channel
–
Recording Platform
Desktop
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
India
Language
Kannada
Gender
–
Number of Speakers
–
Age
18-50
Overview
Title
Kannada Language Dataset
Dataset Type
Media Audio
Description
Licensable Public domain audio/video files such as interviews, podcasts etc – 1 to 5 people. Approx. Audio Duration (Range) 15-60 minutes.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
Total hours
40
Sample Rate
–
Audio Channel
–
Recording Platform
Web Sourcing
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
India
Language
Kannada
Gender
–
Number of Speakers
–
Age
18-50
Featured Clients
Empowering teams to build world-leading AI products.
Can’t find what you are looking for?
New off-the-shelf datasets are being collected across all data types
Contact us now to let go of your audio/speech training data collection worries