UK English Dataset

Overview

Title

UK English Language Dataset

Dataset Type

Wake Word

Description

Keyphrases collection of data

  • 200 speakers
  • 4 unique keyphrases per speaker
  • 25-30 repeated keyphrases recordings per unique keyphrase
  • 25-30 audio files per unique keyphrase
  • 120 total recorded utterances per speaker

Use Case

ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling

Data Set Details

Total hours

200

Sample Rate

16 kHz

Audio Channel

1 channel

Recording Platform

Mobile App

Audio Format

.wav

Transcription Format

.json

WER (%)

5

Data Set Demographics

Country

UK

Language

UK English

Gender

Female 50%, Male 50%, Unknown 10%

Number of Speakers

Age

18-50

Featured Clients

Empowering teams to build world-leading AI products.

Shaip Contact Us

Can’t find what you are looking for?

New off-the-shelf datasets are being collected across all data types

Contact us now to let go of your audio/speech training data collection worries

  • By registering, I agree with Shaip Privacy Policy and Terms of Service and provide my consent to receive B2B marketing communication from Shaip.