Irish Dataset

Tacar Sonraí Iris

Overview

Title

Irish Language Dataset

Dataset Type

General Conversation

Description

Unscripted telephonic conversation between two people. Approx. Audio Duration (Range) – 15-60 minutes.

Use Case

ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling

Data Set Details

Total hours

192

Sample Rate

8 kHz

Audio Channel

Dual

Recording Platform

Desktop

Audio Format

.wav

Transcription Format

.json

WER (%)

5

Data Set Demographics

Country

Ireland

Language

Irish

Gender

Female 213, Male 153, Unknown 0

Number of Speakers

366

Age

18-50

Featured Clients

Empowering teams to build world-leading AI products.

Shaip contact us

Can’t find what you are looking for?

New off-the-shelf datasets are being collected across all data types

Contact us now to let go of your audio/speech training data collection worries

  • By registering, I agree with Shaip Privacy Policy and Terms of Service and provide my consent to receive B2B marketing communication from Shaip.
  • This field is for validation purposes and should be left unchanged.