High Quality Curated Data to Train Your AI Model

Download to check the kind of data we can deliver.

Physician Dictation Audio & Transcribed Reports

Physician Dictation Audio Files

A set of 16 hours of audio, dictated by physicians describing patients’ clinical condition and plan of care based on physician-patient encounters in the hospital/clinical setting.

Download

Verbatim Transcribed Text Files

A set of transcribed documents corresponding to the dictation audio dataset. Transcription has been done verbatim, as required to train speech recognition acoustic and vocabulary models.

Download

Human-Bot Conversations
(Audio and Transcribed JSON)

Canadian French Transcribed Files

5 hours of Canadian French language human-bot audio conversation and transcribed json files

Download

Australian English Transcribed Files

5 hours of Australian English language human-bot audio conversation and transcribed json files

Download

UK English Transcribed Files

5 hours of UK English language human-bot audio conversation and transcribed json
files

Download

Conversations to Train your AI model
(Audio and JSON)

Danish
Audio & Transcribed
Files

A set of 5 hours of Danish language Audio & Transcribed files.

Download

Hindi
Audio & Transcribed
Files

A set of 5 hours of Hindi language
Audio & Transcribed files.

Download

Telugu
Audio & Transcribed
Files

A set of 5 hours of Telugu language Audio & Transcribed files.

Download

Indonesian
Audio & Transcribed
Files

A set of 5 hours of Indonesian language Audio & Transcribed files.

Download

Hebrew
Audio & Transcribed
Files

A set of 5 hours of Hebrew language Audio & Transcribed files.

Download

Malay
Audio & Transcribed
Files

A set of 5 hours of Malay language
Audio & Transcribed files.

Download

Afrikaans
Audio & Transcribed
Files

A set of 5 hours of Afrikaans language Audio & Transcribed files.

Download

Arabic
Audio & Transcribed
Files

A set of 5 hours of Arabic language Audio & Transcribed files.

Download