서울대학교 의학연구원 의료빅데이터연구센터

RESEARCH

연구개발

연구실적

Building a Korean conversational speech database in the emergency medical domain

김선희, 이주영, 최서경, 지승훈, 강지민, 김종인, 김도희, 김보령, 조은기, 김호정, 장정민, 김준형, 구본혁, 박형민, 정민화

ABSTRACT

This paper describes a method of building Korean conversational speech data in the emergency medical domain and proposes an annotation method for the collected data in order to improve speech recognition performance. To suggest future research directions, baseline speech recognition experiments were conducted by using partial data that were collected and annotated. All voices were recorded at 16-bit resolution at 16 kHz sampling rate. A total of 166 conversations were collected, amounting to 8 hours and 35 minutes. Various information was manually transcribed such as orthography, pronunciation, dialect, noise, and medical information using Praat. Baseline speech recognition experiments were used to depict problems related to speech recognition in the emergency medical domain. The Korean conversational speech data presented in this paper are first-stage data in the emergency medical domain and are expected to be used as training data for developing conversational systems for emergency medical applications.

DOI

10.13064/KSSS.2020.12.4.081
유형

JOURNAL
저널/저서

Phonetics and Speech Sciences
발간일

2020/12/31
키워드

conversational speech; speech data; speech recognition; annotation; emergency medical domain
외부링크

https://doi.org/10.13064/KSSS.2020.12.4.081

목록으로

서울대학교 의학연구원
의료빅데이터연구센터

서울대학교 의과대학
의료빅데이터연구소

연구개발

연구실적

Building a Korean conversational speech database in the emergency medical domain

ABSTRACT

관련 연구실적 리스트

약관

서울대학교 의과대학 의료빅데이터연구소

연구개발

연구실적

Building a Korean conversational speech database in the emergency medical domain

ABSTRACT

관련 연구실적 리스트

약관

서울대학교 의과대학
의료빅데이터연구소