http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
다음색 감정 음성합성 응용을 위한 감정 SSML 처리기
유세희,조희,이주현,홍기형,Ryu, Se-Hui,Cho, Hee,Lee, Ju-Hyun,Hong, Ki-Hyung 한국음향학회 2021 韓國音響學會誌 Vol.40 No.5
In this paper, we designed and developed an Emotional Speech Synthesis Markup Language (SSML) processor. Multi-speaker emotional speech synthesis technology that can express multiple voice colors and emotional expressions have been developed, and we designed Emotional SSML by extending SSML for multiple voice colors and emotional expressions. The Emotional SSML processor has a graphic user interface and consists of following four components. First, a multi-speaker emotional text editor that can easily mark specific voice colors and emotions on desired positions. Second, an Emotional SSML document generator that creates an Emotional SSML document automatically from the result of the multi-speaker emotional text editor. Third, an Emotional SSML parser that parses the Emotional SSML document. Last, a sequencer to control a multi-speaker and emotional Text-to-Speech (TTS) engine based on the result of the Emotional SSML parser. Based on SSML which is a programming language and platform independent open standard, the Emotional SSML processor can easily integrate with various speech synthesis engines and facilitates the development of multi-speaker emotional text-to-speech applications.
법과학적 활용을 위한 삼성 스마트폰 음성 녹음 파일의 메타데이터 구조 및 속성 비교 분석 연구
안서영,유세희,김경화,홍기형,Ahn, Seo-Yeong,Ryu, Se-Hui,Kim, Kyung-Wha,Hong, Ki-Hyung 한국음성학회 2022 말소리와 음성과학 Vol.14 No.3
Due to the popularization of smartphones, most of the recorded speech files submitted as evidence of recent crimes are produced by smartphones, and the integrity (forgery) of the submitted speech files based on smartphones is emerging as a major issue in the investigation and trial process. Samsung smartphones with the highest domestic market share are distributed with built-in speech recording applications that can record calls and voice, and can edit recorded speech. Unlike editing through third-party speech (audio) applications, editing by their own builtin speech applications has a high similarity to the original file in metadata structures and attributes, so more precise analysis techniques need to prove integrity. In this study, we constructed a speech file metadata database for speech files (original files) recorded by 34 Samsung smartphones and edited speech files edited by their built-in speech recording applications. We analyzed by comparing the metadata structures and attributes of the original files to their edited ones. As a result, we found significant metadata differences between the original speech files and the edited ones.
비대면 홈 트레이닝 서비스의 사용자 경험 향상을 위한 어플리케이션 프로토타입 개선 연구
박가현(Ga Hyun, Park),박상우(Sang Woo, Park),정희정(Hee Jung, Chung),유세희(Se Hui, Ryu),강효진(Hyo-Jin, Kang) 한국HCI학회 2022 한국HCI학회 학술대회 Vol.2022 No.2
본 연구에서는 비대면 서비스의 확대에 따라 기존 홈 트레이닝 서비스의 개선을 위한 어플리케이션 프로토타입을 연구하였다. 맥락적 조사의 과정에 따라 사용자의 서비스 이용 행태를 분석하고 그에 따른 니즈를 도출해내면서 사용자 경험을 중심으로 하여 비대면 상황 속에서도 효과적으로 이용할 수 있는 홈 트레이닝 서비스의 개선된 프로토타입 디자인을 제안한다. 본 연구를 통하여 새로워진 환경에서도 자신의 상황에 맞게 홈 트레이닝 서비스를 이용할 수 있는 경험을 제공하는 것에 기여할 수 있다.