http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
장인선(Inseon Jang),이용주(Yong-Ju Lee),장대영(Dae Young Jang),강경옥(Kyeongok Kang) 한국방송·미디어공학회 2008 한국방송공학회 학술발표대회 논문집 Vol.2008 No.-
헤드폰 또는 이어폰으로 오디오 청취 시 흔히 음상이 머리 내부에 맺히는 현상이 발생하게 되며, 이러한 현상을 음상 내재화(Inside Head Localization; IHL)라 한다. 오디오의 음상이 머리 주변 혹은 머리 내부에 맺히게 되면 공간감이나 입체감이 떨어지게 되어 음향의 현실감을 저하시키게 되며 또한 청취에 따른 피로도가 증가하게 된다. 이러한 음상 내재화 현상을 제거하여, 헤드폰/이어폰을 통해 오디오 청취 시 음상이 머리의 외부에 맺히도록(Out of Head Localization; OHL) 하는 기술을 음상 외재화(Sound Extermalization) 기술이라 한다. 룸 임펄스 응답이 방향 큐와 연계하여 생성되었을 경우 외재화가 가능하다는 실험적 사실을 바탕으로 기존의 음상 외재화 방법은 일반적인 HRTF(Head Related Transfer Function)를 이용하여 외재화 필터를 구성해왔다. 본 논문에서는 구체마이크로폰을 이용하여 녹음한 멀티채널 룸 임펄스 응답을 기반으로 모델링 된 외재화 필터를 이용한 음원 외재화 방법을 제안한다. 또한 실험 및 결과 분석을 통하여 본 알고리즘의 전방 음원 외재화 성능의 우수성을 입증하고, 외재화 알고리즘 적용 후의 원 신호 음상 보존 성능을 확인한다.
장인선(Inseon Jang),안충현(ChungHyun Ahn),서정일(Jeongil Seo),장윤선(Younseon Jang) 한국방송·미디어공학회 2017 방송공학회논문지 Vol.22 No.5
In this paper, we propose a DNN based speech detection system using acoustic characteristics and context information of media audio. The speech detection for discriminating between speech and non-speech included in the media audio is a necessary preprocessing technique for effective speech processing. However, since the media audio signal includes various types of sound sources, it has been difficult to achieve high performance with the conventional signal processing techniques. The proposed method improves the speech detection performance by separating the harmonic and percussive components of the media audio and constructing the DNN input vector reflecting the acoustic characteristics and context information of the media audio. In order to verify the performance of the proposed system, a data set for speech detection was made using more than 20 hours of drama, and an 8-hour Hollywood movie data set, which was publicly available, was further acquired and used for experiments. In the experiment, it is shown that the proposed system provides better performance than the conventional method through the cross validation for two data sets.
장인선(Inseon Jang),백승권(Seungkwon Beack),서정일(Jeongil Seo),장대영(Dae-young Jang) 한국방송·미디어공학회 2006 방송공학회논문지 Vol.11 No.2
Technology for compressing low-bitrate multichannel audio coding should be developed owing to the increasing need of consumer for multichannel audio contents and services. To meet this requirement, MPEG has standardized MPEG Surround. In this paper, we introduce status on MPEG Surround standardization and analyze techniques adopted in the current MPEG Surround.
시각장애인 미디어접근권 향상을 위한 해설오디오 수용도 조사 및 분석
장인선(Inseon Jang),안충현(ChungHyun Ahn),서정일(Jeongil Seo),이은하(Eun Ha Lee),강완식(Wan Sic Kang) 한국방송·미디어공학회 2017 방송공학회논문지 Vol.22 No.2
For people with physical or sensory limitations, broadcasting is the main means of information acquisition and leisure. Recently, changes in the media environment, such as convergence of broadcasting and communication, digital·mobile conversion of broadcasting, and active media usage behavior of users, make broadcasting accessibility of the disabled difficult, and as a result, the information gap between the disabled and the non-disabled is increasing. A notice on broadcasting rights for the disabled was enacted in consequence of the amendment of the Broadcasting Law in July 2011 and the web accessibility guideline became more effective with the amendment of the National Informatization Act in 2013 so that legal basis for the right of media access for the disabled was established. However, media services for them are still lacking quantitatively and qualitatively. In this study, we describe the present status of the audio description service for the visually impaired, and analyze the results of the questionnaire survey on the usage status, satisfaction and improvement requirements of the audio description service for 100 visually impaired people.
장인선(Inseon Jang),안충현(ChungHyun Ahn),장윤선(Younseon Jang) 한국방송·미디어공학회 2014 방송공학회논문지 Vol.19 No.3
This paper addresses a problem of non-dialog section detection for the DVS authoring, the goal of which is to find meaningful section from the broadcasting audio, where audio description can be inserted. The broadcasting audio involves the presence of various sounds so that it first discriminates between speech and non-speech for each audio frame. Proposed method jointly exploits the inter-channels structure and speech source characteristics of the broadcasting audio whose number of channel is stereo. Also, rule based post-processing is finally applied to detect the non-dialog section whose length is appropriate for audio description. Proposed method provides more accurate detection compared to conventional method. Experimental results on real broadcasting contents show that qualitative superiority of the proposed method.
장인선(Inseon Jang),서정일(Jeongil Seo),백승권(Seungkwon Beack),강경옥(Kyeongok Kang) 한국방송·미디어공학회 2005 방송공학회논문지 Vol.10 No.3
Technology for compressing low-bitrate multichannel audio coding is being standardized owing to the increasing need of consumer for multichannel audio contents. In this paper we propose the sound source location cue coding (SSLCC) for extremely compressing multichannel audio to be suitable at the narrow bandwidth transmission environment. To improve the compression capability of the conventional binaural cue coding(BCC), the SSLCC adopts the virtual source location information (VSLI) as a spatial cue parameter, a symmetric uniform quantizer, and Huffman coder. The objective and subjective assessment results show that the SSLCC provides lower bitrate and better audio quality than conventional BCC method.