RISS 검색 - 학위논문 상세보기

국문 초록 (Abstract)

인터넷 발전으로 다른 사람들의 의견과 경험이 포한된 온라인 후기를 손쉽게 접할 수 있게 되었다. 후기는 제품과 서비스의 실질적 구매에 영향을 미치기 때문에 기업의 중요한 마케팅 전략으로 활용될 수 있다.
이에 언어에 포함된 의견과 태도와 같은 주관적인 정보를 탐지하는 감성분석 연구가 이루어졌다. 초기에는 감성 강도를 나타낸 감성사전을 통해 이루어졌으나 최근에는 딥러닝 기반 감성분석 연구가 증가하고 있다.
감성분석과 함께 말뭉치를 학습해 다음에 나올 단어 혹은 문장을 예측하는 언어모델에서도 딥러닝을 이용하고 있다. 한국어 문장을 이용한 언어모델 실험에서는 다양한 말뭉치를 활용하였지만, 감성분석은 형태소 단위 말뭉치만이 수행되었다.
본 논문에서는 다양한 한국어 말뭉치를 이용해 문장의 감성을 긍정 또는 부정으로 판별하는 감성분석을 시도한다. 기존 감성분석에 사용한 감성사전을 구축하지 않고 문장을 구성하는 말뭉치로만 감성을 예측한다. 선행연구로 형태소 단위 모형과 함께 한글의 가장 작은 단위인 음소와 어절 단위 모형을 구축해 각 입력단위 별 모형성능을 비교분석 한다.
음소 단위 모형은 LSTM, Vanilla RNN, GRU와 같은 RNN layer를 3개 포함하는 모형을 구성하였다. 형태소 및 어절 단위 모형은 Word2vec을 이용해 말뭉치를 분산표현 한 후 LSTM layer를 2개 또는 3개 포함하는 모형을 구축하였다. 성능평가로는 정확도 및 loss를 측정하였고 각 모형별 학습시간을 비교하였다.
실험결과로는 음소 단위 모형은 LSTM 성능이 가장 뛰어났으며 형태소 및 어절 단위 모형에서는 LSTM layer가 모형성능에 영향을 미치지 않은 것을 확인하였다. 또한, 학습시간은 가장 작은 단위로 분할하여 문장길이가 길어진 음소 단위 모형이 가장 크고 어절 단위 모형이 가장 작았다.
한국어 문장을 다양한 말뭉치로 학습한 감성분석 실험을 통해 영어에 비해 연구가 부족한 한글 기반 언어처리 및 음성인식 분야에 활용될 것으로 기대한다.

번역하기

인터넷 발전으로 다른 사람들의 의견과 경험이 포한된 온라인 후기를 손쉽게 접할 수 있게 되었다. 후기는 제품과 서비스의 실질적 구매에 영향을 미치기 때문에 기업의 중요한 마케팅 전략...

다국어 초록 (Multilingual Abstract)

Internet development makes access easy to the online review which contained opinions and experiences of others. Reviews can be utilized as an important marketing strategy for the company due to their actual influence on the product and service.
Sentiment analysis research was accomplished to sensor the subjective information like opinion and attitude in the words. Research was made by sentiment dictionary in the beginning, but glowing sentiment analysis research recently based on deep learning.
Language model, together with sentiment analysis, which can expect next words or sentences by studying corpus is using deep learning. In language model experiment using Korean sentences, various corpus is applied but sentiment analysis with morpheme only.
This study implement sentiment analysis to distinguish negative or positive using various Korean corpus. It forecast sentiment not by sentiment dictionary construction used current sentiment but by corpus only composed sentences. By construction for morpheme-level model and phoneme-level model which is the smallest unit in Korean and word-level model, analyze model performance by input unit.
Phoneme-level model was constructed 3 RNN layers like LSTM, Vanilla RNN, GRU. After morpheme and word-level model expressed dispersion and constructed the models which had 2 or 3 layers included. Measured accuracy and loss for performance evaluation and compared learning hours per model.
By experiment result, phoneme-level model was standing high above the others and confirmed not effect LSTM layer for model performance in morpheme and word-level model. Learning hours is the largest in phoneme-level model which the sentence becomes longer by divided in the smallest unit and the smallest in word-level model.
Through sentiment analysis experiment for the Korean sentences by the various corpus, expect implemented in language processing and speech recognition which have lack of research compared to English.

번역하기

목차 (Table of Contents)

제1장 서론 1
제2장 관련연구 4
2.1 감성분석 연구 4
2.2 신경망을 이용한 감성분석 6
2.3 Word2vec 10

제1장 서론 1
제2장 관련연구 4
2.1 감성분석 연구 4
2.2 신경망을 이용한 감성분석 6
2.3 Word2vec 10
2.4 RNN(Recurrent Neural Network) 12
2.4.1 Vanilla Recurrent Neural Network 12
2.4.2 LSTM(Long short-term memory) 14
2.4.3 GRU(Gated recurrent unit) 16
제3장 RNN을 이용한 한국어 감성분석 19
3.1 음소 단위 감성분석 모형 19
3.1.1 음소 단위 감성분석 모형의 전처리 20
3.1.2 학습데이터를 이용한 음소 단위 감성분석 모형학습 21
3.2 형태소 단위 감성분석 모형 22
3.3 어절 단위 감성분석 모형 24
제4장 실험 방법 26
4.1 데이터 수집 및 전처리 26
4.2 실험 종류 30
제5장 실험 결과 33
5.1 형태소 및 어절 단위 모형의 Word2vec 결과 33
5.2 정확도 및 loss 34
5.2.1 음소 단위 모형의 정확도 및 loss 34
5.2.2 형태소 단위 모형의 정확도 및 loss 36
5.2.3 어절 단위 모형의 정확도 및 loss 39
5.3 학습시간 41
제6장 결론 및 향후 연구계획 43
참 고 문 헌 46
Abstract 53

상세검색

RISS 보유자료

상세검색

해외전자자료

RNN을 이용한 한국어 감성분석 : 온라인 영화 후기를 중심으로

부가정보

분석정보

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료