RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      KCI등재

      온톨로지 지식 기반 특성치를 활용한 Bidirectional LSTM-CRF 모델의 시퀀스 태깅 성능 향상에 관한 연구

      한글로보기

      https://www.riss.kr/link?id=A105233038

      • 0

        상세조회
      • 0

        다운로드
      서지정보 열기
      • 내보내기
      • 내책장담기
      • 공유하기
      • 오류접수

      부가정보

      다국어 초록 (Multilingual Abstract)

      This paper proposes a methodology applying sequence tagging methodology to improve the performance of NER(Named Entity Recognition) used in QA system. In order to retrieve the correct answers stored in the database, it is necessary to switch the user...

      This paper proposes a methodology applying sequence tagging methodology to improve the performance of NER(Named Entity Recognition) used in QA system. In order to retrieve the correct answers stored in the database, it is necessary to switch the user’s query into a language of the database such as SQL(Structured Query Language). Then, the computer can recognize the language of the user. This is the process of identifying the class or data name contained in the database. The method of retrieving the words contained in the query in the existing database and recognizing the object based on the index does not identify the homophone and the word phrases because it does not consider the context of the user’s query. If there are multiple search results, all of them are returned as a result, so there can be many interpretations on the query and the time complexity for the calculation becomes large. In addition, in the existing QA system, chunking has a higher probability of selecting the longest phrase among the sentence component phrases retrieved after obtaining a subset of all cases that can come from the user query. This chunking process has a problem that the algorithm is complicated, but it is not always correct according to the query. To overcome these, this study aims to solve this problem by reflecting the contextual meaning of the query using neural network-based methodology and to identify the problem of the neural network based methodology. Context-sensitive tagging combined with CRF and Bidirectional LSTM can be used to supplement the LSTM-based model, which can limit long-term memory problems, but can be biased toward recent input because of the data nature of word embedding. Therefore, we have solved the disadvantages of the neural network model by introducing the latest technology Bidirectional LSTM-CRF model in Sequence Tagging field. We used reasoning that reflects context by using ontology-based characteristic values for untrained words. In case that untrained words come in, we store the object name recognition tag information obtained from the ontology knowledge base in the Lucene index DB as the ontology knowledge based feature. and In this paper, we propose a neural network model based on ontology knowledge based feature.
      Experiments were conducted on the ontology knowledge base of music domain and the performance was evaluated. In order to accurately evaluate the performance of the L-Bidirectional LSTM-CRF proposed in this study, we experimented with converting the words included in the learned query into untrained words in order to test whether the words were included in the database but correctly identified the untrained words. Through experimentation, it was confirmed that tag recognition based on the context of the homophone and chunking is possible. In addition, we could apply the features that match the data type by using the CRF property which can use user defined function as a variable. As a result, it was possible to recognize objects considering the context and can recognize the untrained words without re-training the L-Bidirectional LSTM-CRF model, and it is confirmed that the performance of the object recognition as a whole is improved. Therefore, the proposed L-Bidirectional LSTM-CRF methodology can be applied to the ontology knowledge base of various fields to solve the object name recognition problem. However, the proposed L-Bidirectional LSTM-CRF method performed better than the conventional LSTM-CRF method, but lacked the ability to process homophones compared to the conventional Bidirectional LSTM-CRF. This is due to the fact that the hypothesis of this study inserts a feature that can infer a pattern of unknown words based on the entity name pattern of trained queries. This is because the weight increases linearly with the inclusion in the training data. In future research, it is necessary to experiment with various methods of applying weights so that homophones can be found well while ma

      더보기

      참고문헌 (Reference)

      1 ZHANG, Xiang., "Text understanding from scratch"

      2 GRAVES, Alex., "Speech recognition with deep recurrent neural networks" IEEE 6645-6649, 2013

      3 LAMPLE, Guillaume, "Neural Architectures for Named Entity Recognition" 2016

      4 HOCHREITER, Sepp., "Long short-term memory" 9 (9): 1735-1780, 1997

      5 GRAVES, Alex., "Framewise phoneme classification with bidirectional LSTM and other neural network architectures" 18 (18): 602-610, 2005

      6 ELMAN, Jeffrey L., "Finding structure in time" 14 (14): 179-211, 1990

      7 LING, Wang, "Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation" 1520-1530, 2015

      8 ZHOU, Jie., "End-to-end learning of semantic role labeling using recurrent neural networks" 1127-1137, 2015

      9 MIKOLOV, Tomas, "Efficient estimation of word representations in vector space"

      10 LAFFERTY, John., "Conditional random fields: Probabilistic models for segmenting and labeling sequence data" 2001

      1 ZHANG, Xiang., "Text understanding from scratch"

      2 GRAVES, Alex., "Speech recognition with deep recurrent neural networks" IEEE 6645-6649, 2013

      3 LAMPLE, Guillaume, "Neural Architectures for Named Entity Recognition" 2016

      4 HOCHREITER, Sepp., "Long short-term memory" 9 (9): 1735-1780, 1997

      5 GRAVES, Alex., "Framewise phoneme classification with bidirectional LSTM and other neural network architectures" 18 (18): 602-610, 2005

      6 ELMAN, Jeffrey L., "Finding structure in time" 14 (14): 179-211, 1990

      7 LING, Wang, "Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation" 1520-1530, 2015

      8 ZHOU, Jie., "End-to-end learning of semantic role labeling using recurrent neural networks" 1127-1137, 2015

      9 MIKOLOV, Tomas, "Efficient estimation of word representations in vector space"

      10 LAFFERTY, John., "Conditional random fields: Probabilistic models for segmenting and labeling sequence data" 2001

      11 KIM, Yoon, "Character-aware neural language models" 2015

      12 HUANG, Zhiheng., "Bidirectional LSTM-CRF Models for Sequence Tagging;Named entity recognition with bidirectional LSTM-CNNs"

      더보기

      동일학술지(권/호) 다른 논문

      분석정보

      View

      상세정보조회

      0

      Usage

      원문다운로드

      0

      대출신청

      0

      복사신청

      0

      EDDS신청

      0

      동일 주제 내 활용도 TOP

      더보기

      주제

      연도별 연구동향

      연도별 활용동향

      연관논문

      연구자 네트워크맵

      공동연구자 (7)

      유사연구자 (20) 활용도상위20명

      인용정보 인용지수 설명보기

      학술지 이력

      학술지 이력
      연월일 이력구분 이력상세 등재구분
      2027 평가예정 재인증평가 신청대상 (재인증)
      2021-01-01 평가 등재학술지 유지 (재인증) KCI등재
      2018-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2015-03-25 학회명변경 영문명 : 미등록 -> Korea Intelligent Information Systems Society KCI등재
      2015-03-17 학술지명변경 외국어명 : 미등록 -> Journal of Intelligence and Information Systems KCI등재
      2015-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2011-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2009-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2008-02-11 학술지명변경 한글명 : 한국지능정보시스템학회 논문지 -> 지능정보연구 KCI등재
      2007-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2004-01-01 평가 등재학술지 선정 (등재후보2차) KCI등재
      2003-01-01 평가 등재후보 1차 PASS (등재후보1차) KCI등재후보
      2001-07-01 평가 등재후보학술지 선정 (신규평가) KCI등재후보
      더보기

      학술지 인용정보

      학술지 인용정보
      기준연도 WOS-KCI 통합IF(2년) KCIF(2년) KCIF(3년)
      2016 1.51 1.51 1.99
      KCIF(4년) KCIF(5년) 중심성지수(3년) 즉시성지수
      1.78 1.54 2.674 0.38
      더보기

      이 자료와 함께 이용한 RISS 자료

      나만을 위한 추천자료

      해외이동버튼