RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      KCI등재

      Hierarchical Attention Network를 이용한 복합 장애 발생 예측 시스템 개발

      한글로보기

      https://www.riss.kr/link?id=A107198934

      • 0

        상세조회
      • 0

        다운로드
      서지정보 열기
      • 내보내기
      • 내책장담기
      • 공유하기
      • 오류접수

      부가정보

      다국어 초록 (Multilingual Abstract)

      The data center is a physical environment facility for accommodating computer systems and related components, and is an essential foundation technology for next-generation core industries such as big data, smart factories, wearables, and smart homes. ...

      The data center is a physical environment facility for accommodating computer systems and related components, and is an essential foundation technology for next-generation core industries such as big data, smart factories, wearables, and smart homes. In particular, with the growth of cloud computing, the proportional expansion of the data center infrastructure is inevitable. Monitoring the health of these data center facilities is a way to maintain and manage the system and prevent failure. If a failure occurs in some elements of the facility, it may affect not only the relevant equipment but also other connected equipment, and may cause enormous damage. In particular, IT facilities are irregular due to interdependence and it is difficult to know the cause.
      In the previous study predicting failure in data center, failure was predicted by looking at a single server as a single state without assuming that the devices were mixed. Therefore, in this study, data center failures were classified into failures occurring inside the server (Outage A) and failures occurring outside the server (Outage B), and focused on analyzing complex failures occurring within the server. Server external failures include power, cooling, user errors, etc. Since such failures can be prevented in the early stages of data center facility construction, various solutions are being developed. On the other hand, the cause of the failure occurring in the server is difficult to determine, and adequate prevention has not yet been achieved. In particular, this is the reason why server failures do not occur singularly, cause other server failures, or receive something that causes failures from other servers. In other words, while the existing studies assumed that it was a single server that did not affect the servers and analyzed the failure, in this study, the failure occurred on the assumption that it had an effect between servers.
      In order to define the complex failure situation in the data center, failure history data for each equipment existing in the data center was used. There are four major failures considered in this study: Network Node Down, Server Down, Windows Activation Services Down, and Database Management System Service Down. The failures that occur for each device are sorted in chronological order, and when a failure occurs in a specific equipment, if a failure occurs in a specific equipment within 5 minutes from the time of occurrence, it is defined that the failure occurs simultaneously. After configuring the sequence for the devices that have failed at the same time, 5 devices that frequently occur simultaneously within the configured sequence were selected, and the case where the selected devices failed at the same time was confirmed through visualization.
      Since the server resource information collected for failure analysis is in units of time series and has flow, we used Long Short-term Memory (LSTM), a deep learning algorithm that can predict the next state through the previous state. In addition, unlike a single server, the Hierarchical Attention Network deep learning model structure was used in consideration of the fact that the level of multiple failures for each server is different. This algorithm is a method of increasing the prediction accuracy by giving weight to the server as the impact on the failure increases. The study began with defining the type of failure and selecting the analysis target. In the first experiment, the same collected data was assumed as a single server state and a multiple server state, and compared and analyzed. The second experiment improved the prediction accuracy in the case of a complex server by optimizing each server threshold.
      In the first experiment, which assumed each of a single server and multiple servers, in the case of a single server, it was predicted that three of the five servers did not have a failure even though the actual failure occurred. However, assuming multiple servers

      더보기

      참고문헌 (Reference)

      1 최승호, "컴퓨팅 자원의 가용성을 보장하기 위한 기계 학습 기반의 실시간 장애 예측 프레임워크 연구" 대한전자공학회 56 (56): 63-76, 2019

      2 김휘경, "기계 학습을 활용한 마이닝 장비의 장애 예측" 한국IT정책경영학회 8 (8): 297-302, 2016

      3 Lawrence, A, "Uptime Institute data shows outages are common, costly, and preventable" Uptime Institute

      4 Glorot, X, "Understanding the difficulty of training deep feedforward neural networks" 9 : 249-256, 2010

      5 Zhang, R., "Time series prediction and anomaly detection of light curve using lstm neural network" 1061 (1061): 2018

      6 류창주, "PC 및 서버 상태관리를 위한 모니터링 시스템 개발에 관한 연구" 한국정보통신학회 20 (20): 1741-1746, 2016

      7 Chen. Y, "Outage Prediction and Diagnosis for Cloud Service Systems" 2659-2665, 2019

      8 Gers, F. A, "Neural Nets WIRN Vietri-01" Springer 193-200, 2002

      9 Ko, K. Y., "Monitoring of Wafer Dicing State by Using Back Propagation Algorithm" 6 (6): 486-491, 2000

      10 Hochreiter, S., "Long short-term memory" 9 (9): 1735-1780, 1997

      1 최승호, "컴퓨팅 자원의 가용성을 보장하기 위한 기계 학습 기반의 실시간 장애 예측 프레임워크 연구" 대한전자공학회 56 (56): 63-76, 2019

      2 김휘경, "기계 학습을 활용한 마이닝 장비의 장애 예측" 한국IT정책경영학회 8 (8): 297-302, 2016

      3 Lawrence, A, "Uptime Institute data shows outages are common, costly, and preventable" Uptime Institute

      4 Glorot, X, "Understanding the difficulty of training deep feedforward neural networks" 9 : 249-256, 2010

      5 Zhang, R., "Time series prediction and anomaly detection of light curve using lstm neural network" 1061 (1061): 2018

      6 류창주, "PC 및 서버 상태관리를 위한 모니터링 시스템 개발에 관한 연구" 한국정보통신학회 20 (20): 1741-1746, 2016

      7 Chen. Y, "Outage Prediction and Diagnosis for Cloud Service Systems" 2659-2665, 2019

      8 Gers, F. A, "Neural Nets WIRN Vietri-01" Springer 193-200, 2002

      9 Ko, K. Y., "Monitoring of Wafer Dicing State by Using Back Propagation Algorithm" 6 (6): 486-491, 2000

      10 Hochreiter, S., "Long short-term memory" 9 (9): 1735-1780, 1997

      11 문영식, "IoT 기반 실시간 냉장컨테이너 상태 모니터링 시스템" 한국정보통신학회 19 (19): 629-635, 2015

      12 Yang, "Hierarchical attention networks for document classification" 1480-1489, 2016

      13 Zichao, Y, "Hierarchical Attention Networks for Document Classification" 1480-1489, 2016

      14 Hua, Y., "Deep learning with long short-term memory for time series prediction" 57 (57): 114-119, 2019

      15 Dukic, V, "Beyond the mega-data center:networking multi-data center regions" 765-781, 2020

      16 Kingma, D. P, "Adam: A method for stochastic optimization" 2015

      17 Kim Y. S., "A Research on the Prediction of Computer System Fault based on Big Data" 5-, 2015

      18 Kim Y. S., "A Failure Prediction of Computer System using Deep Learning" 18-, 2016

      더보기

      동일학술지(권/호) 다른 논문

      동일학술지 더보기

      더보기

      분석정보

      View

      상세정보조회

      0

      Usage

      원문다운로드

      0

      대출신청

      0

      복사신청

      0

      EDDS신청

      0

      동일 주제 내 활용도 TOP

      더보기

      주제

      연도별 연구동향

      연도별 활용동향

      연관논문

      연구자 네트워크맵

      공동연구자 (7)

      유사연구자 (20) 활용도상위20명

      인용정보 인용지수 설명보기

      학술지 이력

      학술지 이력
      연월일 이력구분 이력상세 등재구분
      2027 평가예정 재인증평가 신청대상 (재인증)
      2021-01-01 평가 등재학술지 유지 (재인증) KCI등재
      2018-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2015-03-25 학회명변경 영문명 : 미등록 -> Korea Intelligent Information Systems Society KCI등재
      2015-03-17 학술지명변경 외국어명 : 미등록 -> Journal of Intelligence and Information Systems KCI등재
      2015-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2011-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2009-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2008-02-11 학술지명변경 한글명 : 한국지능정보시스템학회 논문지 -> 지능정보연구 KCI등재
      2007-01-01 평가 등재학술지 유지 (등재유지) KCI등재
      2004-01-01 평가 등재학술지 선정 (등재후보2차) KCI등재
      2003-01-01 평가 등재후보 1차 PASS (등재후보1차) KCI등재후보
      2001-07-01 평가 등재후보학술지 선정 (신규평가) KCI등재후보
      더보기

      학술지 인용정보

      학술지 인용정보
      기준연도 WOS-KCI 통합IF(2년) KCIF(2년) KCIF(3년)
      2016 1.51 1.51 1.99
      KCIF(4년) KCIF(5년) 중심성지수(3년) 즉시성지수
      1.78 1.54 2.674 0.38
      더보기

      이 자료와 함께 이용한 RISS 자료

      나만을 위한 추천자료

      해외이동버튼