RISS 검색 - 학위논문

내보내기
내책장담기
한글로보기

정확도순

내림차순

내림차순

10개씩 출력

1
시계열 빅데이터 분류 및 클러스터링 기법과 응용

오규협 경희대학교 대학원 2017 국내박사

RANK : 247631
디지털 기기의 발전으로 다양한 정보가 생성되고 있으며 매해 생성되는 데이터는 증가하고 있다. 다양한 종류로 생성되는 모든 데이터는 분석을 위해 수집 및 저장하고 이러한 데이터들을 ‘빅데이터’라고 한다. 빅데이터의 발전으로 시계열 데이터(Time-Series) 또한 다양하게 생성 및 수집되고 있다. 시계열이란 일정간격으로 배치된 데이터의 수열을 의미하며 일정간격으로 수집되는 데이터를 시계열 데이터라고 한다. 하지만 데이터 수집 기간에 늘어남에 따라 크기가 증가하고 데이터의 종류에 따라 특징이 다른 특성으로 인해 특징을 추출하기는 어렵다. 본 논문에서는 시계열 데이터를 분석하기 위해서 분류 및 클러스터링 기법과 지리적 클러스터링 기법을 제시하였다. 시계열 데이터의 분류를 위하여 CNN 기반 모형과 1-NN DTW 기반 모형을 제안하였고 제안된 모형을 서울시 대중교통 데이터와 공공데이터를 사용하여 검증하였다. 시계열 데이터의 클러스터링을 위하여 대칭형 쿨백 라이블러와 K-메도이드를 활용한 기법을 제시하였다. 제시된 기법을 서울시 대중교통 데이터를 사용하여 분석하였다. 마지막으로 이동거리와 이동량을 기반으로 한 지리적 클러스터링 기법을 제안하였고 서울시 대중교통 데이터로 분석하였다. With the development of digital devices, a variety of information is being generated and the amount of data generated each year is increasing. All data generated in various categories are collected and stored for analysis, and these data are called 'big data'. With the development of big data, time-series data are also variously generated and collected. Time series refers to a sequence of data arranged at regular intervals, and data collected at regular intervals is called time series data. However, as the data collection period increases then the size increases and it is difficult to extract the feature due to the different characteristics depending on the type of data. In this paper, classification and clustering techniques and geographic clustering techniques are presented to analyze time series data. In order to classify time series data, a CNN-based model and a 1-NN DTW-based model were proposed and the proposed model was verified using public transportation data and public data in Seoul. In order to clustering the time series data, a scheme using symmetric Kullback Libler and K-medoids was proposed. The proposed technique was analyzed using public transportation data of Seoul. Finally, we propose a geographic clustering method based on travel distance and travel distance and analyze it with Seoul public transport data.

내보내기
내책장담기
한글로보기

정확도순

내림차순

내림차순

10개씩 출력

맨처음 페이지로 1 맨끝 페이지로

상세검색

RISS 보유자료

상세검색

해외전자자료

연관 검색어 추천