RISS 검색 - 국내학술지논문 상세보기

국문 초록 (Abstract)

공간자료는 필연적으로 공간적 이질성을 내포하고 있다. 공간적 이질성의 존재는 연구 지역이 동일 모집단을 가진다는 가정을 만족하기 어렵게 하며, 이는 국지적으로 다른 기계학습 모형의 생성 필요성과 연결된다. 서울시 지하철 승하차 인원에 있어서도 공간적 이질성이 존재하며, 공간적으로 유사한 지역의 구성은 기계학습 모형의 예측 정확도 향상에 기여할 수 있다. 본 연구는 공간 제약 위계적 군집 방법으로 생성된 군집별 기계학습 모형의 추정이 기계학습 예측 정확도 향상에 미치는 영향을 탐색한다. 구체적으로 군집의 수를 변화시키며, 군집 수에 따른 예측 정확도를 각 군집별 생성된 모형과 전체 모형 간 평균 제곱근 오차를 이용하여 비교하였다. 본 연구에서 사용한 기계학습 기법은 선형회귀모형, 랜덤포레스트, 그래디언트부스팅이며, 반응변수는 지하철 승하차 인원을 사용하였다. 그 결과 대부분의 기법에서 군집의 수가 증가할수록 군집 모형의 예측 정확도가 전체 모형보다 높은 경우가 증가하는 경향을 보였다. 그러나 군집의 수가 일정 수 이상인 경우 그 경향성이 일정하게 유지되어, 예측 정확도 향상을 위해서 적절한 군집 수의 추정이 요구되는 것을 확인하였다. 본 연구는 공간 제약 위계적 군집 분석과 기계학습 기반 지하철 승하차 인원 예측 모형을 결합하여 공간적 군집 기반 모형 추정이 예측 정확도에 미치는 영향을 탐색하였다는 점에서, 공간 기계학습 발전의 기초 연구로 활용될 것으로 기대한다.

번역하기

공간자료는 필연적으로 공간적 이질성을 내포하고 있다. 공간적 이질성의 존재는 연구 지역이 동일 모집단을 가진다는 가정을 만족하기 어렵게 하며, 이는 국지적으로 다른 기계학습 모형...

다국어 초록 (Multilingual Abstract)

Spatial data inherently contains spatial heterogeneity. The existence of spatial heterogeneity often violates the assumption that sub-study areas have the same population, necessitating the creation of local machine learning models. Subway ridership also exhibits spatial heterogeneity, and creating local models based on the composition of spatially similar regions could improve prediction accuracies. This study explores the impact of spatially constrained hierarchical clustering on the prediction accuracy of machine learning models. Specifically, the prediction accuracies are compared between models created for each cluster and the entire model by varying the number of clusters. Subway ridership is utilized as the response variable, and the machine learning techniques employed are linear regression, random forests, and gradient boosting. The results indicate that as the number of clusters increases, the prediction accuracy of the cluster model tends to exceed that of the entire model across most techniques. However, when the number of clusters exceeds a certain threshold, this tendency remains constant, highlighting the necessity of estimating the appropriate number of clusters to improve prediction accuracy. This study is expected to provide a research base for enhancing spatial machine learning by investigating the impact spatially constrained hierarchical cluster-based machine learning models.

번역하기

상세검색

RISS 보유자료

상세검색

해외전자자료

군집 수에 따른 공간 제약 위계적 군집 기반 기계학습 예측 정확도 변화 = Varying Prediction Accuracies of Spatially Constrained Hierarchical Clustering-Based Machine Learning with Different Numbers of Clusters

부가정보

동일학술지(권/호) 다른 논문

분석정보

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료