RISS 검색 - 학위논문 상세보기

국문 초록 (Abstract)

데이터 마이닝은 대용량의 데이터에 숨겨진 의미있고 유용한 패턴과 상관관계를 추출하여 의사결정에 활용하는 작업이다. 그 중에서도 고객 트랜잭션의 데이터베이스에서 아이템(item) 사이에 존재하는 연관규칙을 찾는 것은 중요한 일이 되었다. Apriori 알고리즘 이후 연관규칙을 찾기 위해 대용량의 데이터베이스로부터 압축된 의미있는 정보를 저장하기 위한 데이터 구조와 알고리즘들이 많이 제안되어 왔다. 연관규칙을 발견하기 위한 기존의 연구들은 모든 규칙을 찾아내지만, 사람이 분석하기에 너무 많은 규칙이 생성되기 때문에 규칙을 분석하기 위한 일 또한 많은 과정을 거쳐야 한다.
본 논문에서는 빈발 패턴 네트워크(Frequent Pattern Network)라 부르는 자료 구조를 제안하고 이를 활용하였다. 네트워크는 정점과 간선으로 구성되며 정점은 아이템을 표현하고, 간선은 두 아이템 집합을 표현한다. 아이템의 빈도수를 이용하여 빈발 패턴 네트워크를 구성하고, 아이템 사이의 유사도를 측정하여 클러스터 내의 아이템과는 유사도가 높고, 다른 클러스터의 아이템과는 유사도가 낮도록 클러스터를 생성한다. 클러스터를 이용해 연관규칙을 생성하고 실험을 통해 Apriori와 FP Growth 알고리즘과의 성능 비교를 하였다. 실험을 통해 빈발 패턴 네트워크에서 신뢰도 유사도를 이용하는 것이 클러스터의 정확성을 높여줌을 볼 수 있었다. 그리고 전통적인 방법과 비교를 통해 빈발 패턴 네트워크를 이용하는 것이 최소지지도에 유연성을 가짐을 알 수 있었다.

번역하기

데이터 마이닝은 대용량의 데이터에 숨겨진 의미있고 유용한 패턴과 상관관계를 추출하여 의사결정에 활용하는 작업이다. 그 중에서도 고객 트랜잭션의 데이터베이스에서 아이템(item) 사이...

다국어 초록 (Multilingual Abstract)

Data mining is defined as the process of discovering meaningful and useful pattern in large volumes of data. In particular, finding associations rules between items in a database of customer transactions has become an important thing. Some data structures and algorithms had been proposed for storing meaningful information compressed from an original database to find frequent itemsets since Apriori algorithm. Though existing method find all association rules, we must have a lot of process to analyze association rules because there are too many rules. In this paper, we propose a new data structure, called a Frequent Pattern Network (FPN), which represents items as vertices and 2-itemsets as edges of the network. In order to utilize FPN, We constitute FPN using item's frequency. And then we use a clustering method to group the vertices on the network into clusters so that the intracluster similarity can be maximized and the intercluster similarity can be minimized. We generate association rules based on clusters.
Our experiments showed accuracy of clustering items on the network using confidence, correlation and edge weight similarity methods. And We generated association rules using clusters and compare traditional and our method. From the results, the confidence similarity had a strong influence than others on the frequent pattern network. And FPN had a flexibility to minimum support value.

번역하기

목차 (Table of Contents)

제1장 서론 = 1
제2장 연관규칙 마이닝 = 3
2.1 연관규칙 = 3
2.2 연관규칙 마이닝 알고리즘 = 3
제3장 빈발 패턴 네트워크 = 6

제1장 서론 = 1
제2장 연관규칙 마이닝 = 3
2.1 연관규칙 = 3
2.2 연관규칙 마이닝 알고리즘 = 3
제3장 빈발 패턴 네트워크 = 6
3.1 빈발 패턴 네트워크의 구성 = 7
3.2 연결된 경로 = 8
3.3 중첩수 = 9
3.4 경로지지도 = 10
제4장 클러스터링을 통한 연관규칙 발견 = 12
4.1 클러스터링 알고리즘 = 12
4.2 연관규칙 발견 = 17
제5장 실험 및 결과 = 18
5.1 실험 환경 및 데이터 집합 = 18
5.2 실험 평가 방법 = 19
5.3 실험 결과 및 평가 = 20
제6장 결론 및 향후 연구 = 27
참고문헌 = 29

상세검색

RISS 보유자료

상세검색

해외전자자료

빈발 패턴 네트워크에서 아이템 클러스터링을 이용한 연관규칙 발견

부가정보

분석정보

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료