Star Schema is widely used for OLAP(On-Line Analytical Processing) systems to represent multidimensional data in relational database management systems which can be used to analyze for an enterprise's decision making. The star schema consists of a fac...
Star Schema is widely used for OLAP(On-Line Analytical Processing) systems to represent multidimensional data in relational database management systems which can be used to analyze for an enterprise's decision making. The star schema consists of a fact table and a set of dimension tables surrounding the fact table. Many researchers have investigated the methodologies for designing the Star Schema from the source data in relational databases. Meanwhile, XML is emerging as a new standard representing data and documents in the Internet. We also need to analyze a huge amount of data represented by XML files, thus we present an algorithm, XML2Star, for designing Star Schema from XML source data. The XML2Star extracts the components of star schema, which are measurements (i.e., facts), dimensions, and dimension hierarchies, from DTDs in XML documents. Based on th star schema, this algorithm also builds star schema instances (i.e., multidimensional cube) from XML documents. The star schema instances will be used for users' multidimensional analyses by using OLAP tools. Therefore, the proposed algorithm can help user deriving star schema from XML source data in the Internet easily and analyzing the schema over various dimensions effectively.