Reading Notes: The Data Warehouse Toolkit 2nd
2010-06-18 15:39
375 查看
The key architectural requirement for the data staging area is that it is off-limits to business users and does not provide query and presentation services.
It is acceptable to create a normalized database to support the staging processes; however, this is not the end goal. The normalized structures must be off-limits to user queries because they defeat understandability and performance. As soon as a database supports query and presentation services, it must be considered part of the data warehouse presentation area. By default, normalized databases are excluded from the presentation area, which should be strictly dimensionally structured.
A row in a fact table corresponds to a measurement. A measurement is a row in a fact table. All the measurements in a fact table must be at the same grain.
The most useful facts in a fact table are numeric and additive.
Dimension tables are the entry points into the fact table. Robust dimension attributes deliver robust analytic slicing and dicing capabilities. The dimensions implement the user interface to the data warehouse.
A careful grain statement determines the primary dimensionality of the fact table. It is then often possible to add more dimensions to the basic grain of the fact table, where these additional dimensions naturally take on only one value under each combination of the primary dimensions. If the additional dimension violates the grain by causing additional fact rows to be generated, then the grain statement must be revised to accommodate this dimension.
You must avoid null keys in the fact table. A proper design includes a row in the corresponding dimension table to identify that the dimension is not applicable to the measurement.
It is acceptable to create a normalized database to support the staging processes; however, this is not the end goal. The normalized structures must be off-limits to user queries because they defeat understandability and performance. As soon as a database supports query and presentation services, it must be considered part of the data warehouse presentation area. By default, normalized databases are excluded from the presentation area, which should be strictly dimensionally structured.
A row in a fact table corresponds to a measurement. A measurement is a row in a fact table. All the measurements in a fact table must be at the same grain.
The most useful facts in a fact table are numeric and additive.
Dimension tables are the entry points into the fact table. Robust dimension attributes deliver robust analytic slicing and dicing capabilities. The dimensions implement the user interface to the data warehouse.
A careful grain statement determines the primary dimensionality of the fact table. It is then often possible to add more dimensions to the basic grain of the fact table, where these additional dimensions naturally take on only one value under each combination of the primary dimensions. If the additional dimension violates the grain by causing additional fact rows to be generated, then the grain statement must be revised to accommodate this dimension.
You must avoid null keys in the fact table. A proper design includes a row in the corresponding dimension table to identify that the dimension is not applicable to the measurement.
相关文章推荐
- The Data WarehouseETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delive
- Self Learning Note <The Data Warehouse ETL Toolkit> - Chapter 3 Extracting
- The Data Warehouse ETL Toolkit学习笔记-架构(数据流主线―数据访问)
- The Microsoft Data Warehouse Toolkit
- The Data Warehouse ETL Toolkit学习笔记-架构(规划与设计主线)
- The Data Warehouse ETL Toolkit学习笔记-需求
- 数据仓库工具箱:The Data Warehouse Toolkit
- The Microsoft Data Warehouse Toolkit: With SQL Server 2005 and the Microsoft Business Intelligence T
- 数据仓库工具箱:The Data Warehouse Toolkit
- The Data Warehouse ETL Toolkit学习笔记-架构(数据流主线―数据管理)
- The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling (Second Edition)
- The Data Warehouse ETL Toolkit学习笔记-架构(数据流主线―数据访问)
- The Multiresolution Toolkit: Progressive Access for Regular Gridded Data
- SAP BW as an Enterprise Data Warehouse and How SAP HANA Changes the Game
- Building the Data Warehouse (3rd Edition)
- The Multiresolution Toolkit: Progressive Access for Regular Gridded Data
- (vtk——The Visualization Toolkit)Visulizing Image & Volume Data(图像及三维体数据的可视化)
- Building the Unstructured Data Warehouse: Architecture, Analysis, and Design
- 《Outlier Analysis 2nd Edition》- 1.2 The Data Model is Everything
- Building the Unstructured Data Warehouse: Architecture, Analysis, and Design