47,95 €*
Versandkostenfrei per Post / DHL
Lieferzeit 2-3 Wochen
As the most influential thought leaders in the data warehousing and business intelligence industry, the Kimball Group has developed pioneering techniques that serve as industry standards for DW/BI system design, development, and management. With this new edition of their bestseller, veteran experts from the Kimball Group get you up to speed with using the new Business Intelligence release of SQL Server: SQL Server 2008 R2. Covering the complete suite of data warehousing and BI tools that are part of SQL Server 2008 R2, the authors follow the full project lifecycle, including design, development, deployment, and maintenance.
In addition to a substantial update to the content of the previous edition as well as a look at the new features and functionality of SQL Server 2008 R2 such as PowerPivot and Master Data Services, this new version includes detailed examples that illustrate how to best apply the techniques described in the book. The authors share their own trial-and-error experiences of building a DW/BI system with the Microsoft tools so that you can benefit from their challenges and successes. You'll also discover how using the Kimball Lifecycle to build your DW/BI system encourages you to zero in on four primary principles: focus on the business, build an information infrastructure, deliver in meaningfulincrements, and deliver the entire solution. With these principles in hand, you'll be well on your way to building a successful DW/BI system that supports business intelligence requirements common to most organizations.
No matter your role in the process of working on a DW/BI project, The Microsoft Data Warehouse Toolkit, Second Edition shows you how to:
Focus your efforts on the best opportunities
Select, install, and configure the appropriate components of SQL Server 2008 R2 and other Microsoft products
Design the dimensional model and create the database structures for the relational and Analysis Services databases
Build an ETL System in Integration Services to populate the DW/BI databases
Build the BI applications and data mining models to deliver value to your organization
Manage, secure, and grow the DW/BI system for long-term success
With SQL Server 2008 R2 and the Microsoft Business Intelligence Toolset
As the most influential thought leaders in the data warehousing and business intelligence industry, the Kimball Group has developed pioneering techniques that serve as industry standards for DW/BI system design, development, and management. With this new edition of their bestseller, veteran experts from the Kimball Group get you up to speed with using the new Business Intelligence release of SQL Server: SQL Server 2008 R2. Covering the complete suite of data warehousing and BI tools that are part of SQL Server 2008 R2, the authors follow the full project lifecycle, including design, development, deployment, and maintenance.
In addition to a substantial update to the content of the previous edition as well as a look at the new features and functionality of SQL Server 2008 R2 such as PowerPivot and Master Data Services, this new version includes detailed examples that illustrate how to best apply the techniques described in the book. The authors share their own trial-and-error experiences of building a DW/BI system with the Microsoft tools so that you can benefit from their challenges and successes. You'll also discover how using the Kimball Lifecycle to build your DW/BI system encourages you to zero in on four primary principles: focus on the business, build an information infrastructure, deliver in meaningfulincrements, and deliver the entire solution. With these principles in hand, you'll be well on your way to building a successful DW/BI system that supports business intelligence requirements common to most organizations.
No matter your role in the process of working on a DW/BI project, The Microsoft Data Warehouse Toolkit, Second Edition shows you how to:
Focus your efforts on the best opportunities
Select, install, and configure the appropriate components of SQL Server 2008 R2 and other Microsoft products
Design the dimensional model and create the database structures for the relational and Analysis Services databases
Build an ETL System in Integration Services to populate the DW/BI databases
Build the BI applications and data mining models to deliver value to your organization
Manage, secure, and grow the DW/BI system for long-term success
With SQL Server 2008 R2 and the Microsoft Business Intelligence Toolset
Warren Thornthwaite has been building decision support and data warehousing systems since 1980 and is a member of the Kimball Group.
Ralph Kimball, PhD, is known worldwide as an innovator, writer, educator, speaker, and consultant in the field of data warehousing. He is the founder of the Kimball Group ([...] which provides data warehouse consulting and education.
Foreword xxvii
Introduction xxix
Part 1 Requirements, Realities, and Architecture 1
Chapter 1 Defining Business Requirements 3
The Most Important Determinant of Long-Term Success 5
Adventure Works Cycles Introduction 6
Uncovering Business Value 6
Obtaining Sponsorship 7
Defining Enterprise-Level Business Requirements 8
Prioritizing the Business Requirements 22
Revisiting the Project Planning 25
Gathering Project-Level Requirements 26
Summary 28
Chapter 2 Designing the Business Process Dimensional Model 29
Dimensional Modeling Concepts and Terminology 30
Facts 31
Dimensions 33
Bringing Facts and Dimensions Together 34
The Bus Matrix, Conformed Dimensions, and Drill Across 36
Additional Design Concepts and Techniques 38
Surrogate Keys 38
Slowly Changing Dimensions 39
Dates 42
Degenerate Dimensions 43
Snowflaking 43
Many-to-Many or Multivalued Dimensions 44
Hierarchies 47
Aggregate Dimensions 49
Junk Dimensions 51
The Three Fact Table Types 52
Aggregates 53
The Dimensional Modeling Process 54
Preparation 55
Data Profiling and Research 60
Building Dimensional Models 63
Developing the Detailed Dimensional Model 66
Testing and Refining the Model 68
Reviewing and Validating the Model 68
Case Study: The Adventure Works Cycles Orders Dimensional Model 69
The Orders Fact Table 69
The Dimensions 69
Identifying Dimension Attributes and Facts for the Orders Business Process 72
The Final Draft of the Initial Orders Model 74
Detailed Orders Dimensional Model Development 75
Final Dimensional Model 77
Summary 77
Chapter 3 The Toolset 79
The Microsoft DW/BI Toolset 80
Why Use the Microsoft Toolset? 82
Architecture of a Microsoft DW/BI System 83
Why Analysis Services? 84
Why a Relational Store? 86
ETL Is Not Optional 86
The Role of Master Data Services 88
Delivering BI Applications 88
Overview of the Microsoft Tools 89
Which Products Do You Need? 90
SQL Server Development and Management Tools 92
Summary 97
Chapter 4 System Setup 99
System Sizing Considerations 100
Calculating Data Volumes 101
Determining Usage Complexity 102
Estimating Simultaneous Users 104
Assessing System Availability Requirements 105
How Big Will It Be? 105
System Configuration Considerations 105
Memory 106
Monolithic or Distributed? 106
Storage System Considerations 110
Processors 113
Setting Up for High Availability 114
Software Installation and Configuration 115
Development Environment Software Requirements 116
Test and Production Software Requirements 120
Operating Systems 122
SQL Server Relational Database Setup 122
Analysis Services Setup 126
Integration Services Setup 129
Reporting Services Setup 130
Summary 131
Part 2 Building and Populating the Databases 133
Chapter 5 Creating the Relational Data Warehouse 135
Getting Started 136
Complete the Physical Design 137
Surrogate Keys 138
String Columns 138
To Null, or Not to Null? 140
Housekeeping Columns 140
Table and Column Extended Properties 142
Define Storage and Create Constraints and Supporting Objects 142
Create Files and Filegroups 142
Data Compression 144
Entity and Referential Integrity Constraints 145
Initial Indexing and Database Statistics 147
Aggregate Tables 150
Create Table Views 151
Insert an Unknown Member Row 152
Example CREATE TABLE Statement 152
Partitioned Tables 153
Finishing Up 163
Staging Tables 163
Metadata Setup 163
Summary 164
Chapter 6 Master Data Management 165
Managing Master Reference Data 166
Incomplete Attributes 167
Data Integration 168
Systems Integration 170
Master Data Management Systems and the Data Warehouse 171
Introducing SQL Server Master Data Services 171
Model Definition Features 172
Data Management Features 174
User Interface: Exploring and Managing the Master Data 174
Importing and Updating Data 176
Exporting Data 177
Full Versioning of All Attributes 179
Creating a Simple Application 179
Summary 186
Chapter 7 Designing and Developing the ETL System 187
Round Up the Requirements 188
Develop the ETL Plan 191
Introducing SQL Server Integration Services 192
Control Flow and Data Flow 194
SSIS Package Architecture 197
The Major Subsystems of ETL 198
Extracting Data 199
Subsystem 1: Data Profiling 199
Subsystem 2: Change Data Capture System 200
Subsystem 3: Extract System 202
Cleaning and Conforming Data 206
Subsystem 4: Data Cleaning System 206
Subsystem 5: Error Event Schema 214
Subsystem 6: Audit Dimension Assembler 215
Subsystem 7: Deduplication System 216
Subsystem 8: Conforming System 217
Delivering Data for Presentation 218
Subsystem 9: Slowly Changing Dimension Manager 218
Subsystem 10: Surrogate Key Generator 223
Subsystem 11: Hierarchy Manager 223
Subsystem 12: Special Dimensions Manager 224
Subsystem 13: Fact Table Builders 225
Subsystem 14: Surrogate Key Pipeline 229
Subsystem 15: Multi-Valued Dimension Bridge Table Builder 235
Subsystem 16: Late Arriving Data Handler 235
Subsystem 17: Dimension Manager 238
Subsystem 18: Fact Provider System 238
Subsystem 19: Aggregate Builder 239
Subsystem 20: OLAP Cube Builder 239
Subsystem 21: Data Propagation Manager 240
Managing the ETL Environment 240
Summary 243
Chapter 8 The Core Analysis Services OLAP Database 245
Overview of Analysis Services OLAP 247
Why Use Analysis Services? 247
Why Not Analysis Services? 249
Designing the OLAP Structure 250
Planning 251
Getting Started 253
Create a Project and a Data Source View 255
Dimension Designs 257
Creating and Editing Dimensions 261
Creating and Editing the Cube 274
Physical Design Considerations 291
Understanding Storage Modes 293
Developing the Partitioning Plan 294
Designing Performance Aggregations 296
Planning for Deployment 298
Processing the Full Cube 299
Developing the Incremental Processing Plan 299
Summary 304
Chapter 9 Design Requirements for Real-Time BI 305
Real-Time Triage 306
What Does Real-Time Mean? 306
Who Needs Real Time? 307
Real-Time Tradeoffs 308
Scenarios and Solutions 311
Executing Reports in Real Time 313
Serving Reports from a Cache 313
Creating an ODS with Mirrors and Snapshots 314
Creating an ODS with Replication 314
Building a BizTalk Application 315
Building a Real-Time Relational Partition 315
Querying Real-Time Data in the Relational Database 317
Using Analysis Services to Query Real-Time Data 318
Summary 319
Part 3 Developing the BI Applications 321
Chapter 10 Building BI Applications in Reporting Services 323
A Brief Overview of BI Applications 324
Types of BI Applications 325
The Value of Business Intelligence Applications 326
A High-Level Architecture for Reporting 328
Reviewing Business Requirements for Reporting 328
Examining the Reporting Services Architecture 330
Using Reporting Services as a Standard Reporting Tool 332
Reporting Services Assessment 339
The Reporting System Design and Development Process 340
Reporting System Design 341
Reporting System Development 348
Building and Delivering Reports 351
Planning and Preparation 351
Creating Reports 354
Reporting Operations 368
Ad Hoc Reporting Options 369
The Report Model 370
Shared Datasets 371
Report Parts 371
Summary 372
Chapter 11 PowerPivot and Excel 375
Using Excel for Analysis and Reporting 376
The PowerPivot Architecture: Excel on Steroids 378
Creating and Using PowerPivot Databases 380
Getting Started 381
PowerPivot Table Design 381
Creating Analytics with PowerPivot 385
Observations and Guidelines on PowerPivot for Excel 392
PowerPivot for SharePoint 394
The PowerPivot SharePoint User Experience 394
Server-Level Resources 397
PowerPivot Monitoring and Management 397
PowerPivot's Role in a Managed DW/BI Environment 400
Summary 401
Chapter 12 The BI Portal and SharePoint 403
The BI Portal 404
Planning the BI Portal 405
Impact on Design 406
Business Process Categories 407
Additional Functions 408
Building the BI Portal 409
Using SharePoint as the BI Portal 411
Architecture and Concepts 412
Setting Up SharePoint 417
Summary 426
Chapter 13 Incorporating Data Mining 429
Defining Data Mining 430
Basic Data Mining Terminology 432
Business Uses of Data Mining 433
Roles and Responsibilities 440
SQL Server Data Mining Architecture Overview 440
The Data Mining Design Environment 442
Build, Deploy, and Process 442
Accessing the Mining Models 443
Integration Services and Data Mining 443
...Erscheinungsjahr: | 2011 |
---|---|
Genre: | Importe, Informatik |
Rubrik: | Naturwissenschaften & Technik |
Medium: | Taschenbuch |
Inhalt: | 704 S. |
ISBN-13: | 9780470640388 |
ISBN-10: | 0470640383 |
Sprache: | Englisch |
Herstellernummer: | 14564038000 |
Einband: | Kartoniert / Broschiert |
Autor: |
Mundy, Joy
Thornthwaite, Warren |
Orchester: | Kimball, Ralph |
Auflage: | 2nd edition |
Hersteller: |
Wiley
John Wiley & Sons |
Verantwortliche Person für die EU: | Wiley-VCH GmbH, Boschstr. 12, D-69469 Weinheim, product-safety@wiley.com |
Maße: | 235 x 189 x 40 mm |
Von/Mit: | Joy Mundy (u. a.) |
Erscheinungsdatum: | 08.03.2011 |
Gewicht: | 1,025 kg |
Warren Thornthwaite has been building decision support and data warehousing systems since 1980 and is a member of the Kimball Group.
Ralph Kimball, PhD, is known worldwide as an innovator, writer, educator, speaker, and consultant in the field of data warehousing. He is the founder of the Kimball Group ([...] which provides data warehouse consulting and education.
Foreword xxvii
Introduction xxix
Part 1 Requirements, Realities, and Architecture 1
Chapter 1 Defining Business Requirements 3
The Most Important Determinant of Long-Term Success 5
Adventure Works Cycles Introduction 6
Uncovering Business Value 6
Obtaining Sponsorship 7
Defining Enterprise-Level Business Requirements 8
Prioritizing the Business Requirements 22
Revisiting the Project Planning 25
Gathering Project-Level Requirements 26
Summary 28
Chapter 2 Designing the Business Process Dimensional Model 29
Dimensional Modeling Concepts and Terminology 30
Facts 31
Dimensions 33
Bringing Facts and Dimensions Together 34
The Bus Matrix, Conformed Dimensions, and Drill Across 36
Additional Design Concepts and Techniques 38
Surrogate Keys 38
Slowly Changing Dimensions 39
Dates 42
Degenerate Dimensions 43
Snowflaking 43
Many-to-Many or Multivalued Dimensions 44
Hierarchies 47
Aggregate Dimensions 49
Junk Dimensions 51
The Three Fact Table Types 52
Aggregates 53
The Dimensional Modeling Process 54
Preparation 55
Data Profiling and Research 60
Building Dimensional Models 63
Developing the Detailed Dimensional Model 66
Testing and Refining the Model 68
Reviewing and Validating the Model 68
Case Study: The Adventure Works Cycles Orders Dimensional Model 69
The Orders Fact Table 69
The Dimensions 69
Identifying Dimension Attributes and Facts for the Orders Business Process 72
The Final Draft of the Initial Orders Model 74
Detailed Orders Dimensional Model Development 75
Final Dimensional Model 77
Summary 77
Chapter 3 The Toolset 79
The Microsoft DW/BI Toolset 80
Why Use the Microsoft Toolset? 82
Architecture of a Microsoft DW/BI System 83
Why Analysis Services? 84
Why a Relational Store? 86
ETL Is Not Optional 86
The Role of Master Data Services 88
Delivering BI Applications 88
Overview of the Microsoft Tools 89
Which Products Do You Need? 90
SQL Server Development and Management Tools 92
Summary 97
Chapter 4 System Setup 99
System Sizing Considerations 100
Calculating Data Volumes 101
Determining Usage Complexity 102
Estimating Simultaneous Users 104
Assessing System Availability Requirements 105
How Big Will It Be? 105
System Configuration Considerations 105
Memory 106
Monolithic or Distributed? 106
Storage System Considerations 110
Processors 113
Setting Up for High Availability 114
Software Installation and Configuration 115
Development Environment Software Requirements 116
Test and Production Software Requirements 120
Operating Systems 122
SQL Server Relational Database Setup 122
Analysis Services Setup 126
Integration Services Setup 129
Reporting Services Setup 130
Summary 131
Part 2 Building and Populating the Databases 133
Chapter 5 Creating the Relational Data Warehouse 135
Getting Started 136
Complete the Physical Design 137
Surrogate Keys 138
String Columns 138
To Null, or Not to Null? 140
Housekeeping Columns 140
Table and Column Extended Properties 142
Define Storage and Create Constraints and Supporting Objects 142
Create Files and Filegroups 142
Data Compression 144
Entity and Referential Integrity Constraints 145
Initial Indexing and Database Statistics 147
Aggregate Tables 150
Create Table Views 151
Insert an Unknown Member Row 152
Example CREATE TABLE Statement 152
Partitioned Tables 153
Finishing Up 163
Staging Tables 163
Metadata Setup 163
Summary 164
Chapter 6 Master Data Management 165
Managing Master Reference Data 166
Incomplete Attributes 167
Data Integration 168
Systems Integration 170
Master Data Management Systems and the Data Warehouse 171
Introducing SQL Server Master Data Services 171
Model Definition Features 172
Data Management Features 174
User Interface: Exploring and Managing the Master Data 174
Importing and Updating Data 176
Exporting Data 177
Full Versioning of All Attributes 179
Creating a Simple Application 179
Summary 186
Chapter 7 Designing and Developing the ETL System 187
Round Up the Requirements 188
Develop the ETL Plan 191
Introducing SQL Server Integration Services 192
Control Flow and Data Flow 194
SSIS Package Architecture 197
The Major Subsystems of ETL 198
Extracting Data 199
Subsystem 1: Data Profiling 199
Subsystem 2: Change Data Capture System 200
Subsystem 3: Extract System 202
Cleaning and Conforming Data 206
Subsystem 4: Data Cleaning System 206
Subsystem 5: Error Event Schema 214
Subsystem 6: Audit Dimension Assembler 215
Subsystem 7: Deduplication System 216
Subsystem 8: Conforming System 217
Delivering Data for Presentation 218
Subsystem 9: Slowly Changing Dimension Manager 218
Subsystem 10: Surrogate Key Generator 223
Subsystem 11: Hierarchy Manager 223
Subsystem 12: Special Dimensions Manager 224
Subsystem 13: Fact Table Builders 225
Subsystem 14: Surrogate Key Pipeline 229
Subsystem 15: Multi-Valued Dimension Bridge Table Builder 235
Subsystem 16: Late Arriving Data Handler 235
Subsystem 17: Dimension Manager 238
Subsystem 18: Fact Provider System 238
Subsystem 19: Aggregate Builder 239
Subsystem 20: OLAP Cube Builder 239
Subsystem 21: Data Propagation Manager 240
Managing the ETL Environment 240
Summary 243
Chapter 8 The Core Analysis Services OLAP Database 245
Overview of Analysis Services OLAP 247
Why Use Analysis Services? 247
Why Not Analysis Services? 249
Designing the OLAP Structure 250
Planning 251
Getting Started 253
Create a Project and a Data Source View 255
Dimension Designs 257
Creating and Editing Dimensions 261
Creating and Editing the Cube 274
Physical Design Considerations 291
Understanding Storage Modes 293
Developing the Partitioning Plan 294
Designing Performance Aggregations 296
Planning for Deployment 298
Processing the Full Cube 299
Developing the Incremental Processing Plan 299
Summary 304
Chapter 9 Design Requirements for Real-Time BI 305
Real-Time Triage 306
What Does Real-Time Mean? 306
Who Needs Real Time? 307
Real-Time Tradeoffs 308
Scenarios and Solutions 311
Executing Reports in Real Time 313
Serving Reports from a Cache 313
Creating an ODS with Mirrors and Snapshots 314
Creating an ODS with Replication 314
Building a BizTalk Application 315
Building a Real-Time Relational Partition 315
Querying Real-Time Data in the Relational Database 317
Using Analysis Services to Query Real-Time Data 318
Summary 319
Part 3 Developing the BI Applications 321
Chapter 10 Building BI Applications in Reporting Services 323
A Brief Overview of BI Applications 324
Types of BI Applications 325
The Value of Business Intelligence Applications 326
A High-Level Architecture for Reporting 328
Reviewing Business Requirements for Reporting 328
Examining the Reporting Services Architecture 330
Using Reporting Services as a Standard Reporting Tool 332
Reporting Services Assessment 339
The Reporting System Design and Development Process 340
Reporting System Design 341
Reporting System Development 348
Building and Delivering Reports 351
Planning and Preparation 351
Creating Reports 354
Reporting Operations 368
Ad Hoc Reporting Options 369
The Report Model 370
Shared Datasets 371
Report Parts 371
Summary 372
Chapter 11 PowerPivot and Excel 375
Using Excel for Analysis and Reporting 376
The PowerPivot Architecture: Excel on Steroids 378
Creating and Using PowerPivot Databases 380
Getting Started 381
PowerPivot Table Design 381
Creating Analytics with PowerPivot 385
Observations and Guidelines on PowerPivot for Excel 392
PowerPivot for SharePoint 394
The PowerPivot SharePoint User Experience 394
Server-Level Resources 397
PowerPivot Monitoring and Management 397
PowerPivot's Role in a Managed DW/BI Environment 400
Summary 401
Chapter 12 The BI Portal and SharePoint 403
The BI Portal 404
Planning the BI Portal 405
Impact on Design 406
Business Process Categories 407
Additional Functions 408
Building the BI Portal 409
Using SharePoint as the BI Portal 411
Architecture and Concepts 412
Setting Up SharePoint 417
Summary 426
Chapter 13 Incorporating Data Mining 429
Defining Data Mining 430
Basic Data Mining Terminology 432
Business Uses of Data Mining 433
Roles and Responsibilities 440
SQL Server Data Mining Architecture Overview 440
The Data Mining Design Environment 442
Build, Deploy, and Process 442
Accessing the Mining Models 443
Integration Services and Data Mining 443
...Erscheinungsjahr: | 2011 |
---|---|
Genre: | Importe, Informatik |
Rubrik: | Naturwissenschaften & Technik |
Medium: | Taschenbuch |
Inhalt: | 704 S. |
ISBN-13: | 9780470640388 |
ISBN-10: | 0470640383 |
Sprache: | Englisch |
Herstellernummer: | 14564038000 |
Einband: | Kartoniert / Broschiert |
Autor: |
Mundy, Joy
Thornthwaite, Warren |
Orchester: | Kimball, Ralph |
Auflage: | 2nd edition |
Hersteller: |
Wiley
John Wiley & Sons |
Verantwortliche Person für die EU: | Wiley-VCH GmbH, Boschstr. 12, D-69469 Weinheim, product-safety@wiley.com |
Maße: | 235 x 189 x 40 mm |
Von/Mit: | Joy Mundy (u. a.) |
Erscheinungsdatum: | 08.03.2011 |
Gewicht: | 1,025 kg |