Zum Hauptinhalt springen Zur Suche springen Zur Hauptnavigation springen
Dekorationsartikel gehören nicht zum Leistungsumfang.
The Microsoft Data Warehouse Toolkit
Taschenbuch von Joy Mundy (u. a.)
Sprache: Englisch

47,95 €*

inkl. MwSt.

Versandkostenfrei per Post / DHL

Lieferzeit 2-3 Wochen

Produkt Anzahl: Gib den gewünschten Wert ein oder benutze die Schaltflächen um die Anzahl zu erhöhen oder zu reduzieren.
Kategorien:
Beschreibung
Learn to use SQL Server 2008 R2 to build a successful data warehouse

As the most influential thought leaders in the data warehousing and business intelligence industry, the Kimball Group has developed pioneering techniques that serve as industry standards for DW/BI system design, development, and management. With this new edition of their bestseller, veteran experts from the Kimball Group get you up to speed with using the new Business Intelligence release of SQL Server: SQL Server 2008 R2. Covering the complete suite of data warehousing and BI tools that are part of SQL Server 2008 R2, the authors follow the full project lifecycle, including design, development, deployment, and maintenance.

In addition to a substantial update to the content of the previous edition as well as a look at the new features and functionality of SQL Server 2008 R2 such as PowerPivot and Master Data Services, this new version includes detailed examples that illustrate how to best apply the techniques described in the book. The authors share their own trial-and-error experiences of building a DW/BI system with the Microsoft tools so that you can benefit from their challenges and successes. You'll also discover how using the Kimball Lifecycle to build your DW/BI system encourages you to zero in on four primary principles: focus on the business, build an information infrastructure, deliver in meaningfulincrements, and deliver the entire solution. With these principles in hand, you'll be well on your way to building a successful DW/BI system that supports business intelligence requirements common to most organizations.

No matter your role in the process of working on a DW/BI project, The Microsoft Data Warehouse Toolkit, Second Edition shows you how to:

  • Focus your efforts on the best opportunities

  • Select, install, and configure the appropriate components of SQL Server 2008 R2 and other Microsoft products

  • Design the dimensional model and create the database structures for the relational and Analysis Services databases

  • Build an ETL System in Integration Services to populate the DW/BI databases

  • Build the BI applications and data mining models to deliver value to your organization

  • Manage, secure, and grow the DW/BI system for long-term success

With SQL Server 2008 R2 and the Microsoft Business Intelligence Toolset

Learn to use SQL Server 2008 R2 to build a successful data warehouse

As the most influential thought leaders in the data warehousing and business intelligence industry, the Kimball Group has developed pioneering techniques that serve as industry standards for DW/BI system design, development, and management. With this new edition of their bestseller, veteran experts from the Kimball Group get you up to speed with using the new Business Intelligence release of SQL Server: SQL Server 2008 R2. Covering the complete suite of data warehousing and BI tools that are part of SQL Server 2008 R2, the authors follow the full project lifecycle, including design, development, deployment, and maintenance.

In addition to a substantial update to the content of the previous edition as well as a look at the new features and functionality of SQL Server 2008 R2 such as PowerPivot and Master Data Services, this new version includes detailed examples that illustrate how to best apply the techniques described in the book. The authors share their own trial-and-error experiences of building a DW/BI system with the Microsoft tools so that you can benefit from their challenges and successes. You'll also discover how using the Kimball Lifecycle to build your DW/BI system encourages you to zero in on four primary principles: focus on the business, build an information infrastructure, deliver in meaningfulincrements, and deliver the entire solution. With these principles in hand, you'll be well on your way to building a successful DW/BI system that supports business intelligence requirements common to most organizations.

No matter your role in the process of working on a DW/BI project, The Microsoft Data Warehouse Toolkit, Second Edition shows you how to:

  • Focus your efforts on the best opportunities

  • Select, install, and configure the appropriate components of SQL Server 2008 R2 and other Microsoft products

  • Design the dimensional model and create the database structures for the relational and Analysis Services databases

  • Build an ETL System in Integration Services to populate the DW/BI databases

  • Build the BI applications and data mining models to deliver value to your organization

  • Manage, secure, and grow the DW/BI system for long-term success

With SQL Server 2008 R2 and the Microsoft Business Intelligence Toolset

Über den Autor
Joy Mundy is a member of the Kimball Group and has been focusing on data warehousing and business intelligence since the early 1990s.

Warren Thornthwaite has been building decision support and data warehousing systems since 1980 and is a member of the Kimball Group.

Ralph Kimball, PhD, is known worldwide as an innovator, writer, educator, speaker, and consultant in the field of data warehousing. He is the founder of the Kimball Group ([...] which provides data warehouse consulting and education.

Inhaltsverzeichnis

Foreword xxvii

Introduction xxix

Part 1 Requirements, Realities, and Architecture 1

Chapter 1 Defining Business Requirements 3

The Most Important Determinant of Long-Term Success 5

Adventure Works Cycles Introduction 6

Uncovering Business Value 6

Obtaining Sponsorship 7

Defining Enterprise-Level Business Requirements 8

Prioritizing the Business Requirements 22

Revisiting the Project Planning 25

Gathering Project-Level Requirements 26

Summary 28

Chapter 2 Designing the Business Process Dimensional Model 29

Dimensional Modeling Concepts and Terminology 30

Facts 31

Dimensions 33

Bringing Facts and Dimensions Together 34

The Bus Matrix, Conformed Dimensions, and Drill Across 36

Additional Design Concepts and Techniques 38

Surrogate Keys 38

Slowly Changing Dimensions 39

Dates 42

Degenerate Dimensions 43

Snowflaking 43

Many-to-Many or Multivalued Dimensions 44

Hierarchies 47

Aggregate Dimensions 49

Junk Dimensions 51

The Three Fact Table Types 52

Aggregates 53

The Dimensional Modeling Process 54

Preparation 55

Data Profiling and Research 60

Building Dimensional Models 63

Developing the Detailed Dimensional Model 66

Testing and Refining the Model 68

Reviewing and Validating the Model 68

Case Study: The Adventure Works Cycles Orders Dimensional Model 69

The Orders Fact Table 69

The Dimensions 69

Identifying Dimension Attributes and Facts for the Orders Business Process 72

The Final Draft of the Initial Orders Model 74

Detailed Orders Dimensional Model Development 75

Final Dimensional Model 77

Summary 77

Chapter 3 The Toolset 79

The Microsoft DW/BI Toolset 80

Why Use the Microsoft Toolset? 82

Architecture of a Microsoft DW/BI System 83

Why Analysis Services? 84

Why a Relational Store? 86

ETL Is Not Optional 86

The Role of Master Data Services 88

Delivering BI Applications 88

Overview of the Microsoft Tools 89

Which Products Do You Need? 90

SQL Server Development and Management Tools 92

Summary 97

Chapter 4 System Setup 99

System Sizing Considerations 100

Calculating Data Volumes 101

Determining Usage Complexity 102

Estimating Simultaneous Users 104

Assessing System Availability Requirements 105

How Big Will It Be? 105

System Configuration Considerations 105

Memory 106

Monolithic or Distributed? 106

Storage System Considerations 110

Processors 113

Setting Up for High Availability 114

Software Installation and Configuration 115

Development Environment Software Requirements 116

Test and Production Software Requirements 120

Operating Systems 122

SQL Server Relational Database Setup 122

Analysis Services Setup 126

Integration Services Setup 129

Reporting Services Setup 130

Summary 131

Part 2 Building and Populating the Databases 133

Chapter 5 Creating the Relational Data Warehouse 135

Getting Started 136

Complete the Physical Design 137

Surrogate Keys 138

String Columns 138

To Null, or Not to Null? 140

Housekeeping Columns 140

Table and Column Extended Properties 142

Define Storage and Create Constraints and Supporting Objects 142

Create Files and Filegroups 142

Data Compression 144

Entity and Referential Integrity Constraints 145

Initial Indexing and Database Statistics 147

Aggregate Tables 150

Create Table Views 151

Insert an Unknown Member Row 152

Example CREATE TABLE Statement 152

Partitioned Tables 153

Finishing Up 163

Staging Tables 163

Metadata Setup 163

Summary 164

Chapter 6 Master Data Management 165

Managing Master Reference Data 166

Incomplete Attributes 167

Data Integration 168

Systems Integration 170

Master Data Management Systems and the Data Warehouse 171

Introducing SQL Server Master Data Services 171

Model Definition Features 172

Data Management Features 174

User Interface: Exploring and Managing the Master Data 174

Importing and Updating Data 176

Exporting Data 177

Full Versioning of All Attributes 179

Creating a Simple Application 179

Summary 186

Chapter 7 Designing and Developing the ETL System 187

Round Up the Requirements 188

Develop the ETL Plan 191

Introducing SQL Server Integration Services 192

Control Flow and Data Flow 194

SSIS Package Architecture 197

The Major Subsystems of ETL 198

Extracting Data 199

Subsystem 1: Data Profiling 199

Subsystem 2: Change Data Capture System 200

Subsystem 3: Extract System 202

Cleaning and Conforming Data 206

Subsystem 4: Data Cleaning System 206

Subsystem 5: Error Event Schema 214

Subsystem 6: Audit Dimension Assembler 215

Subsystem 7: Deduplication System 216

Subsystem 8: Conforming System 217

Delivering Data for Presentation 218

Subsystem 9: Slowly Changing Dimension Manager 218

Subsystem 10: Surrogate Key Generator 223

Subsystem 11: Hierarchy Manager 223

Subsystem 12: Special Dimensions Manager 224

Subsystem 13: Fact Table Builders 225

Subsystem 14: Surrogate Key Pipeline 229

Subsystem 15: Multi-Valued Dimension Bridge Table Builder 235

Subsystem 16: Late Arriving Data Handler 235

Subsystem 17: Dimension Manager 238

Subsystem 18: Fact Provider System 238

Subsystem 19: Aggregate Builder 239

Subsystem 20: OLAP Cube Builder 239

Subsystem 21: Data Propagation Manager 240

Managing the ETL Environment 240

Summary 243

Chapter 8 The Core Analysis Services OLAP Database 245

Overview of Analysis Services OLAP 247

Why Use Analysis Services? 247

Why Not Analysis Services? 249

Designing the OLAP Structure 250

Planning 251

Getting Started 253

Create a Project and a Data Source View 255

Dimension Designs 257

Creating and Editing Dimensions 261

Creating and Editing the Cube 274

Physical Design Considerations 291

Understanding Storage Modes 293

Developing the Partitioning Plan 294

Designing Performance Aggregations 296

Planning for Deployment 298

Processing the Full Cube 299

Developing the Incremental Processing Plan 299

Summary 304

Chapter 9 Design Requirements for Real-Time BI 305

Real-Time Triage 306

What Does Real-Time Mean? 306

Who Needs Real Time? 307

Real-Time Tradeoffs 308

Scenarios and Solutions 311

Executing Reports in Real Time 313

Serving Reports from a Cache 313

Creating an ODS with Mirrors and Snapshots 314

Creating an ODS with Replication 314

Building a BizTalk Application 315

Building a Real-Time Relational Partition 315

Querying Real-Time Data in the Relational Database 317

Using Analysis Services to Query Real-Time Data 318

Summary 319

Part 3 Developing the BI Applications 321

Chapter 10 Building BI Applications in Reporting Services 323

A Brief Overview of BI Applications 324

Types of BI Applications 325

The Value of Business Intelligence Applications 326

A High-Level Architecture for Reporting 328

Reviewing Business Requirements for Reporting 328

Examining the Reporting Services Architecture 330

Using Reporting Services as a Standard Reporting Tool 332

Reporting Services Assessment 339

The Reporting System Design and Development Process 340

Reporting System Design 341

Reporting System Development 348

Building and Delivering Reports 351

Planning and Preparation 351

Creating Reports 354

Reporting Operations 368

Ad Hoc Reporting Options 369

The Report Model 370

Shared Datasets 371

Report Parts 371

Summary 372

Chapter 11 PowerPivot and Excel 375

Using Excel for Analysis and Reporting 376

The PowerPivot Architecture: Excel on Steroids 378

Creating and Using PowerPivot Databases 380

Getting Started 381

PowerPivot Table Design 381

Creating Analytics with PowerPivot 385

Observations and Guidelines on PowerPivot for Excel 392

PowerPivot for SharePoint 394

The PowerPivot SharePoint User Experience 394

Server-Level Resources 397

PowerPivot Monitoring and Management 397

PowerPivot's Role in a Managed DW/BI Environment 400

Summary 401

Chapter 12 The BI Portal and SharePoint 403

The BI Portal 404

Planning the BI Portal 405

Impact on Design 406

Business Process Categories 407

Additional Functions 408

Building the BI Portal 409

Using SharePoint as the BI Portal 411

Architecture and Concepts 412

Setting Up SharePoint 417

Summary 426

Chapter 13 Incorporating Data Mining 429

Defining Data Mining 430

Basic Data Mining Terminology 432

Business Uses of Data Mining 433

Roles and Responsibilities 440

SQL Server Data Mining Architecture Overview 440

The Data Mining Design Environment 442

Build, Deploy, and Process 442

Accessing the Mining Models 443

Integration Services and Data Mining 443

...
Details
Erscheinungsjahr: 2011
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: 704 S.
ISBN-13: 9780470640388
ISBN-10: 0470640383
Sprache: Englisch
Herstellernummer: 14564038000
Einband: Kartoniert / Broschiert
Autor: Mundy, Joy
Thornthwaite, Warren
Orchester: Kimball, Ralph
Auflage: 2nd edition
Hersteller: Wiley
John Wiley & Sons
Verantwortliche Person für die EU: Wiley-VCH GmbH, Boschstr. 12, D-69469 Weinheim, product-safety@wiley.com
Maße: 235 x 189 x 40 mm
Von/Mit: Joy Mundy (u. a.)
Erscheinungsdatum: 08.03.2011
Gewicht: 1,025 kg
Artikel-ID: 107224267
Über den Autor
Joy Mundy is a member of the Kimball Group and has been focusing on data warehousing and business intelligence since the early 1990s.

Warren Thornthwaite has been building decision support and data warehousing systems since 1980 and is a member of the Kimball Group.

Ralph Kimball, PhD, is known worldwide as an innovator, writer, educator, speaker, and consultant in the field of data warehousing. He is the founder of the Kimball Group ([...] which provides data warehouse consulting and education.

Inhaltsverzeichnis

Foreword xxvii

Introduction xxix

Part 1 Requirements, Realities, and Architecture 1

Chapter 1 Defining Business Requirements 3

The Most Important Determinant of Long-Term Success 5

Adventure Works Cycles Introduction 6

Uncovering Business Value 6

Obtaining Sponsorship 7

Defining Enterprise-Level Business Requirements 8

Prioritizing the Business Requirements 22

Revisiting the Project Planning 25

Gathering Project-Level Requirements 26

Summary 28

Chapter 2 Designing the Business Process Dimensional Model 29

Dimensional Modeling Concepts and Terminology 30

Facts 31

Dimensions 33

Bringing Facts and Dimensions Together 34

The Bus Matrix, Conformed Dimensions, and Drill Across 36

Additional Design Concepts and Techniques 38

Surrogate Keys 38

Slowly Changing Dimensions 39

Dates 42

Degenerate Dimensions 43

Snowflaking 43

Many-to-Many or Multivalued Dimensions 44

Hierarchies 47

Aggregate Dimensions 49

Junk Dimensions 51

The Three Fact Table Types 52

Aggregates 53

The Dimensional Modeling Process 54

Preparation 55

Data Profiling and Research 60

Building Dimensional Models 63

Developing the Detailed Dimensional Model 66

Testing and Refining the Model 68

Reviewing and Validating the Model 68

Case Study: The Adventure Works Cycles Orders Dimensional Model 69

The Orders Fact Table 69

The Dimensions 69

Identifying Dimension Attributes and Facts for the Orders Business Process 72

The Final Draft of the Initial Orders Model 74

Detailed Orders Dimensional Model Development 75

Final Dimensional Model 77

Summary 77

Chapter 3 The Toolset 79

The Microsoft DW/BI Toolset 80

Why Use the Microsoft Toolset? 82

Architecture of a Microsoft DW/BI System 83

Why Analysis Services? 84

Why a Relational Store? 86

ETL Is Not Optional 86

The Role of Master Data Services 88

Delivering BI Applications 88

Overview of the Microsoft Tools 89

Which Products Do You Need? 90

SQL Server Development and Management Tools 92

Summary 97

Chapter 4 System Setup 99

System Sizing Considerations 100

Calculating Data Volumes 101

Determining Usage Complexity 102

Estimating Simultaneous Users 104

Assessing System Availability Requirements 105

How Big Will It Be? 105

System Configuration Considerations 105

Memory 106

Monolithic or Distributed? 106

Storage System Considerations 110

Processors 113

Setting Up for High Availability 114

Software Installation and Configuration 115

Development Environment Software Requirements 116

Test and Production Software Requirements 120

Operating Systems 122

SQL Server Relational Database Setup 122

Analysis Services Setup 126

Integration Services Setup 129

Reporting Services Setup 130

Summary 131

Part 2 Building and Populating the Databases 133

Chapter 5 Creating the Relational Data Warehouse 135

Getting Started 136

Complete the Physical Design 137

Surrogate Keys 138

String Columns 138

To Null, or Not to Null? 140

Housekeeping Columns 140

Table and Column Extended Properties 142

Define Storage and Create Constraints and Supporting Objects 142

Create Files and Filegroups 142

Data Compression 144

Entity and Referential Integrity Constraints 145

Initial Indexing and Database Statistics 147

Aggregate Tables 150

Create Table Views 151

Insert an Unknown Member Row 152

Example CREATE TABLE Statement 152

Partitioned Tables 153

Finishing Up 163

Staging Tables 163

Metadata Setup 163

Summary 164

Chapter 6 Master Data Management 165

Managing Master Reference Data 166

Incomplete Attributes 167

Data Integration 168

Systems Integration 170

Master Data Management Systems and the Data Warehouse 171

Introducing SQL Server Master Data Services 171

Model Definition Features 172

Data Management Features 174

User Interface: Exploring and Managing the Master Data 174

Importing and Updating Data 176

Exporting Data 177

Full Versioning of All Attributes 179

Creating a Simple Application 179

Summary 186

Chapter 7 Designing and Developing the ETL System 187

Round Up the Requirements 188

Develop the ETL Plan 191

Introducing SQL Server Integration Services 192

Control Flow and Data Flow 194

SSIS Package Architecture 197

The Major Subsystems of ETL 198

Extracting Data 199

Subsystem 1: Data Profiling 199

Subsystem 2: Change Data Capture System 200

Subsystem 3: Extract System 202

Cleaning and Conforming Data 206

Subsystem 4: Data Cleaning System 206

Subsystem 5: Error Event Schema 214

Subsystem 6: Audit Dimension Assembler 215

Subsystem 7: Deduplication System 216

Subsystem 8: Conforming System 217

Delivering Data for Presentation 218

Subsystem 9: Slowly Changing Dimension Manager 218

Subsystem 10: Surrogate Key Generator 223

Subsystem 11: Hierarchy Manager 223

Subsystem 12: Special Dimensions Manager 224

Subsystem 13: Fact Table Builders 225

Subsystem 14: Surrogate Key Pipeline 229

Subsystem 15: Multi-Valued Dimension Bridge Table Builder 235

Subsystem 16: Late Arriving Data Handler 235

Subsystem 17: Dimension Manager 238

Subsystem 18: Fact Provider System 238

Subsystem 19: Aggregate Builder 239

Subsystem 20: OLAP Cube Builder 239

Subsystem 21: Data Propagation Manager 240

Managing the ETL Environment 240

Summary 243

Chapter 8 The Core Analysis Services OLAP Database 245

Overview of Analysis Services OLAP 247

Why Use Analysis Services? 247

Why Not Analysis Services? 249

Designing the OLAP Structure 250

Planning 251

Getting Started 253

Create a Project and a Data Source View 255

Dimension Designs 257

Creating and Editing Dimensions 261

Creating and Editing the Cube 274

Physical Design Considerations 291

Understanding Storage Modes 293

Developing the Partitioning Plan 294

Designing Performance Aggregations 296

Planning for Deployment 298

Processing the Full Cube 299

Developing the Incremental Processing Plan 299

Summary 304

Chapter 9 Design Requirements for Real-Time BI 305

Real-Time Triage 306

What Does Real-Time Mean? 306

Who Needs Real Time? 307

Real-Time Tradeoffs 308

Scenarios and Solutions 311

Executing Reports in Real Time 313

Serving Reports from a Cache 313

Creating an ODS with Mirrors and Snapshots 314

Creating an ODS with Replication 314

Building a BizTalk Application 315

Building a Real-Time Relational Partition 315

Querying Real-Time Data in the Relational Database 317

Using Analysis Services to Query Real-Time Data 318

Summary 319

Part 3 Developing the BI Applications 321

Chapter 10 Building BI Applications in Reporting Services 323

A Brief Overview of BI Applications 324

Types of BI Applications 325

The Value of Business Intelligence Applications 326

A High-Level Architecture for Reporting 328

Reviewing Business Requirements for Reporting 328

Examining the Reporting Services Architecture 330

Using Reporting Services as a Standard Reporting Tool 332

Reporting Services Assessment 339

The Reporting System Design and Development Process 340

Reporting System Design 341

Reporting System Development 348

Building and Delivering Reports 351

Planning and Preparation 351

Creating Reports 354

Reporting Operations 368

Ad Hoc Reporting Options 369

The Report Model 370

Shared Datasets 371

Report Parts 371

Summary 372

Chapter 11 PowerPivot and Excel 375

Using Excel for Analysis and Reporting 376

The PowerPivot Architecture: Excel on Steroids 378

Creating and Using PowerPivot Databases 380

Getting Started 381

PowerPivot Table Design 381

Creating Analytics with PowerPivot 385

Observations and Guidelines on PowerPivot for Excel 392

PowerPivot for SharePoint 394

The PowerPivot SharePoint User Experience 394

Server-Level Resources 397

PowerPivot Monitoring and Management 397

PowerPivot's Role in a Managed DW/BI Environment 400

Summary 401

Chapter 12 The BI Portal and SharePoint 403

The BI Portal 404

Planning the BI Portal 405

Impact on Design 406

Business Process Categories 407

Additional Functions 408

Building the BI Portal 409

Using SharePoint as the BI Portal 411

Architecture and Concepts 412

Setting Up SharePoint 417

Summary 426

Chapter 13 Incorporating Data Mining 429

Defining Data Mining 430

Basic Data Mining Terminology 432

Business Uses of Data Mining 433

Roles and Responsibilities 440

SQL Server Data Mining Architecture Overview 440

The Data Mining Design Environment 442

Build, Deploy, and Process 442

Accessing the Mining Models 443

Integration Services and Data Mining 443

...
Details
Erscheinungsjahr: 2011
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: 704 S.
ISBN-13: 9780470640388
ISBN-10: 0470640383
Sprache: Englisch
Herstellernummer: 14564038000
Einband: Kartoniert / Broschiert
Autor: Mundy, Joy
Thornthwaite, Warren
Orchester: Kimball, Ralph
Auflage: 2nd edition
Hersteller: Wiley
John Wiley & Sons
Verantwortliche Person für die EU: Wiley-VCH GmbH, Boschstr. 12, D-69469 Weinheim, product-safety@wiley.com
Maße: 235 x 189 x 40 mm
Von/Mit: Joy Mundy (u. a.)
Erscheinungsdatum: 08.03.2011
Gewicht: 1,025 kg
Artikel-ID: 107224267
Sicherheitshinweis

Ähnliche Produkte

Ähnliche Produkte