|
A Pragmatic Programmers Introduction to Data Integration Studio: Hands on Workshop |
|
If you are looking for quick guide, with step by step instructions, on how to do the standard data warehousng tasks in SAS Data Integration Studio, then read this paper. It even has screenshots!
Gregory S. Nelson, SUGI 31 2006 - Paper 111-31
Abstract
ETL is the process of moving data from a source system (such as operational systems or a table in a
database) into a structure that supports analytics and reporting (target). This workshop will guide
participants through a structured, hands-on exercise designed to give
them a broad overview of what things we can accomplish with Data
Integration Studio. Here we will prepare data for use by extracting
data from an external file, creating transformations that enrich our
data, combining it with other data for completeness and finally loading
the data into tables that are part of a star schema. The goal of this
workshop will be to get users comfortable with the tool and demonstrate
its capability.
A Pragmatic Programmers Introduction to Data Integration Studio: Hands on Workshop
|
|
|
Super Size It!!! Maximize the Performance of Your ETL Processes |
|
If you are looking to apply some tricks to boost the speed of your Data Integration processes, then you gotta read this paper.
Rausch, Nancy Wills, Nancy J, SAS Forum 2007 - Paper 108-2007
Abstract
The success of every business activity—from supplier management to
customer service—is dependent upon how well an organization manages its
critical data. This paper discusses practical recommendations for
optimizing the performance of your Extract, Transform, and Load (ETL)
data management processes. The paper presents best practices in ETL
process development and includes performance, tuning, and capacity
information. You will learn to analyze and debug ETL flows, gain
efficiency quickly, optimize your system environment for performance,
and customize with advanced performance techniques.
Super Size It!!! Maximize the Performance of Your ETL Processes
|
|
|
Speed It Up – Active Warehousing with SAS® Data Integration: From Batch to Real-Time |
|
This is a great paper that explains approaches to use DI Studio to manage Change Data Capture (CDC), Solwly Changing Dimensions (SCD) and integrating with Web Services.
Rausch, Nancy Hunley, Eric J. Mehler, Gary, SAS Forum 2007 - Paper 100-2007:
Abstract
The global economy has fueled a need to "speed it up" to fulfill the
demand for current, reliable data for decision support. The SAS Data
Integration Server, with its active warehousing capabilities, can help
you meet this need.
This paper presents methods and best practices for using real-time techniques that are available in SAS Data
Integration Studio:
- change data capture (CDC)
- message queues technology
- service-oriented-architecture (SOA) technologies.
Speed It Up – Active Warehousing with SAS® Data Integration: From Batch to Real-Time
|
|
|
Base SAS vs. SAS Data Integration Studio |
|
This is a great paper that compares SAS Data Integration Studio against the requirements Kimball outlines all ETL environment should have.
Danny Grasse and Greg Nelson, SUGI 31, Paper 099-31
Abstract
Every data warehouse, data mart and data hut needs it. Every good business
intelligence interface depends on it. It has been the hallmark of what SAS
programmers have done for over 30 years –beat data into submission (a.k.a. ETL -
extract, transform and load data). Now all grown up and ready to take on the
world, SAS software’s ability to get at just about any data source, massage,
cajole, beat, cleanse and transform information and then make it sing and dance
in your tool of preference makes SAS the ideal choice for modern
decision-support applications.
So which tools, technologies and/or approaches can we use for ETL and which
make sense for any given
application? We will explore the world of ETL from
the perspective of the tasks that we have to accomplish
and compare using
Base SAS tools versus Data Integration Studio (or as it is now known – SAS
Data
Integration Studio). To that end, we will highlight what a good ETL
system should be able to do by taking a lesson from Ralph Kimball and his book
and articles that outline the 38 subsystems for ETL. We will touch
on several
key tasks found in ETL and show you how to accomplish these using both Base SAS
and SAS Data Integration Studio. In addition, we will summarize the major
capabilities of each approach as a quick
reference for management.
Base SAS vs. SAS Data Integration Studio:
|
|
|
Best Practices for Working with CDISC Metadata in the SAS Data Integration Server |
|
Althougth this paper focusses on using DI Serverin a clincial nevironment, it has some great recomended practises we could all use.
Michael Kilhullen, PharmaSUG 2007, Paper sa03
Abstract
Over the past few years, SAS has demonstrated how the SAS® Metadata Server can be used to implement and manage CDISC metadata and facilitate a metadata driven approach to standardizing clinical data. In this paper, we examine best practices for using SAS® Data Integration Studio to execute and manage key CDISC concepts such as controlled terminology, value level metadata, normalization of data, importing and exporting XML documents, and producing the CRT-DDS.
Within the context of these topics, we will also examine considerations for setting up and managing study metadata, writing efficient transformation processes, leveraging metadata to answer key business questions, and effective use of change management.
Best Practices for Working with CDISC Metadata in the SAS® Data Integration
|
|
|