Welcome to IS417
Data Warehousing and Business Analytics (DWBA)
About
Data
warehousing has recently gained
a considerable momentum as a fundamental paradigm for driving daily business
analytics operations.
This course is
designed to provide an introduction to fundamental issues and novel
techniques of data warehouse. Issues covered include
data warehouse planning;
business modeling, design, and implementation. In particular, the role of data
warehouse in supporting business
intelligence (BI) and effective
decision making is also reviewed and explored.
This course is taught with an emphasis to balance theories about data
warehouse techniques and applications to real world BI problems.
The participants will explore applications and have unique opportunity
for hands-on exposure with business analytics for data
warehousing using the advanced
software packages from leading industrial partners.
Staff & Office Hours
|
Instructor |
Professor Jialie SHEN
(Jerry) |
|
Tel |
6828-0982 (o) |
|
Email |
|
|
Consultation |
Every Friday 1:00pm ~ 2:00pm Rm 80-5025 |
|
TA |
Leonard NG Wee Siong Email:
leonard.ng.2008@sis.smu.edu.sg Consultation: Every Friday,
14:00 ~ 15:00, Week 2 ~ 13, SIS GSR 3-2 (Rm 3015) |
Teaching Strategies
- Lectures/Invited Talks: the main way to
introduce concepts, show examples
- Lab/Demo: work on projects or examples.
Instructor is available in labs.
- Consultations: weekly
one-hour sessions to provide personalized advice to students.
The lecutres will serve as the main method of delivering the course
contents to the students. Weekly laboratories,
assignments and projects give the
students the hands-on experience on various subjects of the coruse. The
consultation
will give students the
opportunities to seek the assistance from the teaching staffs over all contents
of the course.
Course Assessment
This course’s assessment consists of 2
problem sets, attendance & participation, one term project, one mid-term
exam
and one final exam. Grades
will be determined on the following basis:
|
Problem set |
10% |
2
sets, 5% each, individual |
|
Attendance &
Participation |
10% |
|
|
Term project |
20% |
Final
presentation 10%, Final report 10%, group based |
|
Mid-term exam |
20% |
90
mins, written exam, week 7 |
|
Final
exam |
40% |
3
hours, written exam |
Final mark
(100%) = Problem set + Term project + Attendance & Participation +
Mid-term
exam + Final exam
Course
Schedule
|
Class Topic |
Readings |
Remark |
|
|
1 |
Course Overview |
|
|
|
|
Part I: Basic
Knowledge about Data Warehousing and BI |
|
|
|
1 [Jan 13] |
Overview of Data
Warehousing & BI Invited Talk I by SAS |
[CB01] Chp.1 pp. 4 ~ 11 [CB02] Chp. 3 pp. 105 ~ 110 [RA04] |
|
|
2 [Jan 20] |
Dimensional Modeling
I: Basics |
[CB02] Chp. 3 pp. 110 ~ 122 [CB03] Chp. 6 pp. 233 ~ 248 |
|
|
3 [Jan 27] |
Dimensional Modeling
II: Advanced Topics Lab 1: Dimensional Modeling |
[CB03] Chp. 6 pp. 253 ~ 267 |
Problem set 1 out |
|
4 [Feb 3] |
Online Analytical
Processing (OLAP) Lab 2: OLAP analysis and Reporting |
[CB02] Chp. 3 pp. 123 ~ 126
pp. 135 ~ 136 [RA07] |
Deadline for group registration |
|
|
Part II: Data
Warehouse Architecture and Management |
|
|
|
5 [Feb 10] |
Extract, Transform,
Load (ETL) and Data Quality Lab 3a: ETL process |
[CB01] Chp. 10 pp. 256 ~ 259
pp. 267 ~ 276
pp. 281 ~ 283 [RA05] |
|
|
6 [Feb 17] |
Data Warehouse
Architecture Data Warehouse
Development and Management Tutorial I Lab 3b ETL process |
[CB02] Chp. 3
pp. 130 ~ 133
[CB01] Chp. 9
pp. 236 ~ 239
pp. 246 ~ 249 [RA01]
[RA02] |
|
|
7 [Feb 24] |
Mid Term Examination
(90 mins, Close book) & Term project consultation |
|
Problem set 2 out and Problem set 1 due |
|
8 |
Session break (Feb 27 ~ Mar 4) |
|
|
|
|
Part III:
Applications of DW/BI Systems |
|
|
|
9 [Mar 9] |
Decision making and
Business Intelligence I – Basics Team presentation for term project [Phase I] SAS Invited Talk II |
[CB03] Chp. 11 pp. 473 ~ 479 [CB02]
Chp. 1
pp. 21 ~ 27 [RB04] Chp.1
pp. 22 ~ 28 |
|
|
10 [Mar 16] |
Decision making and Business
Intelligence II – Turning data to firm performance Tutorial II Lab 4: SAS BI/DW introduction |
[CB02] Chp. 11 pp. 649 ~ 653 [RA06]
|
|
|
11 [Mar 23] |
Next Generation Data
Warehousing and BI Lab 5: SAS data integration toolkit |
Hand out & class Slides |
|
|
12 [Mar 30] |
Team project
presentation |
|
Problem set 2 due |
|
13 [Apr 6] |
Team project
presentation |
|
|
|
14 |
Student Study Week |
|
|
|
15 |
Final
Examination Date:
20 Apr 2012 (Friday) Time: 14:30
- 17:30 |
|
|
Problem Sets
·
Problem set 1
·
Problem set 2
Projects (TBA)
Textbooks & Reference
· [CB01] The Data
Warehouse Mentor: Practical Data Warehouse and Business Intelligence Insights,
R. Laberge, 2011, McGraw-Hill.
·
[CB02] Data Mining: Concepts and Techniques, J. Han
and M. Kamber, 2nd edition, 2006, Kaufmann Publishers
· [CB03] The Data Warehouse Lifecycle Toolkit: Practical
Techniques for Building Data Warehouse and Business Intelligence Systems,
R. Kimball, M. Ross, W. Thornthwaite, J. Mundy, B.
Becker, 2nd edition, 2007, Wiley
Reference
Books:
· [RB01] Fundamentals of Data Warehouses, M. Jarke, M.
Lenzerini, Y. Vassiliou, P. Vassilidis, 2nd
edition, 2003, Springer
· [RB02] Database Management Systems, R. Ramakrishnan
and J. Gehrke, 3rd edition, 2002, McGraw-Hill
· [RB03] Information Rules – A Strategic Guide to the
Network Economy, C. Shapiro and H. R. Varian, Harvard Business School Press,
1998.
· [RB04] Data Mining: Practical Machine Learning Tools
and Techniques, I. H. Witten and E. Frank, 2nd Edition, Elservier 2005
Reference Articles:
· [RA01] My Top
10 Assertions about Data Warehouses, M. Stonebraker, BLOG@CACM, 2010.
· [RA02]
Managerial Considerations, H. J. Watson and B. J. Haley, Communications of
the ACM, Vol. 41, No. 9, 1998.
· [RA03] Does
Data Warehouse End-User Metadata Add Value? N. Foshay, A. Mukherjee, A.
Taylor, Communications of the ACM, Vol. 50, No. 11, 2007.
·
[RA04] An Overview of Business Intelligence Technology, S. Chaudhuri, U. Dayal, V. Narasayya, Communications of the ACM, Vol.
54, No. 8, 2011.
· [RA05] Assessing Data Quality for Information
Products: Impact of Selection, Projection, and Cartesian product,
A. Parssian, S. Sarkar and V. S.
Jacob, Management Science, Vol. 50, No. 7, 2004.
· [RA06]
Competing on Analytics, Thomas H. Daveport, Harvard Business Review, January, 2006.
· [RA07] Data cube: a relational aggregation operator
generalizing group-by, cross-tab, and sub-totals,
Data Mining and Knowledge Discovery,
J. Gray, S. Chaudhuri, A. Bosworth, A. Layman, D. Reichart, M. Venkatrao, Vol.
1, No. 1. 1997.