Categories

There are currently no items in your shopping cart.

User Panel

# Data Science A-Z™: Real-Life Data Science Exercises Included

7.99 \$
Video Introducing this tutorial

Get Excited :
Welcome to Data Science A-Z™

What is Data Science? :
Intro (what you will learn in this section)
Profession of the future
Areas of Data Science
IMPORTANT: Course Pathways

--------------------------- Part 1: Visualisation --------------------------- :
Welcome to Part 1

Introduction to Tableau :
Intro (what you will learn in this section)
Installing Tableau Desktop and Tableau Public (FREE)
Challenge description + view data in file
Connecting Tableau to a Data file - CSV file
Navigating Tableau - Measures and Dimensions
Creating a calculated field
Section Recap
Tableau Basics
5 questions

How to use Tableau for Data Mining :
Intro (what you will learn in this section)
Get the Dataset + Project Overview
Connecting Tableau to an Excel File
How to visualise an ad-hoc A-B test in Tableau
Working with Aliases
Looking for anomalies
Handy trick to validate your approach / data
Section Recap

Advanced Data Mining With Tableau :
Intro (what you will learn in this section)
Creating bins & Visualizing distributions
Creating a classification test for a numeric variable
Combining two charts and working with them in Tableau
Validating Tableau Data Mining with a Chi-Squared test
Chi-Squared test when there is more than 2 categories
Visualising Balance and Estimated Salary distribution
Bonus: Chi-Squared Test (Stats Tutorial)
Bonus: Chi-Squared Test Part 2 (Stats Tutorial)
Section Recap
Part Completed

--------------------------- Part 2: Modelling --------------------------- :
Welcome to Part 2

Stats Refresher :
Intro (what you will learn in this section)
Types of variables: Categorical vs Numeric
Types of regressions
Ordinary Least Squares
R-squared

Simple Linear Regression :
Intro (what you will learn in this section)
Introduction to Gretl
Get the dataset
Import data and run descriptive statistics
Plotting and analysing the graph

Multiple Linear Regression :
Intro (what you will learn in this section)
Caveat: assumptions of a linear regression
Get the dataset
Dummy Variables
Dummy Variable Trap
Ways to build a model: BACKWARD, FORWARD, STEPWISE
Backward Elimination - Practice time
Using Adjusted R-squared to create Robust models
Interpreting coefficients of MLR
Section Recap
Logistic Regression
8 lectures
Intro (what you will learn in this section)
Get the dataset
Logistic regression intuition
False Positives and False Negatives
Confusion Matrix
Interpreting coefficients of a logistic regression
Building a robust geodemographic segmentation model
10 lectures
Intro (what you will learn in this section)
Get the dataset
What is geo-demographic segmenation?
Let's build the model - first iteration
Let's build the model - backward elimination: STEP-BY-STEP
Transforming independent variables
Creating derived variables
Checking for multicollinearity using VIF
Correlation Matrix and Multicollinearity Intuition
Model is Ready and Section Recap
10 lectures
Intro (what you will learn in this section)
Cumulative Accuracy Profile (CAP)
How to build a CAP curve in Excel
Assessing your model using the CAP curve
Get my CAP curve template
How to use test data to prevent overfitting your model
Applying the model to test data
Comparing training performance and test performance
Section Recap
7 lectures
Intro (what you will learn in this section)
Coefficients of a Logistic Regression - Plan of Attack (advanced topic)
Odds Ratio vs Coefficients in a Logistic Regression (advanced topic)
Section Recap
Model maintenance
5 lectures
Intro (what you will learn in this section)
What does model deterioration look like?
Why do models deteriorate?
Three levels of maintenance for deployed models
Section Recap
--------------------------- Part 3: Data Preparation ---------------------------
1 lecture
Welcome to Part 3
8 lectures
Intro (what you will learn in this section)
Working with Data
What is a Data Warehouse? What is a Database?
Setting up Microsoft SQL Server 2014 for practice
Important: Practice Database
ETL for Data Science - what is Extract Transform Load (ETL)?
Microsoft BI Tools: What is SSDT-BI and what are SSIS/SSAS/SSRS ?
Installing SSDT with MSVS Shell
ETL Phase 1: Data Wrangling before the Load
6 lectures
Intro (what you will learn in this section)
Two things you HAVE to do before the load
7 lectures
Intro (what you will learn in this section)
Starting and navigating an SSIS Project
Creating a flat file source task and OLE DB destination
Setting up your flat file source connection
Setting up your database connection and creating a RAW table
Handling errors during ETL (Phases 1 & 2)
16 lectures
Intro (what you will learn in this section)
How excel can mess up your data
Bulletproof Blueprint for Data Wrangling before the Load
SSIS Error: Text qualifier not specified
What do you do when your source file is corrupt? (Part 1)
What do you do when your source file is corrupt? (Part 2)
SSIS Error: Data truncation
Handy trick for finding anomalies in SQL
Automating Error Handling in SSIS: Conditional Split
Automating Error Handling in SSIS: Conditional Split (Level 2)
How to analyze the error files
Due Dilligence: the one thing you HAVE to do every time
Types of Errors in SSIS
Summary
Homework
SQL Programming for Data Science
17 lectures
Intro (what you will learn in this section)
Getting To Know MS SQL Management Studio
SELECT * Statement
Using the WHERE clause to filter data
How to use Wildcards / Regular Expressions in SQL (% and _)
Order By
Data Types in SQL
Implicit Data Conversion in SQL
Using Cast() vs Convert()
Working with NULLs
Understanding how LEFT, RIGHT, INNER, and OUTER joins work
Joins with duplicate values
Joining on multiple fields
Practicing Joins
ETL Phase 3: Data Wrangling after the load
16 lectures
Intro (what you will learn in this section)
RAW, WRK, DRV tables
Create your first Stored Proc in SQL
Executing Stored Procedures
Modifying Stored Procedures
Create table
Insert INTO
Check if table exists + drop table + Truncate
Intermediate Recap - Procs
Create the proc for the second file
Converting data on the fly
How to create a proc template
Archiving Procs
What you can do with these tables going forward [drv files etc.]
Handling errors during ETL (Phase 3)
12 lectures
Intro (what you will learn in this section)
Upload the data to RAW table
Create Stored Proc
How to deal with errors using the isnumeric() function
How to deal errors using the len() function
How to deal with errors using the isdate() function
Part Completed
ETL Error Handling "Vehicle Service" Project
--------------------------- Part 4: Communication ---------------------------
1 lecture
Welcome to Part 4
Working with people
8 lectures
Intro (what you will learn in this section)
Cross-departmental Work
Come to me with a Business Problem
Setting expectations and pre-project communication
Go and sit with them
The art of saying "No"
Sometimes you have to go to the top
Building a data culture
Presenting for Data Scientists
11 lectures
Intro (what you will learn in this section)
Case study
Analysing the intro
Intro dissection - recap
REAL Data Science Presentation Walkthrough - Make Your Audience Say "WOW"
My brainstorming method
How to present to executives
The truth is not always pretty
Passion and the Wow-factor
Bonus: my full presentation | LIVE 2015
Bonus: links to other examples of good storytelling
Homework Solutions
6 lectures
Advanced Data Mining with Tableau: Visualising Credit Score & Tenure
Advanced Data Mining with Tableau: Chi-Squared Test for Country
ETL Error Handling (Phases 1 and 2)
ETL Error Handling "Vehicle Service" Project (Part 1 of 3)
ETL Error Handling "Vehicle Service" Project (Part 2 of 3)
ETL Error Handling "Vehicle Service" Project (Part 3 of 3)
Bonus Lectures
1 lecture