Categories

There are currently no items in your shopping cart.

User Panel

Forgot your password?.

PacktPub Mastering Big Data Analytics with PySpark

Video Introducing this tutorial


Python and Spark: A Match Made in Heaven:
Course Overview
Python versus Spark
Preparing for the Course
Connecting Jupyter to Spark

Working with PySpark:
Getting to Know Spark
The Power of Spark
The Power of Spark MLlib
Spark DataFrames
Spark Data Operations

Preparing Data Using Spark SQL:
Loading Data from CSV Files
Fixing Issues in Our Data - Part One
Fixing Issues in Our Data - Part Two
Grouping, Joining, and Aggregating - Part One
Grouping, Joining, and Aggregating - Part Two

Machine Learning with Spark MLlib:
Machine Learning with Spark
Building a Recommendation System with Spark MLlib - Part One
Building a Recommendation System with Spark MLlib - Part Two
Building a Recommendation System with Spark MLlib - Part Three
Finalizing our Recommendation System
What We Have Learned So Far

Classification and Regression:
Machine Learning with Spark
Machine Learning Pipelines
Running a Logistic Regression Pipeline
Parameters, Features, and Persistence
Frequent Pattern Mining and Statistics

Analyzing Big Data:
Natural Language Processing with Spark
Identifying Our Data
Data Preparation and Exploration
Creating Our Raw Training Data

Processing Natural Language in Spark:
Data Preparation and Regular Expressions
Data Cleaning and Transformation
Training a Sentiment Analysis Model - Part One
Training a Sentiment Analysis Model - Part Two

Machine Learning in Real-Time:
Fetching Data from Twitter
Spark Structured Streaming
Managing and Converting Streams
Assembling Our Streaming ML Solution
A Structured Approach to ML Streaming

The Power of PySpark:
Running Spark in Production
Running Spark at Scale
Tips, Tricks, and Take-Aways