There are currently no items in your shopping cart.

User Panel

Forgot your password?.

Hadoop for Data Science Tips, Tricks, & Techniques

Video Introducing this tutorial


What you should know
Exercise files
Environment setup

1. Working with Files

Organize files in HDFS
Upload files to HDFS
Move files in HDFS
Remove files in HDFS

2. Connecting to Hadoop

Explore Hive through Beeline
Access Hive from Python
Create aggregates in Hive
Select partitions in Hive

3. Complex Data Structures in Hive

Map data in Hive
Arrays in Hive
Structs in Hive
Create flat tables for Impala
Deconstruct Impala queries


Next steps