Spark Essentials with Scala
The only Spark Big Data online course on the web with actual big data.
Become a professional.
You probably know by now: Spark is the most popular computing engine for big data, the most maintained, and with a proven track record of performance. It's 100 times faster than the old MapReduce paradigm, and can easily be extended with machine learning and streaming capabilities, and much more.
If you're dealing with large amounts of data, learning Spark is a must.
The demand for Spark has skyrocketed, and companies are struggling to fill their Data Scientist positions. Scala and Spark are two of the best paying technologies in the field. Forget the reported 120k salaries on PayScale and StackOverflow. I regularly see engineers and data scientists working for 150k+ per year, or charging thousands a day for consulting.
However you take it, learning Spark will be a game changer for your career, if you choose. And this course will help you get those skills. Join this Spark Big Data online course and learn by writing code.
Work with real big data.
At the end of the course, we will dive into one of the biggest datasets publicly available and we'll put everything that we've learned to the test. Unlike every other material on the web (free or paid), this Spark course is the only place where you can really practice big data.
Everyone else runs a Spark job on one million records in a 20MB dataset. Why would you need Spark for that?
We do 1.4 BILLION car trips in a 400GB dataset. You don't fit that into most computers. That's the definition of big data.
This will be your true exercise to practice Spark with Scala. At the end of the project, we gather data insights worth millions of dollars for the company you're looking to help, and tens of thousands for you as a consultant and data scientist.
(picture courtesy of Ravi Shekhar)
Take the proven path.
As with the other Rock the JVM courses, this Spark and Big Data Essentials course will take you through a battle-tested path to Spark proficiency as a data scientist and engineer.
As always, I've
- deconstructed the complexity of Spark in bite-sized chunks that you can practice in isolation
- selected the essential concepts and exercises with the appropriate complexity
- sequenced the topics in increasing order of difficulty so that they "click" along the way
- applied everything in live code
Course Overview
-
StartDataFrames Basics (23:09)
-
StartDataFrames Basics: Exercises (8:16)
-
StartHow DataFrames Work (6:22)
-
StartData Sources (13:25)
-
StartData Sources, Part 2 + Exercises (27:43)
-
StartDataFrame Columns and Expressions (24:33)
-
StartColumns and Expressions: Exercises (17:12)
-
StartDataFrame Aggregations (28:13)
-
StartDataFrame Joins (20:23)
-
StartJoins: Exercises (17:27)
Your Instructor
I'm a software engineer and the founder of Rock the JVM. I started the Rock the JVM project out of love for Scala and the technologies it powers - they are all amazing tools and I want to share as much of my experience with them as I can.
As of February 2024, I've taught Java, Scala, Kotlin and related tech (e.g. Cats, ZIO, Spark) to 100000+ students at various levels and I've held live training sessions for some of the best companies in the industry, including Adobe and Apple. I've also taught university students who now work at Google and Facebook (among others), I've held Hour of Code for 7-year-olds and I've taught more than 35000 kids to code.
I have a Master's Degree in Computer Science and I wrote my Bachelor and Master theses on Quantum Computation. Before starting to learn programming, I won medals at international Physics competitions.
Get started now!
Risk-free: 100% money back guarantee.
If you're not happy with this Spark course, I want you to have your money back. If that happens, email me at [email protected] with a copy of your welcome email and I will refund you the course.
Less than 0.1 percent of students refunded the course, and every payment was returned in less than 72 hours.