Course Overview

Apache Spark has a thriving open-source community and is the most active Apache project at the moment proving it to be the uncontested winner in big data processing. Spark provides a faster and more general data processing platform letting you write code more quickly as you have over 80 high-level operators at your disposal. It lets you run programs up to 100x faster in memory, or 10x faster on disk than Hadoop. Today, Spark is being adopted by major players like Amazon, eBay, and Yahoo! In this course we are going to take a deep dive into the use cases, architecture, commonly used API’s, simple and a few advanced transformations and actions that are required to get onboarded and get started to process huge amounts of data quickly in an efficient manner. By the end of this course, you will have gained a significant amount of mastery over Apache Spark. You will have mastered all the core concepts since this hands-on session is designed just for you!

What You Will Learn

  • Deeper understanding of Apache Spark fundamentals
  • Efficiently implement and deliver high-performance Spark applications
  • Install and run Apache Spark on a desktop computer or on a cluster
  • Understand Best practices of working with Apache Spark in the field
  • Techniques for getting the most out of standard RDD transformations
  • Get a strong handle on advanced techniques to optimize and tune Apache Spark jobs
  • Inspect
  • tune
  • and debug your Spark operations with Spark configurations and Spark UI
  • Get familiar with batch and streaming APIs

Program Curriculum

  • What is Big Data?
  • Analytics
  • The Importance of Big Data
  • Chapter 1 Quiz
  • Where does Spark Fit in?
  • Use Case
  • $7 Million Cybersecurity Scholarship by EC-Council

  • In and Around Spark
  • Architecture
  • Chapter 2 Quiz

  • Hardware Requirements
  • Let’s Install Spark
  • Pre-requisites
  • Chapter 3 Quiz

  • SparkContext
  • Resilient Distributed Dataset (RDD)
  • SQLContext
  • SparkSession
  • Dataframe
  • Types of Spark Operations
  • Chapter 4 Quiz

  • Lazy Evaluation
  • Lambda Function
  • Spark UDFs (User Defined Functions)
  • Different Methods We Can Make Use of UDFs
  • Chapter 5 Quiz

  • Advanced Transformations
  • Actions
  • Window Functions in Spark
  • Chapter 4 Quiz
Load more modules

Instructor

Yogesh Nageswara Rao

Yogesh Nageswara Rao is an experienced data engineer and architect with an overall of 6+ years in the Big Data domain as a developer, architect which includes extensive work on Hadoop framework, Apache Spark, AWS, Splunk, Tableau & Looker. Yogesh has spent 6 years at Comcast Xfinity and CapitalOne, developing and managing the technology to ingest tera bytes of data per day on various innovative product lines and banking needs, and has been using Apache Spark from its initial days of existence.

Join over 1 Million professionals from the most renowned Companies in the world!

certificate

Empower Your Learning with Our Flexible Plans

Invest in your future with our flexible subscription plans. Whether you're just starting out or looking to enhance your expertise, there's a plan tailored to meet your needs. Gain access to in-demand skills and courses for your continuous learning needs.

Monthly Plans
Annual Plans
Save 20% with our annual plans!

Pro

Ideal for continuous learning, offering extensive resources with 600+ courses and diverse Learning Paths to enhance your skills.

$ 499.00
Billed annually or $59.00 billed monthly

What is included

  • 700+ Premium Short Courses
  • 50+ Structured Learning Paths
  • Validation of Completion with all courses and learning paths
  • New Courses added every month
Early Access Offer

Pro +

Experience immersive learning with Practice Labs, CTF Challenges, and exclusive EC-Council certifications for comprehensive skill-building.

$ 599.00
Billed annually or $69.00 billed monthly

Everything in Pro and

  • 800+ Practice Lab exercises with guided instructions
  • 150+ CTF Challenges with detailed walkthroughs
  • New Practice Labs and Challenges added every month
  • 3 Official EC-Council Essentials Certifications¹ (retails at $897!)
    Exclusive Bonus with Annual Plans

¹This plan includes Digital Forensics Essentials (DFE), Ethical Hacking Essentials (EHE), and Network Defense Essentials (NDE) certifications. No other EC-Council certifications are included.

Related Courses

1 of 8