Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
32credentials
76courses

Related roles

Gain the knowledge and skills you need to advance.

  • This role has a $137,984 median salary ¹.

    description:

    A Data Engineer builds data pipelines for large datasets, optimizing systems and ensuring reliable data flow using tools like Hadoop and Spark.

    This role has a $137,984 median salary ¹.

    Offered by

    DeepLearning.AI_logo
    Amazon Web Services_logo
    Google Cloud_logo

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Results for "apache spark"

  • Status: Free Trial

    Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Distributed Computing, Performance Tuning, Data Transformation, Debugging

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Machine Learning, Generative AI, PySpark, Applied Machine Learning, Supervised Learning, Apache Hadoop, Data Pipelines, Unsupervised Learning, Feature Engineering, Data Processing, Extract, Transform, Load, Predictive Modeling, Data Transformation, Regression Analysis

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, MySQL, Data Pipelines, Scala Programming, Extract, Transform, Load, Customer Analysis, Apache Hadoop, Classification And Regression Tree (CART), Predictive Modeling, Applied Machine Learning, Data Processing, Advanced Analytics, Big Data, Apache Maven, Statistical Machine Learning, Unsupervised Learning, SQL, Apache, Python Programming

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, Customer Analysis, Big Data, Data Processing, Advanced Analytics, Statistical Modeling, Text Mining, Customer Insights, Data Mining, Data Transformation, Unstructured Data, Predictive Modeling, Simulation and Simulation Software, Data Manipulation, Marketing Analytics, Image Analysis, Risk Analysis

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Data Lakes, Analytics, Data Pipelines, Data Processing, Data Import/Export, Data Integration, Linux Commands, Data Mapping, Linux, File Systems, Text Mining, Data Management, Distributed Computing, Java, C++ (Programming Language)

  • Status: Free Trial

    Skills you'll gain: NoSQL, Apache Hadoop, Apache Spark, MongoDB, PySpark, Apache Hive, Databases, Apache Cassandra, Big Data, Machine Learning, Generative AI, IBM Cloud, Applied Machine Learning, Kubernetes, Supervised Learning, Distributed Computing, Docker (Software), Database Management, Data Pipelines, Scalability

What brings you to Coursera today?

  • Status: Free Trial
    Status: AI skills

    Skills you'll gain: NoSQL, Data Warehousing, Apache Hadoop, Extract, Transform, Load, Apache Airflow, Web Scraping, Linux Commands, Database Design, SQL, IBM Cognos Analytics, MySQL, Apache Spark, Database Administration, Data Store, Generative AI, Professional Networking, Data Import/Export, Python Programming, Data Analysis, Data Science

  • Status: Preview

    Skills you'll gain: PySpark, Apache Spark, Data Management, Distributed Computing, Apache Hadoop, Data Processing, Data Analysis, Exploratory Data Analysis, Python Programming, Scalability

  • Status: Free Trial

    LearnKartS

    Skills you'll gain: Apache Kafka, Apache Spark, Prometheus (Software), Data Pipelines, Distributed Computing, Real Time Data, Data Processing, Security Controls, Configuration Management, Application Performance Management, Performance Tuning, Encryption, Authorization (Computing), Authentications, Data Storage Technologies, Server Administration, Network Monitoring, File Management

  • Status: Free Trial

    École Polytechnique Fédérale de Lausanne

    Skills you'll gain: Apache Spark, Apache Hadoop, Scala Programming, Distributed Computing, Big Data, Data Manipulation, Data Processing, Performance Tuning, Data Transformation, SQL, Data Analysis

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Scala Programming, Data Processing, Big Data, Applied Machine Learning, IntelliJ IDEA, Real Time Data, Graph Theory, Data Transformation, Development Environment, Distributed Computing, Build Tools, Regression Analysis, Performance Tuning

  • Status: Free Trial

    Johns Hopkins University

    Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Analytics, Data Science, SQL, Query Languages, Java, Data Manipulation, Data Structures, Distributed Computing, Scripting Languages, Data Transformation, Performance Tuning

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Packt
  • IBM
  • EDUCBA
  • Pearson
  • University of California San Diego
  • Amazon Web Services
  • Edureka