Databricks Certified Data Engineer Associate – Preparation

Wishlist Share
Share Course
Page Link
Share On Social Media

About Course

If you are interested in becoming a Certified Data Engineer Associate from Databricks, you have come to the right place! This study guide will help you with preparing for this certification exam.

By the end of this course, you should be able to:

  • Understand how to use and the benefits of using the Databricks Lakehouse Platform and its tools, including:
    • Data Lakehouse (architecture, descriptions, benefits)
    • Data Science and Engineering workspace (clusters, notebooks, data storage)
    • Delta Lake (general concepts, table management and manipulation, optimizations)
  • Build ETL pipelines using Apache Spark SQL and Python, including:
    • Relational entities (databases, tables, views)
    • ELT (creating tables, writing data to tables, cleaning data, combining and reshaping tables, SQL UDFs)
    • Python (facilitating Spark SQL with string manipulation and control flow, passing data between PySpark and Spark SQL)
  • Incrementally process data, including:
    • Structured Streaming (general concepts, triggers, watermarks)
    • Auto Loader (streaming reads)
    • Multi-hop Architecture (bronze-silver-gold, streaming applications)
    • Delta Live Tables (benefits and features)
  • Build production pipelines for data engineering applications and Databricks SQL queries and dashboards, including:
    • Jobs (scheduling, task orchestration, UI)
    • Dashboards (endpoints, scheduling, alerting, refreshing)
  • Understand and follow best security practices, including:
    • Unity Catalog (benefits and features)
    • Entity Permissions (data objects Privileges)

With the knowledge you gain during this course, you will be ready to take the certification exam.

I am looking forward to meeting you!

Show More

What Will You Learn?

  • Understand how to use Databricks Lakehouse Platform and its tools
  • Build ETL pipelines using Apache Spark SQL and Python
  • Process data incrementally in batch and streaming mode
  • Orchestrate production pipelines
  • Understand and follow best security practices in Databricks

Course Content

01 Introduction

  • Download Resource Files
  • 001 Course Overview
    01:36
  • 002 What is Databricks
    05:05
  • 003 Get started with Community Edition
    03:21
  • 004 Free trial on Azure
    03:39
  • 005 Exploring Workspace
    03:37
  • 006 Course Materials
    01:31
  • 007 Creating Cluster
    06:47
  • 008 Notebooks Fundamentals
    13:48
  • 009 Databricks Repos
    08:39

02 Databricks Lakehouse Platform

03 ELT with Spark SQL and Python

04 Incremental Data Processing

05 Production Pipelines

06 Data Governance

07 Certification Overview

Student Ratings & Reviews

No Review Yet
No Review Yet