Official Resources from spark and databricks
Using Spark for practice
Certification
- Databricks Certified Associate Developer for Apache Spark — tips to get prepared for the exam
- Databricks Data Engineer Associate Exam Made Easy — A Comprehensive Guide
- Databricks Certification and Badging
- Databricks Certification Notes
- Study Guide for Databricks Certified Associate Developer for Apache Spark 3.0 Certification
Blogs and articles
- The Internals of Spark SQL
- Spark Core
- Spark 3 Array Functions
- How to solve the “large number of small files” problem in Spark
- THE BRICK LEARNING - medium
- Databricks Cost Observability & Optimization AI Solution Using Databricks AI Framework
- Databricks Autoloader Cookbook
- Market Basket Analysis [at scale] - Spark
- Spark Tips. Partition Tuning
- Spark Partitions
- Apache Spark — Repartitioning 101
- S3 Cost Optimization
- Unity Catalog Best Practices
- Delta Lake Optimisation Guide
- Databricks - How to change a partition of an existing Delta table?
- Medallion Architecture