🧱
Databricks
Azure Databricks Platform & Governance
🧱
🧱
Databricks · Section 11 of 18

Azure Databricks Platform & Governance

🔒

This section is locked

Unlock every deep-dive, lab, mock interview, and memory map across all 10 topics.

View Plans — from ₹299/month

Already have a plan? Sign in

Azure Databricks Platform & Governance

💡 Interview Tip
Time: 6-7 hours | Priority: HIGH — Platform knowledge differentiates senior from mid-level Context: Azure cloud, GDPR compliance, multi-tenant data sharing, PII governance Approach: Every topic starts with simple explanation → then interview-level depth

SECTION 1: UNITY CATALOG (1.5 hours)

Q1: What is Unity Catalog? What is the object hierarchy?

Simple Explanation: In Databricks, you have hundreds of tables, ML models, files, and functions. Unity Catalog is the single place that manages ALL of them — who can access what, where data came from, how it's organized, and what changed.

Think of Unity Catalog as the security guard + librarian + receptionist of your entire data platform:

  • Security guard: Controls who can access which tables (access control)
  • Librarian: Organizes all data into catalogs/schemas so you can find it easily (discovery)
  • Receptionist: Keeps a log of who accessed what and when (auditing)

Why do we need it? Without Unity Catalog: Each team creates tables in random locations, no one knows who has access to what, PII data leaks because there's no control, and you can't trace where data came from. With Unity Catalog: One central place to govern everything — tables, ML models, files, permissions.

The hierarchy (how data is organized):

🗂️Metastore (top level — ONE per Azure region)
│ Think of this as: "The entire data universe for your region"
Catalog (logical grouping — like separate departments)
│ Examples: travel_prod, travel_dev, travel_staging
│ Think of this as: "A database server" or "an environment"
Schema (like a database inside the catalog)
│ Examples: bookings, passengers, flights, analytics
│ Think of this as: "A database" or "a topic area"
Table (your data — managed or external)
View (a saved query on top of tables)
Function