Databricks: Delta Tables – Deletion Vectors & Liquid Clustering
Delta Lake keeps improving with features that optimize performance and storage. Two of the most important recent features are: Let’s explore both in detail with examples you…
Databricks: Delta Tables MERGE & UPSERT (SCD1 + Soft Deletes)
This tutorial covers how to perform upserts (MERGE) in Delta tables on Databricks, with both hard deletes and soft deletes (using SCD1 style). 1. 🔹 Introduction In…
Databricks: Delta Tables, Catalogs, Views, and Clones
This tutorial will walk you through core Delta Lake functionality in Databricks, including catalogs, schemas, tables, views, CTAS, deep clone, and shallow clone. Each section is backed…
Databricks – Catalog, Schemas & Tables with External Location
this is exactly the core of Unity Catalog’s object model. The way Databricks resolves storage paths for managed tables depends on where you attach the external/managed location….
Databricks Lab – Managed vs External Tables + UNDROP (with External Location setup)
Databricks Unity Catalog Tutorial Managed vs External Tables + UNDROP (with External Location setup) Introduction (what we’ll build) You’ll learn to: What’s new in Databricks? (Updates &…
Databricks Lab – Working with Schemas and External Locations
We will: Unity Catalog has a 4-level hierarchy: Metastore → Catalog → Schema → Table 👉 Today we’ll create three schemas to see how Unity Catalog stores…
Databricks Lab – Catalog with External Location, & Storage Credentials in Unity Catalog
Good Read – https://dataopsschool.com/blog/databricks-catalog-schemas-tables-with-external-location/ 1. Create Catalog without External Location 2. Create Catalog with SQL 3. Drop Catalog and Drop Catalog Recursively 4. Create External Location in…
Databricks: Unity Catalog vs Catalogs vs Workspace vs Metastore
🔑 Unity Catalog vs Catalogs vs Workspace vs Metastore 1. Unity Catalog (UC) ✅ 👉 Analogy: National Library System – it governs all libraries in a country….
Databricks Components
Databricks Components Hierarchy 1. Account Level (Top Layer) 2. Governance & Data Management 3. Computation & Execution 4. Developer Interfaces 5. Data & AI Layers ✅ In…
Databricks: Service Principal in Databricks using Azure?
What Is a Service Principal in Databricks? A service principal is a specialized, non-human identity within Azure Databricks, designed exclusively for automation, integrations, and programmatic access. Service…
Databricks: What is Databricks workspace?
What Is a Databricks Workspace? A Databricks workspace is the core organizational environment in Databricks where teams perform all collaborative data engineering, data science, analytics, and machine…
Databricks: Set Up Metastore & Map Azure Storage Account with Access Connector, Enable Unity Catalog
This guide walks you through setting up a Unity Catalog metastore in Azure Databricks, connecting it securely to an Azure storage account using the Access Connector, validating…
Databricks: Step-by-Step Commands: Managed vs. External Table in Databricks
Below is a complete workflow—with working SQL and Python code—demonstrating how to create, manage, insert, read, and delete data for both Managed and External tables in Databricks….
Databricks: File Storage Options on Databricks
The main file storage options in Databricks are: Option Best Use Case Security/Governance Notes Unity Catalog Volumes Data, artifacts across workspaces Strong Recommended, scalable Workspace Files Notebooks,…
Databricks: Working with Different Types of Tables
Databricks supports several types of tables, each designed for distinct storage, management, and integration scenarios. The main table types are: Summary Table Table Type Storage/Location Management Formats…
Databricks: dbutils is a utility library
dbutils is a built-in utility module in Databricks notebooks (Python, Scala, R) that provides programmatic access to common workspace tasks, including interacting with the Databricks File System…
Databricks: Unity Catalog
here’s the simplified definition of Unity Catalog: In short — it’s the “library catalog” and “security guard” for all your Databricks data and AI. If you want,…
Databricks Account Console
The Databricks Account Console is the central, account-level management portal for Databricks — it’s where you control everything that spans multiple workspaces. Think of it as the…
Databricks Lab & Excercise – Notebook – Unity Catalog → schema → table
let’s make this a “Databricks SQL Quickstart – 25 Commands” guide for first-time use in the Notebook with the Unity Catalog → schema → table workflow. I’ll…