Databricks: Delta Tables – Deletion Vectors & Liquid Clustering

Delta Lake keeps improving with features that optimize performance and storage. Two of the most important recent features are: Let’s explore both in detail with examples you…

Read More

Databricks: Delta Tables MERGE & UPSERT (SCD1 + Soft Deletes)

This tutorial covers how to perform upserts (MERGE) in Delta tables on Databricks, with both hard deletes and soft deletes (using SCD1 style). 1. 🔹 Introduction In…

Read More

Databricks: Delta Tables, Catalogs, Views, and Clones

This tutorial will walk you through core Delta Lake functionality in Databricks, including catalogs, schemas, tables, views, CTAS, deep clone, and shallow clone. Each section is backed…

Read More

Databricks – Catalog, Schemas & Tables with External Location

this is exactly the core of Unity Catalog’s object model. The way Databricks resolves storage paths for managed tables depends on where you attach the external/managed location….

Read More

Databricks Lab – Managed vs External Tables + UNDROP (with External Location setup)

Databricks Unity Catalog Tutorial Managed vs External Tables + UNDROP (with External Location setup) Introduction (what we’ll build) You’ll learn to: What’s new in Databricks? (Updates &…

Read More

Databricks Lab – Working with Schemas and External Locations

We will: Unity Catalog has a 4-level hierarchy: Metastore → Catalog → Schema → Table 👉 Today we’ll create three schemas to see how Unity Catalog stores…

Read More

Databricks Lab – Catalog with External Location, & Storage Credentials in Unity Catalog

Good Read – https://dataopsschool.com/blog/databricks-catalog-schemas-tables-with-external-location/ 1. Create Catalog without External Location 2. Create Catalog with SQL 3. Drop Catalog and Drop Catalog Recursively 4. Create External Location in…

Read More

Databricks: Unity Catalog vs Catalogs vs Workspace vs Metastore

🔑 Unity Catalog vs Catalogs vs Workspace vs Metastore 1. Unity Catalog (UC) ✅ 👉 Analogy: National Library System – it governs all libraries in a country….

Read More

Databricks Components

Databricks Components Hierarchy 1. Account Level (Top Layer) 2. Governance & Data Management 3. Computation & Execution 4. Developer Interfaces 5. Data & AI Layers ✅ In…

Read More

Databricks: Service Principal in Databricks using Azure?

What Is a Service Principal in Databricks? A service principal is a specialized, non-human identity within Azure Databricks, designed exclusively for automation, integrations, and programmatic access. Service…

Read More

Databricks: What is Databricks workspace?

What Is a Databricks Workspace? A Databricks workspace is the core organizational environment in Databricks where teams perform all collaborative data engineering, data science, analytics, and machine…

Read More

Databricks: Set Up Metastore & Map Azure Storage Account with Access Connector, Enable Unity Catalog

This guide walks you through setting up a Unity Catalog metastore in Azure Databricks, connecting it securely to an Azure storage account using the Access Connector, validating…

Read More

Databricks: Step-by-Step Commands: Managed vs. External Table in Databricks

Below is a complete workflow—with working SQL and Python code—demonstrating how to create, manage, insert, read, and delete data for both Managed and External tables in Databricks….

Read More

Databricks: File Storage Options on Databricks

The main file storage options in Databricks are: Option Best Use Case Security/Governance Notes Unity Catalog Volumes Data, artifacts across workspaces Strong Recommended, scalable Workspace Files Notebooks,…

Read More

Databricks: Working with Different Types of Tables

Databricks supports several types of tables, each designed for distinct storage, management, and integration scenarios. The main table types are: Summary Table Table Type Storage/Location Management Formats…

Read More

Databricks: dbutils is a utility library

dbutils is a built-in utility module in Databricks notebooks (Python, Scala, R) that provides programmatic access to common workspace tasks, including interacting with the Databricks File System…

Read More

Databricks: Unity Catalog

here’s the simplified definition of Unity Catalog: In short — it’s the “library catalog” and “security guard” for all your Databricks data and AI. If you want,…

Read More

Databricks Account Console

The Databricks Account Console is the central, account-level management portal for Databricks — it’s where you control everything that spans multiple workspaces. Think of it as the…

Read More

Databricks Lab & Excercise – Notebook – Unity Catalog → schema → table

let’s make this a “Databricks SQL Quickstart – 25 Commands” guide for first-time use in the Notebook with the Unity Catalog → schema → table workflow. I’ll…

Read More