Databricks: Delta Tables – Deletion Vectors & Liquid Clustering
Delta Lake keeps improving with features that optimize performance and storage. Two of the most important recent features are: Let’s explore both in detail with examples you…
Databricks: Delta Tables MERGE & UPSERT (SCD1 + Soft Deletes)
This tutorial covers how to perform upserts (MERGE) in Delta tables on Databricks, with both hard deletes and soft deletes (using SCD1 style). 1. 🔹 Introduction In…
Databricks: Delta Tables, Catalogs, Views, and Clones
This tutorial will walk you through core Delta Lake functionality in Databricks, including catalogs, schemas, tables, views, CTAS, deep clone, and shallow clone. Each section is backed…
Databricks – Catalog, Schemas & Tables with External Location
this is exactly the core of Unity Catalog’s object model. The way Databricks resolves storage paths for managed tables depends on where you attach the external/managed location….
Databricks Lab – Managed vs External Tables + UNDROP (with External Location setup)
Databricks Unity Catalog Tutorial Managed vs External Tables + UNDROP (with External Location setup) Introduction (what we’ll build) You’ll learn to: What’s new in Databricks? (Updates &…
Databricks Lab – Working with Schemas and External Locations
We will: Unity Catalog has a 4-level hierarchy: Metastore → Catalog → Schema → Table 👉 Today we’ll create three schemas to see how Unity Catalog stores…
Databricks Lab – Catalog with External Location, & Storage Credentials in Unity Catalog
Good Read – https://dataopsschool.com/blog/databricks-catalog-schemas-tables-with-external-location/ 1. Create Catalog without External Location 2. Create Catalog with SQL 3. Drop Catalog and Drop Catalog Recursively 4. Create External Location in…
Databricks: Unity Catalog vs Catalogs vs Workspace vs Metastore
🔑 Unity Catalog vs Catalogs vs Workspace vs Metastore 1. Unity Catalog (UC) ✅ 👉 Analogy: National Library System – it governs all libraries in a country….
Databricks Components
Databricks Components Hierarchy 1. Account Level (Top Layer) 2. Governance & Data Management 3. Computation & Execution 4. Developer Interfaces 5. Data & AI Layers ✅ In…
Tutorial: Data Democratization in the Context of DataOps
1. Introduction & Overview What is Data Democratization? Data Democratization is the process of making data accessible, understandable, and usable to everyone in an organization—without requiring deep…
Semantic Layer in DataOps: A Comprehensive Tutorial
Introduction & Overview What is a Semantic Layer? A semantic layer is a data abstraction layer that sits between raw data sources and business users, providing a…
Tutorial: Metrics Store in the Context of DataOps
1. Introduction & Overview What is a Metrics Store? A Metrics Store is a centralized repository designed to store, organize, and serve business metrics in a consistent,…
KPI Dashboard in the Context of DataOps – A Comprehensive Tutorial
1. Introduction & Overview What is a KPI Dashboard? A KPI Dashboard (Key Performance Indicator Dashboard) is a data visualization tool that consolidates and displays real-time business…
Self-Service Analytics in DataOps: A Comprehensive Tutorial
1. Introduction & Overview What is Self-Service Analytics? Self-Service Analytics (SSA) is an approach that empowers business users, analysts, and even non-technical stakeholders to access, explore, and…
Tutorial: Embedded Analytics in DataOps
1. Introduction & Overview What is Embedded Analytics? Embedded Analytics is the integration of analytical capabilities (like dashboards, reporting, and visualization) directly into applications, workflows, or business…
Comprehensive Tutorial on Looker in DataOps
1. Introduction & Overview What is Looker? Looker is a modern Business Intelligence (BI) and Data Analytics platform (acquired by Google in 2019, now part of Google…
Power BI in the Context of DataOps – A Comprehensive Tutorial
1. Introduction & Overview What is Power BI? Power BI is a business intelligence (BI) and data visualization tool by Microsoft that allows organizations to connect to…
Tutorial: Tableau in the Context of DataOps
1. Introduction & Overview What is Tableau? Tableau is a leading data visualization and business intelligence (BI) platform that helps teams transform raw data into interactive dashboards…
Comprehensive Tutorial on BI Tools in DataOps
1. Introduction & Overview What are BI Tools? Business Intelligence (BI) Tools are software applications that help organizations analyze, visualize, and report on data to make informed…