Home Rajesh Kumar

Rajesh Kumar

About DataOps Certified Professional (DOCP)

Rajesh Kumar · September 10, 2025 · Comments off

The DataOps Certified Professional (DOCP) is a globally recognized certification designed to validate practical expertise in DataOps principles, tools, and automated data pipeline management. Overview DOCP covers…

Uncategorized

Top 10 DataOps Tools in 2025

Rajesh Kumar · September 10, 2025 · Comments off

What is DataOps? DataOps is an organizational practice (people + process + platforms) that applies DevOps and agile principles to the end-to-end data lifecycle—from ingestion and transformation…

Uncategorized

Databricks Lab & Excercise

Rajesh Kumar · September 7, 2025 · Comments off

Databricks Account Console Databricks Lab – Create an Azure Databricks workspace Databricks: Set Up Metastore & Map Azure Storage Account with Access Connector, Enable Unity Catalog Databricks…

Uncategorized

Databricks: User Management in Databricks

Rajesh Kumar · September 7, 2025 · Comments off

Introduction In Databricks, identities (users, groups, service principals) live at the account level and can be assigned to one or more workspaces. For Unity Catalog (UC), principals…

Uncategorized

Databricks: Databricks Secret Management & Secret Scopes

Rajesh Kumar · September 7, 2025 · Comments off

Introduction Hard-coding credentials (DB passwords, API tokens, SAS keys, hosts) in notebooks or jobs is risky. In Databricks you store them as secrets inside a secret scope,…

Uncategorized

Databricks: Truncate-and-Load as a streaming source, Full Refresh of a DLT pipeline, Workflow file-arrival triggers

Rajesh Kumar · September 7, 2025 · Comments off

Introduction Today we’ll cover four production patterns for Delta Live Tables (DLT): Truncate-Load table as Source for Streaming Tables (with skipChangeCommits) Problem: Your upstream system truncates a…

Uncategorized

Databricks: hands-on tutorial for DLT Data Quality & Expectations

Rajesh Kumar · September 7, 2025 · Comments off

Here’s a complete, hands-on tutorial for DLT Data Quality & Expectations — including how to define rules, use warning / fail / drop actions, and monitor a…

Uncategorized

Databricks: DLT SCD2 & SCD1 table | Apply Changes | CDC | Back-loading SCD2 | Delete/Truncate SCD

Rajesh Kumar · September 7, 2025 · Comments off

Introduction Goal: Build a CDC-ready dimension pipeline in Delta Live Tables (DLT) that supports: Core ideas you’ll use What we’ll model How to build SCD1 or SCD2…

Uncategorized

Databricks: DLT Append Flow (Union) & Auto Loader

Rajesh Kumar · September 2, 2025 · Comments off

Pass parameters in a DLT pipeline | Generate tables dynamically This hands-on guide shows how to: We’ll build on your earlier DLT pipeline (Orders + Customers →…

Uncategorized

Databricks: Delta Live Tables (DLT) Internals & Incremental Load

Rajesh Kumar · September 1, 2025 · Comments off

Delta Live Tables (DLT) Internals & Incremental Load Part 2: Add/Modify Columns | Rename Tables | Data Lineage This tutorial walks step by step through advanced Delta…

Uncategorized

Databricks: DLT Introduction

Rajesh Kumar · September 1, 2025 · Comments off

Introduction Goal: Build a Delta Live Tables (DLT) pipeline that: What DLT gives you (why declarative matters): What we’ll build: What is Delta Live Tables (DLT)? How…

Uncategorized

Databricks: Medallion Architecture in Data Lakehouse

Rajesh Kumar · August 31, 2025 · Comments off

Here’s a step-by-step tutorial with deep explanations + examples: 📘 Medallion Architecture in Data Lakehouse (Bronze, Silver, Gold Layers with Databricks) 1. 🔹 Introduction In a Data…

Uncategorized

Databricks: Databricks Auto Loader Tutorial

Rajesh Kumar · August 23, 2025 · Comments off

🚀 Databricks Auto Loader Tutorial (with Schema Evolution Modes & File Detection Modes) Auto Loader in Databricks is the recommended way to ingest files incrementally and reliably…

Uncategorized

Databricks: Databricks COPY INTO Command – Idempotent & Exactly-Once Data Loading

Rajesh Kumar · August 23, 2025 · Comments off

1. 🔹 What is COPY INTO? 👉 For millions of files or complex directories, use Autoloader instead. 2. 🔹 Setup: Managed Volume & Input Files Now we…

Uncategorized