Skip to content
Azure
── Foundations
Cloud Computing Concepts
Azure Fundamentals
What is ADF?
Synapse Workspace Setup
ADF vs Synapse
DB vs DW + SQL Pools
── Storage & Networking
Blob Storage
ADLS Gen2
Azure SQL Database
Azure Networking
All File Formats
Azure RBAC Roles
Azure Connections and Authentication
── Pipelines
Metadata-Driven Pipeline
Synapse + Audit Logging
Parameterized Datasets
ADF Expressions
Incremental Loading
Unified Full+Incremental
Pipeline JSON Guide
Audit Logging Concepts
── SCD & Data Flows
SCD Types (0,1,2,3,6)
SCD Type 1 Full Load
SCD Type 1 Hash-Based
SCD Type 2 Pipeline
Combined SCD1+SCD2
Data Flows Guide
Data Flow Joins
── Triggers & CI/CD
ADF Triggers
Trigger Parameters
CI/CD GitHub
CI/CD Azure DevOps
CI/CD for Azure Data Factory and Synapse: Complete Hands-On Guide
── Integration Runtime
IR Types Guide
On-Prem Pipeline + SHIR
── Troubleshooting
Data Lake Cleanup
Common Pipeline Errors
Databricks
Databricks Intro & dbutils
Connecting to Blob/ADLS
Secret Scopes & Key Vault
Reading/Writing Formats
Delta Lake Deep Dive
Connecting to Azure SQL
External Tables & Unity Catalog
Delta Lake and PySpark Optimization
File Storage in Azure Databricks
Data Quality in Azure Databricks
Databricks Workflows and Jobs
The Medallion Architecture in Azure
Databricks Git Integration and CI/CD
SCD Type 1 & Type 2 with PySpark Delta MERGE
AutoLoader (cloudFiles)
Unity Catalog Deep Dive
SQL
SQL Fundamentals, Execution Order & WHERE Clauses
GROUP BY, HAVING & CASE WHEN
Subqueries & Performance
SQL Functions
SQL Joins
Window Functions
CTEs & Subqueries
DDL, DML, and Constraints
Indexes and Execution Plans
Views, Temp Tables & Variables
Stored Procedures & Triggers
Normalization & Star Schema
UNION, PIVOT & Dynamic SQL
Transactions & ACID
SQL Interview Practice (20 Qs)
Python & PySpark
Python for Data Engineers
PySpark Foundations
PySpark Architecture
PySpark Joins
FastAPI on AWS Lambda
PySpark Transformations
Lazy Evaluation in PySpark
PySpark Window Functions Deep Dive
AWS
AWS S3
Glue Data Catalog
AWS Amplify
AWS Cognito
Fabric
Microsoft Fabric for Data Engineers
Capacity, Workspaces, Items
OneLake Deep Dive
OneLake Shortcuts
Connections & Gateways
Lakehouse vs Warehouse
Lakehouse Practical Guide
Warehouse Practical Guide
Warehouse Advanced (COPY INTO, CTAS, DMVs)
Fabric Data Factory & Pipelines
Data Factory Expression Language
Triggers, Scheduling & Orchestration
Dataflow Gen2: Introduction
Dataflow Gen2: Advanced Transforms
Dataflow Gen2: Production Patterns
M Language (Power Query) Guide
Fabric Notebooks Deep Dive
Apache Spark in Fabric
Spark Structured Streaming
Delta Lake Table Properties
Materialized Lake Views (MLVs)
Mirrored Databases
Power BI Direct Lake
Real-Time Intelligence
RTI Deep Dive (Windows, KQL, Materialized Views)
KQL (Kusto Query Language) Complete Guide
Data Activator
Security & Governance
Administration & Cost
Monitoring & Troubleshooting
Optimization Guide
Git Integration & CI/CD Deployment Pipelines
Fabric REST APIs
DP-700 Certification Study Guide
AI/ML
Artificial Intelligence and Machine Learning
Linear & Logistic Regression
Decision Trees & Random Forests
XGBoost & Gradient Boosting
Model Evaluation Deep Dive
Feature Engineering
Fine-Tuning LLMs
Clustering Algorithms
Hyperparameter Tuning
More
Top 20 DE Interview Questions
Top 15 ADF Interview Questions
Parquet vs CSV vs JSON
Schema-on-Write vs Read
How Real Companies Receive Data
Table Creation and Governance
Azure
── Foundations
Cloud Computing Concepts
Azure Fundamentals
What is ADF?
Synapse Workspace Setup
ADF vs Synapse
DB vs DW + SQL Pools
── Storage & Networking
Blob Storage
ADLS Gen2
Azure SQL Database
Azure Networking
All File Formats
Azure RBAC Roles
Azure Connections and Authentication
── Pipelines
Metadata-Driven Pipeline
Synapse + Audit Logging
Parameterized Datasets
ADF Expressions
Incremental Loading
Unified Full+Incremental
Pipeline JSON Guide
Audit Logging Concepts
── SCD & Data Flows
SCD Types (0,1,2,3,6)
SCD Type 1 Full Load
SCD Type 1 Hash-Based
SCD Type 2 Pipeline
Combined SCD1+SCD2
Data Flows Guide
Data Flow Joins
── Triggers & CI/CD
ADF Triggers
Trigger Parameters
CI/CD GitHub
CI/CD Azure DevOps
CI/CD for Azure Data Factory and Synapse: Complete Hands-On Guide
── Integration Runtime
IR Types Guide
On-Prem Pipeline + SHIR
── Troubleshooting
Data Lake Cleanup
Common Pipeline Errors
Databricks
Databricks Intro & dbutils
Connecting to Blob/ADLS
Secret Scopes & Key Vault
Reading/Writing Formats
Delta Lake Deep Dive
Connecting to Azure SQL
External Tables & Unity Catalog
Delta Lake and PySpark Optimization
File Storage in Azure Databricks
Data Quality in Azure Databricks
Databricks Workflows and Jobs
The Medallion Architecture in Azure
Databricks Git Integration and CI/CD
SCD Type 1 & Type 2 with PySpark Delta MERGE
AutoLoader (cloudFiles)
Unity Catalog Deep Dive
SQL
SQL Fundamentals, Execution Order & WHERE Clauses
GROUP BY, HAVING & CASE WHEN
Subqueries & Performance
SQL Functions
SQL Joins
Window Functions
CTEs & Subqueries
DDL, DML, and Constraints
Indexes and Execution Plans
Views, Temp Tables & Variables
Stored Procedures & Triggers
Normalization & Star Schema
UNION, PIVOT & Dynamic SQL
Transactions & ACID
SQL Interview Practice (20 Qs)
Python & PySpark
Python for Data Engineers
PySpark Foundations
PySpark Architecture
PySpark Joins
FastAPI on AWS Lambda
PySpark Transformations
Lazy Evaluation in PySpark
PySpark Window Functions Deep Dive
AWS
AWS S3
Glue Data Catalog
AWS Amplify
AWS Cognito
Fabric
Microsoft Fabric for Data Engineers
Capacity, Workspaces, Items
OneLake Deep Dive
OneLake Shortcuts
Connections & Gateways
Lakehouse vs Warehouse
Lakehouse Practical Guide
Warehouse Practical Guide
Warehouse Advanced (COPY INTO, CTAS, DMVs)
Fabric Data Factory & Pipelines
Data Factory Expression Language
Triggers, Scheduling & Orchestration
Dataflow Gen2: Introduction
Dataflow Gen2: Advanced Transforms
Dataflow Gen2: Production Patterns
M Language (Power Query) Guide
Fabric Notebooks Deep Dive
Apache Spark in Fabric
Spark Structured Streaming
Delta Lake Table Properties
Materialized Lake Views (MLVs)
Mirrored Databases
Power BI Direct Lake
Real-Time Intelligence
RTI Deep Dive (Windows, KQL, Materialized Views)
KQL (Kusto Query Language) Complete Guide
Data Activator
Security & Governance
Administration & Cost
Monitoring & Troubleshooting
Optimization Guide
Git Integration & CI/CD Deployment Pipelines
Fabric REST APIs
DP-700 Certification Study Guide
AI/ML
Artificial Intelligence and Machine Learning
Linear & Logistic Regression
Decision Trees & Random Forests
XGBoost & Gradient Boosting
Model Evaluation Deep Dive
Feature Engineering
Fine-Tuning LLMs
Clustering Algorithms
Hyperparameter Tuning
More
Top 20 DE Interview Questions
Top 15 ADF Interview Questions
Parquet vs CSV vs JSON
Schema-on-Write vs Read
How Real Companies Receive Data
Table Creation and Governance
My account
[woocommerce_my_account]
Scroll to Top