[14일차] Spark + Airflow 실전 연동 | 대규모 ETL 파이프라인 구축하기
🏗️ Data Lakehouse Engineering 20-Day Intensive Course DAY 14 — Spark + Airflow Integration and Large-Scale ETL Pipelines In this video, we will learn how to build practical data pipelines by integrating Apache Spark and Apache Airflow. ✔ SparkSubmitOperator ✔ KubernetesPodOperator ✔ Data Quality Gate ✔ Slack Notification Integration ✔ Spark Job Orchestration ✔ Airflow-based ETL Automation ✔ Vault Integration and Security Configuration ✔ Large-Scale Data Processing Architecture Airflow manages workflows, while Spark performs actual data processing. In a practical data platform, these two technologies are combined to build scalable ETL pipelines. This lecture explains the operational-level structure, ranging from Spark Job execution to quality verification, failure notifications, and security configuration. 📌 Recommended for: Data Engineers Platform Engineers DevOps Engineers MLOps Engineers Spark Operators Airflow Operators 🔥 Key Content of This Video How to use SparkSubmitOperator Utilizing KubernetesPodOperator Designing ETL Pipelines Automating Data Quality Validation Slack-based Fault Notifications Airflow + Spark Integration 🧪 Hands-on Content Running Airflow → SparkSubmitOperator Building Bronze → Silver → Gold Pipelines Implementing Data Quality Gates Setting up Slack Fault Notifications Spark Job Monitoring Automating Iceberg Loading 🔐 Security Content Vault Agent Sidecar Integration Kubernetes ServiceAccount Authentication NetworkPolicy-based Communication Control Masking Sensitive Information Personal Information in Fault Notifications Protection #ApacheSpark #ApacheAirflow #Spark #Airflow #ETL #DataEngineering #Iceberg #Kubernetes #DataPipeline #DevOps #MLOps #DataEngineer #AIInfrastructure

기업이 꼭 알아야 할 '온톨로지'의 모든 것 (김학래 중앙대 교수)

But what is the Fourier Transform? A visual introduction.

Software Testing Course – Playwright, E2E, and AI Agents
![[Day 13] Complete Review of ETL vs. ELT | Practical Data Pipeline Design Patterns](https://i.ytimg.com/vi/3LbKdkAd9jA/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLC1wDEkk5LAAjxwDzKdsK3OCD4mzw)
[Day 13] Complete Review of ETL vs. ELT | Practical Data Pipeline Design Patterns

Cut K8s Cluster Costs 60% - Real Tactics
![Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]](https://i.ytimg.com/vi/X48VuDVv0do/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDNg7nINwKqigXGqrL80FN9YuTNGg)
Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]

생성형 AI를 위한 데이터 실무자 가이드

Should You Still Become a Software Engineer in 2026? GitHub VP

Databricks Live Bootcamp | Day1: Introduction & Data Analytics
![[8일차] Apache Spark 완전 이해 | 아키텍처 · RDD · DataFrame 핵심 정리](https://i.ytimg.com/vi/fBG0788GRDo/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLCJxMoH7MR6N9LDuZiYSQVt4LKAew)
[8일차] Apache Spark 완전 이해 | 아키텍처 · RDD · DataFrame 핵심 정리

Backend web development - a complete overview

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Databricks Tutorial | Databricks Free Edition Tutorial with End-to-End Data + AI Project
![[15일차] Airflow 보안 강화 & 프로덕션 운영 | Vault · SSO · HA 구축](https://i.ytimg.com/vi/eTuqjONmQI0/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLA_K-ZbZ0hrFaZiXU26mm7DQnn9Mw)
[15일차] Airflow 보안 강화 & 프로덕션 운영 | Vault · SSO · HA 구축

개인 노트북에 Claude Code 깔고 "일정 만들어줘" 해보기 (2편)

Data Engineering Course for Beginners
![[스타크] HikariCP 내부 코드 파헤치기](https://i.ytimg.com/vi/nEQ6to3Y-w4/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLBYweWvFOzNtLCPR50XSNeAFcfmHA)
[스타크] HikariCP 내부 코드 파헤치기

CI/CD Explained: The DevOps Skill That Makes You 10x More Valuable

Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer

