Amazon EMR Deep Dive and Best Practices - AWS Online Tech Talks

Amazon EMR is the industry-leading cloud big data platform for processing vast amounts of data using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto. In this tech talk, we introduce you to Amazon EMR design patterns and architectural best practices. We show how EMR can help you run Petabyte-scale analysis at less than half of the cost of traditional on-premises solutions and over 3x faster than standard Apache Spark. You'll learn how to spin up and spin down clusters as needed for short jobs and how to create highly available clusters that automatically scale to meet demand. Discover how Apache Hudi simplifies building data pipelines. You will also learn how to run EMR clusters on AWS Outposts for on-premises or hybrid deployments. Learning Objectives: *Learn how to design a big data environment using best practices *Find out the best way to support Apache Spark, Hive, HBase and other open source applications *See how to choose the right approach for both short- and long-running jobs ***To learn more about the services featured in this talk, please visit: https://aws.amazon.com/emr Subscribe to AWS Online Tech Talks On AWS: https://www.youtube.com/@AWSOnlineTec... Follow Amazon Web Services: Official Website: https://aws.amazon.com/what-is-aws Twitch:   / aws   Twitter:   / awsdevelopers   Facebook:   / amazonwebservices   Instagram:   / amazonwebservices   ☁️ AWS Online Tech Talks cover a wide range of topics and expertise levels through technical deep dives, demos, customer examples, and live Q&A with AWS experts. Builders can choose from bite-sized 15-minute sessions, insightful fireside chats, immersive virtual workshops, interactive office hours, or watch on-demand tech talks at your own pace. Join us to fuel your learning journey with AWS. #AWS

Getting Started with Amazon Redshift - AWS Online Tech Talks
▶︎

Getting Started with Amazon Redshift - AWS Online Tech Talks

AWS re:Invent 2020: An introduction to data lakes and analytics on AWS
▶︎

AWS re:Invent 2020: An introduction to data lakes and analytics on AWS

AWS EMR Tutorial [FULL COURSE in 60mins]
▶︎

AWS EMR Tutorial [FULL COURSE in 60mins]

AWS re:Invent 2015 | (BDT208) A Technical Introduction to Amazon Elastic MapReduce
▶︎

AWS re:Invent 2015 | (BDT208) A Technical Introduction to Amazon Elastic MapReduce

AWS re:Invent 2018: Amazon DynamoDB Deep Dive: Advanced Design Patterns for DynamoDB (DAT401)
▶︎

AWS re:Invent 2018: Amazon DynamoDB Deep Dive: Advanced Design Patterns for DynamoDB (DAT401)

High Performance Data Streaming with Amazon Kinesis: Best Practices and Common Pitfalls
▶︎

High Performance Data Streaming with Amazon Kinesis: Best Practices and Common Pitfalls

AWS re:Invent 2018: Effective Data Lakes: Challenges and Design Patterns (ANT316)
▶︎

AWS re:Invent 2018: Effective Data Lakes: Challenges and Design Patterns (ANT316)

AWS re:Invent 2020: Data lakes: Easily build, secure, & share with AWS Lake Formation
▶︎

AWS re:Invent 2020: Data lakes: Easily build, secure, & share with AWS Lake Formation

Kubernetes Zero to Hero: The Complete Beginner’s Guide (2025 Edition)
▶︎

Kubernetes Zero to Hero: The Complete Beginner’s Guide (2025 Edition)

Everything You Need to Know About Big Data: From Architectural Principles to Best Practices
▶︎

Everything You Need to Know About Big Data: From Architectural Principles to Best Practices

Building a Data Mesh Architecture with AWS Lake Formation - AWS Online Tech Talks
▶︎

Building a Data Mesh Architecture with AWS Lake Formation - AWS Online Tech Talks

Simplify and Scale Data Engineering Pipelines with Delta Lake
▶︎

Simplify and Scale Data Engineering Pipelines with Delta Lake

Gemini CLI Essentials – Full Course
▶︎

Gemini CLI Essentials – Full Course

3. Apache Kafka Fundamentals | Apache Kafka Fundamentals
▶︎

3. Apache Kafka Fundamentals | Apache Kafka Fundamentals

AWS re:Invent 2021 - Building a data lake on Amazon S3
▶︎

AWS re:Invent 2021 - Building a data lake on Amazon S3

Complete Terraform Course - From BEGINNER to PRO! (Learn Infrastructure as Code)
▶︎

Complete Terraform Course - From BEGINNER to PRO! (Learn Infrastructure as Code)

Deep Dive Into AWS Lake Formation - Level 300 (United States)
▶︎

Deep Dive Into AWS Lake Formation - Level 300 (United States)

Power BI Data Modeling Crash Course Learn Fast and Build Smarter Models! Full Course
▶︎

Power BI Data Modeling Crash Course Learn Fast and Build Smarter Models! Full Course

AWS Certified Cloud Practitioner COMPLETE STUDY GUIDE - 2024
▶︎

AWS Certified Cloud Practitioner COMPLETE STUDY GUIDE - 2024

Learn Snowflake in 2 Hours| High Paying Skills | Step by Step For Beginners
▶︎

Learn Snowflake in 2 Hours| High Paying Skills | Step by Step For Beginners