Judge the LLM Judge – Ensemble Testing of an LLM Application | Berlin Quality Engineering meetup

In the Berlin Quality Engineering Meetup, I demonstrate a process you can use to align an LLM-Judge with a domain expert's evaluation, with a live demo. I had to edit out the parts of the video where there was a lot of discussion within the room, that is not audible in the recording. Here is the repo to the code used during the demo: https://github.com/BeyondQuality/llmJ...

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan
▶︎

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

BloomIntent: Automating Search Evaluation with LLM-Generated Fine-Grained User Intents
▶︎

BloomIntent: Automating Search Evaluation with LLM-Generated Fine-Grained User Intents

CLAUDE CODE ADVANCED FULL COURSE (3 HOURS)
▶︎

CLAUDE CODE ADVANCED FULL COURSE (3 HOURS)

Headroom: A Context Optimization Layer for LLM Applications - Tejas Chopra, Netflix, Inc.
▶︎

Headroom: A Context Optimization Layer for LLM Applications - Tejas Chopra, Netflix, Inc.

Let's put aside Frontier AI Labs and Hyperscalers Cost Effective AI Inference for the Rest of Us   M
▶︎

Let's put aside Frontier AI Labs and Hyperscalers Cost Effective AI Inference for the Rest of Us M

Kanada – Katar  Highlights | Gruppe B, FIFA WM 2026 | sportstudio
▶︎

Kanada – Katar  Highlights | Gruppe B, FIFA WM 2026 | sportstudio

Stop Prompting Claude. Use Karpathy's Method Instead.
▶︎

Stop Prompting Claude. Use Karpathy's Method Instead.

Master No Code Chatbots With Copilot Studio (Formerly Power Virtual Agents) [Full Course]
▶︎

Master No Code Chatbots With Copilot Studio (Formerly Power Virtual Agents) [Full Course]

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit
▶︎

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Full Archon Guide - Build AI Coding Harnesses That Actually Ship (LIVE)
▶︎

Full Archon Guide - Build AI Coding Harnesses That Actually Ship (LIVE)

The Complete Guide to Secret Hygiene for Java and Cloud Native Engineers   Martin Ladecký
▶︎

The Complete Guide to Secret Hygiene for Java and Cloud Native Engineers Martin Ladecký

Deep Dive into LLMs like ChatGPT
▶︎

Deep Dive into LLMs like ChatGPT

Software Testing Course – Playwright, E2E, and AI Agents
▶︎

Software Testing Course – Playwright, E2E, and AI Agents

Karpathy's Wiki vs. Open Brain. One Fails When You Need It Most.
▶︎

Karpathy's Wiki vs. Open Brain. One Fails When You Need It Most.

Data + AI Summit Keynote 2026 | Day 1
▶︎

Data + AI Summit Keynote 2026 | Day 1

Power Automate Tutorial ⚡ Beginner To Pro [Full Course]
▶︎

Power Automate Tutorial ⚡ Beginner To Pro [Full Course]

Leading in the Age of AI: A Conversation with NVIDIA CEO Jensen Huang | Global Conference 2026
▶︎

Leading in the Age of AI: A Conversation with NVIDIA CEO Jensen Huang | Global Conference 2026

Build a Complete Medical Chatbot with LLMs, LangChain, Pinecone, Flask & AWS 🔥
▶︎

Build a Complete Medical Chatbot with LLMs, LangChain, Pinecone, Flask & AWS 🔥

Keynote Rethinking Observability as a Platform Product   Kasper Borg Nissen
▶︎

Keynote Rethinking Observability as a Platform Product Kasper Borg Nissen