Lightning Interview "Troubleshooting Large Language Models"
Despite the skill and effort that goes into creating LLMs, every data scientist will run into issues and problems that they will have to troubleshoot. But, what is the best way to choose the correct metrics, fix errors, and deal with hallucinations? Talented Amber Roberts of Arize AI, will help us answer these questions and take us on an exploration of interpretability tools like Phoenix. Topics 1– Amber, can you introduce yourself and give tell us about your journey in Data Science and AI 2– Tell us about Arize AI and its mission. 3– What are the biggest challenges with large language models? 4– Why is troubleshooting large language models so difficult? 5– Describe your experience troubleshooting issues in large language models (LLMs), including open-source and proprietary models. What were the types of issues you encountered, and how did you approach troubleshooting them, especially in respect to the 9 parts of the LLMs? 6– How would you choose the right metrics for a specific LLM task to evaluate its performance? 7– Once we've identified an error in an LLM, how do we go about fixing it such as updating the model with new or corrected information? 8– Tell us about what constitutes a hallucination, what Causes Hallucinations and are there specific techniques or tools that can identify when a model is producing (unobvious) hallucinations? 9– The development of interpretability tools for LLMs is still in the early stages, but tell us about Phoenix? 10– How does Phoenix help the evaluation, troubleshooting, and fine-tuning of large language models (LLMs)? 11– Tell us about some of the tools Phoenix integrates with such as "llama-index" and LangChain? 12– Can you explain the key features of Phoenix and how they contribute to the effective management of machine learning models in production? 13– What types of challenges in AI model monitoring and observability does Phoenix address, and what solutions does it offer? 14– Looking ahead, what are the future development plans for Phoenix? 15– This is an incredibly exciting time to be in AI. Do you feel the path to a career in AI has changed and what advice would you give? Some useful links: Phoenix: AI Observability & Evaluation - https://docs.arize.com/phoenix/ Community Paper Reading: https://arize.com/resource/community-... Latest Arize Workshop: https://arize.com/resource/rag-time/ Arize docs: https://docs.arize.com/arize/ Vector DB Comparison - https://vdbs.superlinked.com/ Connect with Amber over Linkedin - / amber-roberts42

RAG & MCP Fundamentals – A Hands-On Crash Course

The TRUTH about AUKUS and the Victoria Barracks: What They're NOT TELLING You!

Designing MCP for Real-World Agents with Jeremiah Lowin

Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)

Engineering an AI Platform: The Role of Data Science at Shipium

From Intelligence to Autonomy - Building the Infrastructure for the AI Workforce by Lake Dai

Emerging Accessibility Pros Fireside Chat

RAG Crash Course for Beginners

Communicating with AI: Teaching Machines What We Really Want with Michael Littman

Gemini CLI Essentials – Full Course

Ollama, Local AI, and Open Agents with Parth Sareen

HOLY ROSARY TODAY THURSDAY, JUNE 11, 2026 ST. JUDE THADDEUS & LUMINOUS MYSTERIES | DAILY HOLY ROSARY

Embodied AI: A Cognitive Shift, Not a Cosmetic Change by Mohammad Soltaniehha, PhD

The Uncomfortable Truth About AI “Reasoning” | World Science Festival

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

747: Technical Intro to Transformers and LLMs — with Kirill Eremenko

Why Great Marketing Doesn't Show Up in the Dashboard

Don't learn AI Agents without Learning these Fundamentals
![Master No Code Chatbots With Copilot Studio (Formerly Power Virtual Agents) [Full Course]](https://i.ytimg.com/vi/nYxf8ndIBE0/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLCDSuC2zfv72qnTbKu4dkMBDhkYUg)
Master No Code Chatbots With Copilot Studio (Formerly Power Virtual Agents) [Full Course]

