Reliability, Availability and Serviceability (RAS) Features on Linux Systems - Vandana Salve, Micron
Reliability, Availability and Serviceability (RAS) Features on Linux Systems - Vandana Salve, Micron Reliability, Availability and serviceability (RAS) is a concept used on the servers to measure their robustness. A system built with high levels of RAS is more fault-tolerant, self-correcting when it discovers corrupted data, and quick and easy to repair without disrupting operations. Reliability indicates the probability that the software or system will produce accurate/correct results consistently, according to its specifications. Availability indicates the probability that a system or software will be operational at any given time. Serviceability refers to the ease and speed with which a system can be fixed or maintained without disrupting operations. As the system scales, the higher is the importance of Reliability Availability and Serviceability (RAS) monitoring. This presentation covers the RAS monitoring features available in the Linux kernel. It also describes the Ras-daemon monitoring tool, which uses the special kernel traces generated by the Kernel to monitor fatal and non-fatal hardware errors that are detected by the CPU, by the memory controller and by the PCIe hardware.

AI Accelerators: Transforming Scalability & Model Efficiency

SELinux - Complete Linux Security & Hardening with Practical Examples
![eBPF: Unlocking the Kernel [OFFICIAL DOCUMENTARY]](https://i.ytimg.com/vi/Wb_vD3XZYOA/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLBxAuuCMJh_jEk7chBuiLFOR9oX5Q)
eBPF: Unlocking the Kernel [OFFICIAL DOCUMENTARY]

Exploring AMD's Error Correction RAS Engineering

Harness Engineering Masterclass: Technical Deep Dive on how to build Agentic Systems

RAG Crash Course for Beginners

Android 17 sucks. So I put Linux on a phone.

LF Live Maintainer Session: My Life as a Linux Kernel Developer and Maintainer with Julia Lawall

The Mind Behind Linux | Linus Torvalds | TED

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

How to use the ps Command | Linux Command Line Basics

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

the true reason C++ always wins

Super-KI? Die große Lüge der Tech-Konzerne

System Design Explained: APIs, Databases, Caching, CDNs, Load Balancing & Production Infra

Watch Ukrainian Drones OBLITERATE a Russian Jet

How DSP is Killing the Analog in SerDes

Linux Basiswissen für Einsteiger

What is SRE | Tasks and Responsibilities of an SRE | SRE vs DevOps

