The Docs on Alb Kestrel

The Docs on Alb Kestrelhttps://alb-kestrel-infra.pages.dev/posts/Recent content in The Docs on Alb KestrelHugo 0.125.0en-usFri, 08 May 2026 00:00:00 +0000Hard Lessons in VRAM Allocation: SIGKILL and the OOM Killerhttps://alb-kestrel-infra.pages.dev/posts/vram-management/Fri, 08 May 2026 00:00:00 +0000https://alb-kestrel-infra.pages.dev/posts/vram-management/The Incident While testing the limits of my RX 6800 16GB on CachyOS, I attempted to load the Qwen2.5-Coder-32B model using llama-server. The Data As shown in my terminal logs, the ROCm0 model buffer size requested was 16123.35 MiB. With only 319.04 MiB mapped to the CPU, the GPU was being pushed to its absolute physical limit. The Crash Because the model consumed nearly the entire 16GB of VRAM, the Linux kernel faced a critical memory shortage.