Linux Systems & Automation Engineer
I work on Linux systems, real-time computing, automation, and applied AI tooling. My work spans low-level diagnostics, C/C++ and Python development, reproducible infrastructure, and debugging latency-sensitive systems where correctness and determinism are critical.
Lately I have been focused on learning Rust, minimizing vendor dependencies for developer workflows and automation via RAG AI pipelines.
Reach me at bryan@ramos.codes
Some notes from maintaining a TurboQuant llama.cpp fork and testing whether hot MoE experts on the GPU could make a huge local model practical.
The machine I built for local AI work, why I built it, and why good enough hardware changes how I use these tools.
Cloud AI is useful, but I want more of my day-to-day AI workflow on infrastructure I control.