Special seriesComplete5 parts

DGX Lab

Building a working AI research lab on a single small box, from intelligent gateway to RAG stack to honest benchmarks.

What you can stand up on one small but powerful desktop AI box in a few days: a routing gateway where simple heuristics beat ML by a wide margin, a shell tuned for ML workflows, a complete local RAG stack from model to vector store, and a benchmark pass that separates the paper numbers from the production reality.

This is the homelab lineage that later de-risked far bigger borrowed hardware. Read it in order, day one through the wrap-up.

The series

All parts

shipped14 min
DGX Lab: When Simple Heuristics Beat ML by 95,000x - Day 1
Building an intelligent AI gateway that routes requests 95,000x faster than ML while maintaining 90% accuracy—proving that smart heuristics can outperform deep learning.
shipped10 min
DGX Lab: Supercharge Your Shell with 50+ ML Productivity Aliases - Day 2
Transform your default shell into a productivity powerhouse with GPU monitoring shortcuts, smart aliases, and custom functions—setup in 5 minutes, benefit forever.
shipped16 min
DGX Lab: Building a Complete RAG Infrastructure - From Ollama to Qdrant to AnythingLLM - Day 3
Building a complete self-hosted RAG infrastructure with 8 integrated services on a single DGX workstation, supporting everything from medical AI fine-tuning to document Q&A.
shipped12 min
DGX Spark Benchmarks: 82,739 tokens/sec on Paper, the Production Reality
NVIDIA's DGX Spark benchmarks show 82,739 tokens/sec for training. After 6 days of intensive ML workloads and feedback from the HN community, here's what the benchmarks don't tell you about precision issues, memory fragmentation, and production workarounds.
shipped16 min
My AI Linux Expert: How Claude Code Suggested a 95,000x Faster Solution
When building an AI request router, my instinct was to use ML. Claude Code analyzed the test results, noticed the heuristics were already working, and suggested removing the ML model entirely—achieving 95,000x faster routing.

All parts

DGX Lab: When Simple Heuristics Beat ML by 95,000x - Day 1

DGX Lab: Supercharge Your Shell with 50+ ML Productivity Aliases - Day 2

DGX Lab: Building a Complete RAG Infrastructure - From Ollama to Qdrant to AnythingLLM - Day 3

DGX Spark Benchmarks: 82,739 tokens/sec on Paper, the Production Reality

My AI Linux Expert: How Claude Code Suggested a 95,000x Faster Solution

Get the next experiment