Poisoning the AI Key Vault: A Technical Deep Dive into the LiteLLM PyPI Attack

Susiloharjo

Poisoning the AI Key Vault: A Technical Deep Dive into the LiteLLM PyPI Attack In the rapidly evolving landscape of AI infrastructure, few libraries have become as central as LiteLLM. Acting as a universal proxy for hundreds of LLM providers, it handles the most sensitive secrets an organization possesses: API keys for OpenAI, Anthropic, Gemini, … Read more

Claude Code’s Compaction Engine: The Architecture of Long-Context Reasoning

Susiloharjo

Claude Code’s Compaction Engine: The Architecture of Long-Context Reasoning The fundamental challenge of modern AI agents is not just intelligence, but coherence over time. As an agent engages in a multi-hour session involving thousands of lines of terminal output, file edits, and tool calls, the context window—however vast—becomes a liability. A bloated context window leads … Read more

The 43-Point Perception Gap: Why AI Coding Assistants Are Quietly Sabotaging Developer Productivity

Susiloharjo

The 43-Point Perception Gap: Why AI Coding Assistants Are Quietly Sabotaging Developer Productivity The Uncomfortable Truth Behind the Hype In March 2026, METR (the Machine Intelligence Research Institute’s applied research arm) published a study that should have made every engineering leadership team pause. Their longitudinal analysis of experienced open-source software developers revealed a paradox that … Read more

The Vera Rubin Architecture: NVIDIA’s 2026 Answer to the Trillion-Parameter AI Factory

Susiloharjo

The Vera Rubin Architecture: NVIDIA’s 2026 Answer to the Trillion-Parameter AI Factory The NVIDIA Vera Rubin platform redefines trillion-parameter AI training with a 10x cost reduction, unified HBM4 memory, NVLink 6, and a dedicated Physical AI foundry. The Scale Problem Nobody Talks About Training a language model with a trillion parameters is not a software … Read more