All Articles

Speculative Decoding and the Model Choice: Lessons

Speculative decoding model differences.

Sep 12, 2025

Standing Up vLLM on a Single A10G: From First Boot to Dual-Model Deployment

Deploying vLLM with docker on AWS using terraform.

Sep 8, 2025

How to Lead High-Confidence, High-Certainty People Without Crushing Their Value

This article shows how leaders can channel strong, dominating confidence into team-strengthening collaboration.

Aug 22, 2025

Rough Notes - Product Observability

A part of Product strategy that isn’t just about shipping features fast.

Aug 19, 2025

Inference-Aware AI: Working Definitions

A glossary of terms that define the concept of inference-aware agents, breaking down the core ideas, agent types, awareness dimensions, and platform components behind cost-efficient AI systems.

Aug 11, 2025

A Hypothesis: Inference-Aware Agents Could Be the Next Big Leap in AI Efficiency

An introduction to the hypothesis that AI agents can be made faster, cheaper, and more effective through an inference-aware platform that optimizes how they decide, act, and use resources.

Aug 11, 2025

A VECTR-Guided Refactor with Cursor.

Let's review a real example i've had to refactor to make it easier to contend with.

Aug 4, 2025

The Duplication Dilemma: A VECTR Guide to Repeating Yourself

This article clarifies the "Don't Repeat Yourself" (DRY) principle within the context of VECTR

Aug 1, 2025

Scaling Engineering with AI from 0 to 50

What it really takes to scale an engineering team from 0 to 50 inside a 100+ person company in today’s AI-native world.

Jul 1, 2025

Logistic Regression from Scratch with Python (Full Implementation)

Logistic regression from scratch with notes and learnings.

Jun 16, 2025

VECTR: Velocity-Engineered Code for Rapid Teams

VECTR is a pragmatic software philosophy focused on speed, clarity, and context. It favors useful code, justified abstractions, and adaptive design over rigid rules.

Jun 1, 2025

Notes to Self: Hiring Playbook

No-fluff hiring guide for engineering leaders

May 12, 2025

Multiple Linear Regression from Scratch (with Diagnostics)

A from-scratch implementation of multiple linear regression using gradient descent, with full diagnostic plots and batch prediction on test data.

May 2, 2025

Simple Linear Regression on Housing Data (Notes)

Just some note on linear regression to come back later to.

Apr 29, 2025