argakiig/blog

notes on distributed systems, infrastructure, and production engineering

Short, practical writing on systems work: schedulers, data pipelines, observability, performance, infrastructure, and the tooling around it.

2026.07.21

AI Can Improve the Result While Weakening the Thinking

AI can raise the quality of an artifact while quietly weakening the judgment that produced it. The risk is real, but the outcome depends on how the tool is used and which parts of the work we continue to own.

2026.07.20

Inside the AI Harness

An AI harness is the runtime that turns a request into controlled and observable work. Following one failing test exposes the parts hidden behind the term.

2026.07.18

AI Makes Unplanned Projects Useful Exploratory Tools

AI makes proofs of concept cheap enough to participate in system design. Exploratory harnesses expose boundaries before the planned implementation begins.

2026.07.17

Representative Evals Can Miss a Real Change

Representative evaluation sets are essential for tuning, yet a small or corrupted suite can hide a real improvement or report one that does not exist.

2026.07.16

Distillation Makes Training Data Easier to Collect

Hand-authored training data is slow to create. Conversation history can provide high-signal raw material when it is curated with provenance, review, and clear boundaries.

2026.07.15

Tool Selection Is an Interface Contract

A tool-use evaluation can only judge model behavior after its harness recognizes the model's emitted tool-call format.

2026.07.14

Every Prompt Has a Training Cost

Prompt tokens are part of every training sequence. Reducing unnecessary context makes fine-tuning faster, cheaper, and easier to evaluate.

2026.07.13

Tuning Moves Through Small Experiments

Fine-tuning improves through controlled changes evaluated against explicit behavior and preserved experiment evidence.

2026.07.12

The Model Only Sees the Context You Send

A model can only act on memory that the application retrieves and includes in the current request. Storing a preference is only the first step.

2026.07.12

How the Model Knew What We Were Working On

A model can sound aware of ongoing work when retrieval finds the relevant memory and assembles it into the current context. A hybrid AST, syntax, and reranking path makes that retrieval faster and more precise.

2026.07.11

Why Streaming Alerts Need Explanations

Streaming alerts need durable evidence beyond threshold breaches, so operators can inspect what happened after the market has moved.

2026.07.10

Frontend Deployment Is Runtime Behavior

Frontend hosting becomes part of application behavior when it determines page speed, streaming, runtime configuration, and the shape of production failures.

2026.07.09

From Backfills to Live Streams

Batch and live market-data paths share domain concepts while relying on different runtime contracts for reconstruction, continuity, and failure recovery.

2026.07.08

The Hard Part of Market Data Is Making It Operable

Ingestion is only the visible part of a market-data system. The durable work is making moving data replayable, inspectable, bounded, safe, and explainable.

2026.07.07

The Market Data App Became a Distributed System

A market-data product becomes a distributed system when it starts depending on durable coordination, replayable work, live state, deployment boundaries, and explanations operators can trust.

2026.07.06

You Can't Dashboard What You Didn't Decide

Agent observability has to capture runtime decisions along with infrastructure health.

2026.07.05

Testing Strategy in the AI Era

When code volume outpaces human review capacity, testing strategy has to change. The old model doesn't scale.

2026.07.04

Build vs Buy Is an Architecture Decision

The total cost of a managed service includes the dependency, coupling, and operational surface you inherit.

2026.07.03

What Actually Happens During an Incident

The outage gets the attention. The first 15 minutes decide the outcome. The postmortem decides whether it happens again.

2026.07.02

Using Conway's Law Deliberately

Conway's Law can be used deliberately when team structure, ownership, and communication paths are part of the architecture discussion.

2026.07.01

Latency Is a Product Feature

Latency budgets are product decisions with architectural consequences.

2026.06.30

The Maintenance Cliff in AI-Generated Codebases

AI gets you to launch fast. What happens two years later when nobody understands the system it built?

2026.06.29

Managing Technical Debt Like a Budget

Technical debt only becomes manageable when teams distinguish strategic debt from accidental debt.

2026.06.28

Verification Is the New Bottleneck

AI made production cheap. It made verification expensive. That's the bottleneck now.

2026.06.27

What "Senior" Means When AI Flattens the Tactical Layer

When AI makes tactical competence cheap, the value of seniority shifts to judgment, verification, and teaching the meta-skill.

2026.06.26

Caching Is a Consistency Decision

Caching improves reads by adding consistency, ownership, and recovery questions to the system.

2026.06.25

Migration Is the Real Test of Architecture

You can build anything. Can you move it? Migration is where abstractions prove themselves or fail.

2026.06.24

The Best AI Wins End in Automation

A lot of useful AI work should not stay AI work forever. Once the task resolves into explicit, repetitive, mechanically checkable rules, the better runtime is usually deterministic automation.

2026.06.23

Agents Are Permission Systems

The hard part of deploying agents is not mostly reasoning quality. It is deciding what they are allowed to do, when they must ask, what they must prove before acting, and how you recover when they are wrong.

2026.06.22

Tactical Skill Is Not Enough

AI is compressing the market value of tactical execution. The people who stay relevant will be the ones who learn to direct the work, judge the output, and keep building real mastery instead of outsourcing the reps that create it.

2026.06.21

AI Review Is Not a New Discipline

A lot of the panic around reviewing AI output treats it like a new burden. It isn't. If you've spent years reviewing other people's code, libraries, and infrastructure decisions, you already know the job. AI just makes the need for that judgment harder to ignore.

2026.06.20

Value Is Not Valuation

AI can be useful without justifying current spending, pricing, or hype.

2026.06.19

Ownership Is Not a File

You can own a service and still abdicate responsibility for the outcomes it produces. Real ownership means caring about the consequences of your work, not just the artifacts.

2026.06.18

From Cloud AI to Local Models, Part 5 — Models That Know How Your Company Thinks

The strongest argument for local and self-hosted models is not cost. It is the ability to turn company-specific knowledge into working engineering infrastructure.

2026.06.17

From Cloud AI to Local Models, Part 4 — The Per-Token Tax on Engineering

Cloud AI pricing is easy to tolerate when AI is occasional. Once AI becomes part of every engineering workflow, the economics change.

2026.06.16

From Cloud AI to Local Models, Part 3 — Capability Is Not a Single Number

Model capability depends on the task, the context, the tools, and the evaluation loop. The model alone is not the product.

2026.06.15

From Cloud AI to Local Models, Part 2 — Fast at What?

A cloud model can be smarter and still make the workflow slower. Speed is not a benchmark number. It is a property of the whole system.

2026.06.14

From Cloud AI to Local Models, Part 1 — The Cloud Was the First AI Deployment Model

Hosted frontier models were the first practical way most teams touched AI. That does not make the cloud the final shape of AI-assisted engineering.

2026.06.13

The Face Changed, the Dependency Didn't

AI outages feel like a new kind of problem. They aren't. They're the same dependency risk we've always had, wearing a new label. The question that matters isn't whether AI is reliable. It's what happens to your business when the dependency disappears.

2026.06.12

When Intelligence Becomes a Commodity

For years software companies built moats around technology that was hard to build. AI is dissolving that assumption. When intelligence is rented instead of owned, the model stops being a moat and becomes infrastructure. The durable advantage is whatever is harder to copy than intelligence itself.

2026.06.11

The Model Myth: Why the Best Model Isn't the Winner

There's a growing consensus that whoever wields the best model wins the AI revolution. I don't buy it. Code is becoming a commodity, and when output is free its value approaches zero. The advantage was never the model. It's the understanding the model can't supply.

2026.06.10

The Goal Is Not To Be Needed

For years I thought the goal was to become the person everyone depends on. It isn't. If the system only works because you're there, you haven't built leverage, you've become its failure mode.

2026.06.09

The Most Expensive Bugs Never Reach Production

The bugs that bankrupt projects aren't runtime bugs. They're decisions. Nobody files a ticket for the wrong database or the wrong ownership model, and nobody sees a stack trace, yet they cost more than every syntax error combined.

2026.06.08

Code Review Isn't Dying. It's Moving Upstream.

People keep saying AI is killing code review. It isn't. The cheap part is being automated and the expensive part is being pushed up to where it always mattered most: the design, the boundary, the conversation before anyone opened an editor.

2026.06.07

Why Most Reliability Problems Are Organizational Problems

We love to blame the outage on a bad deploy or a flaky dependency. Dig into the postmortem and the real cause usually isn't code. It's who owned what, and who thought someone else did.

2026.06.06

Architecture Review > Code Review

Code review catches the bug in the function. Architecture review catches the bug in the plan, and the second one is almost always the one that sinks you.

2026.06.05

You Can't Prove the Bug Isn't There

A missing constraint sat in Zcash's Orchard pool for four years, surviving in-depth human audits. Four days after a new AI model shipped, an automated audit found it. When discovery accelerates like that, the security advantage stops being perfect code and becomes the speed from "found" to "users protected."

2026.06.05