← Case Studies/Case #001/ADR-016
ADR-016Permanent RecordAgentPermanent

AI Capability Assessment — March 2026

#ai-capability#evidence#version-history#permanent
Date
2026-03-29
Freshness
Permanent
Boundary
Never expires as historical record. Dated snapshot of AI capability.
Dependency Graph

Case Study Notice: This ADR is part of an illustrative case study demonstrating the YY Method™ Home Edition v2.3. Numbers are approximate and generalized. Math is illustrative only. Not financial, tax, or legal advice — consult qualified professionals before making any financial decisions. See ADR-017 for full framing notice.

Capture

This ADR documents what AI could and could not do in March 2026, based on direct evidence from this conversation — not benchmarks, not researcher claims, not marketing. A practitioner used AI for a real high-stakes financial planning task across several hours and kept a version-controlled record of every exchange, correction, and insight.

The artifact set — 16 ADRs across 17+ zip versions — is a dated, evidence-based capability assessment of AI at this moment in time.


What AI Could Do


What AI Could Not Do


The Division of Labor — Precise

Operator supplied: All structural tax law, entity rules, every strategic insight, every correction to model errors, every novel synthesis, domain knowledge that made mechanics meaningful.

Model supplied: Implementation mechanics for specific platforms, compliance thresholds, quantified arithmetic, a small number of planning principles not explicitly raised, real-time information lookup.

The operator caught four model errors:

  1. Salary assumption: corrected mid-conversation
  2. Employer contribution basis: 25% on total W-2 → 25% on bonus only
  3. Federal bracket: corrected to actual MFJ bracket upon confirming filing status
  4. S corp re-election: suggested without flagging 5-year lockout

The Version History As Evidence

The 17+ zip versions produced across this conversation are not redundant. Each captures a distinct moment of evolution:

Version Key Evolution
v1 Initial ADR set — wrong salary, wrong rate
v2-v3 Bracket correction: federal rate corrected, State rate added
v4 Employer contribution basis error corrected
v5-v6 Reserve rejection, DCA insight, 5-year lockout encoded
v7-v8 Multi-child aging-in pipeline, teaching moment
truthful-final Full attribution rewrite — operator/model split documented honestly
complete-v2 HSA deployment, backdoor Roth, nondiscrimination risk

The v1 zip — where the model was calculating the wrong bonus against the wrong salary at the wrong marginal rate — is as important to preserve as the final. It shows where the model failed, where the operator corrected it, and how the document set converged on truth across iterations.


Why This Record Matters

Most AI interactions in 2026 are transactional. Ask, answer, close. No record. No honest accounting of what the AI got right, what it got wrong, and what the human had to supply.

This artifact set is different. It is a precise, honest, evidence-based capability snapshot — produced not by a researcher with a benchmark but by a practitioner with a real deadline and real money at stake.

In five years when the question is "what could AI actually do in early 2026" — this is a primary source. Not speculation. Not marketing. A dated version-controlled record with the receipts.


Why-Not

Why not just remember it? Memory decays, compresses, and self-flatters. The version history preserves the actual evolution including the errors — which memory would smooth over. The YY Method requires scars. This is the scar record for an entire AI interaction.

Why not trust the AI's own account of its capabilities? AI accounts of its own capabilities are unreliable — they reflect training, not performance under real conditions. This ADR reflects performance. That is a different and more valuable thing.


Assumptions This Decision Depends On


Tribal Context

Operator supplied: The observation that the version history itself encodes the method's evolution — that keeping all 17 zips is not redundancy but a version-controlled capability record. The insight that this artifact set enables intelligent, evidence-based conversation about AI capabilities at this moment in time.

Model supplied: The structure of the capability split, the version history table, and the framing of the artifact as a primary source rather than a benchmark.

The meta-insight — that the conversation itself is the evidence — was the operator's.


Commit

This ADR is a permanent historical record. It documents AI capability as observed under real conditions on March 29, 2026 — not as claimed, not as benchmarked, but as demonstrated across a live high-stakes financial planning session with a practitioner who kept the receipts.

Preserve all zip versions. The evolution is the evidence. The scars are the truth.

ADR-017