How it gets built with AI, what the real hours look like, and what only a human can do.
| Component | AI Agent | Human | Total | Notes |
|---|---|---|---|---|
| Frontend UI (intake panel, draft view, inline edit, confirm bar) | 5h | 2h | 7h | Draft view is the most complex โ badges, flags, disabled state logic |
| Backend parse endpoint + prompt construction | 4h | 2h | 6h | Injecting catalog + history into prompt correctly takes iteration |
| Prompt calibration | โ | 5โ8h | 5โ8h | โ ๏ธ The real work. Requires Brendon's actual emails to calibrate against. |
| Pipeline integration (intake โ createOrder โ packing โ QB) | 2h | 1h | 3h | Reuses existing pipeline โ minimal new code |
| Testing with real emails from Brendon's clients | 2h | 3h | 5h | โ ๏ธ Cannot skip. Synthetic examples don't catch real patterns. |
| Edge cases + refinement | 2h | 1.5h | 3.5h | โ |
| TOTAL | ~15h | ~15h | ~30h | ~2.5 weeks (Sprint 3) ยท includes 30% buffer throughout |
| Traditional Agency | This Approach (AI-Accelerated) |
|---|---|
| Dev writes parsing logic manually with regex/rules | LLM handles natural language โ no rule writing |
| New client format = new rule = new sprint | New client format = handled automatically |
| Estimated 3โ4 weeks for this feature alone | ~2 weeks including calibration |
| ~$15โ25K agency cost for this scope | ~11h human hours + AI inference cost |
| Hard to update parsing logic post-launch | Prompt update = immediate behavior change |