Burning tokens is equated with making progress. More conversations are treated as more "issues" handled. Coding sessions have changed into speccing, PRD, test plan, code plan, code generation and review pipelines. Only for every single piece of artifact to be double-checked by a human. This is considered "agentic engineering". And agentic engineers' token maxing is becoming the norm and is treated the same as "employee performance/efficiency".
It is like we've handed every engineer (and non-employee) uncapped credit, burning at dozens of dollars per minute. No one is talking about whether the costs are justified. No one accounts for the tokens (and money) spent. All for what feels more and more like busy work.
Am I the only one seeing this?
When artifacts are cheap, differentiation comes from quality.
I have deterministic and stochastic tests that run on each artifact. For those that have a high risk of "not the right thing", I manually review the artifacts. But if it's bog standard I just rely on the auto-gates to reject and get the agent to retry the artifact.
This gets me a high-volume pipeline that yes uses a lot of tokens, but at the same time doesn't overwhelm me. I only deal with things that genuinely need my attention. That's worth it for me, and not busywork.