* Or is this a problem better solved through deterministic verification and better tooling?
Would love to hear real-world perspectives from those working in robotics infrastructure, fleet management, or simulation , what’s actually working (or not)?
Basic arithmetic can meaningfully detect every error you just listed. AI probably cannot "beat the odds" against a simple integral function.
So I guess to answer your question, I think yes, the second, better tooling (and a ton of metrics data collected from the fleet with good versioning).