Anyone used LLM for UI E2E testing?
Did it work? Was it better than writing cucumber (or any other feature) tests?
We added ChatGPT Operator to UI testing, starting soon after it launched. It's only used as an extra testing step on top of everything we had previously used. A quick summary is: on the plus side, it sometimes gives us a much faster feedback cycle. On the minus side, it sometimes dives headfirst into advanced UI features, and can't find a way to backtrack when it makes a mistake there.