I tested every model available on huggingface.com and none of it made it into any kind of regular use for me. Useless, heavily biased replies even with so-called uncensored models and hallucinations are the main reasons. Nothing beats cloud llms when it comes to quality and unfortunately, if you have privacy sensitive data, you better not rely on AI to deal with it, because local LLMs won't make you happy anytime soon. You will just notice how much time you wasted hoping to achieve something with local LLMs that they won't deliver. I wrote this out of anger that nobody adresses this elephant in the room.
To the platform victor go the spoils. I've had the greatest leaps recently with Cursor, which not only is a terrific RAG application but integrates several models. Does anyone want to write and maintain that who is not in that business? No. Hence, platforms.
Eventually the marginal benefits might plateau in combination with enough optimizations to make local use outweigh any cloud models.
More specific and narrow use cases are a different matter.