HACKER Q&A
📣 emschwartz

Are you using a smaller LLM for anything?


Have you had any luck using a smaller model (<= 3B parameters) for anything? Every time I've poked around with them, they seem to stupid to follow the instructions I try to provide.

Curious if others have had any more luck and, if so, which model and for what use case.


  👤 dimgl Accepted Answer ✓
I'm currently using a 32B Qwen model for summarization and it's pretty good.