Besides switching to cheaper models, what have you personally used to reduce cost in real applications?
https://github.com/rtk-ai/rtk