Economical and Performant Deployment of Large Language Models | ZeroChapter