Prakash, Chandra2024-09-042024-09-042024-05http://hdl.handle.net/123456789/3596enLarge Language Models (LLMs)Model compressionSafeguarding performanceDISTDS1278AMTModel compression for large language modelsThesis