DP-LLM: Runtime Model Adaptation with Dynamic Layer-wise Precision Assignment
Published in The 39th Annual Conference on Neural Information Processing Systems (NeurIPS), 2025
Recommended citation: Sangwoo Kwon, Seong Hoon Seo, Jae W. Lee, and Yeonhong Park, The 39th Annual Conference on Neural Information Processing Systems (NeurIPS), December 2025. https://arxiv.org/abs/2508.06041
