Energy efficiency advisor for LLM inference using empirical data across GPUs and quantizations to optimize batch size, precision, and reduce energy waste.
查看全部AI 与机器学习技能