Energy efficiency advisor for LLM inference using empirical data across GPUs and quantizations to optimize batch size, precision, and reduce energy waste.
AI・機械学習スキルをすべて見る