Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models

In real world, large language models (LLMs) can serve as the assistant to help users accomplish their jobs, and also support the development of advanced applications. For the wide application of LLMs, the inference efficiency is an essential concern, which has been widely studied in existing work, a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Chen, Yushuo, Tang, Tianyi, Xiang, Erge, Li, Linjiang, Zhao, Wayne Xin, Wang, Jing, Chai, Yunpeng, Wen, Ji-Rong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!