CI

CueInference

Live inference demo

GLM 5.2

0.0 tok/s·avg 0.0

CI

How can I help you today?

Pick a model and send a prompt. Watch tokens stream in real time with live throughput in the header.

CueInference can make mistakes. Live tok/s shown in header.