summaryrefslogtreecommitdiff
path: root/examples/server/bench
AgeCommit message (Expand)Author
2024-05-16Revert "server bench: fix bench not waiting for model load (#7284)" (#7334)Pierrick Hymbert
2024-05-15server bench: fix bench not waiting for model load (#7284)Johannes Gäßler
2024-04-30ggml : add Flash Attention (#5021)Georgi Gerganov
2024-04-26bench: server add stop word for PHI-2 (#6916)Pierrick Hymbert
2024-04-06ci: bench: support sse and fix prompt processing time / server: add tokens us...Pierrick Hymbert
2024-04-04ci: bench: add more ftype, fix triggers and bot comment (#6466)Pierrick Hymbert
2024-03-27server: continuous performance monitoring and PR comment (#6283)Pierrick Hymbert
2024-03-09server: benchmark: chat/completions scenario and other llm servers comparison...Pierrick Hymbert