index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
examples
/
server
/
utils.hpp
Age
Commit message (
Expand
)
Author
2025-06-08
Fix non rpc build error (#506)
firecoperana
2025-06-08
Revert "Rpc improvement (#480)"
Iwan Kawrakow
2025-06-08
Rpc improvement (#480)
firecoperana
2025-06-08
Webui improvement (#481)
firecoperana
2024-08-12
Merge mainline - Aug 12 2024 (#17)
Kawrakow
2024-07-27
Merge mainline llama.cpp (#3)
Kawrakow
2024-06-08
server : smart slot selection using Longest Common Prefix (#7728)
sasha0552
2024-06-04
common : refactor cli arg parsing (#7675)
Georgi Gerganov
2024-05-13
change default temperature of OAI compat API from 0 to 1 (#7226)
Benjamin Findley
2024-05-08
JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Johannes Gäßler
2024-05-08
clean up json_value & server_log (#7142)
Xuan Son Nguyen
2024-04-21
llama : support Llama 3 HF conversion (#6745)
Pedro Cuenca
2024-04-06
ci: bench: support sse and fix prompt processing time / server: add tokens us...
Pierrick Hymbert
2024-04-03
server : handle exception on wrong type in request (#6452)
JH23X
2024-03-25
Server: clean up OAI params parsing function (#6284)
Xuan Son Nguyen
2024-03-23
server: flush stdout after logging in both text and json layout (#6253)
Pierrick Hymbert
2024-03-22
json-schema-to-grammar : fix order of props + non-str const/enum (#6232)
Olivier Chafik
2024-03-21
json-schema-to-grammar improvements (+ added to server) (#5978)
Olivier Chafik
2024-03-20
Server: Handle n_keep parameter in the request (#6174)
Karthick
2024-03-13
Server: Use multi-task for embeddings endpoint (#6001)
Xuan Son Nguyen
2024-03-11
Server: format error to json (#5961)
Xuan Son Nguyen
2024-03-11
server : maintain chat completion id for streaming responses (#5988)
Minsoo Cheong
2024-03-07
server : refactor (#5882)
Georgi Gerganov
2024-03-02
server: tests: passkey challenge / self-extend with context shift demo (#5832)
Pierrick Hymbert
2024-02-29
Server: normalize naming (#5779)
Xuan Son Nguyen
2024-02-25
server: logs - unified format and --log-format option (#5700)
Pierrick Hymbert
2024-02-25
server: concurrency fix + monitoring - add /metrics prometheus compatible end...
Pierrick Hymbert
2024-02-21
server: health: fix race condition on slots data using tasks queue (#5634)
Pierrick Hymbert
2024-02-20
Server: use llama_chat_apply_template (#5593)
Xuan Son Nguyen
2024-02-18
server : graceful server shutdown (#5244)
Daniel Hiltgen
2024-02-11
server : add llama2 chat template (#5425)
Xuan Son Nguyen
2024-01-27
sync : ggml
Georgi Gerganov
2024-01-26
server : refactored the task processing logic (#5065)
Xuan Son Nguyen