index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
examples
Age
Commit message (
Expand
)
Author
2024-02-09
llava : add requirements.txt and update README.md (#5428)
Daniel Bevenius
2024-02-09
server : fix prompt caching for repeated prompts (#5420)
Riley Stewart
2024-02-08
llava : add missing .py, and fix paths in README.md (#5414)
Daniel Bevenius
2024-02-08
llava: fix typo/formatting in README.md (#5405)
Daniel Bevenius
2024-02-07
llava-cli : always tokenize special tokens (#5382)
Xiao-Yong Jin
2024-02-07
server : update `/props` with "total_slots" value (#5373)
Justin Parker
2024-02-06
server : remove model.json endpoint (#5371)
Alexey Parfenov
2024-02-06
server : include total "num_slots" in props endpoint (#5349)
Justin Parker
2024-02-06
server : add `dynatemp_range` and `dynatemp_exponent` (#5352)
Michael Coppola
2024-02-06
server : various fixes for the prompt field in /completion (#5300)
Niall Coates
2024-02-05
server : allow to get default generation settings for completion (#5307)
Alexey Parfenov
2024-02-04
Adding some imatrix tools (#5302)
Kawrakow
2024-02-03
refactor : switch to emplace_back to avoid extra object (#5291)
Michael Klimenko
2024-02-02
perplexity : fix KL divergence calculations on Windows (#5273)
kalomaze
2024-02-02
[SYCL] update guide of SYCL backend (#5254)
Neo Zhang Jianyu
2024-02-01
add --no-mmap in llama-bench (#5257)
Neo Zhang Jianyu
2024-01-31
llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240)
Georgi Gerganov
2024-01-31
llava : add MobileVLM support (#5132)
JidongZhang-THU
2024-01-31
format license text, restore apache license by legal suggestion (#5233)
Neo Zhang Jianyu
2024-01-31
support SYCL backend windows build (#5208)
Neo Zhang Jianyu
2024-01-30
kompute : llama-bench support and ggml_cpu_has_kompute() (#5226)
Jared Van Bortel
2024-01-30
Revert "server : change deps.sh xxd files to string literals (#5221)"
Georgi Gerganov
2024-01-30
server : fix context shift (#5195)
Georgi Gerganov
2024-01-30
server : change deps.sh xxd files to string literals (#5221)
JohnnyB
2024-01-30
SOTA 3-bit quants (#5196)
Kawrakow
2024-01-30
quantize : fix typo (#5211)
Vladimir Malyutin
2024-01-30
main : allow empty --prompt-cache file (#5176)
divinity76
2024-01-30
server : improve README (#5209)
Wu Jian Ping
2024-01-29
server : embeddings compatibility for OpenAI (#5190)
Wu Jian Ping
2024-01-28
ggml : add Vulkan backend (#2059)
0cc4m
2024-01-28
ggml : add unified SYCL backend for Intel GPUs (#2690)
Abhilash Majumder
2024-01-28
docker : add server-first container images (#5157)
Kyle Mistele
2024-01-27
llava : support for Yi-VL and fix for mobileVLM (#5093)
John
2024-01-27
sync : ggml
Georgi Gerganov
2024-01-27
Remove unused data and add fixes (#5154)
Michael Klimenko
2024-01-27
server : add self-extend support (#5104)
Maximilian Winter
2024-01-26
server : refactored the task processing logic (#5065)
Xuan Son Nguyen
2024-01-25
examples : make pydantic scripts pass mypy and support py3.8 (#5099)
Jared Van Bortel
2024-01-25
android : use release cmake build type by default (#5123)
Valentin Konovalov
2024-01-23
Additional KL-divergence statistics (#5081)
Kawrakow
2024-01-23
minor : clean-up some warnings and style (#5094)
Georgi Gerganov
2024-01-23
llama.vim : added api key support (#5090)
Michael Coppola
2024-01-22
KL-divergence (#5076)
Kawrakow
2024-01-22
llava : MobileVLM support (#4954)
XiaotaoChen
2024-01-22
imatrix : keep intermediate imatrix results (#5077)
Kawrakow
2024-01-22
finetune : print sample-start/include-sample-start (#5072)
Daniel Bevenius
2024-01-22
llama : add Q3_K_XS (#5060)
Kawrakow
2024-01-21
Add ability to evauate multiple choice tasks (#5047)
Kawrakow
2024-01-21
Slightly faster imatrix (#5050)
Kawrakow
2024-01-20
perplexity : fix MSVC build after #5020 (#5043)
Jared Van Bortel
[next]