index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
examples
Age
Commit message (
Expand
)
Author
2024-01-25
examples : make pydantic scripts pass mypy and support py3.8 (#5099)
Jared Van Bortel
2024-01-25
android : use release cmake build type by default (#5123)
Valentin Konovalov
2024-01-23
Additional KL-divergence statistics (#5081)
Kawrakow
2024-01-23
minor : clean-up some warnings and style (#5094)
Georgi Gerganov
2024-01-23
llama.vim : added api key support (#5090)
Michael Coppola
2024-01-22
KL-divergence (#5076)
Kawrakow
2024-01-22
llava : MobileVLM support (#4954)
XiaotaoChen
2024-01-22
imatrix : keep intermediate imatrix results (#5077)
Kawrakow
2024-01-22
finetune : print sample-start/include-sample-start (#5072)
Daniel Bevenius
2024-01-22
llama : add Q3_K_XS (#5060)
Kawrakow
2024-01-21
Add ability to evauate multiple choice tasks (#5047)
Kawrakow
2024-01-21
Slightly faster imatrix (#5050)
Kawrakow
2024-01-20
perplexity : fix MSVC build after #5020 (#5043)
Jared Van Bortel
2024-01-19
finetune : fix ggml_allocr lifetimes (tmp workaround) (#5033)
Uzo Nweke
2024-01-19
imatrix : add README.md
Georgi Gerganov
2024-01-19
winogrande: evaluate log-probs in parallel (#5036)
Kawrakow
2024-01-19
perplexity: avoid unnecessary alloocations and logit copies (#5035)
Kawrakow
2024-01-19
perplexity : faster Winogrande via batching (#5024)
Georgi Gerganov
2024-01-18
server : defer tasks when "slot unavailable" (#5018)
Xuan Son Nguyen
2024-01-18
imatrix : fix assert for src0 non-cont check
Georgi Gerganov
2024-01-18
perplexity : fix winogrande N tasks option
Georgi Gerganov
2024-01-18
HellaSwag: speed up by parallelizing log-prob evaluation (#5020)
Kawrakow
2024-01-18
perplexity : faster HellaSwag via batching (#5017)
Georgi Gerganov
2024-01-18
Add Winogrande evaluation (#5015)
Kawrakow
2024-01-17
imatrix : offload to GPU support (#4957)
Georgi Gerganov
2024-01-16
finetune : add training data file to log message (#4979)
Daniel Bevenius
2024-01-16
examples : add complete parallel function calling example (#4974)
Maximilian Winter
2024-01-16
perplexity : fix kv cache handling for hellaswag (#4981)
Georgi Gerganov
2024-01-16
android : introduce starter project example (#4926)
Neuman Vong
2024-01-16
examples : fix and improv docs for the grammar generator (#4909)
Maximilian Winter
2024-01-16
finetune : use LLAMA_FILE_MAGIC_GGLA (#4961)
Daniel Bevenius
2024-01-16
speculative : threading options (#4959)
stduhpf
2024-01-14
Add ability to use importance matrix for all k-quants (#4930)
Kawrakow
2024-01-14
2-bit quantizations (#4897)
Kawrakow
2024-01-13
metal : remove old API (#4919)
Georgi Gerganov
2024-01-13
server : fix prompt caching with system prompt (#4914)
Georgi Gerganov
2024-01-13
llama : minimize size used for state save/load (#4820)
David Friehs
2024-01-13
main : add parameter --no-display-prompt (#4541)
Yann Follet
2024-01-13
server : fix deadlock that occurs in multi-prompt scenarios (#4905)
Ziad Ben Hadj-Alouane
2024-01-13
server : fix crash with multimodal models without BOS token (#4904)
makomk
2024-01-12
examples : add pydantic models to GBNF grammar generator (#4883)
Maximilian Winter
2024-01-12
llama : ggml-backend integration (#4766)
slaren
2024-01-12
export-lora : use LLAMA_FILE_MAGIC_GGLA (#4894)
Daniel Bevenius
2024-01-12
llama.swiftui : update models layout (#4826)
Zay
2024-01-12
Importance Matrix calculation (#4861)
Kawrakow
2024-01-11
server : fix infill when prompt is empty (#4833)
Georgi Gerganov
2024-01-11
main : better name for variable n_print (#4874)
Georgi Gerganov
2024-01-11
main : disable token count by default (#4874)
Georgi Gerganov
2024-01-11
llama : restore intended k-quants mixes for MoE models (#4872)
Kawrakow
2024-01-11
server : implement credentialed CORS (#4514)
Laura
[next]