index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
examples
Age
Commit message (
Expand
)
Author
2023-10-04
finetune : readme fix typo (#3465)
Merrick Christensen
2023-10-03
main : consistent prefix/suffix coloring (#3425)
h-h-h-h
2023-10-03
llama : fix session saving/loading (#3400)
Georgi Gerganov
2023-10-02
gguf : general usability improvements (#3409)
cebtenzzre
2023-10-02
finetune : fix #3404 (#3437)
xaedes
2023-10-02
cmake : fix transient definitions in find pkg (#3411)
bandoti
2023-10-02
infill : add new example + extend server API (#3296)
vvhg1
2023-09-29
train : fix KQ_pos allocation (#3392)
Georgi Gerganov
2023-09-28
build : enable more non-default compiler warnings (#3200)
Cebtenzzre
2023-09-28
llama.cpp : split llama_context_params into model and context params (#3301)
slaren
2023-09-28
train : finetune LORA (#2632)
xaedes
2023-09-28
llama : custom attention mask + parallel decoding + no context swaps (#3228)
Georgi Gerganov
2023-09-27
make-ggml.py : compatibility with more models and GGUF (#3290)
Richard Roberson
2023-09-27
gguf : fix a few general keys (#3341)
Cebtenzzre
2023-09-27
metal : reusing llama.cpp logging (#3152)
Rickard Hallerbäck
2023-09-27
readme : add some recent perplexity and bpw measurements to READMES, link for...
BarfingLemurs
2023-09-23
llama-bench : add README (#3317)
slaren
2023-09-21
embedding : update README.md (#3224)
yuiseki
2023-09-20
llama : allow gguf RoPE keys to be overridden with defaults (#3240)
Cebtenzzre
2023-09-20
benchmark-matmult : do not use integer abs() on a float (#3277)
Cebtenzzre
2023-09-20
examples : fix benchmark-matmult (#1554)
Georgi Gerganov
2023-09-18
make : restore build-info.h dependency for several targets (#3205)
Cebtenzzre
2023-09-16
Fixing the last deviations from sentencepiece indicated by test-tokenizer-1 (...
goerch
2023-09-15
examples : add compiler version and target to build info (#2998)
Cebtenzzre
2023-09-15
check C++ code with -Wmissing-declarations (#3184)
Cebtenzzre
2023-09-15
sync : ggml (Metal F32 support + reduce ggml-alloc size) (#3192)
Georgi Gerganov
2023-09-15
llama : remove mtest (#3177)
Roland
2023-09-14
cmake : add relocatable Llama package (#2960)
bandoti
2023-09-14
speculative : add heuristic algorithm (#3006)
Leng Yue
2023-09-13
speculative: add --n-gpu-layers-draft option (#3063)
FK
2023-09-08
examples : make n_ctx warning work again (#3066)
Cebtenzzre
2023-09-08
build : do not use _GNU_SOURCE gratuitously (#2035)
Przemysław Pawełczyk
2023-09-07
fix some warnings from gcc and clang-tidy (#3038)
Cebtenzzre
2023-09-07
llama-bench : use two tokens in the warmup run for prompt evals (#3059)
slaren
2023-09-05
examples : replace fprintf to stdout with printf (#3017)
Cebtenzzre
2023-09-05
speculative : add grammar support (#2991)
Georgi Gerganov
2023-09-04
build : on Mac OS enable Metal by default (#2901)
Georgi Gerganov
2023-09-04
llama-bench : make cpp file non-executable (#2999)
Cebtenzzre
2023-09-04
server : add a subtle loading animation to the edit box (#2466)
Aarni Koskela
2023-09-03
speculative : PoC for speeding-up inference via speculative sampling (#2926)
Georgi Gerganov
2023-09-03
perplexity : fix ETA by warming up the model with an empty run
Georgi Gerganov
2023-09-03
examples : fix gpt-neox (#2943)
momonga
2023-09-02
server : avoid aniprompt in probabilities of final response (#2849)
Jhen-Jie Hong
2023-09-01
readme : quick start command fix (#2908)
ZHAOKAI WANG
2023-09-01
Allow quantize to only copy tensors, some other improvements (#2931)
Kerfuffle
2023-09-01
llama2c : rename function
Georgi Gerganov
2023-09-01
minor : add const qualifiers (#2853)
m3ndax
2023-09-01
build : fix most gcc and clang warnings (#2861)
Cebtenzzre
2023-09-01
llama2c : fix segfault and alloc-dealloc-mismatch (#2913)
Cebtenzzre
2023-08-31
scripts: Use local gguf package when running from repo (#2927)
Kerfuffle
[next]