| Age | Commit message (Expand) | Author | 
|---|---|---|
| 2024-01-31 | llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240) | Georgi Gerganov | 
| 2024-01-12 | llama : ggml-backend integration (#4766) | slaren | 
| 2023-12-01 | ggml : add ggml_soft_max_ext (#4256) | Georgi Gerganov | 
| 2023-10-29 | Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843) | Kerfuffle | 
| 2023-10-25 | batched-bench : print params at start | Georgi Gerganov | 
| 2023-10-18 | speculative : add tree-based sampling example (#3624) | Georgi Gerganov | 
| 2023-10-11 | batched : add bench tool (#3545) | Georgi Gerganov | 
