Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-09-03 | speculative : PoC for speeding-up inference via speculative sampling (#2926) | Georgi Gerganov | |
* speculative : initial example * speculative : print encoding speed * speculative : add --draft CLI arg |
![]() |
index : ik_llama.cpp.git | |
Unnamed repository; edit this file 'description' to name the repository. |
summaryrefslogtreecommitdiff |
Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-09-03 | speculative : PoC for speeding-up inference via speculative sampling (#2926) | Georgi Gerganov | |
* speculative : initial example * speculative : print encoding speed * speculative : add --draft CLI arg |