Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-09-05 | speculative : add grammar support (#2991) | Georgi Gerganov | |
* speculative : add grammar support * grammars : add json_arr.gbnf * grammar : add comments to new grammar file * grammar : remove one nested level * common : warm-up with 2 tokens - seems to work better * speculative : print draft token pieces * speculative : reuse grammar parser + better logs and comments * speculative : avoid grammar_mem * make : fix speculative build | |||
2023-09-03 | speculative : PoC for speeding-up inference via speculative sampling (#2926) | Georgi Gerganov | |
* speculative : initial example * speculative : print encoding speed * speculative : add --draft CLI arg |