summaryrefslogtreecommitdiff
path: root/examples/main/main.cpp
AgeCommit message (Expand)Author
2023-11-02build : link against build info instead of compiling against it (#3879)cebtenzzre
2023-10-29Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843)Kerfuffle
2023-10-23llama : remove token functions with `context` args in favor of `model` (#3720)Marcus Dunn
2023-10-22main : escape prompt for cfg_negative_prompt and consecutive inputs in main w...vvhg1
2023-10-20sampling : refactor init to use llama_sampling_params (#3696)Georgi Gerganov
2023-10-18speculative : add tree-based sampling example (#3624)Georgi Gerganov
2023-10-17llama : avoid fprintf in favor of LLAMA_LOG (#3538)Georgi Gerganov
2023-10-17tokenizer : special token handling (#3538)staviq
2023-10-11main : fix session loading bug (#3400)Georgi Gerganov
2023-10-11common : fix mirostat state when using multiple sequences (#3543)Kerfuffle
2023-10-03main : consistent prefix/suffix coloring (#3425)h-h-h-h
2023-10-03llama : fix session saving/loading (#3400)Georgi Gerganov
2023-09-28build : enable more non-default compiler warnings (#3200)Cebtenzzre
2023-09-28llama.cpp : split llama_context_params into model and context params (#3301)slaren
2023-09-28llama : custom attention mask + parallel decoding + no context swaps (#3228)Georgi Gerganov
2023-09-15examples : add compiler version and target to build info (#2998)Cebtenzzre
2023-09-15check C++ code with -Wmissing-declarations (#3184)Cebtenzzre
2023-09-15llama : remove mtest (#3177)Roland
2023-09-08examples : make n_ctx warning work again (#3066)Cebtenzzre
2023-09-08build : do not use _GNU_SOURCE gratuitously (#2035)Przemysław Pawełczyk
2023-09-07fix some warnings from gcc and clang-tidy (#3038)Cebtenzzre
2023-09-04build : on Mac OS enable Metal by default (#2901)Georgi Gerganov
2023-09-03speculative : PoC for speeding-up inference via speculative sampling (#2926)Georgi Gerganov
2023-09-03perplexity : fix ETA by warming up the model with an empty runGeorgi Gerganov
2023-08-30main : log file (#2748)staviq
2023-08-28YAML result logging + preset script (#2657)Johannes Gäßler
2023-08-27llama : more tokenizer fixes (#2810)Georgi Gerganov
2023-08-26main : fix bug (penalize_nl=false doesn't work) + suppress warning on mingw (...Dr. Tom Murphy VII Ph.D
2023-08-26Fix spm whitespaces (#2806)klosax
2023-08-24Fix for main example getting stuck when -n -2 and --interactive (#2767)Kerfuffle
2023-08-23llm : add Falcon support (#2717)Georgi Gerganov
2023-08-23main : insert bos if no tokens (#2727)klosax
2023-08-21gguf : new file format with flexible meta data (beta) (#2398)Georgi Gerganov
2023-08-10Add --n-predict -2 for stopping generation on full context (#2565)Christian Demsar
2023-08-04Add --simple-io option for subprocesses and break out console.h and cpp (#1558)DannyDaemonic
2023-07-25main : add `--in-prefix-bos` to prefix BOS to user inputs; keep EOS (#2304)Xiao-Yong Jin
2023-07-23llama : add grammar-based sampling (#1773)Evan Jones
2023-07-23llama : grouped-query attention + LLaMAv2 70B support (#2276)Georgi Gerganov
2023-07-22llama : optimize memory buffers (#2325)Georgi Gerganov
2023-07-21llama : remove cfg smooth factor as it is only a reparameterization of the gu...Guillaume "Vermeille" Sanchez
2023-07-15llama : add custom RoPE (#2054)Xiao-Yong Jin
2023-07-11llama : add classifier-free guidance (#2135)Bach Le
2023-07-10mpi : add support for distributed inference via MPI (#2099)Evan Miller
2023-07-06convert : update for baichuan (#2081)Judd
2023-06-29Use unsigned for random seed (#2006)Howard Su
2023-06-26ggml : add NUMA support (#1556)zrm
2023-06-24llama : make model stateless and context stateful (llama_state) (#1797)Didzis Gosko
2023-06-17minor : warning fixesGeorgi Gerganov
2023-06-16Fixed possible macro redefinition (#1892)FrankHB
2023-06-16build : fix and ignore MSVC warnings (#1889)Borislav Stanimirov