index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
examples
/
embedding
/
embedding.cpp
Age
Commit message (
Expand
)
Author
2024-08-12
Merge mainline - Aug 12 2024 (#17)
Kawrakow
2024-07-27
Merge mainline llama.cpp (#3)
Kawrakow
2024-06-21
llama : allow pooled embeddings on any model (#7477)
Douglas Hanley
2024-06-04
common : refactor cli arg parsing (#7675)
Georgi Gerganov
2024-05-22
common : normalize naming style (#7462)
Georgi Gerganov
2024-05-15
embedding : free the batch after execution (#7297)
dm4
2024-05-11
llama : add Jina Embeddings architecture (#6826)
Joan Fontanals
2024-04-09
BERT tokenizer fixes (#6498)
Jared Van Bortel
2024-03-27
embedding : show full embedding for single prompt (#6342)
howlger
2024-03-26
embedding : adjust `n_ubatch` value (#6296)
Minsoo Cheong
2024-03-14
embedding : add EOS token if not present (#899)
Georgi Gerganov
2024-03-14
embedding : print all resulting embeddings (#899)
Georgi Gerganov
2024-03-14
embedding : print cosine similarity (#899)
Georgi Gerganov
2024-03-13
llama : add pipeline parallelism support (#6017)
slaren
2024-03-09
server : normalize embeddings (#5956)
SeungWon Jeong
2024-03-04
llama : fix embeddings (#5796)
Georgi Gerganov
2024-02-16
ggml : add numa options (#5377)
bmwl
2024-02-13
llama : support batched embeddings (#5466)
Douglas Hanley
2024-02-11
Add support for BERT embedding models (#5423)
Douglas Hanley
2023-11-02
build : link against build info instead of compiling against it (#3879)
cebtenzzre
2023-09-28
llama.cpp : split llama_context_params into model and context params (#3301)
slaren
2023-09-28
llama : custom attention mask + parallel decoding + no context swaps (#3228)
Georgi Gerganov
2023-09-18
make : restore build-info.h dependency for several targets (#3205)
Cebtenzzre
2023-09-15
examples : add compiler version and target to build info (#2998)
Cebtenzzre
2023-09-08
examples : make n_ctx warning work again (#3066)
Cebtenzzre
2023-09-07
fix some warnings from gcc and clang-tidy (#3038)
Cebtenzzre
2023-08-27
llama : more tokenizer fixes (#2810)
Georgi Gerganov
2023-08-22
embedding : evaluate prompt in batches (#2713)
slaren
2023-08-21
gguf : new file format with flexible meta data (beta) (#2398)
Georgi Gerganov
2023-07-10
mpi : add support for distributed inference via MPI (#2099)
Evan Miller
2023-07-06
convert : update for baichuan (#2081)
Judd
2023-06-29
Use unsigned for random seed (#2006)
Howard Su
2023-06-26
ggml : add NUMA support (#1556)
zrm
2023-06-24
llama : make model stateless and context stateful (llama_state) (#1797)
Didzis Gosko
2023-06-16
build : fix and ignore MSVC warnings (#1889)
Borislav Stanimirov
2023-05-20
llama : add llama_init_backend() API (close #1527)
Georgi Gerganov
2023-05-16
define default model path once, sync path with readme (#1366)
AndrĂ¡s Salamon
2023-05-13
embedding : remove unused code (#1426)
Rinne
2023-05-02
examples : add llama_init_from_gpt_params() common function (#1290)
Ron Evans
2023-05-02
llama : allow 0 as a seed number. (#1275)
Robert Brisita
2023-05-01
Add git-based build information for better issue tracking (#1232)
DannyDaemonic
2023-04-16
examples: add missing <ctime> include for time() (#1011)
Pavol Rusnak
2023-04-10
Rewrite loading code to try to satisfy everyone:
comex
2023-03-25
Cleanup STL headers + fix embedding examples + minor stuff
Georgi Gerganov
2023-03-25
Overhaul the examples structure
Georgi Gerganov