summaryrefslogtreecommitdiff
path: root/examples/embedding/embedding.cpp
AgeCommit message (Expand)Author
2024-08-12Merge mainline - Aug 12 2024 (#17)Kawrakow
2024-07-27Merge mainline llama.cpp (#3)Kawrakow
2024-06-21llama : allow pooled embeddings on any model (#7477)Douglas Hanley
2024-06-04common : refactor cli arg parsing (#7675)Georgi Gerganov
2024-05-22common : normalize naming style (#7462)Georgi Gerganov
2024-05-15embedding : free the batch after execution (#7297)dm4
2024-05-11llama : add Jina Embeddings architecture (#6826)Joan Fontanals
2024-04-09BERT tokenizer fixes (#6498)Jared Van Bortel
2024-03-27embedding : show full embedding for single prompt (#6342)howlger
2024-03-26embedding : adjust `n_ubatch` value (#6296)Minsoo Cheong
2024-03-14embedding : add EOS token if not present (#899)Georgi Gerganov
2024-03-14embedding : print all resulting embeddings (#899)Georgi Gerganov
2024-03-14embedding : print cosine similarity (#899)Georgi Gerganov
2024-03-13llama : add pipeline parallelism support (#6017)slaren
2024-03-09server : normalize embeddings (#5956)SeungWon Jeong
2024-03-04llama : fix embeddings (#5796)Georgi Gerganov
2024-02-16ggml : add numa options (#5377)bmwl
2024-02-13llama : support batched embeddings (#5466)Douglas Hanley
2024-02-11Add support for BERT embedding models (#5423)Douglas Hanley
2023-11-02build : link against build info instead of compiling against it (#3879)cebtenzzre
2023-09-28llama.cpp : split llama_context_params into model and context params (#3301)slaren
2023-09-28llama : custom attention mask + parallel decoding + no context swaps (#3228)Georgi Gerganov
2023-09-18make : restore build-info.h dependency for several targets (#3205)Cebtenzzre
2023-09-15examples : add compiler version and target to build info (#2998)Cebtenzzre
2023-09-08examples : make n_ctx warning work again (#3066)Cebtenzzre
2023-09-07fix some warnings from gcc and clang-tidy (#3038)Cebtenzzre
2023-08-27llama : more tokenizer fixes (#2810)Georgi Gerganov
2023-08-22embedding : evaluate prompt in batches (#2713)slaren
2023-08-21gguf : new file format with flexible meta data (beta) (#2398)Georgi Gerganov
2023-07-10mpi : add support for distributed inference via MPI (#2099)Evan Miller
2023-07-06convert : update for baichuan (#2081)Judd
2023-06-29Use unsigned for random seed (#2006)Howard Su
2023-06-26ggml : add NUMA support (#1556)zrm
2023-06-24llama : make model stateless and context stateful (llama_state) (#1797)Didzis Gosko
2023-06-16build : fix and ignore MSVC warnings (#1889)Borislav Stanimirov
2023-05-20llama : add llama_init_backend() API (close #1527)Georgi Gerganov
2023-05-16define default model path once, sync path with readme (#1366)AndrĂ¡s Salamon
2023-05-13embedding : remove unused code (#1426)Rinne
2023-05-02examples : add llama_init_from_gpt_params() common function (#1290)Ron Evans
2023-05-02llama : allow 0 as a seed number. (#1275)Robert Brisita
2023-05-01Add git-based build information for better issue tracking (#1232)DannyDaemonic
2023-04-16examples: add missing <ctime> include for time() (#1011)Pavol Rusnak
2023-04-10Rewrite loading code to try to satisfy everyone:comex
2023-03-25Cleanup STL headers + fix embedding examples + minor stuffGeorgi Gerganov
2023-03-25Overhaul the examples structureGeorgi Gerganov