ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2024-06-03	llama : offload to RPC in addition to other backends (#7640)	Radoslav Gerganov
	* llama : offload to RPC in addition to other backends * - fix copy_tensor being called on the src buffer instead of the dst buffer - always initialize views in the view_src buffer - add RPC backend to Makefile build - add endpoint to all RPC object names * add rpc-server to Makefile * Update llama.cpp Co-authored-by: slaren <slarengh@gmail.com> --------- Co-authored-by: slaren <slarengh@gmail.com>
2024-05-28	rpc : resource management rework (#7562)	Radoslav Gerganov
	* rpc : resource management rework * address review comments
2024-05-20	rpc : track allocated buffers (#7411)	Radoslav Gerganov
	* rpc : track allocated buffers ref: #7407 * rpc : pack rpc_tensor tightly
2024-05-17	rpc : set SO_REUSEADDR for the server socket (#7320)	Radoslav Gerganov
	ref: #7293
2024-05-16	rpc : add command line arg for specifying backend memory	Radoslav Gerganov
	ref: #7293
2024-05-14	ggml : add RPC backend (#6829)	Radoslav Gerganov
	* ggml : add RPC backend The RPC backend proxies all operations to a remote server which runs a regular backend (CPU, CUDA, Metal, etc). * set TCP_NODELAY * add CI workflows * Address review comments * fix warning * implement llama_max_devices() for RPC * Address review comments * Address review comments * wrap sockfd into a struct * implement get_alignment and get_max_size * add get_device_memory * fix warning * win32 support * add README * readme : trim trailing whitespace * Address review comments * win32 fix * Address review comments * fix compile warnings on macos