summaryrefslogtreecommitdiff
path: root/common/sampling.h
diff options
context:
space:
mode:
authorAnton Sokolchenko <wsevendays@gmail.com>2025-06-19 09:18:21 +0200
committerGitHub <noreply@github.com>2025-06-19 10:18:21 +0300
commit39e17589a28670ea37b3c8aad36fea08c4eb9e97 (patch)
treea80059e213215daf6353d6f24c1e3a1c29fc44f2 /common/sampling.h
parentc6166b4020b80e8085816ff55dc8635aeb6260f2 (diff)
Update CMakeLists.txt to fix NDEBUG handling (#537)
without my change | PP | TG | N_KV | T_PP s | S_PP t/s | T_TG s | S_TG t/s | | --- | --- | ---- | ------ | -------- | ------ | -------- | ggml_backend_cuda_graph_compute: disabling CUDA graphs due to mul_mat_id ggml_backend_cuda_graph_compute: disabling CUDA graphs due to too many consecutive updates | 8192 | 2048 | 0 | 54.433 | 150.50 | 414.061 | 4.95 | | 8192 | 2048 | 8192 | 64.162 | 127.68 | 428.767 | 4.78 | after my change to CMakeLists.txt | PP | TG | N_KV | T_PP s | S_PP t/s | T_TG s | S_TG t/s | |-------|--------|--------|----------|----------|----------|----------| | 8192 | 2048 | 0 | 58.363 | 140.36 | 405.040 | 5.06 | | 8192 | 2048 | 8192 | 63.752 | 128.50 | 423.548 | 4.84 | | 8192 | 2048 | 16384 | 69.712 | 117.51 | 431.367 | 4.75 |
Diffstat (limited to 'common/sampling.h')
0 files changed, 0 insertions, 0 deletions