summaryrefslogtreecommitdiff
path: root/examples/main-cmake-pkg
diff options
context:
space:
mode:
authoragray3 <agray3@users.noreply.github.com>2024-05-15 14:44:49 +0100
committerGitHub <noreply@github.com>2024-05-15 15:44:49 +0200
commitdc020985b8755dd6aa93a2f002f43c3ede808cce (patch)
treea4be81a8ce9f08fbafbc92c3e38ee892192bfe91 /examples/main-cmake-pkg
parent344f9126cc0d15891fde9472fe40b8572628ad7d (diff)
Avoid unnecessarily disabling CUDA graphs (#7302)
As discussed in PR #6766, CUDA graphs were being disabled in the presence of long prompts. This fixes the issue by avoiding the consective update counter from incrementing unnecessarily for tokens in which cuda graphs are disabled due to batch size > 1.
Diffstat (limited to 'examples/main-cmake-pkg')
0 files changed, 0 insertions, 0 deletions