你的位置：首页>programmer>How can I use CUDA streams in llamacpp? - Stack Overflow

How can I use CUDA streams in llamacpp? - Stack Overflow

programmeradmin2025-02-102浏览0评论

I see support for CUDA streams in CUDA ggml implementation (for example, here .cpp/blob/master/ggml/src/ggml-cuda/softmax.cu#L172 and here .cpp/blob/master/ggml/src/ggml-cuda/common.cuh#L674 ), but it is in vain since ggml_backend_cuda_context.stream() always return stream #0: .cpp/blob/master/ggml/src/ggml-cuda/common.cuh#L683

Am I right? There is no way to use CUDA streams in llamacpp?

与本文相关的文章

How can I use CUDA streams in llamacpp? - Stack Overflow

评论列表(0)

暂无评论

科技改变生活-雨落星辰 - 所有的伟大,都源于一个勇敢的开始

与本文相关的文章

评论列表(0)