最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

google cloud platform - What is the scope and meaning of GCP rate limits for Gemini? - Stack Overflow

programmeradmin3浏览0评论

This page mentions "rate limits"

What doest the (current) 4M tokens/minute limit apply to? Is it per project? Per region? I am looking for a precise definition of it. To me, the online documentation mixes quotas with limits yet stating that these are distinct concepts.

This is what Gemini itself says about it. I want to double-check with a human:

It's per project. This means that the limit applies to all requests made to the model from a single Google Cloud project, regardless of the region.

发布评论

评论列表(0)

  1. 暂无评论