最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

latency - How to Reduce OpenAI Azure Response Time for Structured Output Using GPT-4o Mini (Fine-Tuned Model)? - Stack Overflow

programmeradmin0浏览0评论

I am using an Azure OpenAI GPT-4o Mini fine-tuned model to generate structured responses (e.g., JSON format). However, the response time is higher than expected, and I am looking for ways to optimize it. Expected: Lower response time (~1-2s) while maintaining structured and accurate output.

与本文相关的文章

发布评论

评论列表(0)

  1. 暂无评论