Cloudflare just released their AI inference offering along with a partnership with Hugging Face.
In this tutorial I show you how to use their REST API for inference as well as how to use the AI Gateway product, which I am most excited for.
The AI Gateway product lets you cache, rate limit, and log errors/responses/tokens for inference endpoints from Hugging Face, OpenAI, Replicate, and Cloudflare.
Check out the blog post from Cloudflare which includes pricing info: https://blog.cloudflare.com/workers-ai/
#ai #cloudflare