A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://developers.cloudflare.com/ai-gateway/integrations/worker-binding-methods/ below:

AI Gateway Binding Methods · Cloudflare AI Gateway docs

AI Gateway Binding Methods

Last reviewed: 3 months ago

This guide provides an overview of how to use the latest Cloudflare Workers AI Gateway binding methods. You will learn how to set up an AI Gateway binding, access new methods, and integrate them into your Workers.

1. Add an AI Binding to your Worker

To connect your Worker to Workers AI, add the following to your Wrangler configuration file:

{

"ai": {

"binding": "AI"

}

}

This configuration sets up the AI binding accessible in your Worker code as env.AI.

If you're using TypeScript, run wrangler types whenever you modify your Wrangler configuration file. This generates types for the env object based on your bindings, as well as runtime types.

2. Basic Usage with Workers AI + Gateway

To perform an inference task using Workers AI and an AI Gateway, you can use the following code:

const resp = await env.AI.run(

"@cf/meta/llama-3.1-8b-instruct",

{

prompt: "tell me a joke",

},

{

gateway: {

id: "my-gateway",

},

},

);

Additionally, you can access the latest request log ID with:

const myLogId = env.AI.aiGatewayLogId;

3. Access the Gateway Binding

You can access your AI Gateway binding using the following code:

const gateway = env.AI.gateway("my-gateway");

Once you have the gateway instance, you can use the following methods:

3.1. patchLog: Send Feedback

The patchLog method allows you to send feedback, score, and metadata for a specific log ID. All object properties are optional, so you can include any combination of the parameters:

gateway.patchLog("my-log-id", {

feedback: 1,

score: 100,

metadata: {

user: "123",

},

});

3.2. getLog: Read Log Details

The getLog method retrieves details of a specific log ID. It returns an object of type Promise<AiGatewayLog>. If this type is missing, ensure you have run wrangler types.

const log = await gateway.getLog("my-log-id");

3.3. getUrl: Get Gateway URLs

The getUrl method allows you to retrieve the base URL for your AI Gateway, optionally specifying a provider to get the provider-specific endpoint.

// Get the base gateway URL

const baseUrl = await gateway.getUrl();

// Output: https://gateway.ai.cloudflare.com/v1/my-account-id/my-gateway/

// Get a provider-specific URL

const openaiUrl = await gateway.getUrl("openai");

// Output: https://gateway.ai.cloudflare.com/v1/my-account-id/my-gateway/openai

The getUrl method is particularly useful for integrating with popular AI SDKs:

OpenAI SDK:

import OpenAI from "openai";

const openai = new OpenAI({

apiKey: "my api key", // defaults to process.env["OPENAI_API_KEY"]

baseURL: await env.AI.gateway("my-gateway").getUrl("openai"),

});

Vercel AI SDK with OpenAI:

import { createOpenAI } from "@ai-sdk/openai";

const openai = createOpenAI({

baseURL: await env.AI.gateway("my-gateway").getUrl("openai"),

});

Vercel AI SDK with Anthropic:

import { createAnthropic } from "@ai-sdk/anthropic";

const anthropic = createAnthropic({

baseURL: await env.AI.gateway("my-gateway").getUrl("anthropic"),

});

3.4. run: Universal Requests

The run method allows you to execute universal requests. Users can pass either a single universal request object or an array of them. This method supports all AI Gateway providers.

Refer to the Universal endpoint documentation for details about the available inputs.

const resp = await gateway.run({

provider: "workers-ai",

endpoint: "@cf/meta/llama-3.1-8b-instruct",

headers: {

authorization: "Bearer my-api-token",

},

query: {

prompt: "tell me a joke",

},

});

With these AI Gateway binding methods, you can now:

These methods offer greater flexibility and control over your AI integrations, empowering you to build more sophisticated applications on the Cloudflare Workers platform.


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4