OpenAI Agent

Use OpenAI models for chat, structured output, vision, and safety checks.

What it does

Sends prompts to OpenAI models for chat-style responses, structured data, images, or vision analysis.
Provides moderation to screen content and embeddings for search or matching.

Common uses

Draft or summarize text from orders, tickets, or files.
Extract structured fields from freeform text using the structured output action.
Create embeddings for semantic search or deduplication.
Generate or edit images and answer questions about images.

Key actions from the spec

Chat and text generation: chat completions and freeform text completions with temperature and stop controls.
Structured output: enforce a JSON schema for reliable field extraction.
Vision: describe or answer questions about images, and generate or modify images from prompts.
Embeddings: create vector embeddings from text.
Moderation: check text against safety policies before sending it onward.

Setup notes

Requires an OpenAI connection or API key with access to your chosen models.
Pick models (for example, gpt-4o) and adjust tokens or temperature based on how creative versus precise you need the output.
Provide images as base64 or a reachable URL when using vision actions.