OpenAI Agent

Use OpenAI models for chat, structured output, vision, and safety checks.

What it does

  • Sends prompts to OpenAI models for chat-style responses, structured data, images, or vision analysis.
  • Provides moderation to screen content and embeddings for search or matching.

Common uses

  • Draft or summarize text from orders, tickets, or files.
  • Extract structured fields from freeform text using the structured output action.
  • Create embeddings for semantic search or deduplication.
  • Generate or edit images and answer questions about images.

Key actions from the spec

  • Chat and text generation: chat completions and freeform text completions with temperature and stop controls.
  • Structured output: enforce a JSON schema for reliable field extraction.
  • Vision: describe or answer questions about images, and generate or modify images from prompts.
  • Embeddings: create vector embeddings from text.
  • Moderation: check text against safety policies before sending it onward.

Setup notes

  • Requires an OpenAI connection or API key with access to your chosen models.
  • Pick models (for example, gpt-4o) and adjust tokens or temperature based on how creative versus precise you need the output.
  • Provide images as base64 or a reachable URL when using vision actions.