
Prompting: Systematically iterate on prompts - Unified prompt management dashboard - Side-by-side model comparisons - Native support for tools, structured outputs, OpenAPI specs
Workflows: The IDE for building steerable, agent-like systems. - Visual graph builder for orchestrating complex AI systems - Support for any model, custom code, and map/reduce functions - Built-in features like loops, parallelism, error handling
Workflows SDK: A full-stack SDK + GUI for building AI apps with precision and flexibility. - Bi-directional sync between code and UI - Fine-grained control flow and global state management - Customizable with Docker and streaming support
Evaluations: Test-driven development for AI. - Build scalable test suites via UI/API/CSV - Use pre-built or custom metrics for evaluation - Track quality, cost, latency, and regressions over time
Retrieval: Powerful RAG infrastructure without the complexity. - Simple APIs for uploading and querying unstructured data - Tweak chunking, embeddings, and search for advanced use-cases - Supports various file types including tables and images
Deployments: Update AI systems without redeploying your app. - One-click deploy across any model or provider - Staging environments for safe iteration - Scalable inference endpoints for production use
Observability: Full visibility into AI system performance. - Audit logs, debugging tools, and feedback capture - Evaluation loops on live traffic - Dashboards to track cost, latency, errors, and trends