Reference Guide

AI Agents

Cross-platform comparison of agent platforms: Anthropic Cowork & Claude Agent SDK, Salesforce Agentforce, Microsoft Copilot Studio, Google Antigravity, OpenAI AgentKit, and LangChain/LangGraph. Pricing, models, tool integration, deployment, and observability.

← Back to Reference Hub

Two complementary surfaces. Claude Agent SDK (Python & TypeScript) is a code-first toolkit for building autonomous agents on the same runtime that powers Claude Code — subagents, hooks, skills, and native MCP. Cowork is Anthropic's user-facing agent product, now GA across all paid tiers, where non-developers spin up multi-step research, writing, and analysis agents. Claude Managed Agents (public beta April 2026) hosts production agents on Anthropic's infra without you running the runtime yourself.

Models: Claude Opus 4.7, Sonnet 4.6, Haiku 4.5 — switchable per agent or task
SDK: subagents with isolated context, hooks for guaranteed-execution gates, skills folder, native MCP
Cowork: Pro $20/mo, Max 5x $100/mo, Max 20x $200/mo, Team Premium $100/seat, Enterprise custom
Cowork enterprise controls: SCIM-based RBAC, spend limits, OpenTelemetry observability
Managed Agents (beta): $0.08 / session-hour + standard API token costs
1M-token context on supported surfaces; prompt caching dramatically reduces token costs for long-running agents

Limitations: Single-vendor models — Anthropic only (no GPT, Gemini). Cowork is an end-user product, not a developer building block — you can't customize its UI. Managed Agents is still beta; Cowork's enterprise role/access tooling is newer than competitors. SDK is opinionated around Claude's runtime conventions, which is great if you live there and friction if you don't.

Code-First SDKManaged RuntimeAnthropic-Only

Salesforce's autonomous AI agent platform, built on the Atlas Reasoning Engine. Designed for enterprises that want agents grounded in CRM data, customer history, and Data Cloud unstructured sources. Every Atlas decision — planning, tool selection, action, reflection — is logged for audit, debugging, and behavior tuning.

Atlas Reasoning Engine: plan → act → evaluate loop with full step-level audit logging
Agentforce Script: hybrid agents combining deterministic workflows with LLM reasoning
Agentforce Voice: AI voice agents across phone, web, and mobile channels
Intelligent Context: low-code pipeline for unstructured/multimodal data grounding
Native to Salesforce CRM, Data Cloud, Slack, MuleSoft — pre-wired to enterprise data
Six pricing routes: Foundations (free starter), $2/conversation, Flex Credits ($500/100K = ~$0.10/action), per-user Add-ons (unlimited usage), Agentforce 1 Editions ($550/user/mo), and Service Cloud bundles

Limitations: Sticker shock at the high end — full Agentforce 1 Editions run $550/user/mo and complete deployments commonly land in $125–$650/user/mo territory. Best ROI sits inside the Salesforce ecosystem; outside it, the value proposition narrows. Implementation typically requires Salesforce Architects or partner help. Pricing model variety is flexible but adds forecasting complexity.

Enterprise CRMSalesforce-Centric

Microsoft's low-code platform for building, deploying, and managing AI agents across Microsoft 365, Teams, Power Platform, and standalone channels. Drag-and-drop topic authoring with code escape hatches; pre-built connectors to Dataverse, SharePoint, Dynamics, and 1,500+ Power Platform connectors.

Multi-model: built-in Microsoft models, plus Azure Foundry BYO (GPT, Claude, Gemini, open-weight)
Pricing: Copilot Credit packs at $200 / 25,000 credits / month, or pay-as-you-go meter
Copilot Credit Pre-Purchase Plan for predictable enterprise spend
Native integration with Microsoft 365 Copilot, Teams, Outlook, SharePoint, Dynamics 365
Copilot Tuning, Copilot Connectors, agent governance via Microsoft Purview
Strong enterprise posture: tenant isolation, EU Data Boundary, sovereign cloud options

Limitations: Credit consumption is opaque — specific actions have varying credit costs that aren't always obvious in advance. BYO Foundry models bill separately from Copilot Credits, complicating cost forecasting. The deepest value lives inside Microsoft 365; outside that ecosystem, you're paying for integrations you may not use. Heavy reliance on Power Platform conventions can feel limiting to code-first developers.

Low-CodeM365 NativeCredit Pricing

Google's two-pronged agent strategy. Antigravity (launched November 2025) is the agent-first IDE built around a "Manager Surface" for spawning multiple parallel coding agents with built-in browser self-verification. Vertex AI Agent Builder / Agent Engine is the production runtime for enterprise agents on Google Cloud, with managed sessions, memory, tool governance, and code execution.

Models: Gemini 3 / 3.1 Pro by default; Claude Sonnet 4.6, Claude Opus 4.6/4.7, GPT-OSS 120B in Antigravity; Vertex Model Garden adds Anthropic + Mistral + open-weight
Antigravity: Manager Surface for parallel agents, browser preview with verifiable artifacts, MCP support (added early 2026)
Vertex Agent Engine: managed runtime with sessions, memory, tool governance; $0.0864 / vCPU-hr + $0.0090 / GB-hr for code interpreter
Antigravity pricing: free public preview (rate-limited), AI Pro $20/mo, AI Ultra $249.99/mo
Vertex pricing: token-based (matches Gemini API rates), pay-as-you-go on GCP
Enterprise: VPC-SC, CMEK, IAM, audit logging, regional endpoints (Vertex side)

Limitations: Two distinct products with different audiences — Antigravity is coding-IDE-focused and still in public preview, while Vertex Agent Builder is the production answer but carries GCP setup and billing complexity. Antigravity rate limits have tightened post-launch and credit pricing on paid tiers is opaque. Vertex requires GCP comfort. The two surfaces don't yet share a unified developer story.

Agent IDEProduction RuntimeFree Preview (IDE)

OpenAI's full agent toolkit, built on top of the Responses API. Agent Builder is a visual canvas with drag-and-drop nodes, guardrails, and version history. Agents SDK (Python & JS) is the code-first equivalent for multi-agent workflows. ChatKit embeds chat-based agent UIs into your product. Connector Registry centrally manages tools and data connections. Assistants API remains for legacy workloads but Agent Builder is the recommended path for new work.

Models: GPT-5.5, GPT-5, GPT-5 mini, GPT-5-Codex, o-series reasoning models
Agent Builder: visual canvas with versioning, eval configuration, preview runs
Agents SDK: lightweight multi-agent framework (open source), handoffs, tracing
ChatKit: drop-in chat UI components for embedding agents in your product
Connector Registry: admin-managed tool/data connections across OpenAI products
Pricing: token-based at model rates (no separate API fees); tool meters: Code Interpreter $0.03/session, File Search $0.10/GB/day, web search included in tool budget
Built-in evaluation: datasets, trace grading, automated prompt optimization, third-party model support

Limitations: Tightest fit when you're already on OpenAI models — multi-vendor orchestration requires you to wire it up yourself. Agent Builder visual canvas is newer than enterprise alternatives like Copilot Studio. Pricing is straightforward at the API level but tool meters and storage can add up on large file-search workloads. Assistants API is being deprioritized in favor of the Responses-API-based stack — new builds should default to AgentKit.

Visual + CodeOpen SDK

The dominant open-source agent stack. LangGraph (MIT license, free) is a low-level graph-based orchestration framework for stateful, multi-actor agents. LangSmith is the paid observability and deployment layer — tracing, evals, and managed agent hosting. LangSmith Deployment (formerly LangGraph Platform, renamed October 2025) runs your agents at scale.

Provider-agnostic: works with Anthropic, OpenAI, Google, Mistral, open-weight, anything with an API
LangGraph: stateful graphs, single/multi/hierarchical agents, human-in-the-loop checkpoints, native streaming, built-in memory
LangSmith Developer: free, 5,000 traces/mo, 14-day retention, 1 seat
LangSmith Plus: $39/seat/mo, 10,000 traces, $2.50/1K overage, custom dashboards, evals
LangSmith Enterprise: custom pricing, SSO, dedicated support, custom retention
LangSmith Deployment: $0.005 per Deployment Run, custom for high volume
Largest open-source agent ecosystem — thousands of integrations, templates, and community tools

Limitations: Code-first only — no drag-and-drop builder. Steeper learning curve than the managed platforms; you assemble the pieces yourself. LangChain (the broader framework) has historical reputation issues for over-abstraction; LangGraph is the cleaner, more focused successor. Self-hosting requires real infrastructure work; LangSmith Deployment exists exactly to avoid this. No native enterprise CRM/M365 integrations — you build them.

Open SourceFree TierCode-Only

Best for: Enterprises whose IT, HR, customer service, or shared services backbone runs on ServiceNow and want autonomous agents that reason over the existing CMDB, knowledge bases, and process records — not just chat with end users. Strongest case where ticket volume is high and process patterns are stable.

Thousands of pre-configured agents shipped across ITSM, HRSD, CSM, Field Service, Finance, Legal, Procurement, and industry verticals
AI Agent Studio (low-code / natural language) for custom agents; Build Agent for IDE-out development
Plan / act / reflect orchestration grounded in Workflow Data Fabric and Context Engine
AI Control Tower for centralized governance, observability, and cost attribution
BYO LLM on Prime tier (April 2026 retiering); Now LLMs and partner models on lower tiers
Industry agent libraries: card disputes, insurance claims, telecom service activation, public-sector case workflows

Limitations: Agent quality is heavily dependent on Workflow Data Fabric and Context Engine grounding — without a clean data layer and modeled policies, agents hallucinate or refuse to answer. Biggest payoff when ServiceNow is already the system of record for the workflow being automated; less compelling for orgs running ServiceNow only as an ITSM tool. Pricing remains NDA — included in Foundation / Advanced / Prime tiers but token-pool overage can compound quickly.

Workflow-NativePrime tier for BYO-LLM

Capability	Cowork / Claude SDK	Agentforce	Copilot Studio	Antigravity / Vertex	OpenAI AgentKit	LangGraph + LangSmith	ServiceNow AI Agents
Free / starter tier	Cowork on Pro $20/mo	Foundations free	Tenant license required	Antigravity free preview	API + free tools quota	LangGraph MIT free	No (SN tenant required)
Entry paid tier	$20/mo (Pro)	$2 / conversation	$200 / 25K credits / mo	$20/mo (AI Pro)	Token-based only	$39/seat/mo (LangSmith Plus)	NDA (Foundation+)
Enterprise pricing	Custom (Enterprise)	$125–$650/user/mo	Pre-purchase commit	Custom (Vertex)	Token + tool meters	Custom (LangSmith Ent.)	NDA (sales-quoted)
Heavy-user model	Max 20x $200/mo	Agentforce 1 Editions $550/user	Copilot Credit packs	AI Ultra $249.99/mo	Pure usage-based	Per-run $0.005 (Deployment)	Prime tier
No-code / low-code	Cowork only (end-user)	Agent Builder + Script	Drag-and-drop topics	Manager Surface (IDE)	Visual Agent Builder	Code-only	AI Agent Studio (natural lang.)
Code-first SDK	Claude Agent SDK (Py/TS)	Apex + APIs	Power Fx + APIs	Vertex SDK	Agents SDK (Py/JS)	LangGraph (Py/JS)	Build Agent (IDE-out)
Visual canvas	No	Yes	Yes (full)	No	Yes (Agent Builder)	No	AI Agent Studio
Model selection	Anthropic only	Atlas + BYO via Models API	Built-in + Foundry BYO	Gemini + Claude + GPT-OSS	OpenAI only (eval supports 3rd-party)	Any provider	Now LLMs + partners; BYO on Prime
Frontier model access	Opus 4.7, Sonnet 4.6, Haiku 4.5	Atlas + chosen	GPT-5.5, Claude, Gemini	Gemini 3 Pro, Claude, GPT-OSS	GPT-5.5, GPT-5-Codex, o-series	Whatever you connect	Now LLMs / partner routed
Native MCP support	First-class	Yes	Connector Hub	Antigravity (2026), Vertex tools	Connector Registry	Yes	No public MCP docs
Pre-built connectors	Via MCP only	CRM + Data Cloud + Slack + MuleSoft	1,500+ Power Platform	Vertex Model Garden	Connector Registry	Largest OSS ecosystem	ITSM/HRSD/CSM/Field/Finance/Legal
Code execution / sandbox	Bash + Computer Use	Apex + Flows	Power Automate	Vertex code interpreter	$0.03/session	Bring your own	Now Platform Flows
Context window	200K–1M (Claude)	Per-model	Per-model	1M (Gemini 3 Pro)	Per-model	Per-model	CMDB/KB/WDF-grounded
Long-term memory	Skills + prompt cache	Data Cloud grounding	Dataverse + M365 Graph	Vertex Agent Engine sessions	Vector store + File Search	Built-in memory primitives	Workflow Data Fabric + Context Engine
Human-in-the-loop	Hooks + plan mode	Approval flows	Adaptive cards	Manager Surface	Guardrails + approvals	Native checkpoints	AI Control Tower escalation
Managed runtime	Managed Agents (beta)	Salesforce platform	M365 / Power Platform	Vertex Agent Engine	OpenAI Platform	LangSmith Deployment	Now Platform (SaaS)
Self-host option	Run SDK anywhere	No	No	Vertex on GCP only	No	Yes (full self-host)	No (SaaS only)
End-user UI included	Cowork app	Agent UX in Salesforce	Teams / M365 / web	Antigravity IDE	ChatKit embeds	Build your own	EmployeeWorks + record pages
Tracing / observability	OpenTelemetry (Cowork)	Atlas step-level audit	Purview + Application Insights	Vertex eval + logging	Trace grading + datasets	LangSmith (full traces)	AI Control Tower
Eval framework	SDK + custom	Built-in	Built-in	Vertex Evaluation Service	Auto prompt optimization	First-class evals	AI Control Tower metrics
SSO / SCIM / RBAC	SCIM RBAC (Cowork)	Native Salesforce	Entra ID native	Vertex IAM	SSO + RBAC	SSO on Enterprise	Native ServiceNow
Compliance posture	SOC 2, HIPAA (Ent.)	SOC, HIPAA, FedRAMP, Hyperforce	SOC, HIPAA, FedRAMP, EU Boundary	VPC-SC, CMEK, HIPAA (Vertex)	SOC 2, HIPAA	SOC 2 (LangSmith)	FedRAMP, SOC 2, HIPAA

The biggest split is where the data already lives. Agentforce is dramatically more valuable if your customer data is in Salesforce. Copilot Studio is dramatically more valuable if your work happens in Microsoft 365. AgentKit and Cowork shine for product-feature agents and end-user assistants. LangGraph wins when no managed platform fits and you need full control or multi-provider orchestration. Pricing models are not comparable apples-to-apples. Agentforce charges per conversation or per action. Copilot Studio charges per credit (with opaque per-action consumption). OpenAI charges per token plus tool meters. LangGraph is free but you pay for whatever models and infrastructure you use. Build a back-of-envelope cost model with your expected volume before picking on price alone — sticker prices are misleading at scale. Antigravity is mainly a coding-agent IDE, not a general production agent platform. For Google-side production agents, Vertex AI Agent Builder / Agent Engine is the answer — we list them together since they share the same Gemini 3 model lineup and Google's enterprise governance, but they target different jobs.

Customer-data-aware sales / service agents on top of CRMSalesforce Agentforce

Internal productivity agents inside Teams, Outlook, SharePointMicrosoft Copilot Studio

Embed an agent UI directly into your SaaS productOpenAI AgentKit + ChatKit

Code-first multi-agent system with full provider choiceLangGraph

Build a coding agent on Claude's runtime (subagents, hooks, skills)Claude Agent SDK

Non-developer wants to spin up a research/writing agentAnthropic Cowork

Production agent without managing infrastructureClaude Managed Agents or LangSmith Deployment

Visual canvas with versioning and built-in evalsOpenAI Agent Builder

Strict compliance: HIPAA, FedRAMP, EU data boundaryAgentforce, Copilot Studio, or Vertex

Multi-provider model orchestration (Claude + GPT + Gemini)LangGraph or Copilot Studio

Self-hosted, on-premise agent runtimeLangGraph (only option)

Enterprise audit trail of every agent decisionAgentforce (Atlas logging) or LangSmith

Voice agents on phone / mobile / web channelsSalesforce Agentforce Voice

Long-context agent loops (1M-token windows)Claude Agent SDK or Vertex (Gemini 3 Pro)

Fastest path from idea to working prototypeOpenAI Agent Builder or Cowork

Maximum cost predictability at high volumeCopilot Credit Pre-Purchase or Agentforce Add-ons

Build coding agents in an IDE with parallel executionGoogle Antigravity

Our Recommendation

For most of our client work, the answer follows where the data lives. Salesforce shops get Agentforce — native CRM and Data Cloud grounding plus Atlas's audit trail justify the premium. Microsoft shops get Copilot Studio for the same reason on the M365 side. For product-embedded agents — chat in your own app, AI features in your SaaS — OpenAI AgentKit is the fastest path. When we're building multi-step agentic systems on Claude (research, code, ops), we reach for the Claude Agent SDK directly — subagents, hooks, and skills are uniquely powerful. And when no managed platform fits the shape of the problem — multi-provider orchestration, on-prem deployment, or unusual control flows — LangGraph + LangSmith is the open-source backbone we trust.

AI Agents

Anthropic Cowork & Claude Agent SDK

Salesforce Agentforce

Microsoft Copilot Studio

Google Antigravity (+ Vertex Agent Builder)

OpenAI AgentKit (+ Assistants API)

LangChain / LangGraph + LangSmith

ServiceNow AI Agents

Our Recommendation