arcade-mcp

Author	SHA1	Message	Date
Francisco Or Something	dc4607daa4	feat(telemetry): add developer messages to tool error spans (#831 ) ## Summary - Add shared span attributes for tool error diagnostics, including developer-facing messages when present. - Wire those attributes through MCP server, worker RunTool, and HTTP CallTool spans while keeping default MCP response content public-only. - Cover no-leak response behavior, non-recording spans, outputless worker responses, and the shared attribute contract. ## Verification - `uv run ruff format ...` - `uv run ruff check ...` - `uv run pytest -W ignore libs/tests/arcade_mcp_server/test_debug_exposure_integration.py libs/tests/core/test_log_extras.py libs/tests/worker/test_worker_base.py` Made with [Cursor](https://cursor.com) <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk > Adds new telemetry attributes that propagate tool error messages (including optional developer_message) into active spans across MCP server and worker execution paths; risk is mainly around potential leakage of sensitive developer messages into tracing backends and changes to observability contracts. > > Overview > Adds a shared `arcade_core.log_extras.build_tool_error_span_attributes()` helper and wires it into tool error paths so the current OpenTelemetry span is annotated with stable `tool_error_*` attributes (including `developer_message` when present). > > MCP tool calls now record these span attributes on failure while keeping default MCP response content sanitized, and `arcade-serve` records the same attributes on both `RunTool` and HTTP `CallTool` spans (handling `output=None`). Versions and dependency constraints are bumped to consume the new core helper, with tests added/updated to lock the span-attribute contract and verify behavior for non-recording spans and no-leak responses. > > <sup>Reviewed by [Cursor Bugbot](https://cursor.com/bugbot) for commit 33a53991d72140a662152f508dc53e9b769b9f07. Bugbot is set up for automated code reviews on this repo. Configure [here](https://www.cursor.com/dashboard/bugbot).</sup> <!-- /CURSOR_SUMMARY -->	2026-04-29 20:41:07 -03:00
Francisco Or Something	d9812621de	feat: add NetworkTransportError for no-response HTTP failures (#823 ) ## Summary - Adds `NetworkTransportError` — a new sibling to `UpstreamError` under `ToolExecutionError` — for failures where no complete HTTP response was received from the upstream service (timeouts, connection errors, pool exhaustion, DNS failures, decoding issues, redirect exhaustion) - Routes client-construction bugs (`InvalidURL`, `UnsupportedProtocol`, `MissingSchema`, `SSLError`, `InvalidHeader`, etc.) to existing `FatalToolError` instead of `UpstreamError` - Adds 3 new `ErrorKind` values: `NETWORK_TRANSPORT_RUNTIME_TIMEOUT`, `_UNREACHABLE`, `_UNMAPPED` — operationally distinct telemetry slices matching the UpstreamError pattern - `UpstreamError` is unchanged and reserved for real HTTP responses with status codes Addresses Eric's feedback on #820: the `include_status_code=False` post-init null-out workaround is replaced by a clean class hierarchy where `NetworkTransportError.status_code` is natively `None`. ### Changes \| File \| What \| \|---\|---\| \| `arcade-core/errors.py` \| 3 new `ErrorKind` values, `NetworkTransportError` class, `is_network_transport_error` helper \| \| `arcade-tdk/providers/http/error_adapter.py` \| Full rewrite of httpx + requests exception routing with 3-way split \| \| `arcade-tdk/providers/graphql/error_adapter.py` \| `TransportConnectionFailed`/`TransportProtocolError` → `NetworkTransportError` \| \| `arcade-tdk/errors.py`, `arcade-mcp-server/exceptions.py` \| Re-exports \| \| `pyproject.toml` × 3 \| Version bumps: core 4.7.0, tdk 3.7.0, mcp-server 1.20.0 \| \| Tests × 3 \| 33 new tests, 3 updated (2659 passed, 0 failures) \| ### Exception routing table \| Exception \| Target \| Kind \| can_retry \| \|---\|---\|---\|---\| \| `httpx.HTTPStatusError`, `requests.HTTPError` (with response) \| `UpstreamError` \| status-derived \| status-derived \| \| `httpx.TimeoutException`, `requests.Timeout` \| `NetworkTransportError` \| `TIMEOUT` \| ✅ \| \| `httpx.TransportError`, `requests.ConnectionError` \| `NetworkTransportError` \| `UNREACHABLE` \| ✅ \| \| `httpx.DecodingError`, `TooManyRedirects`, fallback \| `NetworkTransportError` \| `UNMAPPED` \| varies \| \| `httpx.InvalidURL`/`UnsupportedProtocol`/`LocalProtocolError`, `requests.MissingSchema`/`SSLError`/etc. \| `FatalToolError` \| `TOOL_RUNTIME_FATAL` \| ❌ \| ### Engine companion PR ArcadeAI/monorepo — `feat/network-transport-error-kinds` adds the 3 `ErrorKind` constants to Go schemas + OpenAPI docs. No engine logic changes needed (ErrorKind is a string alias, retry uses `can_retry` flag only, telemetry auto-slices). ## Test plan - [x] 2659 existing tests pass (0 failures) - [x] 33 new routing + class tests added - [x] mypy clean on arcade-core, arcade-tdk - [ ] Verify engine telemetry dashboard auto-surfaces new `NETWORK_TRANSPORT_` kinds after deploy 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk* > Changes the error taxonomy and classification helpers used for retries/telemetry, so misclassification could affect operational behavior, but the change is additive and covered by new tests. > > Overview > Adds a new error category for outbound request failures that never yield a complete upstream response: `NetworkTransportError` (sibling to `UpstreamError`) plus `ErrorKind.NETWORK_TRANSPORT_RUNTIME_{TIMEOUT,UNREACHABLE,UNMAPPED}` and matching `is_network_transport_error` classification helpers on both `ToolkitError` and the wire-model `ToolCallError`. > > Re-exports `NetworkTransportError` from `arcade-tdk` and `arcade-mcp-server`, bumps package versions (`arcade-core` 4.7.0, `arcade-tdk` 3.7.0, `arcade-mcp-server` 1.20.0) and dependency minimums, and expands `core/test_errors.py` to cover the new kind invariants/defaults and classification behavior. > > <sup>Reviewed by [Cursor Bugbot](https://cursor.com/bugbot) for commit d2b89078729c6a67ba42684dc98445352238bc1d. Bugbot is set up for automated code reviews on this repo. Configure [here](https://www.cursor.com/dashboard/bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 18:29:13 -03:00
Francisco Or Something	1492c80fc5	TOO-627: Improve error messages for agents and Datadog (#814 ) ## Summary - Improve tool call error messages across 4 libraries (arcade-core, arcade-tdk, arcade-mcp-server, arcade-serve) so agents can self-correct and Datadog can facet on structured fields - Guard empty error messages, enrich input validation errors with field-level detail, fix `@tool` decorator fallback formatting, surface `additional_prompt_content` in MCP responses, and add structured log extras for Datadog - Addresses the 3 worst error patterns: generic "Error in tool input deserialization", bare `KeyError` values, and empty `FatalToolError` messages Linear: TOO-627 Plan: `docs/plans/2026-04-08-improve-error-messages-handoff.md` ## Tasks - [ ] Task 1: Guard empty error messages (arcade-core) - [ ] Task 2: Enrich input validation error messages (arcade-core) - [ ] Task 3: Improve `@tool` decorator error fallback (arcade-tdk) - [ ] Task 4: Fix MCP agent-facing error response (arcade-mcp-server) - [ ] Task 5: Add structured log extras in BaseWorker (arcade-serve) - [ ] Task 6: Add structured log extras in MCP server (arcade-mcp-server) ## Test plan - [ ] Each task has dedicated unit tests verifying the new behavior - [ ] `make test` passes after all tasks - [ ] `make check` (ruff + mypy) passes - [ ] Verify the 3 worst error patterns now produce actionable messages 🤖 Generated with [Claude Code](https://claude.com/claude-code) <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk > Touches cross-library error formatting and logging behavior used in production tool execution paths; while mostly additive/guardrails, it changes agent-visible messages and Datadog log facets, which could impact client expectations and alerting. > > Overview > Improves tool-call error handling across core/runtime, MCP transport, worker transport, and the TDK to make agent-visible failures more actionable while reducing sensitive-data leakage. > > In `arcade-core`, empty error messages now get placeholders, `ToolOutputFactory.fail` defaults blank messages, and input validation errors are rewritten as field-level summaries that intentionally omit rejected values (avoiding Pydantic echo of secrets). The `@tool` fallback in `arcade-tdk` no longer surfaces `str(exception)` to agents; it returns exception type-only* in `message` while preserving full detail in `developer_message`. > > Adds a shared `build_tool_error_log_extra` helper and updates `arcade-serve` + `arcade-mcp-server` to emit consistent structured WARNING logs (`error_*`, `tool_name`, optional toolkit/version) for Datadog, while MCP error responses now append `additional_prompt_content` and force `structuredContent=None` on failures per spec. Includes extensive new tests and bumps package versions (`arcade-core` 4.6.2, `arcade-tdk` 3.6.1, `arcade-mcp-server` 1.19.3, `arcade-serve` 3.2.3). > > <sup>Reviewed by [Cursor Bugbot](https://cursor.com/bugbot) for commit e5c7ebcaf56176cfbd8e6d1f2b6295352abd0ec0. Bugbot is set up for automated code reviews on this repo. Configure [here](https://www.cursor.com/dashboard/bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 20:10:51 -03:00
Eric Gustin	3204201360	fix: TypedDict total=False output breaks validation (#816 ) When a tool’s output TypedDict uses total=False, MCP clients reject the response with: ``` MCP error -32602: Structured content does not match the tool's output schema ``` Note that the bug also exists for the Engine transport (/worker/tools/execute), but since the engine doesn't validate the output schema, the bug never surfaced. This PR addresses the problem holistically (MCP and Engine) in preparation for a future where the Engine transport validates output schemas. Two bugs combined to cause this: 1. Schema: The outputSchema had no required array and declared all fields as strict types (e.g. "type": "string"), making every field look mandatory and non-null. 2. Serialization: model_dump() on TypedDict-derived Pydantic models emitted None for absent optional fields. A tool returning {"name": "hello"} produced {"name": "hello", "optional_field": null} which is a value the schema forbids. <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk > Adjusts core schema generation and MCP JSON Schema conversion for TypedDicts, affecting how tool input/output contracts are emitted and validated across clients; mistakes could break compatibility or validation behavior. > > Overview > Fixes MCP/engine validation failures for `TypedDict(total=False)` outputs by ensuring absent optional keys are omitted from serialized output and that emitted schemas correctly describe required vs optional keys. > > `arcade-core` now tracks `required_keys`/`inner_required_keys` and per-field `nullable` in `ValueSchema`, derives required sets from TypedDict `__required_keys__`, and unwraps `Optional[T]` to support optional nested TypedDicts; TypedDict-derived Pydantic models now `model_dump(exclude_unset=True)` to avoid leaking missing fields as `null`. > > `arcade-mcp-server` JSON Schema conversion now emits `required` arrays (including for arrays of objects), supports `nullable` by generating `type: [<type>, "null"]` (and `enum` including `None`), and treats nullable top-level objects as valid unwrapped output schemas. Adds focused unit/end-to-end tests plus an expanded example server demonstrating total-false, mixed required/optional, nullable, and optional-nested TypedDict outputs, and bumps package versions/dependencies accordingly. > > <sup>Reviewed by [Cursor Bugbot](https://cursor.com/bugbot) for commit 53fe8365f613053599130520b75f30b614b465ca. Bugbot is set up for automated code reviews on this repo. Configure [here](https://www.cursor.com/dashboard/bugbot).</sup> <!-- /CURSOR_SUMMARY -->	2026-04-09 17:47:57 -07:00
jottakka	fe8ddfd500	[TOO-326] Windows papercuts (#768 ) <!-- CURSOR_SUMMARY --> > [!NOTE] > Medium Risk > Touches authentication/login flow, credentials-file permissions, and subprocess lifecycle behavior across platforms; while mostly defensive, regressions could impact login or process management on Windows/macOS runners. > > Overview > Improves Windows/cross-platform reliability across the CLI and MCP server: OAuth login now binds the callback server to `127.0.0.1`, avoids slow loopback reverse-DNS, adds a configurable callback timeout (`--timeout` + env default), and opens URLs via a Windows-friendly `_open_browser` to avoid flashing console windows. > > Centralizes CLI output via a shared `console` that forces UTF-8 on Windows, standardizes UTF-8 file reads/writes throughout, tightens credentials-file permissions on Windows using `icacls`, and adds shared Windows subprocess helpers for no-window process creation and graceful termination (used by `deploy`, MCP reload, and usage-tracking worker). > > Updates client configuration UX/robustness (Windows AppData resolution via `platformdirs`, Cursor config path fallbacks + compatibility writes, overwrite warnings, absolute `uv` path for GUI clients, safer path display) and improves `deploy` child-process handling to avoid pipe-buffer deadlocks while giving better debug-aware error messages. > > Expands CI to run tests on Linux/Windows/macOS, adds a no-auth CLI integration workflow, disables usage tracking in toolkits CI, and adds extensive regression tests for Windows signals, subprocess cleanup, UTF-8, and config-path edge cases; bumps `arcade-core` to `4.4.2` and `arcade-mcp-server` to `1.17.2` (with updated dependency pin). > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit 0fabd8ca1cd647039ba6ddbdf3f7809c330bab9e. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY -->	2026-02-25 13:18:16 -03:00
Eric Gustin	a4160dd9fe	Four bug fixes (#754 ) 1. Resolves [TOO-363](https://linear.app/arcadedev/issue/TOO-363/arcade-deploy-fails-when-additional-deps-are-added-to-the-server). 2. Resolves [TOO-364](https://linear.app/arcadedev/issue/TOO-364/arcade-cores-tool-skip-logic-is-missing-case-for-direct-execution). 3. Resolves [TOO-358](https://linear.app/arcadedev/issue/TOO-358/missing-evals-error-message-shows-wrong-command). 4. Resolves [TOO-365](https://linear.app/arcadedev/issue/TOO-365/arcade-evals-unit-tests-are-hanging). <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Medium Risk > Medium risk because it changes how `arcade deploy` spawns the server process and adjusts toolkit discovery skip logic, which can affect deployments and tool discovery; however, the changes are small and covered by new unit/integration tests. > > Overview > `arcade deploy` now starts the validation server using the project’s `.venv` interpreter (via `find_python_interpreter`) instead of the CLI’s own `sys.executable`, preventing missing dependency failures when the CLI is installed in an isolated env. > > `arcade-core`’s `Toolkit.tools_from_directory` skip logic is hardened to also skip the currently executing entrypoint by module name (`__main__.__spec__.name`) when file paths don’t match (e.g., bundled execution). CLI error printing now escapes plain messages to avoid rich markup issues, and `arcade-evals` lock acquisition accepts an optional timeout default. > > Adds unit tests for the new toolkit skip behavior and an integration test that boots the MCP server via direct Python invocation to mirror deployment behavior, and bumps `arcade-core`, `arcade-mcp-server`, and root dependency versions accordingly. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit e7785634c231c059f2e0bd1bc73a56bd7470a494. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY -->	2026-01-29 15:12:06 -08:00
jottakka	98fad93d21	Adding MCP Servers supports to Arcade Evals (#689 ) # MCP Server Tool Evaluation Support ## Overview Add support for evaluating tools from remote MCP servers without requiring Python callables. Enables direct evaluation of any MCP-compatible tool server. ## What's New ### Core Features - `MCPToolRegistry`: Evaluate tools from a single MCP server - `CompositeMCPRegistry`: Evaluate tools from multiple MCP servers simultaneously - Automatic loaders: `load_from_stdio()` and `load_from_http()` to fetch tools from running servers - Automatic namespacing: Tools prefixed with server name (e.g., `server_tool_name`) - Smart name resolution: Use short names if unique, full names if ambiguous - OpenAI strict mode: Automatic schema conversion prevents parameter hallucinations ### Usage Automatic Loading: ```python from arcade_evals import load_from_stdio, MCPToolRegistry # Load tools automatically from MCP server tools = load_from_stdio(["npx", "-y", "@modelcontextprotocol/server-github"]) registry = MCPToolRegistry(tools) ``` Single MCP Server: ```python from arcade_evals import MCPToolRegistry, ExpectedToolCall registry = MCPToolRegistry(mcp_tools) suite = EvalSuite(catalog=registry) suite.add_case( expected_tool_calls=[ ExpectedToolCall(tool_name="tool_name", args={...}) ] ) ``` Multiple MCP Servers: ```python from arcade_evals import CompositeMCPRegistry, load_from_stdio # Load from multiple servers github_tools = load_from_stdio(["npx", "-y", "@modelcontextprotocol/server-github"]) slack_tools = load_from_stdio(["npx", "-y", "@modelcontextprotocol/server-slack"]) composite = CompositeMCPRegistry( tool_lists={ "github": github_tools, "slack": slack_tools, } ) suite = EvalSuite(catalog=composite) suite.add_case( expected_tool_calls=[ ExpectedToolCall(tool_name="github_list_issues", args={...}) ] ) ``` ## Implementation ### Files Changed - `libs/arcade-evals/arcade_evals/registry.py` (NEW): Registry abstractions and implementations - `libs/arcade-evals/arcade_evals/loaders.py` (NEW): Automatic tool loading from MCP servers - `libs/arcade-evals/arcade_evals/eval.py` (MODIFIED): Enhanced `ExpectedToolCall` and evaluation logic - `libs/arcade-evals/arcade_evals/__init__.py` (MODIFIED): Exported new registries and loaders ### Key Technical Details - Added `BaseToolRegistry` interface for abstraction - `MCPToolRegistry` handles single server tools - `CompositeMCPRegistry` manages multiple servers with collision detection - `load_from_stdio()` and `load_from_http()` for automatic tool discovery - Fixed name normalization bug: MCP tools use underscores (not dots) - Optimized tool copying: 2.5x faster via shallow copy ## Testing - ✅ 41 tests passing (25 new tests added) - ✅ `test_eval_mcp_registry.py`: MCPToolRegistry functionality - ✅ `test_eval_composite_mcp.py`: CompositeMCPRegistry with multiple servers - ✅ Verified backward compatibility with Python tools ## Backward Compatibility ✅ 100% backward compatible - No breaking changes ## Breaking Changes None <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Adds end-to-end eval UX: examples, a robust CLI runner, and rich outputs. > > - New examples: `eval_arcade_gateway.py`, `eval_stdio_mcp_server.py`, `eval_http_mcp_server.py`, `eval_comprehensive_comparison.py` with timeouts, error handling, and track-based comparisons; detailed `README.md` > - CLI runner: `arcade_cli/evals_runner.py` to execute evals/capture in parallel with progress, error isolation, failed-only filtering, context inclusion, and multi-provider/model support > - Output formatters: `arcade_cli/formatters/` (txt, md, html, json) for evals and capture; comparative and multi-model HTML with tabs and context rendering > - Display refactor: `display.py` now supports writing multiple formats, failed-only disclaimers, include-context, and improved console summaries > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit ff8acf9c34a6b61462a019a1ee9df081006517d0. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Francisco Liberal <francisco@arcade.dev> Co-authored-by: Mateo Torres <torresmateo@gmail.com>	2026-01-07 20:26:23 -03:00
Eric Gustin	c15c07e12f	Better Handling of MCP-specific `Context` usage for managed servers (#679 ) Since servers managed by Arcade use the `/worker` routes under the hood, tools that use MCP-specific properties of `Context` will fail. This PR helps reduce the 'blast radius' of the above fact. For properties that were deemed 'non-critical' to the execution of a deployed tool, we simply no-op. For properties that were deemed 'critical' to the execution of a deployed tool, we raise an error that informs the caller that the feature is not supported for Arcade managed servers. - Non-critical property: A context property that returns None - Critical property: A context property that may return something that could be necessary for a tool execution to succeed.	2025-11-07 10:26:56 -08:00
Eric Gustin	e727af3a21	Fix MCP capabilities, examples, tests, and more (#657 ) # PR Description Consider this PR the result of a full pass through of this repository. ## Add helper for adding tools to an `MCPApp` You can now add all of the tools in a module to an `MCPApp` via `app.add_tools_from_module(...)` ## Edit what `arcade new` generates First, I updated the backend to use hatchling. Second, the structure generated before this PR was simple, but did not create a proper Python module. This hindered developers in the following ways: 1. Difficult to add the tools in your server to an evaluation suite 2. Difficult to add more than one tool to an MCPApp at a time 3. All other niceties that come with being able to import modules ``` # Before server/ ├── .env.example ├── server.py └── pyproject.toml ``` This PR updates the structure generated such that a valid Python module is generated: ``` # After server/ ├── pyproject.toml └── src/ └── server/ ├── __init__.py ├── .env.example └── server.py ``` ## Fix Tool Chaining `self._ctx.server.executor.run(...)` was being called, but `MCPServer` does not have an instance of `ToolExecutor` (and it's not intended to be an instance anyways). I updated `Tool.call_raw` to pass the programmatic tool call through the `MCPServer._handle_call_tool`. This means that the programmatic tool calls now go through the same steps that a typical tool call (initiated by the MCP client) would. This means that toolA, which specifies requirementsA, is permitted to call toolB, which specifies requirementsB, without needing to explicitly declare or satisfy requirementsB. I believe this is acceptable because the secrets and/or auth token associated with toolB's `Context` are not exposed to toolA, and the secrets and/or auth token associated with toolA's `Context` are not exposed to toolB. ## Fix User Elicitation 1. The read & write streams were created with a maximum queue size of 0. I increased this to 100. 2. I updated `ServerSession`'s run loop to both read messages from the stream & process them concurrently. This enables server initiated requests (like user elicitation and progress reporting) to be handled while tools are being executed. Otherwise, the server initiated requests would wait for the tool to finish executing and the tool execution would wait for the server initiated request to finish. 3. ## Fix Progress Reporting Progress tokens sent by the client were not being stored. Therefore there was no way to notify a client with progress updates. I am now storing the `progressToken`, along with other `_meta` sent from the client, in the `ServerSession`'s `_request_meta`. I am setting `_request_meta` whenever the `MCPServer` is handling an incoming message from a client. ## Fix handling of server names with spaces Before: Server name: "The simple server name" Tool name: whisper_secret Name seen by client: "The_simple_server_name_WhisperSecret" After Server name: "The simple server name" Tool name: whisper_secret Name seen by client: "TheSimpleServerName_WhisperSecret" ## Add Integration Tests The stdio integration test is much more comprehensive than the http integration test. These tests will let me sleep a bit more at night ## Add Example MCP Servers Example servers for sampling, user-elicitation, progress reporting, logging, tool chaining, combining prebuilt tools with custom tools, tool secrets, tool auth, evaluations, and more! ## Add Docker template Added a Docker template for running an MCP server in Docker (and removed the old docker stuff)	2025-10-30 11:59:00 -07:00
Eric Gustin	49e53d2b33	Server start events (#635 ) 1. Refactored the core usage logic from `arcade_cli` to `arcade_core` 2. Add "MCP server started" event As always, opt out by setting `ARCADE_USAGE_TRACKING` to 0.	2025-10-22 16:14:52 -07:00
Eric Gustin	3d2665d36c	Rename some 'toolkit' references to 'server' (#624 ) There are many more instances of toolkit within this repo, but the goal of this PR is to get rid of user facing references as much as possible. --------- Co-authored-by: Nate Barbettini <nate@arcade.dev>	2025-10-14 18:42:27 -07:00
Eric Gustin	3424ec8219	MCP Local (#563 ) Versions: * arcade-mcp\==1.0.0rc1 * arcade-mcp-server\==1.0.0rc1 * arcade-core\==2.5.0rc1 * arcade-tdk\==2.6.0rc1 * arcade-serve\==2.2.0rc1 ### Summary Adds first-class MCP support across Arcade, introduces a new MCP server and CLI, unifies the project under the arcade-mcp name, overhauls templates/scaffolding, and improves developer tooling, secrets management, and examples. ### Highlights - MCP Server & Core - New MCP server with stdio and HTTP/SSE transports, session management, resumability, and lifecycle handling. - FastAPI-like `MCPApp` for building servers with lazy init; integrated worker+MCP HTTP app option. - Middleware system (logging and error handling), robust exception hierarchy, and Pydantic-based settings. - Async-safe managers for tools, resources, and prompts backed by registries and locks. - Developer-facing, transport-agnostic runtime context interfaces (logs, tools, prompts, resources, sampling, UI, notifications). - Conversion from Arcade ToolDefinition to MCP tool schema; OpenAI JSON tool schema converter. - Parser supports `@app.tool`/`@app.tool(...)` decorators. - CLI - New `mcp` command to run MCP servers with stdio or HTTP/SSE. - New `secret` command to set/list/unset tool secrets (supports .env input, preserves original casing for lookups). - `new` command refactored; option to create a full toolkit package with scaffolding. - `chat` command removed. - `serve.py` imports updated to `arcade_serve.fastapi.telemetry`; version retrieval now uses `arcade-mcp`. - `show.py` refactor to use new local catalog utilities. - `display_tool_details` improved: adds “Default” column and handles nested properties. - Configuration & Discovery - New `configure.py` to set up Claude Desktop, Cursor, and VS Code to connect to local or Arcade Cloud MCP servers. - Discovery utilities to find/install toolkits, build `ToolCatalog`s, analyze files for tools, load kits from directories (pyproject parsing), and build minimal toolkits. - Better handling of provider API key resolution and evaluation suite loading. - Templates & Scaffolding - Reorganized template structure (minimal vs full); moved `.pre-commit-config.yaml`, `.ruff.toml`, license, Makefile, README, tests, and tools layout to correct paths. - Minimal template adds `.env.example` for runtime secret injection. - Template pyproject updated for MCP servers; includes sample server with greeting and secret-reveal tools. - Authorization flow in templates simplified. - Repo-wide Renaming & Examples - Migrates references from `arcade-ai` to `arcade-mcp` across READMEs, scripts, and package metadata. - Examples updated (LangChain/LangGraph/AI SDK/TypeScript) and package name changed to `arcade-mcp-sdk`. - Evals & Core Utilities - Evals now use OpenAI tooling format (`OpenAIToolList`, `to_openai`); `tool_eval` takes `provider_api_key`. - Core utilities: fixed `does_function_return_value` by dedenting before parse; version bump to `2.5.0rc1` and dependency cleanup. - Tooling & CI - `setup-uv-env` action splits toolkit vs contrib dependency installation. - Pre-commit: excludes `libs/arcade-mcp-server/mkdocs.yml` and `libs/tests/` from YAML and Ruff hooks; Ruff per-file ignores (e.g., C901 in `libs/*/.py`, TRY400 in server docs paths). - Makefile updates for uv env setup, quality checks, tests, builds, and new `shell` target. - Added Makefile to MCP server library to streamline dev workflow. - Cleanup - Removed `claude.json` config. - Simplified stdio entrypoint; removed unused imports (`arcade_gmail`, `arcade_search`). ### Breaking Changes - CLI: `chat` command removed; use `mcp`, `secret`, and updated `new`. - Naming: All users should update references from `arcade-ai` to `arcade-mcp`. - Templates: File paths moved; downstream scripts referencing old template locations may need updates. ### Getting Started - Run an MCP server: - `arcade mcp --stdio --toolkits your_toolkit` - `arcade mcp --http --toolkits your_toolkit` - Manage secrets: - `arcade secret set your_toolkit KEY=value` - `arcade secret list your_toolkit` - `arcade secret unset your_toolkit KEY` - Configure clients: - `arcade configure` to set up Claude Desktop, Cursor, and VS Code for local/Arcade Cloud MCP. --------- Co-authored-by: Sam Partee <sam@arcade-ai.com> Co-authored-by: Shub <125150494+shubcodes@users.noreply.github.com>	2025-09-25 15:28:15 -07:00
Eric Gustin	f4558ef3a8	Tool Error Handling (#539 ) # Improvements to Arcade TDK Error Handling I tried my very best to not make any breaking changes in this PR. So, you will notice various "Deprecation" notices throughout. ### Instructions for PR reviewers 1. Pull down this PR's branch 2. Pull down the Engine's tool error handling PR's branch 3. Update your installed arcadepy to have the following: - In `arcadepy/resources/tools/tools.py`, if you want to test out including stacktraces, then you need to update `ToolsResource.execute` to accept a `include_error_stacktrace` argument and also include the "include_error_stacktrace" argument to the POST to the Engine inside of the function's execute method's body. - In `arcadepy/types/execute_tool_response.py` add the following enum ```py class ErrorKind(str, Enum): """Error kind that is comprised of - the who (toolkit, tool, upstream) - the when (load time, definition parsing time, runtime) - the what (bad_definition, bad_input, bad_output, retry, context_required, fatal, etc.)""" TOOLKIT_LOAD_FAILED = "TOOLKIT_LOAD_FAILED" TOOL_DEFINITION_BAD_DEFINITION = "TOOL_DEFINITION_BAD_DEFINITION" TOOL_DEFINITION_BAD_INPUT_SCHEMA = "TOOL_DEFINITION_BAD_INPUT_SCHEMA" TOOL_DEFINITION_BAD_OUTPUT_SCHEMA = "TOOL_DEFINITION_BAD_OUTPUT_SCHEMA" TOOL_RUNTIME_BAD_INPUT_VALUE = "TOOL_RUNTIME_BAD_INPUT_VALUE" TOOL_RUNTIME_BAD_OUTPUT_VALUE = "TOOL_RUNTIME_BAD_OUTPUT_VALUE" TOOL_RUNTIME_RETRY = "TOOL_RUNTIME_RETRY" TOOL_RUNTIME_CONTEXT_REQUIRED = "TOOL_RUNTIME_CONTEXT_REQUIRED" TOOL_RUNTIME_FATAL = "TOOL_RUNTIME_FATAL" UPSTREAM_RUNTIME_BAD_REQUEST = "UPSTREAM_RUNTIME_BAD_REQUEST" UPSTREAM_RUNTIME_AUTH_ERROR = "UPSTREAM_RUNTIME_AUTH_ERROR" UPSTREAM_RUNTIME_NOT_FOUND = "UPSTREAM_RUNTIME_NOT_FOUND" UPSTREAM_RUNTIME_VALIDATION_ERROR = "UPSTREAM_RUNTIME_VALIDATION_ERROR" UPSTREAM_RUNTIME_RATE_LIMIT = "UPSTREAM_RUNTIME_RATE_LIMIT" UPSTREAM_RUNTIME_SERVER_ERROR = "UPSTREAM_RUNTIME_SERVER_ERROR" UPSTREAM_RUNTIME_UNMAPPED = "UPSTREAM_RUNTIME_UNMAPPED" UNKNOWN = "UNKNOWN" ``` - In `arcadepy/types/execute_tool_response.py` add the following fields to OutputError: ```py kind: ErrorKind status_code: Optional[int] = None stacktrace: Optional[str] = None extra: Optional[dict[str, Any]] = None ``` ### Example Client Usage ```py # Example of handling an upstream rate limit error = response.output.error if error and error.kind == ErrorKind.UPSTREAM_RUNTIME_RATE_LIMIT: sleep_time = error.retry_after_ms / 1000 time.sleep(sleep_time) # and then execute again ``` ```py # Examples of determining what type of runtime error it is error = response.output.error if error: is_retryable_error = error.kind == ErrorKind.TOOL_RUNTIME_RETRY is_a_bug_in_the_tool = error.kind == ErrorKind.TOOL_RUNTIME_FATAL is_additional_context_required = error.kind == ErrorKind.TOOL_RUNTIME_CONTEXT_REQUIRED ``` ### Example Tool Usage ```py # EXAMPLE 1 letting Arcade handle upstream error handling for you reddit_client.post(params) # Arcade's httpx adapter will handle error handling for you! # ------------------------------------ # EXAMPLE 2 handling upstream bad request yourself, but letting Arcade handle the rest try: reddit_client.post(params) except httpx.HTTPStatusError as e: if e.status_code == 400: raise UpstreamError("My extra custom message) from e raise ``` ```py # EXAMPLE 1 letting Arcade handle it for you risky_element = my_risky_list[42] # Arcade will raise a FatalToolError for you # ------------------------------------ # EXAMPLE 2 handling it yourself for extra flexibility try: risky_element = my_risky_list[42] except IndexError as e: raise FatalToolError("My extra custom message") from e ``` ### Non-runtime Error Message Examples Example ToolkitLoadError Messages: ``` - [TOOLKIT_LOAD_FAILED] ToolkitLoadError when loading toolkit 'sample_tool': Could not import module mock_module. Reason: Mock import error - [TOOLKIT_LOAD_FAILED] ToolkitLoadError when loading toolkit 'test_toolkit': Tool 'ValidTool' in toolkit 'test_toolkit' already exists in the catalog. ``` Example ToolDefinitionError Messages ``` - [TOOL_DEFINITION_BAD_DEFINITION] ToolDefinitionError in definition of tool 'tool_missing_description': Tool 'tool_missing_description' is missing a description - [TOOL_DEFINITION_BAD_DEFINITION] ToolDefinitionError in definition of tool 'tool_with_invalid_secret_type': Secret keys must be strings (error in tool ToolWithInvalidSecretType). - [TOOL_DEFINITION_BAD_DEFINITION] ToolDefinitionError in definition of tool 'tool_with_empty_secret': Secrets must have a non-empty key (error in tool ToolWithEmptySecret). - [TOOL_DEFINITION_BAD_DEFINITION] ToolDefinitionError in definition of tool 'tool_with_invalid_metadata_type': Metadata must be strings (error in tool ToolWithInvalidMetadataType). - [TOOL_DEFINITION_BAD_DEFINITION] ToolDefinitionError in definition of tool 'tool_with_metadata_requiring_auth_without_auth': Tool ToolWithMetadataRequiringAuthWithoutAuth declares metadata key 'client_id', which requires that the tool has an auth requirement, but no auth requirement was provided. Please specify an auth requirement. - [TOOL_DEFINITION_BAD_DEFINITION] ToolDefinitionError in definition of tool 'tool_with_empty_metadata': Metadata must have a non-empty key (error in tool ToolWithEmptyMetadata). - [TOOL_DEFINITION_BAD_DEFINITION] ToolDefinitionError in definition of tool 'tool_with_unsupported_param_type': Unsupported parameter type: <class 'test_catalog.MyFancyTestClass'> ``` Example ToolInputSchemaError Messages ``` - [TOOL_DEFINITION_BAD_INPUT_SCHEMA] ToolInputSchemaError in definition of tool 'tool_with_missing_input_parameter_annotation': Parameter 'input_text' is missing a description - [TOOL_DEFINITION_BAD_INPUT_SCHEMA] ToolInputSchemaError in definition of tool 'tool_with_no_type_annotation': Parameter param has no type annotation. - [TOOL_DEFINITION_BAD_INPUT_SCHEMA] ToolInputSchemaError in definition of tool 'tool_with_invalid_param_name': Invalid parameter name: '123invalid' is not a valid identifier. Identifiers must start with a letter or underscore, and can only contain letters, digits, or underscores. - [TOOL_DEFINITION_BAD_INPUT_SCHEMA] ToolInputSchemaError in definition of tool 'tool_with_too_many_annotations': Parameter param: Annotated[str, 'name', 'desc', 'extra'] has too many string annotations. Expected 0, 1, or 2, got 3. - [TOOL_DEFINITION_BAD_INPUT_SCHEMA] ToolInputSchemaError in definition of tool 'tool_with_required_union_param': Parameter param is a union type. Only optional types are supported. - [TOOL_DEFINITION_BAD_INPUT_SCHEMA] ToolInputSchemaError in definition of tool 'tool_with_non_callable_default_factory': Default factory for parameter param: Annotated[str, 'Parameter'] = FieldInfo(annotation=NoneType, required=False, default_factory=str) is not callable. - [TOOL_DEFINITION_BAD_INPUT_SCHEMA] ToolInputSchemaError in definition of tool 'tool_with_multiple_tool_contexts': Only one ToolContext parameter is supported, but tool tool_with_multiple_tool_contexts has multiple. ``` Example ToolOutputSchemaError Messages ``` - [TOOL_DEFINITION_BAD_OUTPUT_SCHEMA] ToolOutputSchemaError in definition of tool 'tool_missing_return_type_hint': Tool 'ToolMissingReturnTypeHint' must have a return type - [TOOL_DEFINITION_BAD_OUTPUT_SCHEMA] ToolOutputSchemaError in definition of tool 'tool_with_unsupported_output_type': Unsupported output type '<class 'test_catalog.MyFancyTestClass'>'. Only built-in Python types, TypedDicts, Pydantic models, and standard collections are supported as tool output types. ``` ### Runtime Error Message Examples Example Tool Runtime Error Messages ``` - [TOOL_RUNTIME_FATAL] FatalToolError during execution of tool 'get_posts_in_subreddit': list index out of range - [TOOL_RUNTIME_CONTEXT_REQUIRED] ContextRequiredToolError during execution of tool 'get_posts_in_subreddit': Ambiguous username. Please provide a more specific username - [TOOL_RUNTIME_RETRY] RetryableToolError during execution of tool 'get_posts_in_subreddit': Retry with subreddit=learnpython or subreddit=learnprogramming ``` Example Upstream Runtime Error Messages ``` - [UPSTREAM_RUNTIME_RATE_LIMIT] UpstreamRateLimitError during execution of tool 'get_posts_in_subreddit': 429 Client Error: Too Many Requests - [UPSTREAM_RUNTIME_BAD_REQUEST] UpstreamError during execution of tool 'get_posts_in_subreddit': 400 Client Error: Bad request. Missing 'id' parameter. - [UPSTREAM_RUNTIME_BAD_REQUEST] UpstreamError during execution of tool 'search_files': Upstream Google API error: Invalid value '-23'. Values must be within the range: [value: 1\n, value: 1000\n] ```	2025-09-10 10:45:18 -07:00
Sam Partee	e188fc6ae9	Improve Typedict and Basemodel support (#523 ) Improve Pydantic and Typedict support and add a bunch of tets. 1. Fixed the test failure where TypeDict was being serialized as a list of tuples instead of a dict by: - Adding proper handling for BaseModel instances in the output.py file - Converting BaseModel results (from TypeDict conversion) to dicts using model_dump() - Handling lists containing BaseModel objects 2. Fixed None handling to ensure None results are converted to empty strings as expected 3. Updated the schema.py to allow dict and list types in ToolCallOutput.value 4. new tests - TypeDict output execution tests - Output factory tests - Executor tests with TypeDict support - Schema validation tests The key changes were: - In ``arcade_core/output.py``: Added BaseModel conversion logic in the success method - In ``arcade_core/schema.py``: Changed ToolCallOutput.value type from list[str] to list to support complex types TODO - [ ] Confirm engine compatibility without changes made to engine --------- Co-authored-by: Eric Gustin <eric@arcade.dev>	2025-08-27 16:50:09 -07:00
Sterling Dreyer	7888dc505e	Fix venv files not being found (#525 )	2025-08-01 12:12:35 -07:00
Sterling Dreyer	3f5c7aa6ba	Error on invalid toolkit file (#510 ) Changes toolkit loading and deployments to error if there are syntax errors in the file	2025-07-28 16:17:06 -07:00
Sam Partee	27a6cd31a3	Support Tool Output in ValueSchema of ToolDefinition (#487 ) ## Before ### Tool: ``GoogleNews.SearchNewsStories`` ```python @tool(requires_secrets=["SERP_API_KEY"]) async def search_news_stories( context: ToolContext, keywords: Annotated[ str, "Keywords to search for news articles. E.g. 'Apple launches new iPhone'.", ], country_code: Annotated[ CountryCode \| None, "2-character country code to search for news articles. " "E.g. 'us' (United States). " f"Defaults to '{DEFAULT_GOOGLE_NEWS_COUNTRY}'.", ] = None, language_code: Annotated[ LanguageCode, "2-character language code to search for news articles. E.g. 'en' (English). " f"Defaults to '{DEFAULT_GOOGLE_NEWS_LANGUAGE}'.", ] = DEFAULT_GOOGLE_NEWS_LANGUAGE, limit: Annotated[ int \| None, "Maximum number of news articles to return. Defaults to None " "(returns all results found by the API).", ] = None, ) -> Annotated[dict[str, Any]]: """Search for news articles related to a given query.""" ... ``` ### Tool Definition: ``GoogleNews.SearchNewsStories`` ``` { "name": "SearchNewsStories", "fully_qualified_name": "GoogleNews.SearchNewsStories", "description": "Search for news articles related to a given query.", "toolkit": { "name": "GoogleNews", "description": "Arcade.dev LLM tools for getting new via Google News", "version": "2.0.0" }, "input": { "parameters": [ { "name": "keywords", "required": true, "description": "Keywords to search for news articles. E.g. 'Apple launches new iPhone'.", "value_schema": { "val_type": "string", "inner_val_type": null, "enum": null, }, "inferrable": true }, { "name": "country_code", "required": false, "description": "2-character country code to search for news articles. E.g. 'us' (United States). Defaults to 'None'.", "value_schema": { "val_type": "string", "inner_val_type": null, "enum": null, }, "inferrable": true }, { "name": "language_code", "required": false, "description": "2-character language code to search for news articles. E.g. 'en' (English). Defaults to 'en'.", "value_schema": { "val_type": "string", "inner_val_type": null, "enum": null, }, "inferrable": true }, { "name": "limit", "required": false, "description": "Maximum number of news articles to return. Defaults to None (returns all results found by the API).", "value_schema": { "val_type": "integer", "inner_val_type": null, "enum": null, }, "inferrable": true } ] }, "output": { "description": "News search results with article details.", "available_modes": [ "value", "error" ], "value_schema": { "val_type": "json" } }, "requirements": { "authorization": null, "secrets": [ { "key": "serp_api_key" } ], "metadata": null }, "deprecation_message": null }, ``` ## After ### Enhanced Tool: ``GoogleNews.SearchNewsStories`` ```python """Type definitions for Google News API responses and parameters.""" from typing_extensions import TypedDict CountryCode = str LanguageCode = str class SearchNewsParams(TypedDict): """Input parameters for searching news articles.""" keywords: str """Search query terms to find relevant news articles \ (e.g., 'Apple launches new iPhone').""" country_code: CountryCode \| None """Optional 2-letter country code to filter news by region \ (e.g., 'us' for United States, 'uk' for United Kingdom).""" language_code: LanguageCode \| None """Optional 2-letter language code to filter news by language \ (e.g., 'en' for English, 'es' for Spanish).""" limit: int \| None """Optional maximum number of news articles to return. \ If not specified, returns all results from the API.""" class SourceInfo(TypedDict, total=False): """Information about the news source/publication.""" name: str """Name of the publication (e.g., 'CNN', 'BBC News', 'The New York Times').""" icon: str """URL to the source's favicon or logo image.""" authors: list[str] """List of author names for the article, if available.""" class NewsResult(TypedDict, total=False): """Individual news article from the Google News API response.""" position: int """Ranking position of this result in the search results.""" title: str """Headline or title of the news article.""" link: str """Full URL to the original news article.""" source: SourceInfo """Information about the publication source.""" date: str """Publication date and time (e.g., '2 hours ago', 'Dec 15, 2023').""" snippet: str """Brief excerpt or summary from the article content.""" thumbnail: str """URL to a high-resolution thumbnail image for the article.""" thumbnail_small: str """URL to a low-resolution thumbnail image for the article.""" story_token: str """Token for accessing full coverage of this news story across multiple sources.""" stories: list["NewsResult"] """Related news stories from other sources covering the same topic.""" highlight: dict """Additional highlighted information about the story.""" class SearchMetadata(TypedDict, total=False): """Metadata about the search request and processing.""" id: str """Unique identifier for this search request within SerpApi.""" status: str """Current processing status ('Processing', 'Success', or 'Error').""" json_endpoint: str """URL to retrieve the JSON results for this search.""" created_at: str """Timestamp when the search request was created.""" processed_at: str """Timestamp when the search request was processed.""" google_news_url: str """Original Google News URL that would return these results.""" total_time_taken: float """Total time in seconds taken to process this search.""" class SearchParameters(TypedDict, total=False): """Parameters used for the search request.""" engine: str """Search engine used (always 'google_news' for this API).""" q: str """Search query string.""" gl: str """Country code used for geographic filtering.""" hl: str """Language code used for language filtering.""" topic_token: str """Token for accessing specific news topics (e.g., 'World', 'Business', 'Technology').""" publication_token: str """Token for accessing news from specific publishers.""" class MenuLink(TypedDict): """Navigation link for news categories or topics.""" title: str """Display text for the menu item (e.g., 'Technology', 'Sports', 'Business').""" topic_token: str """Token to access this specific topic or category.""" serpapi_link: str """SerpApi URL to search within this topic.""" class TopStoriesLink(TypedDict): """Link to top stories section.""" topic_token: str """Token to access top stories.""" serpapi_link: str """SerpApi URL to retrieve top stories.""" class GoogleNewsResponse(TypedDict, total=False): """Complete response from the Google News API.""" search_metadata: SearchMetadata """Metadata about the search request and processing.""" search_parameters: SearchParameters """Parameters that were used for this search.""" news_results: list[NewsResult] """List of news articles matching the search criteria.""" menu_links: list[MenuLink] """Navigation links to different news categories and topics.""" top_stories_link: TopStoriesLink """Link to access top stories.""" title: str """Title of the page or topic being displayed.""" class SimplifiedNewsResult(TypedDict): """Simplified news article format for tool output.""" title: str """Headline of the news article.""" link: str """URL to the full article.""" source: str \| None """Name of the publication source.""" date: str \| None """When the article was published.""" snippet: str \| None """Brief excerpt from the article.""" class SearchNewsOutput(TypedDict): """Output format for the search_news_stories tool.""" news_results: list[SimplifiedNewsResult] """List of news articles in simplified format.""" @tool(requires_secrets=["SERP_API_KEY"]) async def search_news_stories( context: ToolContext, keywords: Annotated[ str, "Keywords to search for news articles. E.g. 'Apple launches new iPhone'.", ], country_code: Annotated[ CountryCode \| None, "2-character country code to search for news articles. " "E.g. 'us' (United States). " f"Defaults to '{DEFAULT_GOOGLE_NEWS_COUNTRY}'.", ] = None, language_code: Annotated[ LanguageCode, "2-character language code to search for news articles. E.g. 'en' (English). " f"Defaults to '{DEFAULT_GOOGLE_NEWS_LANGUAGE}'.", ] = DEFAULT_GOOGLE_NEWS_LANGUAGE, limit: Annotated[ int \| None, "Maximum number of news articles to return. Defaults to None " "(returns all results found by the API).", ] = None, ) -> Annotated[SearchNewsOutput, "News search results with article details."]: """Search for news articles related to a given query.""" ... ``` ### Enhanced Tool Definition: ``GoogleNews.SearchNewsStories`` ```json { "name": "SearchNewsStories", "fully_qualified_name": "GoogleNews.SearchNewsStories", "description": "Search for news articles related to a given query.", "toolkit": { "name": "GoogleNews", "description": "Arcade.dev LLM tools for getting new via Google News", "version": "2.0.0" }, "input": { "parameters": [ { "name": "keywords", "required": true, "description": "Keywords to search for news articles. E.g. 'Apple launches new iPhone'.", "value_schema": { "val_type": "string", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": null }, "inferrable": true }, { "name": "country_code", "required": false, "description": "2-character country code to search for news articles. E.g. 'us' (United States). Defaults to 'None'.", "value_schema": { "val_type": "string", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": null }, "inferrable": true }, { "name": "language_code", "required": false, "description": "2-character language code to search for news articles. E.g. 'en' (English). Defaults to 'en'.", "value_schema": { "val_type": "string", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": null }, "inferrable": true }, { "name": "limit", "required": false, "description": "Maximum number of news articles to return. Defaults to None (returns all results found by the API).", "value_schema": { "val_type": "integer", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": null }, "inferrable": true } ] }, "output": { "description": "News search results with article details.", "available_modes": [ "value", "error" ], "value_schema": { "val_type": "json", "inner_val_type": null, "enum": null, "properties": { "news_results": { "val_type": "array", "inner_val_type": "json", "enum": null, "properties": null, "inner_properties": { "title": { "val_type": "string", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": "Headline of the news article." }, "link": { "val_type": "string", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": "URL to the full article." }, "source": { "val_type": "string", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": "Name of the publication source." }, "date": { "val_type": "string", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": "When the article was published." }, "snippet": { "val_type": "string", "inner_val_type": null, "enum": null, "properties": null, "inner_properties": null, "description": "Brief excerpt from the article." } }, "description": "List of news articles in simplified format." } }, "inner_properties": null, "description": null } }, "requirements": { "authorization": null, "secrets": [ { "key": "serp_api_key" } ], "metadata": null }, "deprecation_message": null }, ``` --------- Co-authored-by: Eric Gustin <eric@arcade.dev>	2025-07-24 15:32:35 -07:00
Eric Gustin	856606f38c	Remove arcade_ prefix requirement and add entry point toolkit discovery (#485 ) ## Summary This PR removes the requirement that all toolkits must have the arcade_ prefix and introduces a more flexible toolkit discovery system using Python entry points. ### 🏷️ Flexible Toolkit Naming * Community toolkits: Only add arcade_ prefix when the user is in arcade-ai/toolkits/ directory and explicitly chooses to create a community contribution. * External toolkits: No prefix requirement - developers can name their toolkits however they want * Toolkit names are now determined by user choice rather than enforced automatically ### 🔍 Entry Point Discovery * Added find_arcade_toolkits_from_entrypoints() method to discover toolkits via entry points * Entry point group: arcade_toolkits with name: toolkit_name * Updated pyproject.toml template to include entry point configuration * Entry point discovery takes precedence over prefix-based discovery for deduplication ### 📦 Backward Compatibility * Existing arcade_* prefixed toolkits continue to work via find_arcade_toolkits_from_prefix() find_all_arcade_toolkits() now combines both discovery methods * Deduplication logic prefers entry point toolkits over prefix-based ones when package names match ### 🛠️ `arcade new` Template Updates * pyproject.toml template for `arcade new` now includes entry point configuration: [project.entry-points.arcade_toolkits] ### 🔧 Minor Improvements * Refactored _strip_arcade_prefix() into a separate method for reusability * Updated variable naming for clarity (community_toolkit → is_community_toolkit) ### Benefits * Developer Freedom: Toolkit developers are no longer forced to use the arcade_ prefix. They are also no longer forced to use the package name as the toolkit name. * Cleaner Naming: External toolkits can use more natural names (e.g., my_company_toolkit instead of arcade_my_company_toolkit) * Better Discovery: Entry points provide a more standard Python mechanism for plugin discovery * Flexible Distribution: Toolkits can be distributed with any package name while still being discoverable ### Testing * Added comprehensive tests for the new entry point functionality * Tests cover edge cases like deduplication, error handling, and backward compatibility ### Version Bumps arcade-core: 2.0.0 → 2.1.0 arcade-ai: 2.0.5 → 2.1.0 This change makes the Arcade toolkit ecosystem more flexible and developer-friendly while maintaining full backward compatibility with existing toolkits. --------- Co-authored-by: Mateo Torres <mateo@arcade.dev>	2025-07-16 09:51:21 -07:00
Sam Partee	b6b4cd0a4c	🏗️ Restructure: Multi-Package Architecture + uv Migration (#412 ) ### Overview Major restructuring from monolithic `arcade-ai` package to modular library architecture with standardized uv-based dependency management. ![arcade-ai Monorepo (2)](https://github.com/user-attachments/assets/25f102b0-bb87-4a04-9701-d227d05664b1) ### New Package Structure - `arcade-tdk` - Lightweight toolkit development kit (core decorators, auth) - `arcade-core` - Core execution engine and catalog functionality - `arcade-serve` - FastAPI/MCP server components - `arcade-ai` - Meta package that includes CLI functionality. Optionally include evals via the `evals` extra. Optionally include all packages via the `all` extra. ### Key Benefits - Lighter Dependencies: Toolkits now depend only on `arcade-tdk` (~2 deps) vs full `arcade-ai` (~30+ deps) - Faster Builds: uv provides 10-100x faster dependency resolution and installation - Better Modularity: Clear separation of concerns, consumers import only what they need - Standard Tooling: Eliminates custom poetry scripts, uses standard Python packaging ### Migration Impact - All 20 toolkits converted from poetry → uv with `arcade-tdk` dependencies plus `arcade-ai[evals]` and `arcade-serve` dev dependencies. When developing locally, devs should install toolkits via `make install-local`. - Modern Python 3.10+ type hints throughout - Standardized build system with hatchling backend - Enhanced Makefile with robust toolkit management commands - Removed `arcade dev` CLI command - Reduce the number of files created by `arcade new` and add an option to not generate a tests and evals folder. This foundation enables faster development cycles and cleaner dependency chains for the growing toolkit ecosystem. ### Todo After this PR is merged - [ ] Post-merge workflow(s) (release & publish containers, etc) - [ ] Release order plan. @EricGustin suggests releasing in the following order: 1. `arcade-core` version 0.1.0 2. `arcade-serve` version 0.1.0 and `arcade-tdk` version 0.1.0 3. `arcade-ai` version 2.0.0 4. Patch release for all toolkits (all changes in toolkits are internal refactors) - [ ] [Update docs](https://github.com/ArcadeAI/docs/pull/318) --------- Co-authored-by: Eric Gustin <eric@arcade.dev> Co-authored-by: Eric Gustin <34000337+EricGustin@users.noreply.github.com>	2025-06-11 16:48:17 -07:00

19 commits