feat(arcade-core): opt-in debug leak flags for toolkit authors (#826)

## Summary Adds two strictly opt-in env vars that let toolkit developers see `developer_message` / `stacktrace` content *in* the agent-facing error message while debugging. Off by default; activation requires a specific acknowledgement string, not a boolean — `true`/`1` is explicitly rejected with a warning log. - `ARCADE_UNSAFE_DEBUG_LEAK_DEVELOPER_MESSAGE_TO_AGENT` - `ARCADE_UNSAFE_DEBUG_LEAK_STACKTRACE_TO_AGENT` - Magic ack: `yes-i-accept-leaking-internals-to-the-agent` Everything goes through a single funnel — `ToolOutputFactory.fail` / `fail_retry` in `arcade_core/output.py` — so the behavior covers both the MCP server path and the Arcade Worker path with no call-site changes. A loud `logger.warning` fires once per process on activation, and a big header comment in `output.py` tells future maintainers not to add more flags of this shape (debug info belongs in `logger.debug`, not in a field that gets shipped to the model and often to end users). Bumps `arcade-core` 4.6.2 → 4.7.0. Non-breaking, additive. ## Why Today the project does a lot of work to keep `developer_message` and `stacktrace` off the agent's context. That's the right default, but it makes iterating on a new toolkit painful — you end up adding temporary logging or rebuilds just to see what blew up. This gives toolkit authors a safe, ugly, loud-on-activation escape hatch. ## Safety design - Two separate flags so you only leak what you need. - Magic string (not a boolean) activates the flag. Boolean-style values are rejected and log a pointer to `output.py`. - First activation logs a `WARNING` identifying the flag and the risk. - Flags documented only in `CLAUDE.md`, not in the public README. - Top-of-file banner in `output.py` explicitly tells maintainers not to add more flags of this shape. ## Test plan - [x] Existing test suite passes (1154 tests — `libs/tests/{core,tool,arcade_mcp_server}`). - [x] End-to-end smoke test against the built `arcade_core-4.7.0` wheel, driven through `ToolExecutor.run` (same path toolkits hit). Covered cases: - flags off → message unchanged - `ARCADE_UNSAFE_..._DEVELOPER_MESSAGE_TO_AGENT=true` → flag rejected, warning logged, message unchanged - `ARCADE_UNSAFE_..._DEVELOPER_MESSAGE_TO_AGENT=<magic>` → `[DEBUG] developer_message: ...` appended - both flags with magic, `ToolRuntimeError` path → developer_message appended (stacktrace absent because `ToolRuntimeError.stacktrace()` returned `None`, which is existing behavior) - stacktrace flag with magic, generic `Exception` path → full `traceback.format_exc()` appended, activation `WARNING` visible Made with [Cursor](https://cursor.com)  --- > [!NOTE] > **Medium Risk** > Adds an opt-in path to include `developer_message` and stacktraces in agent-facing MCP error messages, which could leak sensitive data if misconfigured; safeguards (magic ack string + CI/pre-commit guard) reduce but don’t eliminate risk. > > **Overview** > Adds `arcade_mcp_server/_debug_exposure.py` with two env-gated debug flags that, only when set to a specific acknowledgement string, append `developer_message` and/or `stacktrace` into the agent-visible MCP tool error `message` (and logs one-shot warnings on rejection/activation). > > Wires this into the MCP error path in `MCPServer._handle_call_tool`, documents the flags in `CLAUDE.md`, bumps `arcade-mcp-server` to `1.21.0`, and adds unit + integration tests plus a pre-commit hook and GitHub Actions workflow (`scripts/check_debug_leak_flags_off.py`) to ensure the magic ack string can’t be committed outside a small allowlist. > > <sup>Reviewed by [Cursor Bugbot](https://cursor.com/bugbot) for commit 30e242c454128ec7cc62e169c2afd116be735cb5. Bugbot is set up for automated code reviews on this repo. Configure [here](https://www.cursor.com/dashboard/bugbot).</sup>
2026-04-25 11:40:26 -03:00 · 2026-04-25 11:40:26 -03:00 · 70515e3356
commit 70515e3356
parent 40e05af27c
9 changed files with 696 additions and 1 deletions
--- a/.github/workflows/check-debug-leak-flags.yml
+++ b/.github/workflows/check-debug-leak-flags.yml
@ -0,0 +1,30 @@
 name: Debug Leak Flag Guard
 # Ensures the debug-exposure flags in arcade_mcp_server/_debug_exposure.py cannot be
 # activated by anything shipped in committed files. The flags only activate
 # when the env var is set to one specific acknowledgement string, so we just
 # need to guarantee that string never appears outside its allowlist. See
 # scripts/check_debug_leak_flags_off.py for details.
 on:
  push:
    branches:
      - main
  pull_request:
    types: [opened, synchronize, reopened, ready_for_review]
 jobs:
  guard:
    name: Debug leak flag guard
    runs-on: ubuntu-latest
    steps:
      - name: Check out
        uses: actions/checkout@v4
      - name: Set up Python
        uses: actions/setup-python@v5
        with:
          python-version: "3.11"
      - name: Verify debug-leak flags stay off
        run: python scripts/check_debug_leak_flags_off.py
--- a/.pre-commit-config.yaml
+++ b/.pre-commit-config.yaml
@ -20,3 +20,16 @@ repos:
        exclude: "(.*/templates/.*|libs/tests/.*)"
      - id: ruff-format
        exclude: "(.*/templates/.*|libs/tests/.*)"
  - repo: local
    hooks:
      - id: check-debug-leak-flags
        name: "Guard: unsafe debug-leak flags must stay off"
        description: >-
          Fails if the activation acknowledgement string for the unsafe
          debug-leak flags in arcade_core/output.py appears in any tracked
          file outside its small allowlist.
        entry: python scripts/check_debug_leak_flags_off.py
        language: python
        pass_filenames: false
        always_run: true
--- a/CLAUDE.md
+++ b/CLAUDE.md
@ -253,6 +253,17 @@ The `arcade` CLI (`libs/arcade-cli/arcade_cli/main.py`) is typer-based. Key comm
 | `ARCADE_DISABLE_AUTOUPDATE=1` | Disable CLI auto-update checks |
 | Any non-`MCP_`/`_` prefixed var | Automatically available as a tool secret via `context.get_secret()` |
 ### Debug-only flags: expose error internals in tool error responses (toolkit authors)
 When set, these flags append `developer_message` and/or the tool stacktrace to the `message` field of the MCP tool error response — useful while debugging a toolkit, because most MCP clients render only `message` and drop `developer_message`. Use ONLY for local debugging. Both require the exact value `yes-i-accept-leaking-internals-to-the-agent` (nothing else is accepted — `true`, `1`, etc. are rejected and log a warning). Each logs a loud WARNING on first activation. Implemented in `libs/arcade-mcp-server/arcade_mcp_server/_debug_exposure.py`.
 | Env var | Effect when set to the magic ack value |
 |---------|-----------------------------------------|
 | `ARCADE_DEBUG_EXPOSE_DEVELOPER_MESSAGE_IN_TOOL_ERROR_RESPONSES` | Appends `developer_message` to the error response `message` field |
 | `ARCADE_DEBUG_EXPOSE_STACKTRACE_IN_TOOL_ERROR_RESPONSES` | Appends the tool stacktrace to the error response `message` field |
 **Never enable in production.** The `message` field is returned verbatim to whoever called the tool — LLMs, transcripts, end-user UIs, and anything else downstream.
 ## Project Layout
 - `libs/arcade-*/` — Core libraries, each with own `pyproject.toml` (except cli/evals → root)
--- a/libs/arcade-mcp-server/arcade_mcp_server/_debug_exposure.py
+++ b/libs/arcade-mcp-server/arcade_mcp_server/_debug_exposure.py
@ -0,0 +1,84 @@
 """
 Debug-only escape hatch for MCP tool error responses.
 MCP clients typically render only the ``message`` field of a tool error
 response, dropping ``developer_message`` and ``stacktrace``. That makes
 server-side iteration painful when a tool is failing. The flags in this
 module let a toolkit author opt in to appending those internals to the
 ``message`` field while debugging.
 DEBUG-ONLY. Activating these flags can leak paths, tokens, or PII to
 callers. Don't add more flags of this shape — put debug info in logs
 instead.
 """
 from __future__ import annotations
 import logging
 import os
 _logger = logging.getLogger(__name__)
 # Acknowledgement string a developer must set as the env value. Picked to be
 # impossible to set by mistake — no sane config management or CI will ever
 # emit this string.
 _DEBUG_LEAK_MAGIC = "yes-i-accept-leaking-internals-to-the-agent"
 _ENV_EXPOSE_DEVELOPER_MESSAGE = "ARCADE_DEBUG_EXPOSE_DEVELOPER_MESSAGE_IN_TOOL_ERROR_RESPONSES"
 _ENV_EXPOSE_STACKTRACE = "ARCADE_DEBUG_EXPOSE_STACKTRACE_IN_TOOL_ERROR_RESPONSES"
 # One-shot warning state per flag. The rejection warning (truthy but not the
 # magic string) and the activation warning (magic string set) are tracked in
 # *separate* sets so that fixing a misconfigured flag within the same process
 # still fires the critical activation warning.
 _warned_rejected: set[str] = set()
 _warned_activated: set[str] = set()
 def _leak_enabled(env_var: str) -> bool:
    raw = os.environ.get(env_var)
    if raw is None:
        return False
    if raw.strip() != _DEBUG_LEAK_MAGIC:
        # A value is set but it isn't the magic ack. Treat as off and, if it
        # looks like someone tried a boolean, nudge them via a log so the
        # silence isn't confusing.
        if raw.strip().lower() in {"1", "true", "yes", "on"} and env_var not in _warned_rejected:
            _warned_rejected.add(env_var)
            _logger.warning(
                "%s is set to a truthy value but not to the required "
                "acknowledgement string. Flag remains OFF. "
                "See arcade_mcp_server/_debug_exposure.py.",
                env_var,
            )
        return False
    if env_var not in _warned_activated:
        _warned_activated.add(env_var)
        _logger.warning(
            "%s is ENABLED. Tool error internals will be appended to the "
            "`message` field of MCP tool error responses. This can leak paths, "
            "tokens, or PII to callers. DO NOT USE IN PRODUCTION.",
            env_var,
        )
    return True
 def augment_error_message_for_debug(
    message: str,
    developer_message: str | None,
    stacktrace: str | None,
 ) -> str:
    """Append debug internals to ``message`` when the corresponding env flags are set.
    This is a no-op in the default case (both flags off), and also a no-op when
    the flags are set to anything other than the activation ack string. See
    module docstring for the full rationale.
    """
    extras: list[str] = []
    if developer_message and _leak_enabled(_ENV_EXPOSE_DEVELOPER_MESSAGE):
        extras.append(f"developer_message: {developer_message}")
    if stacktrace and _leak_enabled(_ENV_EXPOSE_STACKTRACE):
        extras.append(f"stacktrace:\n{stacktrace}")
    if not extras:
        return message
    return f"{message}\n\n[DEBUG] " + "\n\n[DEBUG] ".join(extras)
--- a/libs/arcade-mcp-server/arcade_mcp_server/server.py
+++ b/libs/arcade-mcp-server/arcade_mcp_server/server.py
@ -29,6 +29,7 @@ from arcade_core.schema import ToolAuthRequirement as CoreToolAuthRequirement
 from arcadepy import ArcadeError, AsyncArcade
 from arcadepy.types.auth_authorize_params import AuthRequirement, AuthRequirementOauth2
 from arcade_mcp_server._debug_exposure import augment_error_message_for_debug
 from arcade_mcp_server.context import Context, get_current_model_context, set_current_model_context
 from arcade_mcp_server.convert import convert_content_to_structured_content, convert_to_mcp_content
 from arcade_mcp_server.exceptions import NotFoundError, ToolRuntimeError
@ -933,6 +934,11 @@ class MCPServer:
                    error_text = error.message
                    if error.additional_prompt_content:
                        error_text += f"\n\n{error.additional_prompt_content}"
                    error_text = augment_error_message_for_debug(
                        error_text,
                        error.developer_message,
                        error.stacktrace,
                    )
                    content = convert_to_mcp_content(error_text)
                    self._log_tool_call_error(tool_name, error)
                else:
--- a/libs/arcade-mcp-server/pyproject.toml
+++ b/libs/arcade-mcp-server/pyproject.toml
@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 [project]
 name = "arcade-mcp-server"
-version = "1.20.0"
+version = "1.21.0"
 description = "Model Context Protocol (MCP) server framework for Arcade.dev"
 readme = "README.md"
 authors = [{ name = "Arcade.dev" }]
--- a/libs/tests/arcade_mcp_server/test_debug_exposure.py
+++ b/libs/tests/arcade_mcp_server/test_debug_exposure.py
@ -0,0 +1,162 @@
 """Tests for the debug-exposure escape hatch in ``arcade_mcp_server/_debug_exposure.py``."""
 import logging
 import pytest
 from arcade_mcp_server import _debug_exposure as debug_exposure
 from arcade_mcp_server._debug_exposure import augment_error_message_for_debug
 _LEAK_MAGIC = "yes-i-accept-leaking-internals-to-the-agent"
 _ENV_DEV_MSG = "ARCADE_DEBUG_EXPOSE_DEVELOPER_MESSAGE_IN_TOOL_ERROR_RESPONSES"
 _ENV_STACKTRACE = "ARCADE_DEBUG_EXPOSE_STACKTRACE_IN_TOOL_ERROR_RESPONSES"
@pytest.fixture(autouse=True)
 def _reset_leak_warn_state(monkeypatch):
    """Clear the per-process one-shot warning state so each test starts clean.
    Both flags emit loud warnings (rejection and activation) one-shot per flag.
    Without a reset, later tests would silently lose coverage of those branches
    because the module-level tracking sets are already populated from earlier
    tests.
    """
    monkeypatch.delenv(_ENV_DEV_MSG, raising=False)
    monkeypatch.delenv(_ENV_STACKTRACE, raising=False)
    debug_exposure._warned_rejected.clear()
    debug_exposure._warned_activated.clear()
    yield
    debug_exposure._warned_rejected.clear()
    debug_exposure._warned_activated.clear()
 def test_no_leak_by_default():
    """With both flags unset, message must not be augmented."""
    out = augment_error_message_for_debug(
        "public error",
        developer_message="secret internals",
        stacktrace="Traceback...\n  line",
    )
    assert out == "public error"
@pytest.mark.parametrize("bad_value", ["true", "1", "yes", "on", "TRUE", "True"])
 def test_rejects_boolean_activation(monkeypatch, caplog, bad_value):
    """Any truthy-looking value that isn't the magic string must be rejected."""
    monkeypatch.setenv(_ENV_DEV_MSG, bad_value)
    with caplog.at_level(logging.WARNING, logger="arcade_mcp_server._debug_exposure"):
        out = augment_error_message_for_debug(
            "public error", developer_message="secret internals", stacktrace=None
        )
    assert out == "public error"
    assert any(
        "set to a truthy value but not to the required" in rec.message for rec in caplog.records
    )
 def test_rejects_random_non_magic_value(monkeypatch, caplog):
    """A non-boolean-looking value that isn't the magic string is silently off."""
    monkeypatch.setenv(_ENV_DEV_MSG, "debug-please")
    with caplog.at_level(logging.WARNING, logger="arcade_mcp_server._debug_exposure"):
        out = augment_error_message_for_debug(
            "public error", developer_message="secret internals", stacktrace=None
        )
    assert out == "public error"
    assert not any(
        "set to a truthy value but not to the required" in rec.message for rec in caplog.records
    )
 def test_developer_message_flag_enabled(monkeypatch, caplog):
    monkeypatch.setenv(_ENV_DEV_MSG, _LEAK_MAGIC)
    with caplog.at_level(logging.WARNING, logger="arcade_mcp_server._debug_exposure"):
        out = augment_error_message_for_debug(
            "public error", developer_message="secret internals", stacktrace="trace"
        )
    assert "public error" in out
    assert "[DEBUG] developer_message: secret internals" in out
    # Stacktrace flag is off → stacktrace must NOT be in the augmented text.
    assert "trace" not in out.replace("public error", "")
    assert any("is ENABLED" in rec.message for rec in caplog.records)
 def test_stacktrace_flag_enabled(monkeypatch):
    monkeypatch.setenv(_ENV_STACKTRACE, _LEAK_MAGIC)
    out = augment_error_message_for_debug(
        "public error",
        developer_message="secret internals",
        stacktrace="Traceback (most recent call last):\n  File ...",
    )
    assert "public error" in out
    assert "[DEBUG] stacktrace:" in out
    assert "File ..." in out
    # Developer-message flag off → dev message must NOT leak.
    assert "secret internals" not in out
 def test_both_flags_enabled(monkeypatch):
    monkeypatch.setenv(_ENV_DEV_MSG, _LEAK_MAGIC)
    monkeypatch.setenv(_ENV_STACKTRACE, _LEAK_MAGIC)
    out = augment_error_message_for_debug(
        "public error", developer_message="dev info", stacktrace="trace info"
    )
    assert "[DEBUG] developer_message: dev info" in out
    assert "[DEBUG] stacktrace:\ntrace info" in out
 def test_flag_enabled_but_no_content_to_leak(monkeypatch):
    """Flag on but developer_message/stacktrace are None → message unchanged."""
    monkeypatch.setenv(_ENV_DEV_MSG, _LEAK_MAGIC)
    monkeypatch.setenv(_ENV_STACKTRACE, _LEAK_MAGIC)
    out = augment_error_message_for_debug("public error", None, None)
    assert out == "public error"
 def test_activation_warning_emitted_once_per_process(monkeypatch, caplog):
    """Second call with the flag on must NOT emit another activation warning."""
    monkeypatch.setenv(_ENV_DEV_MSG, _LEAK_MAGIC)
    with caplog.at_level(logging.WARNING, logger="arcade_mcp_server._debug_exposure"):
        augment_error_message_for_debug("a", developer_message="dev", stacktrace=None)
        first_count = sum("is ENABLED" in r.message for r in caplog.records)
        augment_error_message_for_debug("b", developer_message="dev", stacktrace=None)
        second_count = sum("is ENABLED" in r.message for r in caplog.records)
    assert first_count == 1
    assert second_count == 1  # one-shot per process
 def test_rejection_does_not_suppress_later_activation_warning(monkeypatch, caplog):
    """Regression: once a truthy-but-non-magic value has been rejected for a
    flag, correcting the value to the magic string within the same process
    must still emit the critical "ENABLED ... DO NOT USE IN PRODUCTION"
    warning. Previously both paths shared one state set, so the activation
    warning was silently swallowed in this scenario.
    """
    with caplog.at_level(logging.WARNING, logger="arcade_mcp_server._debug_exposure"):
        monkeypatch.setenv(_ENV_DEV_MSG, "true")
        out_rejected = augment_error_message_for_debug(
            "public error", developer_message="secret internals", stacktrace=None
        )
        assert "[DEBUG]" not in out_rejected
        rejection_count = sum(
            "set to a truthy value but not to the required" in r.message for r in caplog.records
        )
        assert rejection_count == 1
        monkeypatch.setenv(_ENV_DEV_MSG, _LEAK_MAGIC)
        out_activated = augment_error_message_for_debug(
            "public error", developer_message="secret internals", stacktrace=None
        )
        assert "[DEBUG] developer_message: secret internals" in out_activated
        activation_count = sum("is ENABLED" in r.message for r in caplog.records)
        assert activation_count == 1, (
            "activation warning must fire even after the rejection warning "
            "has already been emitted for the same flag in this process"
        )
 def test_magic_value_ignores_surrounding_whitespace(monkeypatch):
    """Leading/trailing whitespace around the magic string still activates the flag."""
    monkeypatch.setenv(_ENV_DEV_MSG, f"  {_LEAK_MAGIC}  ")
    out = augment_error_message_for_debug(
        "public error", developer_message="secret internals", stacktrace=None
    )
    assert "[DEBUG] developer_message: secret internals" in out
--- a/libs/tests/arcade_mcp_server/test_debug_exposure_integration.py
+++ b/libs/tests/arcade_mcp_server/test_debug_exposure_integration.py
@ -0,0 +1,263 @@
 """End-to-end integration tests for the MCP debug-exposure escape hatch.
 These complement the pure-function unit tests in ``test_debug_exposure.py`` by
 exercising the full MCP tool-call path:
    tool raises -> ToolExecutor.run -> ToolOutputFactory.fail ->
    MCPServer._call_tool -> augment_error_message_for_debug ->
    CallToolResult.content[0].text
 This is the path every real MCP client hits, so it's where regressions in the
 wire-up (wrong call site, wrong argument order, missing import, etc.) would
 actually surface. The unit tests can't catch those because they call the pure
 function directly.
 """
 from typing import Annotated
 import pytest
 import pytest_asyncio
 from arcade_core.catalog import MaterializedTool, ToolCatalog, ToolMeta, create_func_models
 from arcade_core.errors import FatalToolError
 from arcade_core.schema import (
    InputParameter,
    ToolDefinition,
    ToolInput,
    ToolkitDefinition,
    ToolOutput,
    ToolRequirements,
    ValueSchema,
 )
 from arcade_mcp_server import _debug_exposure as debug_exposure
 from arcade_mcp_server import tool
 from arcade_mcp_server.server import MCPServer
 from arcade_mcp_server.settings import MCPSettings
 from arcade_mcp_server.types import CallToolRequest, CallToolResult, JSONRPCResponse
 _LEAK_MAGIC = "yes-i-accept-leaking-internals-to-the-agent"
 _ENV_DEV_MSG = "ARCADE_DEBUG_EXPOSE_DEVELOPER_MESSAGE_IN_TOOL_ERROR_RESPONSES"
 _ENV_STACKTRACE = "ARCADE_DEBUG_EXPOSE_STACKTRACE_IN_TOOL_ERROR_RESPONSES"
@pytest.fixture(autouse=True)
 def _reset_leak_state(monkeypatch):
    monkeypatch.delenv(_ENV_DEV_MSG, raising=False)
    monkeypatch.delenv(_ENV_STACKTRACE, raising=False)
    debug_exposure._warned_rejected.clear()
    debug_exposure._warned_activated.clear()
    yield
    debug_exposure._warned_rejected.clear()
    debug_exposure._warned_activated.clear()
 # ---- Tool definitions used by the integration tests -------------------------
@tool
 def raises_fatal_tool_error(
    query: Annotated[str, "A query"],
 ) -> Annotated[str, "Result"]:
    """Simulates a toolkit author's tool failing with a rich error."""
    raise FatalToolError(
        message="Failed to fetch results",
        developer_message=f"HTTP 503 on upstream endpoint for query={query!r}",
    )
@tool
 def raises_unhandled_exception(
    query: Annotated[str, "A query"],
 ) -> Annotated[str, "Result"]:
    """Simulates a toolkit author's tool crashing with an unexpected exception.
    The executor's generic `except Exception` branch populates the stacktrace
    via `traceback.format_exc()`, which is what the stacktrace flag leaks.
    """
    raise ValueError(f"unexpected crash for query={query!r}")
 def _materialized(func, name: str) -> MaterializedTool:
    definition = ToolDefinition(
        name=name,
        fully_qualified_name=f"TestToolkit.{name}",
        description=f"{name} integration fixture",
        toolkit=ToolkitDefinition(name="TestToolkit", description="", version="1.0.0"),
        input=ToolInput(
            parameters=[
                InputParameter(
                    name="query",
                    required=True,
                    description="A query",
                    value_schema=ValueSchema(val_type="string"),
                ),
            ]
        ),
        output=ToolOutput(
            description="Result",
            value_schema=ValueSchema(val_type="string"),
        ),
        requirements=ToolRequirements(),
    )
    input_model, output_model = create_func_models(func)
    return MaterializedTool(
        tool=func,
        definition=definition,
        meta=ToolMeta(module=func.__module__, toolkit="TestToolkit"),
        input_model=input_model,
        output_model=output_model,
    )
@pytest.fixture
 def erroring_catalog() -> ToolCatalog:
    catalog = ToolCatalog()
    mt1 = _materialized(raises_fatal_tool_error, "raises_fatal_tool_error")
    mt2 = _materialized(raises_unhandled_exception, "raises_unhandled_exception")
    catalog._tools[mt1.definition.get_fully_qualified_name()] = mt1
    catalog._tools[mt2.definition.get_fully_qualified_name()] = mt2
    return catalog
@pytest_asyncio.fixture
 async def erroring_server(erroring_catalog) -> MCPServer:
    settings = MCPSettings()
    settings.middleware.mask_error_details = False
    server = MCPServer(
        catalog=erroring_catalog,
        name="Integration Debug Exposure Server",
        version="0.0.0",
        settings=settings,
    )
    await server.start()
    try:
        yield server
    finally:
        await server.stop()
 async def _call(erroring_server: MCPServer, tool_name: str) -> CallToolResult:
    message = CallToolRequest(
        jsonrpc="2.0",
        id=1,
        method="tools/call",
        params={"name": f"TestToolkit.{tool_name}", "arguments": {"query": "ping"}},
    )
    response = await erroring_server._handle_call_tool(message)
    assert isinstance(response, JSONRPCResponse)
    assert isinstance(response.result, CallToolResult)
    assert response.result.isError is True
    assert response.result.structuredContent is None
    return response.result
 # ---- Integration tests ------------------------------------------------------
@pytest.mark.asyncio
 async def test_integration_baseline_no_leak(erroring_server):
    """Default state: the agent sees ONLY the sanitized message."""
    result = await _call(erroring_server, "raises_fatal_tool_error")
    text = result.content[0].text
    assert "Failed to fetch results" in text
    assert "[DEBUG]" not in text
    assert "HTTP 503" not in text
    assert "query='ping'" not in text
@pytest.mark.asyncio
 async def test_integration_boolean_rejected_no_leak(erroring_server, monkeypatch, caplog):
    """Boolean-looking values are rejected by the MCP boundary too."""
    monkeypatch.setenv(_ENV_DEV_MSG, "true")
    import logging
    with caplog.at_level(logging.WARNING, logger="arcade_mcp_server._debug_exposure"):
        result = await _call(erroring_server, "raises_fatal_tool_error")
    text = result.content[0].text
    assert "Failed to fetch results" in text
    assert "[DEBUG]" not in text
    assert "HTTP 503" not in text
    assert any(
        "set to a truthy value but not to the required" in r.message for r in caplog.records
    )
@pytest.mark.asyncio
 async def test_integration_developer_message_flag_leaks_through_mcp(
    erroring_server, monkeypatch
 ):
    """When the flag is set to the magic value, the MCP response `content`
    carries `developer_message` alongside the sanitized message."""
    monkeypatch.setenv(_ENV_DEV_MSG, _LEAK_MAGIC)
    result = await _call(erroring_server, "raises_fatal_tool_error")
    text = result.content[0].text
    assert "Failed to fetch results" in text
    assert "[DEBUG] developer_message:" in text
    assert "HTTP 503 on upstream endpoint for query='ping'" in text
    # Stacktrace flag is off — stacktrace must NOT leak.
    assert "[DEBUG] stacktrace:" not in text
@pytest.mark.asyncio
 async def test_integration_stacktrace_flag_leaks_traceback_through_mcp(
    erroring_server, monkeypatch
 ):
    """Unhandled exceptions go through the executor's generic except branch,
    which populates a real stacktrace. With the flag on, that stacktrace must
    appear in the MCP response content."""
    monkeypatch.setenv(_ENV_STACKTRACE, _LEAK_MAGIC)
    result = await _call(erroring_server, "raises_unhandled_exception")
    text = result.content[0].text
    # The generic-exception branch wraps the message with the tool name.
    assert "raises_unhandled_exception" in text
    assert "[DEBUG] stacktrace:" in text
    assert "Traceback" in text
    assert "ValueError" in text
    assert "unexpected crash for query='ping'" in text
@pytest.mark.asyncio
 async def test_integration_both_flags_leak_through_mcp(erroring_server, monkeypatch):
    """Both flags together on an unhandled exception: developer_message (from
    `str(e)` in the executor) AND the stacktrace both reach the MCP content."""
    monkeypatch.setenv(_ENV_DEV_MSG, _LEAK_MAGIC)
    monkeypatch.setenv(_ENV_STACKTRACE, _LEAK_MAGIC)
    result = await _call(erroring_server, "raises_unhandled_exception")
    text = result.content[0].text
    assert "[DEBUG] developer_message:" in text
    assert "unexpected crash for query='ping'" in text
    assert "[DEBUG] stacktrace:" in text
    assert "Traceback" in text
@pytest.mark.asyncio
 async def test_integration_success_path_unaffected_by_flags(
    tool_catalog, mcp_settings, monkeypatch
 ):
    """Sanity check: even with both flags on, SUCCESSFUL tool responses are
    not touched. The augmentation only runs on the error branch."""
    monkeypatch.setenv(_ENV_DEV_MSG, _LEAK_MAGIC)
    monkeypatch.setenv(_ENV_STACKTRACE, _LEAK_MAGIC)
    server = MCPServer(
        catalog=tool_catalog,
        name="Success Path Server",
        version="0.0.0",
        settings=mcp_settings,
    )
    await server.start()
    try:
        response = await server._handle_call_tool(
            CallToolRequest(
                jsonrpc="2.0",
                id=1,
                method="tools/call",
                params={"name": "TestToolkit.test_tool", "arguments": {"text": "hi"}},
            )
        )
    finally:
        await server.stop()
    assert isinstance(response, JSONRPCResponse)
    assert isinstance(response.result, CallToolResult)
    assert response.result.isError is False
    assert response.result.structuredContent is not None
    for item in response.result.content:
        assert "[DEBUG]" not in getattr(item, "text", "")
--- a/scripts/check_debug_leak_flags_off.py
+++ b/scripts/check_debug_leak_flags_off.py
@ -0,0 +1,126 @@
 #!/usr/bin/env python3
 # ruff: noqa: S603, S607
 #
 # This script shells out to `git` via PATH on purpose: it runs inside
 # pre-commit and GitHub Actions, both of which guarantee git on PATH, and
 # hard-coding an absolute path would break portability. The subprocess
 # invocations here pass only constant argv lists, so S603/S607 don't apply.
 """
 Guard: the debug-exposure flags in ``arcade_mcp_server/_debug_exposure.py``
 must never ship in the "on" state through committed files.
 The two env vars
    ARCADE_DEBUG_EXPOSE_DEVELOPER_MESSAGE_IN_TOOL_ERROR_RESPONSES
    ARCADE_DEBUG_EXPOSE_STACKTRACE_IN_TOOL_ERROR_RESPONSES
 only activate when set to one specific acknowledgement string. Therefore we
 only need to guarantee that string never appears in the tree outside a tiny
 allowlist of files (the source that defines it, the tests that exercise it,
 the developer doc, and this guard itself).
 This script is run both as a pre-commit hook and as a dedicated CI workflow.
 Exit codes:
  0  OK — flags cannot be activated by anything in the tree.
  1  FAIL — the magic string was found in a non-allowlisted file.
  2  Infrastructure error (e.g. ``git ls-files`` unavailable).
 """
 from __future__ import annotations
 import subprocess
 import sys
 from pathlib import Path
 # The activation ack string. Kept as the sole constant so updating it in one
 # place (arcade_mcp_server/_debug_exposure.py) also updates the guard.
 MAGIC = "yes-i-accept-leaking-internals-to-the-agent"
 # Files that are *allowed* to mention the magic string. Everything else is a
 # hard fail. Paths are relative to the repository root and use forward slashes.
 ALLOWLIST: frozenset[str] = frozenset({
    # The source of truth for the flags.
    "libs/arcade-mcp-server/arcade_mcp_server/_debug_exposure.py",
    # Unit tests for the pure augmentation function.
    "libs/tests/arcade_mcp_server/test_debug_exposure.py",
    # Integration tests for the MCP-boundary wire-up.
    "libs/tests/arcade_mcp_server/test_debug_exposure_integration.py",
    # Developer documentation for the flags.
    "CLAUDE.md",
    # This guard itself.
    "scripts/check_debug_leak_flags_off.py",
 })
 def _repo_root() -> Path:
    try:
        out = subprocess.check_output(
            ["git", "rev-parse", "--show-toplevel"],
            text=True,
            stderr=subprocess.DEVNULL,
        )
    except (subprocess.CalledProcessError, FileNotFoundError):
        print("check_debug_leak_flags_off: not a git checkout", file=sys.stderr)
        raise SystemExit(2) from None
    return Path(out.strip())
 def _tracked_files(root: Path) -> list[str]:
    try:
        out = subprocess.check_output(
            ["git", "-C", str(root), "ls-files"],
            text=True,
            stderr=subprocess.DEVNULL,
        )
    except (subprocess.CalledProcessError, FileNotFoundError):
        print("check_debug_leak_flags_off: git ls-files failed", file=sys.stderr)
        raise SystemExit(2) from None
    return [line for line in out.splitlines() if line]
 def main() -> int:
    root = _repo_root()
    failures: list[str] = []
    for rel in _tracked_files(root):
        if rel in ALLOWLIST:
            continue
        path = root / rel
        if not path.is_file():
            continue
        try:
            text = path.read_text(encoding="utf-8", errors="ignore")
        except OSError:
            continue
        if MAGIC in text:
            failures.append(rel)
    if failures:
        print("Debug-leak flag guard: FAIL", file=sys.stderr)
        print("", file=sys.stderr)
        print(
            "The activation acknowledgement string for the unsafe debug-leak "
            "flags was found in files that must never contain it:",
            file=sys.stderr,
        )
        for f in failures:
            print(f"  - {f}", file=sys.stderr)
        print("", file=sys.stderr)
        print(
            "These env vars must stay off by default everywhere the repo ships. "
            "If you need to iterate locally, export the magic value in your "
            "shell only — never commit it.",
            file=sys.stderr,
        )
        print(
            "See libs/arcade-mcp-server/arcade_mcp_server/_debug_exposure.py "
            "for the full rationale.",
            file=sys.stderr,
        )
        return 1
    print("Debug-leak flag guard: OK")
    return 0
 if __name__ == "__main__":
    sys.exit(main())