arcade-mcp

Author	SHA1	Message	Date
Eric Gustin	2fe907e5dd	Release Google Toolkit version 1.0.0 (#280 ) Updates to the Google Toolkit: https://github.com/ArcadeAI/arcade-ai/pull/264 (google patch) https://github.com/ArcadeAI/arcade-ai/pull/188 (google minor) https://github.com/ArcadeAI/arcade-ai/pull/272 (google major) https://github.com/ArcadeAI/arcade-ai/pull/269 (google minor) https://github.com/ArcadeAI/arcade-ai/pull/265 (google major)	2025-03-12 14:25:50 -07:00
Renato Byrro	ac0f5aa10c	Search Google Drive documents and retrieve contents (#265 ) This tool will be useful in scenarios akin to RAG, where someone wants to ask questions or request the production of a summary, for instance, about a bunch of documents related to a particular topic. Currently, to fulfill such requests, the LLM needs to first `list_documents`, then `get_document_by_id` for each document. We also implement a utility functions to return documents in Markdown and HTML, since the Drive API JSON is verbose and would waste too many tokens unnecessarily. Limitations: the Markdown/HTML utilities do not handle table of contents (which I think aren't really useful here), headers, footers, or footnotes. --- This PR deprecates `list_documents` and implements `search_documents`, apart from `search_and_retrieve_documents`). This configuration makes it easier for LLMs to understand when to call each tool. Both tools had their interfaces refactored to remove Google API-specific arguments that were confusing LLMs sometimes, such as "corpora" and "support_all_drives". It now accepts arguments that better relate to expected user requests. --------- Co-authored-by: Eric Gustin <eric@arcade.dev>	2025-03-07 18:42:12 -03:00
Renato Byrro	2135101acd	Tool to retrieve file tree structure from Google 'My Drive' and 'Shared Drives' (#269 )	2025-03-07 18:02:09 -03:00
Nate Barbettini	e9ee3bba40	fix: Use tool secrets in toolkits (#271 ) ~~Note: Don't merge until the correct secrets have been added to Arcade Cloud.~~ Ready to merge, the feature is already on its way to prod. --------- Co-authored-by: Eric Gustin <eric@arcade.dev>	2025-03-04 13:35:36 -08:00
Renato Byrro	75da4bf8b0	Make search_contacts interface more LLM-friendly (#272 ) Break down `search_contacts` into `search_contacts_by_name` and `search_contacts_by_email`. The search_contacts' `query` argument was not clear enough for LLMs.	2025-02-28 16:47:03 -03:00
Alex Salazar	7b1110f2b7	Alex gmail improvements (#188 ) Improved gmail toolkit. Added support for threading in draft replies, multipart email parsing, and label management. Fixed the DateRange parameter issue in list_emails_by_headers. Added logging and removed print statements. Created custom exceptions for each specific google toolkit. ----- Summary of changes by @byrro: - Fixed minor bug related to the `date_range` argument of `list_emails_by_header` - A few utility functions (`build_email_message`, `build_reply_recipients`, `build_reply_body`) to centralize logic and remove repeated code from email-sending tools - New `reply_to_email` tool (apart from `write_draft_reply_email`, implemented by Alex) to keep the toolkit consistent - Evals and unit tests - Handling of reply-to (only sender) and reply-to-all recipients - Removed some unnecessary debug messages, which Alex had added to replace print statements - Removed HTML handling implemented by Alex in `write_draft_reply_email` > I think we should either support HTML across all applicable tools or not at all; I decided to remove it and leave this feature for a future PR. --------- Co-authored-by: Renato Byrro <rmbyrro@gmail.com>	2025-02-27 11:56:32 -03:00
Nate Barbettini	62173da343	Update scope on Google.ListDocuments (#264 ) Updating the scope on this unreleased tool to be more granular.	2025-02-20 21:36:46 -08:00
Eric Gustin	eeb47dbec5	[Toolkit Release] Weekly Toolkit Release 02-20-25 (#261 ) # Weekly Toolkit Release 02-20-25 Previous Toolkit Release: #248 ## Google Toolkit Minor Release https://github.com/ArcadeAI/arcade-ai/pull/249 #259 ## Slack Toolkit Minor Release #254 ## X Toolkit Patch Release #256	2025-02-20 13:27:35 -08:00
Nate Barbettini	274e63c9e5	Polish up Google.SearchContacts and CreateContact (#259 ) Adding the missing polish (evals, tests) for #249	2025-02-20 08:53:12 -08:00
Renato Byrro	8efa9a51df	Slack tools to retrieve messages & metadata from multi-person DM conversation (#254 )	2025-02-19 16:51:45 -03:00
Nate Barbettini	becd86da0c	Google toolkit: Search and create contacts POC (#249 ) This is an initial sketch of what Contacts (People) API tools could look like. But I haven't yet thought like an MX Engineer @byrro 😉	2025-02-18 17:27:33 -08:00
Eric Gustin	7d45a99722	X Toolkit: Handle no tweets returned in search (#256 ) ## PR Description Search ([see API docs here](https://docs.x.com/x-api/posts/recent-search)) returns a 'data' field that maps to a list of tweets returned. We've observed that the 'data' field is not present if no tweets match the search. This PR handles that case safely.	2025-02-18 13:28:33 -08:00
Eric Gustin	49ff013e80	[Toolkit Release] Weekly Toolkit Release 02-13-25 (#248 ) # Weekly Toolkit Release 02-13-25 Previous Toolkit Release: #236 ## Associated PRs * Minor #241	2025-02-13 13:06:11 -08:00
Renato Byrro	a00bd4734e	Tool to retrieve Slack messages from a DM conversation with a given username (#241 ) Currently, retrieving DMs with a given username requires several actions: first get the current user's ID; list all users and find the ID of the username; then scan all DM conversations and find the one with the current user's ID and the username's ID, to finally retrieve the messages using that conversation ID. This tool abstracts all that in a single call. PS: we'll implement a similar tool for multi-person DM conversations in a subsequent PR.	2025-02-10 09:01:04 -03:00
Eric Gustin	be2539602f	Evals New Features (#208 ) # PR Description This PR adds ~~four~~ three improvements to evals. ~~## 1. Add parameterized eval cases~~ ~~Adds a new method named `add_parameterized_case`. Just like pytest’s parameterized tests, eval cases can be parameterized with multiple user messages. Adds a case to the `EvalSuite` for each user message. All cases have the same expected tool call(s), params, additional_messages. This reduces duplicate code and makes it easy to observe how a model performs based on increasingly more difficult prompts.~~ ```python """ NO LONGER IN THIS PR user_messages = [ "Call the delete tweet by id tool with the tweet ID '148975632'.", "Delete the tweet with ID '148975632'.", "I don't want to have this tweet (148975632) on my account anymore.", "do the opposite of post for https://x.com/x/status/148975632", ] suite.add_parameterized_case( name="Delete a tweet by ID", user_messages=user_messages, expected_tool_calls=[ ExpectedToolCall( func=delete_tweet_by_id, args={"tweet_id": "148975632"}, ) ], critics=[ BinaryCritic( critic_field="tweet_id", weight=1.0, ), ], ) """ ``` ~~PASSED Delete a tweet by ID (user_message 1 of 4) -- Score: 100.00%~~ ~~PASSED Delete a tweet by ID (user_message 2 of 4) -- Score: 100.00%~~ ~~PASSED Delete a tweet by ID (user_message 3 of 4) -- Score: 100.00%~~ ~~FAILED Delete a tweet by ID (user_message 4 of 4) -- Score: 0.00%~~ ~~Summary -- Total: 4 -- Passed: 3 -- Failed: 1~~ ## 2. Parameters that are not explicitly criticized are assigned a `NoneCritic`. A NoneCritic has no effect on the evaluation results and does not actually evaluate. Parameters that have a NoneCritic will be displayed as ‘un-criticized’ in the evaluation summary (if `-d` flag is used). ![image](https://github.com/user-attachments/assets/300756ec-9b53-436a-9cf9-fc61d0b00c01) ## 3. Add a hardcoded `seed` parameter for evals. The seed parameter aides in receiving (mostly) consistent outputs - aiding in reproducibility for evaluations. ## 4. Disallow more than one critic for the same field. Raises a `ValueError` if more than one critic is assigned to a field. --------- Co-authored-by: Eric Gustin <eric@arcade-ai.com>	2025-02-05 15:22:08 -08:00
Renato Byrro	149c25d967	Fix missing scopes in tools that call other tools (#240 )	2025-02-03 17:39:34 -08:00
Eric Gustin	aaf1dbd795	[Toolkit Release] Weekly Toolkit Release 01-29-25 (#236 ) # Weekly Toolkit Release 01-29-25 Previous Toolkit Release: #222 ## Associated PRs Slack Patch: #232 GitHub Patch: #227	2025-01-29 09:51:16 -08:00
Eric Gustin	ce2fb0f6c1	Update Examples & Various Renames (#233 ) # PR Description * This PR updates code in `examples/` to be compatible with version 1.0.0 * This PR removes the Spotify examples since the Arcade hosted worker doesn't currently cataloge the Spotify toolkit. We can reintroduce these examples when it does. * This PR performs various renames across the codebase for `arcade-ai.com` --> `arcade.dev` and `Arcade AI` --> `Arcade`	2025-01-28 17:17:29 -08:00
Eric Gustin	3657fc79b6	Whitelist Toolkit Release Managers (#234 ) # PR Description The `github.event.pull_request.author_association` in the "Prevent Unauthorized Version Updates" workflow was returning inconsistent results by saying that MEMBERS were CONTRIBUTORS. This PR moves away from `author_association` in favor of a whitelist text file containing the GitHub usernames of authorized toolkit release managers. A toolkit release manager has the following special permissions: * Can change the version of an existing toolkit * Can delete an existing toolkit * Can rename an existing toolkit	2025-01-27 14:35:45 -08:00
Renato Byrro	27d8aa7f43	Fix bug in slack tool pagination (#232 ) The `get_conversation_metadata_by_name` tool retrieves conversation metadata from another tool, `list_conversations_metadata`, but was accessing the `next_cursor` using the Slack API response dict structure, instead of the tool response structure. As a result, in that tool, the tool would never actually paginate to the second page. This PR fixes it and also adjust tests to capture the issue appropriately.	2025-01-27 12:23:00 -08:00
Renato Byrro	aa0cd02fe9	Make starred = True by default in Github tool (#227 ) `starred` is a required argument of the `arcade_github.activity.set_starred` tool, but when it is not provided in the tool call, the engine is somehow passing it with a falsy value, instead of raising an error. the falsy value makes the tool unstar a repo by default, which is not the desired behavior. we're setting the `starred` arg to True in the tool interface to prevent that.	2025-01-24 13:50:30 -08:00
Eric Gustin	ca90b31262	Update README and LICENSE (#220 ) Updates README to point to updated URLs --------- Co-authored-by: Nate Barbettini <nate@arcade-ai.com>	2025-01-23 19:43:48 -08:00
Eric Gustin	d5d6942ed1	Weekly Toolkit Release (#222 ) # Relevant PRs Google: #207 Spotify: #204   Slack: #162 Also relaxes arcade-ai dependency for all toolkits	2025-01-23 18:46:05 -08:00
Eric Gustin	66e54d7cde	Slack Tools (#162 ) implements additional tools for Slack related to retrieving conversations metadata, list of members, history of messages, as well as sending messages to private/public channels and DMs / multi-person DMs. --------- Co-authored-by: Eric Gustin <eric@arcade-ai.com> Co-authored-by: Renato Byrro <rmbyrro@gmail.com>	2025-01-23 18:15:52 -08:00
Eric Gustin	48c9870eac	Removed unused param (#207 ) # PR Description For the `Google.GetThread` tool, we had a parameter named `metadata_headers`. This parameter only makes a difference if the format is "metadata", but the tool will never have the format "metadata". So, the input parameter is useless. This parameter should have never been added to the tool and we should remove it before public beta.	2025-01-16 09:41:26 -08:00
Eric Gustin	e314ac5ed5	Remove deprecated eval imports (#206 ) # PR Description Continuation of PR #196	2025-01-15 17:40:25 -08:00
Renato Byrro	62327c30a7	Pytests for Spotify tools (#204 )	2025-01-14 12:45:32 -08:00
Eric Gustin	8795871d51	Check if toolkit version changed before attempting publish (#198 ) # PR Description Changes to a toolkit without changes to the toolkit's version fail the 'Publish Toolkit' workflow with `HTTP Error 400: File already exists ('arcade_zoom-0.1.7.tar.gz', with blake2_256 hash '02183cda607f06616e7edb17e3d22bc11d1d83b074b3e44066b78ec72602fb37'). See https://pypi.org/help/#file-name-reuse for more information.`, for example. This PR adds the `--skip-existing` flag to `poetry publish` to avoid attempting to publish an existing version. Skips slack notification if publish is skipped. The `grep`'d string comes from https://github.com/python-poetry/poetry/blob/main/src/poetry/publishing/uploader.py#L246-L249	2025-01-13 10:00:24 -08:00
Renato Byrro	c5e29693e7	Remove tools relying on deprecated Spotify endpoints (#196 ) Spotify deprecated several endpoints in Nov, 2024. Two of them were being used in Tracks tools. We're removing those from the Toolkit. Spotify announcement: https://developer.spotify.com/blog/2024-11-27-changes-to-the-web-api Archive: https://archive.is/LMBe5	2025-01-08 16:40:36 -03:00
Renato Byrro	cd837a363d	Separate tools & helper funcs in separate files (#192 ) Separates utility and helper functions, as well as constant values (e.g. base URLs), in dedicated files, apart from tools files.	2025-01-08 16:39:25 -03:00
Renato Byrro	cd1fb648bd	Return media attachments metadata when retrieving X tweets (#191 ) Modifies X tweet tools to return metadata about media attachments (photo, GIF or video) when retrieving a tweet by ID, username or keywords. The tool will always return media attachments by default. Since it's only metadata, it shouldn't add significant network overhead to existing implementations of the tool. My guess is more often than not people will want this info included. When not needed, it doesn't hurt to include by default. It'd be annoying to have to ask the LLM to include it every time they need.	2025-01-08 16:36:43 -03:00
Eric Gustin	b2bdfe2459	Least Privileged Scope for Update Calendar (#195 )	2025-01-08 10:03:04 -08:00
Eric Gustin	d5067af023	Bump toolkit versions (#194 )	2025-01-07 13:32:36 -08:00
Eric Gustin	feb83c95ca	Pin poetry to 1.8.5 (#193 ) # PR Description Poetry released v2 with many breaking changes a couple days ago. The `install-poetry` action that our workflows use default to that v2 version, so many of our workflows are failing. This PR forces that action to use poetry version 1.8.5 and also uses 1.8.5 for toolkits A ticket to migrate to 2.0.0 has been filed for future work	2025-01-07 13:21:55 -08:00
Eric Gustin	ab889f9f1d	Lint all toolkits (#183 ) # PR Description * Adds/updates the following files to all toolkits: - `.pre-commit-config.yaml` - `.ruff.toml` - `LICENSE` - `Makefile` - `pyproject.toml` * Lint all toolkits such that they pass `make check` and `make test` (a total doozy). This includes adding some unit tests and evals. * Github workflow for testing toolkits before merge into main (courtesy of @sdreyer) * Added a QOL improvement for tool developers for when they need to get the context's auth token. * Minor updates to `arcade new` template.	2024-12-20 09:49:45 -08:00
Sterling Dreyer	950a8600f8	Remove lock flag from check (#181 )	2024-12-19 11:36:32 -08:00
Sterling Dreyer	1512d0699e	Testing for Math Toolkit (#180 )	2024-12-19 11:28:03 -08:00
Sterling Dreyer	70faf7af5a	Test Version Override (#179 ) Testing to make sure pypi versions don't override. This should fail	2024-12-19 11:13:18 -08:00
Eric Gustin	7c228a59d5	Update Evals SDK (#175 ) # PR Description This PR renames `ExpectedToolCall` to `NamedExpectedToolCall` and then creates a new dataclass called `ExpectedToolCall`. `ExpectedToolCall` can be passed to the `EvalSuite.add_case` and `EvalSuite.extend_case` methods. 1. Enhance `EvalSuite.add_case` and `EvalSuite.extend_case` by accepting a list of `ExpectedToolCall` as their `expected_tool_calls` input parameter. This helps create a scaffolding for developers. Previously, the expected type was `list[tuple[Callable, dict[str, Any]]]`, which is still valid for backward compatibility. ```python # Before (still valid for backward compatibility) expected_tool_calls=[ ( adjust_playback_position, { "absolute_position_ms": 10000, }, ) ] # After expected_tool_calls=[ ExpectedToolCall( func=adjust_playback_position, args={"absolute_position_ms": 10000}, ) ] ``` 2. Removed any references to arcade.core in toolkits directory. 3. Some linting for import organization.	2024-12-19 10:29:13 -08:00
Eric Gustin	02eee63884	X Toolkit: Support tweets longer than 280 characters (#171 ) # PR Description * Update `search_recent_tweets_by_username`, `search_recent_tweets_by_keywords`, and `lookup_tweet_by_id` to support long tweets. Previously, only the first 280 characters of the tweet's text were returned by the tool.	2024-12-13 09:04:50 -08:00
Eric Gustin	00d5babcd7	Add next_token to X Search tools (#169 ) # PR Description Adds an optional `next_token` input parameter to the `X.SearchRecentTweetsByUsername` and `X.SearchRecentTweetsByKeywords` tools. This allows users to paginate through tweets. A `next_token` is provided in the tools's response. For example, to access the `next_token` when using the `tools.execute`, you can do `next_token = response.output.value["meta"].get("next_token", None)` and then pass it to the tool on your next call through the tools' `next_token` input parameter.	2024-12-10 12:35:20 -08:00
Sam Partee	bebfcab1e9	Add `lookup_tweet_by_id` to X Toolkit (#165 ) This PR introduces the `lookup_tweet_by_id` tool to the X toolkit, enabling users to retrieve tweet details by tweet ID. This enhancement extends the toolkit's capabilities, allowing for more comprehensive interactions with the X (Twitter) API. Key Changes: - Added `lookup_tweet_by_id` Tool: - Implemented the `lookup_tweet_by_id` function in `tools/tweets.py`, which allows users to fetch tweet information using a tweet ID. - Included error handling for API response codes and expanded URLs in tweets to assist language models in avoiding hallucinations due to shortened URLs. - Enhanced Toolkit Structure: - Added several configuration files to the X toolkit to establish a standardized project structure, which in the future will be generated by `arcade new`. These include: - `.pre-commit-config.yaml`: Defines pre-commit hooks for code quality checks. - `.ruff.toml`: Configuration for the Ruff linter. - `LICENSE`: MIT License file for the toolkit. - `Makefile`: Contains common commands for building, testing, and linting the toolkit. - Updated Makefile: - Added `make check-toolkits` command to the top-level `Makefile`. This command runs code quality tools for each toolkit that contains a `Makefile`. Additional Notes: - Tests: - Added unit tests for the new `lookup_tweet_by_id` tool in `tests/test_tweets.py`. - Included tests for the user lookup functionality in `tests/test_users.py`. - Linting and Code Quality: - Configured pre-commit hooks and Ruff linter to enforce code standards. - Updated the `pyproject.toml` file with development dependencies for testing and linting. - --------- Co-authored-by: Eric Gustin <eric@arcade-ai.com>	2024-11-27 17:07:12 -08:00
Eric Gustin	2798cc0820	Add Gmail Thread Tools (#159 ) # PR Description 1. This PR adds three new tools: - GetThread (by ID) - ListThreads - SearchThreads 2. This PR updates the return type for various Gmail tools from str to dict. 3. This PR adds evals and tests for the added tools	2024-11-20 11:26:09 -08:00
Eric Gustin	8b46e4f7f9	Add Code Sandbox Tools (#114 ) # PR Description This PR creates a new toolkit called CodeSandbox. This toolkit has two tools: 1. `RunCode`: Creates an E2B sandbox and runs the provided code in that sandbox. Returns the execution logs, result, and errors. Supports Python, JavaScript, R, Java, and Bash code. 2. `CreateStaticMatplotlibChart`: Creates a sandbox, runs the provided python code that uses matplotlib, and returns the base64 encoded image of the chart along with any logs or errors. - I recommend not using `tool_choice="generate"` since the return object contains a base64 image can be a lot of tokens that will not provide much value to a generate's response. Example of creating a pie chart: ```python import base64 import json import os from openai import OpenAI def call_tool_with_openai(client: OpenAI) -> dict: response = client.chat.completions.create( messages=[ { "role": "user", "content": "There are 17 red apples, 4 green apples, and 10 yellow apples. Create a pie chart for this data.", }, ], model="gpt-4o-mini", user="you@example.com", tools=["CodeSandbox.CreateStaticMatplotlibChart"], tool_choice="execute", ) return response arcade_api_key = os.environ.get("ARCADE_API_KEY") cloud_host = "http://localhost:9099/v1" openai_client = OpenAI( api_key=arcade_api_key, base_url=cloud_host, ) chat_result = call_tool_with_openai(openai_client) tool_call_id = chat_result.choices[0].message.tool_calls[0].id content = json.loads(chat_result.choices[0].message.content) base64_image = content[tool_call_id]["value"]["base64_image"] image_data = base64.b64decode(base64_image) with open("output_image.png", "wb") as image_file: image_file.write(image_data) ```	2024-11-15 13:29:52 -08:00
Eric Gustin	081865733a	Add examples (#136 ) ## PR Description This PR adds 7 examples. * `call_a_tool_directly_with_auth.py` - Simple example that uses Arcade client to execute a tool that lists Gmail emails * `call_a_tool_directly.py` - Simple example that uses Arcade client to execute a tool that adds two numbers together * `call_a_tool_with_llm.py` - Simple example that uses the LLM api to star the arcade-ai repository * `get_auth_token.py` - Simple example that gets a Google auth token and then calls the Google API * `call_multiple_tools_directly_with_auth.py` - A more involved example that directly calls multiple spotify tools sequentially * `call_multiple_tools_with_llm.py` - A more involved example that uses an llm to call multiple spotify tools sequentially * `simple_chatbot.py` - Simple chatbot that uses arcade tools and has history --------- Co-authored-by: Nate Barbettini <nathanaelb@gmail.com>	2024-11-06 11:02:41 -08:00
Eric Gustin	6d1bc6c084	Random int and random float tools (#148 ) As requested by D&D fans	2024-11-06 09:28:05 -08:00
Eric Gustin	bc393db305	Return success message if playback is altered (#145 ) # PR Description Previously, if a tool adjusted the playback state, then the tool would return the current playback state after the modification had occurred. The problem with this approach was that Spotify would not update the playback state in time (sometimes), so the tools were returning stale data!	2024-11-04 17:22:10 -08:00
Eric Gustin	efee9589fa	More Spotify Tools (#140 ) # PR Description This PR adds three new spotify tools that are natural language friendly. 1. `search` - Search Spotify Catalog information 2. `play_artist_by_name` - Gets 5 songs by the specified artist and plays them. Uses `search`, and `start_tracks_playback_by_id` under the hood 3. `play_track_by_name` - Plays the specified song, optionally provide the artist name who plays the song. Uses `search`, and `start_tracks_playback_by_id` under the hood	2024-11-01 13:15:43 -07:00
Eric Gustin	c8e686c04e	Add tools to Spotify Toolkit (#132 ) # PR Description 1. `adjust_playback_position` - Adjust the playback position within the currently playing track 2. `skip_to_previous_track` - Skip to the previous track in the user's queue, if any 3. `skip_to_next_track` - Skip to the next track in the user's queue, if any 4. `pause_playback` - Pause the currently playing track, if any 5. `resume_playback` - Resume the currently playing track, if any 6. `start_tracks_playback_by_id` - Start playback of a list of tracks (songs) 7. `get_playback_state` - Get information about the user's current playback state, including track or episode, and active device 8. `get_currently_playing` - Get information about the user's currently playing track 9. `get_track_from_id` - Get information about a track 10. `get_recommendations` - Get track (song) recommendations based on seed artists, genres, and tracks, and multiple target audio stats 11. `get_tracks_audio_features` - Get audio features for a list of tracks (songs) ---------------------- My favorite feature of this toolkit is 1. Start playing my favorite song 2. Get the song that I'm currently playing 3. Get audio features of that song 4. Ask for recommended songs that are similar to it 5. Jam out ------------	2024-10-30 18:26:39 -07:00
Eric Gustin	ddaeb4db53	Add list stargazers tool (#130 ) # PR Description As a celebration for arcade-ai becoming open sourced, this PR adds a tool to list the stargazers for a particular repository ### Example `arcade chat -h localhost` usage: ![image](https://github.com/user-attachments/assets/c4ba9ce6-d3ec-461b-b356-72e78a09249b)	2024-10-30 18:13:36 -07:00

1 2

91 commits