cstoolbox

Local 2025-09-01 00:59:47 0

CSToolbox provides features like web search, web content crawling, and chart generation via the MCP protocol.

CSToolbox is an extension toolkit for ChatSpeed. It provides features like web search, web content crawling, and chart generation via the MCP protocol.

Features

? Web Search - Supports multiple search engines (Google, Bing, Baidu, etc.)
?️ Web Crawling - Extracts structured content from web pages (Supports Markdown/HTML format)
? Chart Generation - Quickly generates various data visualization charts, supporting line charts, bar charts, and pie charts
? PDF Processing - Downloads documents from PDF URLs and extracts text content

How to Use

MCP Client Configuration

{
  "mcpServers": {
    "cstoolbox": {
      "command": "uvx",
      "args": [
        "cstoolbox"
      ],
      "env": {
        "CS_LOG_LEVEL": "DEBUG",
        "CS_LOG_DIR": "logs",
        "CS_PROXY": "http://localhost:15154",
        "CS_BROWSER_TZ": "Asia/Shanghai",
        "CS_BROWSER_LANG": "zh-CN",
        "CS_REGION": "com",
        "CS_HEADLESS": "true",
        "CS_USER_DATA_DIR": null,
        "CS_BROWSER_TYPE": "chromium",
        "CS_EXECUTABLE_PATH": "/Applications/Google Chrome.app/Contents/MacOS/Google Chrome"
      }
    }
  }
}

Environment Variable Descriptions

CS_LOG_LEVEL: Log level. Possible values are DEBUG, INFO, WARNING, ERROR, CRITICAL. Defaults to INFO.
CS_LOG_DIR: Log directory. Defaults to logs.
CS_PROXY: Proxy server address. A proxy is needed in some regions where search engines like google.com or bing.com are inaccessible. Additionally, some websites have regional restrictions and cannot be accessed without a proxy.
CS_BROWSER_TZ: Timezone. Defaults to Etc/UTC.
CS_BROWSER_LANG: Browser language. Defaults to en-US.
CS_REGION: Search engine region. Possible values include com, cn, us, uk, etc. Defaults to com.
CS_HEADLESS: Whether to enable headless mode. Defaults to true.
CS_BROWSER_TYPE: Browser type. Possible values are chromium, firefox, webkit. Defaults to chromium.
CS_EXECUTABLE_PATH: Browser executable file path. Defaults to empty. You can use this to specify the path to a browser already installed on your system. This allows leveraging the browser's existing state data (like login status, cookies, etc.). If you specify CS_EXECUTABLE_PATH, ensure it matches the CS_BROWSER_TYPE.
CS_USER_DATA_DIR: User data directory. If you specify CS_EXECUTABLE_PATH, it is recommended to set CS_USER_DATA_DIR to the parent directory of the browser's "Profile Path".

How to find Chrome's Executable Path and Profile Path

Open the Chrome browser.
Enter chrome://version/ in the address bar.
On the page, you will find the "Executable Path". It will look similar to /Applications/Google Chrome.app/Contents/MacOS/Google Chrome. This is the path you should set for CS_EXECUTABLE_PATH.
Find the 'Profile Path', which will look similar to /Users/xxx/Library/Application Support/Google/Chrome/Default. The value for CS_USER_DATA_DIR should be /Users/xxx/Library/Application Support/Google/Chrome (note: this path should exclude the final Default part).

⚠️ Important Notes

Ensure the proxy is correctly configured when using google or bing for web searches in certain regions.
It is best not to set CS_EXECUTABLE_PATH to the browser you are currently using, as this can cause conflicts. If you are used to using google chrome, you can install other Chromium-based browsers like Edge or Brave. Conversely, if you are used to using the edge browser, it is highly recommended that you install google chrome.

{
  "mcpServers": {
    "cstoolbox": {
      "command": "uvx",
      "args": [
        "cstoolbox"
      ],
      "env": {
        "CS_PROXY": "http://{your-proxy-server}",
        "CS_HEADLESS": "false",
        "CS_BROWSER_TYPE": "chromium",
        "CS_EXECUTABLE_PATH": "/Applications/Google Chrome.app/Contents/MacOS/Google Chrome",
        "CS_USER_DATA_DIR": "~/Library/Application Support/Google/Chrome"
      }
    }
  }
}

Python Usage Example

For more Python usage examples, please refer to the tests/mcp_client.py file.

from pathlib import Path

from mcp import ClientSession, StdioServerParameters, types
from mcp.client.stdio import stdio_client

project_root = Path(__file__).resolve().parent.parent

# Create server parameters for stdio connection
server_params = StdioServerParameters(
    command="python",
    args=[f"{project_root}/src/cstoolbox/main.py"],  # Path to cstoolbox mcp main.py
    env={
        "CS_PROXY": "http://localhost:15154",
        "CS_BROWSER_TZ": "Asia/Shanghai",
        "CS_BROWSER_LANG": "zh-CN"
    },
)


# Optional: create a sampling callback
async def handle_sampling_message(
    message: types.CreateMessageRequestParams,
) -> types.CreateMessageResult:
    return types.CreateMessageResult(
        role="assistant",
        content=types.TextContent(
            type="text",
            text="Hello, world! from model",
        ),
        model="gpt-3.5-turbo",
        stopReason="endTurn",
    )


async def run():
    async with stdio_client(server_params) as (read, write):
        async with ClientSession(read, write, sampling_callback=handle_sampling_message) as session:
            # Initialize the connection
            await session.initialize()

            # List available tools
            tools = await session.list_tools()

            # Call a web search tool
            result = await session.call_tool(
                "web_search",
                arguments={"provider": "bing", "kw": "deepseek r2", "number": 10, "page": 1, "time_period": "month"},
            )
            print(result)

if __name__ == "__main__":
    import asyncio

    asyncio.run(run())

Development

Clone the source code repository:

git clone https://github.com/aidyou/cstoolbox.git
cd cstoolbox

Install uv macOS/Linux

curl -LsSf https://astral.sh/uv/install.sh | sh

Windows Use irm to download and install:

powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

Create and activate a venv environment:
```
uv venv --python=python3.12
source .venv/bin/activate
# On Windows use: .venvScriptsactivate
```
(Note: Added Windows activation command hint for completeness)
Install dependencies:
```
uv pip install .
```
Start the test server:
```
mcp dev src/cstoolbox/main.py
```
Now you can perform functional tests via http://127.0.0.1:6274/#tools

HTTP API Testing Add the following configuration to your .vscode/launch.json file:

{
    "version": "0.2.0",
    "configurations": [
        {
            "name": "http dev",
            "type": "debugpy",
            "request": "launch",
            "module": "cstoolbox.http_api",
            "args": [
            ],
            "console": "integratedTerminal",
            "env": {
                "PYTHONPATH": "${workspaceFolder}/src",
                "CS_BROWSER_TZ": "Asia/Shanghai",
                "CS_BROWSER_LANG": "zh-CN",
                "CS_LOG_LEVEL": "DEBUG",
                "CS_PROXY": "http://localhost:15154"
            },
            "python": "${workspaceFolder}/.venv/bin/python"
            // On Windows, adjust python path: "${workspaceFolder}.venvScriptspython.exe"
        }
    ]
}

(Note: Added Windows python path hint for completeness)

Adjust the CS_* settings in the env configuration according to your actual environment. After starting the debug session in VS Code, you can test using the following endpoints:

Search: curl http://localhost:12321/chp/web_search?provider=google&kw=deepseek+r2&number=10&page=1
Content Crawling: curl http://localhost:12321/chp/web_crawler?url=https://medium.com/@lbq999/deepseek-r2-is-around-the-corner-c449a41bfec6

License

This project is open-sourced under the MIT License. You are free to use, modify, and distribute this software.

[
  {
    "description": "Perform web search using the specified search engine provider",
    "inputSchema": {
      "properties": {
        "kw": {
          "description": "Keywords for search",
          "title": "Kw",
          "type": "string"
        },
        "number": {
          "default": 10,
          "description": "Search results per page",
          "maximum": 50,
          "minimum": 1,
          "title": "Number",
          "type": "integer"
        },
        "page": {
          "default": 1,
          "description": "Page number",
          "maximum": 10,
          "minimum": 1,
          "title": "Page",
          "type": "integer"
        },
        "provider": {
          "default": "bing",
          "description": "Name of provider",
          "enum": [
            "google",
            "bing",
            "baidu",
            "google_news",
            "baidu_news"
          ],
          "title": "Provider",
          "type": "string"
        },
        "time_period": {
          "default": "",
          "description": "Time range filter. Default: empty (no time filter).",
          "enum": [
            "day",
            "week",
            "month",
            "year",
            ""
          ],
          "title": "Time Period",
          "type": "string"
        }
      },
      "required": [
        "kw"
      ],
      "title": "web_searchArguments",
      "type": "object"
    },
    "name": "web_search"
  },
  {
    "description": "Extract and return structured data from the provided URL",
    "inputSchema": {
      "properties": {
        "format": {
          "default": "markdown",
          "description": "Data format",
          "enum": [
            "markdown",
            "html"
          ],
          "title": "Format",
          "type": "string"
        },
        "remove_link": {
          "default": true,
          "description": "Whether to remove links from the content",
          "title": "Remove Link",
          "type": "boolean"
        },
        "url": {
          "description": "Url to extract data",
          "title": "Url",
          "type": "string"
        }
      },
      "required": [
        "url"
      ],
      "title": "web_crawlerArguments",
      "type": "object"
    },
    "name": "web_crawler"
  },
  {
    "description": "Download PDF file and extract its textual content",
    "inputSchema": {
      "properties": {
        "url": {
          "description": "URL of the PDF document",
          "title": "Url",
          "type": "string"
        }
      },
      "required": [
        "url"
      ],
      "title": "pdfArguments",
      "type": "object"
    },
    "name": "pdf"
  },
  {
    "description": "Generate data visualization using specified plot type",
    "inputSchema": {
      "properties": {
        "data": {
          "additionalProperties": true,
          "description": "Plot data, e.g., {'x': [1, 2], 'y': [4, 5]} for line or bar, {'labels': ['A', 'B'], 'values': [30, 70]} for pie",
          "title": "Data",
          "type": "object"
        },
        "plot_type": {
          "description": "Type of plot",
          "enum": [
            "line",
            "bar",
            "pie"
          ],
          "title": "Plot Type",
          "type": "string"
        },
        "title": {
          "default": "",
          "description": "Plot title",
          "title": "Title",
          "type": "string"
        },
        "x_label": {
          "default": "",
          "description": "Label for x-axis",
          "title": "X Label",
          "type": "string"
        },
        "y_label": {
          "default": "",
          "description": "Label for y-axis",
          "title": "Y Label",
          "type": "string"
        }
      },
      "required": [
        "plot_type",
        "data"
      ],
      "title": "plotArguments",
      "type": "object"
    },
    "name": "plot"
  }
]