textin mcp

Local 2025-09-01 00:43:19 0

A server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.


TextIn MCP Server is a tool for extracting text and performing OCR on documents, including document text recognition, ID recognition, and invoice recognition. It also supports converting documents into Markdown format.

Textin Server MCP server

Tools

  • recognition_text
  • Text recognition from images, Word documents, and PDF files.
  • Input: file path (string)
  • Return: Text of the document.

  • doc_to_markdown

  • Convert images, PDFs, and Word documents to Markdown.
  • Input: file path (string)
  • Return: Markdown of the document.

  • general_information_extration

  • Automatically and intelligently extract key information from documents.
  • Input: file path (string)
  • Return: The key information JSON.

Setup

APP_ID and APP_SECRET

Click here to register for a TextIn account.

Get Textin APP_ID and APP_SECRET by following the instructions here.

NPX

{
  "mcpServers": {
    "textin-ocr": {
      "command": "npx",
      "args": [
        "-y",
        "@intsig/server-textin"
      ],
      "env": {
        "APP_ID": "<YOUR_APP_ID>",
        "APP_SECRET": "<YOUR_APP_SECRET>",
        "MCP_SERVER_REQUEST_TIMEOUT": "600000"
      },
      "timeout": 600
    }
  }
}

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

[
  {
    "description": "Text recognition from images, Word documents, and PDF files.",
    "inputSchema": {
      "properties": {
        "path": {
          "description": "Read the complete contents of a file from the file system. ",
          "format": "file-path",
          "type": "string"
        }
      },
      "required": [
        "path"
      ],
      "type": "object"
    },
    "name": "recognition_text"
  },
  {
    "description": "Automatically and intelligently extract key information from documents.",
    "inputSchema": {
      "properties": {
        "path": {
          "description": "Read the complete contents of a file from the file system. ",
          "format": "file-path",
          "type": "string"
        }
      },
      "required": [
        "path"
      ],
      "type": "object"
    },
    "name": "general_information_extration"
  },
  {
    "description": "Convert images, PDFs, and Word documents to Markdown.",
    "inputSchema": {
      "properties": {
        "path": {
          "description": "Read the complete contents of a file from the file system. ",
          "format": "file-path",
          "type": "string"
        }
      },
      "required": [
        "path"
      ],
      "type": "object"
    },
    "name": "doc_to_markdown"
  }
]