Skip to content

A Model Context Protocol (MCP) server that helps large language models index, search, and analyze code repositories with minimal setup

License

Notifications You must be signed in to change notification settings

johnhuang316/code-index-mcp

Repository files navigation

Code Index MCP

Code Index MCP is a Model Context Protocol server that enables large language models (LLMs) to index, search, and analyze code in project directories.

Features

  • Index and navigate project file structures
  • Search for specific patterns in code
  • Get detailed file summaries
  • Analyze code structure and complexity
  • Support for multiple programming languages
  • Persistent storage of project settings

Installation

This project uses uv for environment management and dependency installation.

  1. Ensure you have Python 3.10 or later installed

  2. Install uv (recommended):

    # Windows
    powershell -c "irm https://astral.sh/uv/install.ps1 | iex"
    # macOS/Linux
    curl -LsSf https://astral.sh/uv/install.sh | sh
  3. Getting the code:

    # Clone the repository
    git clone https://github.com/your-username/code-index-mcp.git

Usage

Running the Server Directly

# Run directly with uv - no additional dependency installation needed
uv run run.py

UV will automatically handle all dependency installations based on the project's configuration.

Using Docker

You can also use Code Index MCP as a containerized tool with Docker:

# Build the Docker image
docker build -t code-index-mcp .

# Use the container as a tool to analyze your code
docker run --rm -i code-index-mcp

This containerized approach works well with Claude Desktop, which treats MCP servers as on-demand processes rather than persistent servers. Claude Desktop will start the container when needed and communicate with it via stdio, keeping it running only for the duration of the session.

When using the containerized version, you'll need to set the project path explicitly using the set_project_path tool, just like in the non-containerized version.

Integrating with Claude Desktop

You can easily integrate Code Index MCP with Claude Desktop:

Option 1: Using UV (Direct Installation)

  1. Ensure you have UV installed (see installation section above)

  2. Find or create the Claude Desktop configuration file:

    • Windows: %APPDATA%\Claude\claude_desktop_config.json
    • macOS/Linux: ~/Library/Application Support/Claude/claude_desktop_config.json
  3. Add the following configuration (replace with your actual path):

    For Windows:

    {
      "mcpServers": {
        "code-indexer": {
          "command": "uvx",
          "args": [
             "C:\\Users\\username\\path\\to\\code-index-mcp"
           ]
        }
      }
    }

    For macOS/Linux:

    {
      "mcpServers": {
        "code-indexer": {
          "command": "uvx",
          "args": [
             "/Users/username/path/to/code-index-mcp"
           ]
        }
      }
    }

    Note: The tool can also be run directly with uv run run.py which may be useful for development and debugging.

Option 2: Using Docker (Containerized)

  1. Build the Docker image as described in the Docker section above

  2. Find or create the Claude Desktop configuration file (same locations as above)

  3. Add the following configuration:

    {
      "mcpServers": {
        "code-indexer": {
          "command": "docker",
          "args": [
             "run",
             "-i",
             "--rm",
             "code-index-mcp"
           ]
        }
      }
    }

    Note: This configuration allows Claude Desktop to start the containerized MCP tool on demand.

  4. Restart Claude Desktop to use Code Indexer for analyzing code projects

Claude Desktop will start the MCP server as an on-demand process when needed, communicate with it via stdio, and keep it running only for the duration of your session.

Basic Workflow

  1. Set Project Path (required first step):

    • When using for the first time, you must set the project path to analyze
    • Through Claude command: "I need to analyze a project, help me set up the project path"
    • Provide the complete project directory path
  2. Code Search:

    • Search for specific keywords or patterns: "Search for 'function name' in the project"
    • Filter by file type: "Search for 'import' in all .py files"
  3. File Analysis:

    • Analyze specific files: "Analyze the file src/main.py"
    • Get file summaries: "Give me a list of functions in utils/helpers.js"
  4. Project Navigation:

    • View project structure: "Show me the structure of this project"
    • Find files matching specific patterns: "Find all test_*.py files"

Technical Details

Persistent Storage

All index and settings data are stored in the system temporary directory, in a subfolder specific to each project:

  • Windows: %TEMP%\code_indexer\[project_hash]\
  • Linux/macOS: /tmp/code_indexer/[project_hash]/

Each project's data includes:

  • config.json: Project configuration information
  • file_index.pickle: File index data
  • content_cache.pickle: File content cache

This approach ensures that:

  1. Different projects' data are kept separate
  2. The data is automatically cleaned up by the OS when no longer needed
  3. In containerized environments, the data is stored in the container's temporary directory

Dependency Management with UV

Code Index MCP uses UV for dependency management, which provides several advantages:

  • Automatic dependency resolution based on project requirements
  • Faster package installation and environment setup
  • Consistent dependency versions via the lock file

Supported File Types

The following file types are currently supported for indexing and analysis:

  • Python (.py)
  • JavaScript/TypeScript (.js, .ts, .jsx, .tsx)
  • Java (.java)
  • C/C++ (.c, .cpp, .h, .hpp)
  • C# (.cs)
  • Go (.go)
  • Ruby (.rb)
  • PHP (.php)
  • Swift (.swift)
  • Kotlin (.kt)
  • Rust (.rs)
  • Scala (.scala)
  • Shell (.sh, .bash)
  • HTML/CSS (.html, .css, .scss)
  • Markdown (.md)
  • JSON (.json)
  • XML (.xml)
  • YAML (.yml, .yaml)

Security Considerations

  • File path validation prevents directory traversal attacks
  • Absolute path access is not allowed
  • Project path must be explicitly set, with no default value
  • Index data is stored in the system temporary directory, not in the project directory
  • Each project's data is stored in a separate directory based on the project path's hash

Contributing

Contributions via issues or pull requests to add new features or fix bugs are welcome.


For documentation in Chinese, please see README_zh.md.

About

A Model Context Protocol (MCP) server that helps large language models index, search, and analyze code repositories with minimal setup

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 10