MCP Servers

LM Studio MCP Server

Enables AI assistants to manage LM Studio models, including listing, loading, and unloading models through the LM Studio API.

README

LM Studio MCP Server

An MCP (Model Context Protocol) server that provides AI assistants with control over LM Studio models. This server enables remote model management including listing, loading, and unloading models through the LM Studio API.

Features

Health Check: Verify connectivity to LM Studio
List Downloaded Models: View all LLM models available in your LM Studio library
List Loaded Models: See which models are currently loaded in memory
Load Models: Load models into memory with configurable parameters
Unload Models: Remove specific model instances from memory
Get Model Info: Retrieve detailed information about loaded models

Prerequisites

Node.js 18.0.0 or higher
LM Studio running with the local server enabled

Installation

# Clone the repository
git clone <repository-url>
cd lm-studio-mcp-server

# Install dependencies
npm install

Configuration

The server connects to LM Studio using environment variables:

Variable	Default	Description
`LMSTUDIO_BASE_URL`	(derived)	Full WebSocket URL for LM Studio
`LMSTUDIO_HOST`	`127.0.0.1`	LM Studio host (used if BASE_URL not set)
`LMSTUDIO_PORT`	`1234`	LM Studio port (used if BASE_URL not set)

Usage

Running Modes

Development (uses tsx for TypeScript execution):

npm start
# or with file watching
npm run dev

Production (uses compiled JavaScript):

npm run build
npm run start:prod

Docker:

# Pull the published image
docker pull portertech/lm-studio-mcp-server:latest

# Run (connects to LM Studio on host machine)
docker run -i --rm portertech/lm-studio-mcp-server:latest

# Run with custom LM Studio host
docker run -i --rm \
  -e LMSTUDIO_HOST=192.168.1.100 \
  -e LMSTUDIO_PORT=1234 \
  portertech/lm-studio-mcp-server:latest

MCP Client Configuration

Claude

Add to your claude_desktop_config.json:

Using npx (recommended for installed packages):

{
  "mcpServers": {
    "lmstudio": {
      "command": "npx",
      "args": ["@portertech/lm-studio-mcp-server"],
      "env": {
        "LMSTUDIO_HOST": "127.0.0.1",
        "LMSTUDIO_PORT": "1234"
      }
    }
  }
}

Using local development:

{
  "mcpServers": {
    "lmstudio": {
      "command": "npx",
      "args": ["tsx", "/path/to/lm-studio-mcp-server/src/index.ts"],
      "env": {
        "LMSTUDIO_HOST": "127.0.0.1",
        "LMSTUDIO_PORT": "1234"
      }
    }
  }
}

Using Docker:

{
  "mcpServers": {
    "lmstudio": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "-e",
        "LMSTUDIO_HOST=127.0.0.1",
        "-e",
        "LMSTUDIO_PORT=1234",
        "portertech/lm-studio-mcp-server:latest"
      ]
    }
  }
}

Note: For Docker on macOS/Windows connecting to LM Studio on the host machine, use LMSTUDIO_HOST=host.docker.internal.

Available Tools

All tools return a consistent response envelope:

{
  success: boolean;
  message: string;
  data?: T;           // Present on success
  error?: {           // Present on failure
    code: string;
    message: string;
  };
}

Error Codes

Code	Description
`MODEL_NOT_FOUND`	Requested model does not exist
`MODEL_NOT_LOADED`	Model is not currently loaded
`CONNECTION_FAILED`	Cannot connect to LM Studio
`INVALID_INPUT`	Invalid parameters provided
`LOAD_FAILED`	Failed to load model
`UNLOAD_FAILED`	Failed to unload model
`UNKNOWN`	Unexpected error

`health_check`

Check connectivity to LM Studio server.

Parameters: None

Returns: Connection status and base URL

`list_models`

List all downloaded LLM models available in LM Studio.

Parameters: None

Returns: Array of model info objects with:

modelKey: Model identifier for loading
path: Relative path to the model
displayName: Human-readable model name
sizeBytes: Size in bytes
architecture: Model architecture (if available)
quantization: Quantization type (if available)

`list_loaded_models`

List all currently loaded models in memory.

Parameters: None

Returns: Array of loaded model info with:

identifier: Instance identifier
modelKey: Model key
path: Model path
displayName: Human-readable name
sizeBytes: Size in bytes
vision: Whether model supports vision
trainedForToolUse: Whether model was trained for tool use

`load_model`

Load a model into memory.

Parameters:

model (required): Model key to load (e.g., llama-3.2-3b-instruct)
identifier (optional): Custom identifier for the loaded instance
contextLength (optional): Context window size in tokens (minimum: 1)
evalBatchSize (optional): Batch size for token processing (minimum: 1)

Returns: Success status with loaded model details (identifier, modelKey, path)

`unload_model`

Unload a model from memory.

Parameters:

identifier (required): Identifier of the loaded model to unload

Returns: Success status

`get_model_info`

Get detailed information about a loaded model.

Parameters:

identifier (required): Identifier of the loaded model

Returns: Model details including identifier, modelKey, path, displayName, sizeBytes, contextLength

Development

# Build the project
npm run build

# Run in development mode with auto-reload
npm run dev

# Type check without emitting
npm run typecheck

# Run tests
npm test

# Lint
npm run lint

# Format
npm run format:check
npm run format

Release Process

The project includes a make release command for automated releases:

# Create a new release (runs CI, sets version, commits, tags, publishes to npm and Docker Hub)
make release VERSION=<version>

# Example:
make release VERSION=1.0.5

This runs the full release pipeline:

CI checks (lint, typecheck, test)
Sets version in package.json
Commits the version bump
Creates an annotated git tag (v<version>)
Publishes to npm
Builds and pushes Docker images to Docker Hub

Project Structure

src/
├── index.ts              # MCP server entry point
├── client.ts             # LM Studio client wrapper
├── types.ts              # Shared types and result helpers
└── tools/
    ├── index.ts          # Tool exports
    ├── health-check.ts   # Health check tool
    ├── list-models.ts    # List downloaded models
    ├── list-loaded-models.ts
    ├── load-model.ts
    ├── unload-model.ts
    └── get-model-info.ts

Architecture

Consistent Results: All tools return the same ToolResult<T> envelope
Safe Wrappers: Tool handlers are wrapped to catch exceptions and return error payloads
Lazy Config: Environment variables are read at runtime, not module load
Singleton Client: Single LM Studio client instance is reused

License

ISC

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured