Sozio Business
Sozio Business is a standalone local MCP server for combining private company data with 22,635 profiled Swiss open-data resources.
README
Sozio Business
Sozio Business is a standalone local MCP server for combining private company data with 22,635 profiled Swiss open-data resources. It extends Sozio Full with private imports, local profiling, reviewed join mappings, output controls, and the synthetic AlpenMarkt AG demonstration.
Boundary
The server contains no language model and makes no internal LLM calls. Claude, Codex, or another MCP client performs research design, SQL generation, review, method selection, interpretation, and writing. Sozio Business performs:
- local public/private catalog search;
- private CSV, JSON, Parquet, XLSX, and SQLite import;
- local Parquet normalization and profiling;
- automatic private/public join suggestions with explicit confirmation;
- reviewed private/public join mappings and a local JSONL audit trail;
- read-only DuckDB SQL and joins;
- scipy tests and statsmodels regressions;
- OpenAlex bibliographic retrieval;
- local scientific reports and reproduction bundles.
Private files, normalized data, profiles, mappings, and reports remain in the configured local state directory. Public providers and OpenAlex receive normal HTTP requests when their data is requested.
Privacy modes
SOZIO_BUSINESS_PRIVACY_MODE accepts:
standard: schema, samples, distinct values, and result rows are available;protected(default): private samples and distinct values are suppressed, queries referencing sensitive columns are blocked, result rows are capped, and groups below the minimum size are blocked;local_only: applies protected output controls and records that a local model is required. The MCP server cannot technically verify the client's model.
Additional settings:
$env:SOZIO_BUSINESS_MINIMUM_GROUP_SIZE = "5"
$env:SOZIO_BUSINESS_MAXIMUM_PRIVATE_ROWS = "100"
Protected mode reduces accidental disclosure but is not a complete legal or statistical disclosure-control system. Real personal data requires a documented governance and security review.
Development
uv sync --extra dev
uv run sozio-business doctor
uv run sozio-business setup-demo
uv run sozio-business mcp-server
MCP Bundle
manifest.json uses the MCPB 0.4 UV runtime, so the host resolves the locked
Python dependencies, including compiled DuckDB/statistics packages. Build the
installable bundle with:
npx --yes @anthropic-ai/mcpb validate
npx --yes @anthropic-ai/mcpb pack
The bundle includes the complete local public profile catalog. Private imports,
profiles, normalized files, reports, mappings, and audit.jsonl are written to
the state directory selected during installation.
setup-demo generates and imports five synthetic AlpenMarkt datasets:
| Resource | Format | Grain |
|---|---|---|
| stores | XLSX | one row per store |
| sales_daily | CSV | store, date, product group |
| employees_monthly | Parquet | store and month |
| campaigns | JSON | one campaign |
| energy_monthly | SQLite | store and month |
The company and all records are synthetic. manager_email uses the reserved
.invalid domain and demonstrates sensitive-column controls.
Recommended demonstration
Ask the MCP client:
- Set up AlpenMarkt AG and list the private profiles.
- Find a public municipal population resource that joins through the BFS municipality number.
- Inspect and explicitly confirm the join grain and key types.
- Calculate store revenue per resident by municipality.
- Estimate an OLS model for daily revenue using sales area, discount, weekend, campaign, and product group.
- Explain assumptions and limitations, then create a scientific report and reproduction bundle.
See examples/ALPENMARKT_ANALYSES.md for detailed workflows.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.