arrivia-recs
Provides AI-driven, partner-aware travel recommendations with auditability, integrating member context and read-only partner policy rules.
README
arrivia Agentic Travel Recommendations API
Internal service for AI-driven, partner-aware travel recommendations that power an "AI Concierge" experience inside partner booking flows. The service combines member context with read-only partner rules and returns recommendations with an audit block so multi-tenant policy decisions stay traceable.
Goals, constraints, and delivery expectations come from the program brief in Prompt.md (maintained outside this repository).
Constraints From The Brief
- Stack: ship inside arrivia's existing containerized footprint. No new infrastructure layer or third-party platform is introduced here.
- Partner configuration is read-only: this service only reads partner rules and enforces whatever the partner config service returns.
- Four-week first step: scope is limited to what one engineer can credibly ship first and own on call.
- On-call ownership: prefer simple behavior, explicit failures, and clear auditability over hidden fallback logic.
- Multi-tenant boundaries: partner isolation matters, and policy decisions must remain reviewable during incident response.
Fastest Reviewer Path
This is the recommended challenge-evaluation path because it avoids host/Docker DNS mismatches and uses the checked-in WireMock stubs.
- Create the Compose env file from the committed template, then start the API plus mocked upstreams:
Copy-Item .env.example .env -Force
docker compose --profile mocks up --build
Docker Compose reads .env for the API container. .env.example is the tracked template; edit .env only if you need local overrides.
v0 deployment topology for first rollout:
- run a single active API replica
- treat session-cap tracking as same-machine shared state only
- Horizontal scale is deferred until the service has a shared distributed session-state layer
- see
docs/examples/v0-rollout.yamlfor the checked-in rollout guardrail
- Verify the service is up:
curl -s http://127.0.0.1:8080/health
curl -s http://127.0.0.1:8080/ready
- Call the primary REST route with the committed demo member and session:
curl -s http://127.0.0.1:8080/v1/recommendations `
-H "Content-Type: application/json" `
-d '{"member_id":"m1","session_id":"review-session-1"}'
Expected reviewer checks for the sample response:
partner_idisp1- the response includes an
auditblock - the mock partner policy excludes cruise offers
- the mock partner policy caps the session at
3recommendations
- Optional shipped CLI demo over the same primary route:
arrivia-recs-demo --member-id m1 --session-id review-session-1
- If you want to inspect the mock fixtures directly, see:
mocks/member-service/mappings/member-m1.jsonmocks/partner-config-service/mappings/partner-p1-policy.json
MCP Quick Start
The MCP server is a stdio process exposed by python -m arrivia_recs.mcp.server.
Canonical reviewer-facing surfaces for this challenge:
- Primary HTTP contract:
POST /v1/recommendations - Primary MCP tool:
get_travel_recommendations
If you run the mocks in Docker but run MCP on the host, point the MCP process at localhost instead of the Compose service hostnames:
python -m venv .venv
.\.venv\Scripts\Activate.ps1
pip install -e ".[dev]"
$env:MEMBER_SERVICE_BASE_URL="http://127.0.0.1:8081"
$env:PARTNER_CONFIG_BASE_URL="http://127.0.0.1:8082"
python -m arrivia_recs.mcp.server
From any MCP-capable client, register the server with this command:
{
"mcpServers": {
"arrivia-recs": {
"command": "python",
"args": ["-m", "arrivia_recs.mcp.server"],
"cwd": "."
}
}
}
Then discover and invoke:
- List tools and confirm
get_travel_recommendationsis available. - Call
get_travel_recommendationswith{"member_id":"m1","session_id":"review-session-1"}. - Confirm the MCP JSON response includes
partner_id,recommendations, andaudit.
The MCP tool forwards to the same RecommendationService used by POST /v1/recommendations, so the main reviewer path is to compare the MCP output with the REST output for the same member_id and session_id.
Reviewer-visible MCP transcript:
docs/examples/mcp-stdio-transcript.md
Repository-backed MCP proof path:
python -m pytest tests/test_mcp_stdio_smoke.py -q
That smoke test spawns the real python -m arrivia_recs.mcp.server stdio process, uses the official mcp client SDK to list tools, and invokes get_travel_recommendations against temporary upstream mock servers. The transcript file above mirrors that proof in reviewer-friendly form.
For v0, REST and MCP session-cap parity is only guaranteed when both are evaluated in the same single-machine rollout topology. Cross-host or horizontally scaled session consistency is intentionally deferred.
Local Setup
Requirements: Python 3.11+. Docker is optional, but the challenge's fastest successful path is Docker plus WireMock.
python -m venv .venv
.\.venv\Scripts\Activate.ps1
pip install -e ".[dev]"
Bare-metal API run:
$env:MEMBER_SERVICE_BASE_URL="http://127.0.0.1:8081"
$env:PARTNER_CONFIG_BASE_URL="http://127.0.0.1:8082"
python -m uvicorn arrivia_recs.main:app --reload --host 127.0.0.1 --port 8080
The .env.example defaults point at Compose-only hostnames (member-service and partner-config-service). Those names do not resolve on bare metal unless you override them as shown above.
Minimal CLI demo against a running API:
arrivia-recs-demo --member-id m1 --session-id review-session-1 --base-url http://127.0.0.1:8080
API-only Docker path:
Copy-Item .env.example .env -Force
docker compose up --build
Reviewer-complete Docker path with mocks:
Copy-Item .env.example .env -Force
docker compose --profile mocks up --build
Quality Checks
pytest
ruff check .
python -m compileall src tests
Judge Proof
This submission is intentionally scoped to a single-active-replica v0. It does not claim horizontally scaled session-cap consistency.
- v0 runs as one active replica only
- same-machine session-cap parity is shipped
- cross-host or multi-replica consistency is intentionally later work
Copy-paste proof command:
python -m pytest tests/test_mcp_stdio_smoke.py tests/test_scope_contracts.py -q
Reviewer map:
docs/examples/judge-proof.md
What Ships In v0
- FastAPI service with
/health,/ready, andPOST /v1/recommendations - Read-only member and partner-policy adapters backed by WireMock for local development
- Recommendation flow that applies partner caps and exclusions with an auditable
auditblock - MCP server that exposes
get_travel_recommendations - Docker-based local run path and Python package layout under
src/arrivia_recs/ - Minimal CLI (
arrivia-recs-demo) for end-to-end reviewer smoke tests against the primary API - Explicit failure handling for invalid input and upstream failures on the primary REST surface
- Single-active-replica rollout with same-machine session-cap semantics for reviewer and first-production use
- Checked-in rollout guardrail at
docs/examples/v0-rollout.yaml
What Comes Later
- Live upstream authn/authz and production secrets management
- Shared distributed session-budget state for horizontal scale across hosts/replicas
- Richer ranking, inventory-aware offers, and experimentation support
- Production telemetry, SLOs, and deployment manifests tuned for arrivia environments
Four-Week Delivery Plan
| Week | Focus | Exit criteria |
|---|---|---|
| Week 1 | Foundation: repo layout, settings, health/readiness, Docker/Compose, env docs | App boots locally and /health plus /ready respond |
| Week 2 | Core REST plus policy: member and partner reads, caps/exclusions, auditable responses | POST /v1/recommendations respects partner rules and tests cover policy edges |
| Week 3 | MCP: expose recommendation tooling to agent clients | Agent can discover get_travel_recommendations and invoke it against a running stack |
| Week 4 | Hardening plus handoff: docs, failure-path polish, quality pass | pytest, ruff, and compile checks pass and the operator path is documented |
Ralphy And Codex Workflow
This repository keeps the implementation plan in YAML so Ralphy can drive simultaneous multi-agent execution with the Codex engine.
Prerequisites:
npm install -g ralphy-cli- an authenticated Codex CLI or Codex desktop session available to Ralphy
Challenge-review workflow example:
ralphy --codex --yaml docs/examples/codex-ralphy-review.yaml --parallel --max-parallel 3 -v
What that proves for the submission:
- orchestration is declared in checked-in YAML
- Ralphy runs multiple review tracks simultaneously via
parallel_group - Codex is the execution engine for those agents
- the workflow converges on one verification pass before submission
That example YAML is reviewer-oriented rather than internal-only. It shows the minimum parallel review loop needed to inspect prompt compliance, runtime behavior, and proof artifacts before final verification.
Phase-by-phase project workflow:
ralphy --codex --yaml prd/01-plan.yaml --parallel --max-parallel 3 -v
ralphy --codex --yaml prd/02-setup.yaml --parallel --max-parallel 3 -v
ralphy --codex --yaml prd/03-core-service.yaml --parallel --max-parallel 3 -v
ralphy --codex --yaml prd/04-mcp-ship.yaml --parallel --max-parallel 3 -v
Relevant docs:
docs/plan/runbook.mddocs/plan/mcp-tools.mddocs/examples/codex-ralphy-review.yamldocs/examples/judge-proof.mddocs/examples/mcp-stdio-transcript.md
Cursor Compatibility Note
This repository does not depend on Cursor-specific orchestration. The shipped multi-agent workflow is Ralphy plus Codex because that is the clearest way to show YAML-based simultaneous agent execution in the repo itself.
If you open the repo in Cursor, the practical reviewer path is still straightforward:
- run the Docker plus mocks flow from this README
- run the MCP stdio server from
python -m arrivia_recs.mcp.server - use the checked-in docs and tests as the proof surface
Cursor can act as an editor or MCP client for the same service, but the checked-in orchestration artifact for this submission is docs/examples/codex-ralphy-review.yaml, not a Cursor-only workflow.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.