Google Gemini — Smart Model Fallback for Claude Desktop

A lightweight MCP (Model Context Protocol) server that connects Claude Desktop to the Google Gemini API. Automatically selects the best available Gemini model using a tier-based fallback strategy with intelligent quota tracking — so your AI tools keep working even when individual models hit rate limits.

Built on @modelcontextprotocol/sdk with zero Gemini-specific dependencies (uses Node.js built-in fetch).

Features

3 MCP tools — ask_gemini, list_models, gemini_status
Smart model cache — tracks quota (RPM/RPD), availability, and TTL per model in memory
Quota-aware fallback — reads Retry-After header and quota type (per-minute vs per-day), skips blocked models automatically
Structured prompts — optional context[] blocks (skill / data / text) prepended before the prompt
Tier-based selection — models ranked by tier; best available tier selected automatically on every call
Configurable model list — edit dist/models.json to add, remove, or re-rank models without rebuilding
Zero Gemini deps — uses built-in Node.js fetch (Node 18+)

Requirements

Node.js 18+
Google Gemini API key — Get one at Google AI Studio

Build

The build script installs dependencies, compiles the bundle, copies models.json to dist/, runs the full test suite, and cleans up node_modules.

Windows:

build.cmd YOUR_GEMINI_API_KEY

Linux / macOS:

chmod +x build.sh
./build.sh YOUR_GEMINI_API_KEY

If GEMINI_API_KEY is already set in your environment, you can omit the argument:

build.cmd

./build.sh

After a successful build, dist/ is fully self-contained:

dist/
  mcp.js        — bundled server (single file, no node_modules needed)
  models.json   — model tier configuration (safe to edit without rebuilding)

Claude Desktop Setup

Open the Claude Desktop configuration file:

Windows: %APPDATA%\Claude\claude_desktop_config.json
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json

Add the following entry:

{
  "mcpServers": {
    "gemini-bridge": {
      "command": "node",
      "args": ["C:/absolute/path/to/dist/mcp.js"],
      "env": {
        "GEMINI_API_KEY": "your_api_key_here"
      }
    }
  }
}

Restart Claude Desktop. All three tools will be available in every conversation.

Model Configuration (`dist/models.json`)

Models are ranked by tier — the server always picks the lowest available tier. Edit this file at any time without rebuilding:

[
  { "id": "gemini-2.5-pro",        "tier": 1, "desc": "best reasoning, complex tasks" },
  { "id": "gemini-2.5-flash",      "tier": 2, "desc": "fast, capable, balanced" },
  { "id": "gemini-2.5-flash-lite", "tier": 3, "desc": "lightweight, high quota" },
  { "id": "gemini-2.0-flash",      "tier": 4, "desc": "fallback, stable" }
]

To use a custom path: set GEMINI_MODELS_PATH=/your/path/models.json in the environment.

Tools

`ask_gemini`

Sends a prompt to Gemini. Automatically selects the best available model by tier, or uses the model you specify.

Parameter	Type	Description
`prompt`	string (required)	The question or instruction
`model`	string (optional)	Override model ID, e.g. `"gemini-2.5-pro"`
`context`	array (optional)	Structured context blocks, max 5

context block:

{ "type": "skill|data|text", "text": "..." }

Composed prompt format when context is provided:

[skill]
You are a senior Node.js engineer.

[data]
{ "version": "2.0" }

[prompt]
Review this code for bugs.

Response — always a JSON string in content[0].text:

{ "ok": true,  "text": "...", "model_used": "gemini-2.5-pro" }
{ "ok": false, "error": "quota",   "retry": false, "best_retry_in": "43s" }
{ "ok": false, "error": "blocked", "retry": false, "detail": "..." }
{ "ok": false, "error": "timeout", "retry": true }

Always check ok before using text.

`list_models`

Returns the model list with current cache status. No API calls made.

[
  { "id": "gemini-2.5-pro",        "tier": 1, "status": "ok",        "retry_in": null },
  { "id": "gemini-2.5-flash",      "tier": 2, "status": "quota_rpm", "retry_in": "43s" },
  { "id": "gemini-2.5-flash-lite", "tier": 3, "status": "unknown",   "retry_in": null }
]

Status values: ok | quota_rpm | quota_rpd | error | unknown

Use this to decide which model to pass to ask_gemini.

`gemini_status`

Actively probes first N models and warms up the cache. Stops at the first successful model.

Parameter	Type	Description
`limit`	integer (optional)	Number of models to probe, default 3

Use for debugging or cache warmup. For a quick overview without API calls, use list_models instead.

Environment Variables

Variable	Default	Description
`GEMINI_API_KEY`	—	API key (required)
`GEMINI_FETCH_TIMEOUT_MS`	`30000`	Per-request timeout in milliseconds
`GEMINI_TTL_OK_MS`	`300000`	Cache TTL for healthy models (default 5 min)
`GEMINI_MODELS_PATH`	`dist/models.json`	Custom path to `models.json`

Project Structure

src/
  mcp.js              — entry point (Server + StdioTransport)
  Config.js           — API key, timeouts, TTL, models path
  GeminiClient.js     — callGemini(), probeModel(), parse429()
  ModelCache.js       — in-memory cache per model
  Tools/
    AskGemini.js
    ListModels.js
    GeminiStatus.js
  Utils/
    composePrompt.js
dist/
  mcp.js              — bundled output (esbuild, single file)
  models.json         — model tier configuration
models.json           — source of truth (copied to dist/ during build)
test/
  unit.js             — unit tests (ModelCache, composePrompt), no API calls
  integration.js      — integration tests via stdio JSON-RPC

Running Tests

# Unit tests only (no API key needed)
node test/unit.js

# Integration tests against src/ (requires GEMINI_API_KEY)
node test/integration.js src

# Integration tests against dist/
node test/integration.js dist

# Full suite (same as npm test)
npm test

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
dist		dist
src		src
test		test
.gitignore		.gitignore
README.md		README.md
build.cmd		build.cmd
build.sh		build.sh
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google Gemini — Smart Model Fallback for Claude Desktop

Features

Requirements

Build

Claude Desktop Setup

Model Configuration (`dist/models.json`)

Tools

`ask_gemini`

`list_models`

`gemini_status`

Environment Variables

Project Structure

Running Tests

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Google Gemini — Smart Model Fallback for Claude Desktop

Features

Requirements

Build

Claude Desktop Setup

Model Configuration (dist/models.json)

Tools

ask_gemini

list_models

gemini_status

Environment Variables

Project Structure

Running Tests

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Model Configuration (`dist/models.json`)

`ask_gemini`

`list_models`

`gemini_status`

Packages