AI Provider for WebLLM

Run a large language model inside the browser — no API key, no cloud, no per-token cost — and use it as a first-class AI Provider for the WordPress AI Client.

AI Provider for WebLLM registers WebLLM as a client-side provider for the WordPress AI Client. The model runs in the visitor's browser via WebGPU, so text generation is private by default and costs nothing per request — there is no server or third-party API in the loop.

Why use it

No API key, no bill. Inference happens on the user's GPU. Nothing is sent to a cloud provider.
Private by design. Prompts and completions never leave the browser.
A real AI Client provider. It shows up in Settings → AI (Connectors) like any other provider, so existing AI Client code can target it.
Optional server-side bridge. An open wp-admin tab can act as a worker, letting PHP-initiated generation run in that browser.

Requirements

WordPress 7.0 or newer (the AI Client ships with WordPress 7.0)
PHP 7.4 or newer
The WordPress AI Client (WordPress\AiClient) must be available — it ships with WordPress 7.0. Without it the provider registers nothing and stays dormant.
A browser with WebGPU support (recent Chrome is the most reliable) for the actual inference.
A secure context — the site must be served over HTTPS or localhost. Browsers disable WebGPU on plain-HTTP origins.

Installation

Download or clone this repository into wp-content/plugins/ai-provider-for-webllm.
Activate AI Provider for WebLLM from the Plugins screen.
Make sure a plugin or feature providing the WordPress AI Client is also active.

cd wp-content/plugins
git clone https://github.com/ProgressPlanner/ai-provider-for-webllm.git

Quick start

Go to Settings → WebLLM.
Pick a model. The list loads live from WebLLM's own catalogue, ordered by size — smaller models download faster and use less memory.
Save. The first generation downloads the model once and caches it in the browser.

That's it. Anywhere the AI Client is used in the browser, WebLLM is now a selectable provider.

How it works

WebLLM is a client-side provider, which makes it different from cloud providers:

The PHP classes here only describe the provider and its models so the AI Client and the Settings → AI Connectors screen recognise it. They never call an API.
Actual inference runs in the browser through WebLLM's WebGPU runtime (see assets/js).
The provider declares an api_key auth method only because core's Connectors screen surfaces credential-based connectors. The key is hardcoded to a sentinel (not-needed), so users are never asked for one.

Optional: server-side generation via the browser bridge

PHP cannot push work to a browser, so the plugin ships an opt-in bridge for PHP-initiated generation:

Enable In-browser worker on Settings → WebLLM.
Keep a wp-admin tab open. It downloads the selected model (once, then cached) and polls for jobs.
When PHP requests a generation, the job is queued in a database table; the worker tab claims it, runs the model, and posts the result back over REST.

This only works while a worker tab with the model loaded is connected. There is no headless path — cron and WP-CLI have no browser, so server-initiated generation fails loudly when no worker is available. PHP also blocks while waiting for the browser to answer (up to the timeout), so prefer browser-initiated use where you can.

Two filters tune the bridge:

// Capability required to operate the worker. Default: manage_options.
add_filter( 'ai_provider_webllm_worker_capability', fn() => 'edit_posts' );

// Seconds PHP waits for a worker result before timing out. Default: 120.
add_filter( 'ai_provider_webllm_timeout', fn() => 60 );

Contributing

Contributions are welcome. Please read CONTRIBUTING.md for development setup, coding standards, and the pull request process. By participating you agree to our Code of Conduct.

Support

For help, see SUPPORT.md. To report a security issue, follow SECURITY.md — please do not open a public issue for vulnerabilities.

License

GPL-2.0-or-later. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github		.github
assets		assets
src		src
.distignore		.distignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.stylelintrc.json		.stylelintrc.json
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
composer.json		composer.json
composer.lock		composer.lock
package.json		package.json
phpcs.xml.dist		phpcs.xml.dist
phpstan.neon.dist		phpstan.neon.dist
plugin.php		plugin.php
readme.txt		readme.txt
uninstall.php		uninstall.php

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Provider for WebLLM

Why use it

Requirements

Installation

Quick start

How it works

Contributing

Support

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Provider for WebLLM

Why use it

Requirements

Installation

Quick start

How it works

Contributing

Support

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages