Browser-Local AI Chat

tileserver-rs includes a browser-local AI assistant that lets you interact with your maps using natural language. The LLM runs entirely in your browser via WebGPU — no API keys, no cloud services, no token costs. Your data never leaves your machine.

How It Works

The AI chat is powered by WebLLM, which runs open-source language models directly in your browser using WebGPU acceleration. The assistant can:

Navigate the map — fly to cities, countries, or coordinates
Query features — find what's visible in the viewport or search tile data
Modify styles — change colors, opacity, visibility of map layers
Highlight features — temporarily highlight features matching a filter
Inspect data — get layer schemas, source statistics, and spatial queries

The entire conversation stays in your browser. Chat history is persisted to localStorage via TanStack DB and survives page refreshes.

Requirements

Info

WebGPU is required. Use Chrome 113+, Edge 113+, or any Chromium-based browser with WebGPU support. Firefox and Safari do not fully support WebGPU yet.

Requirement	Minimum	Recommended
Browser	Chrome 113+	Chrome 120+
GPU VRAM	2 GB	6 GB
Free RAM	2 GB	8 GB
Disk Space	1 GB	5 GB (for larger models)

Getting Started

Open any style in the map viewer (e.g., http://localhost:8080/styles/protomaps-light/)
Press ⌘K (Mac) or Ctrl+K (Windows/Linux) to open the chat palette
On first use, the model downloads and compiles for your GPU (~30 seconds)
Type a message or select a suggested prompt

Subsequent loads are fast (~2–5 seconds) because the compiled model is cached in IndexedDB.

Suggested Prompts

The chat palette shows suggested prompts when the conversation is empty:

"Fly to Paris, France and show me the Eiffel Tower area"
"What layers are available on this map?"
"Take me to Tokyo, Japan at a good zoom level"
"Show me the entire Mediterranean Sea region"

Available Models

tileserver-rs ships with four pre-configured Qwen3 models. Choose based on your hardware and needs:

Model	Size	Best For
Qwen3 4B (default)	3.4 GB	Best balance of quality and speed
Qwen3 8B	5.7 GB	Best quality, needs more VRAM
Qwen3 1.7B	2.0 GB	Lightweight, lower VRAM systems
Qwen3 0.6B	1.4 GB	Minimal hardware, basic interactions

All models use a text-based tool parser — the LLM emits structured [MAP_ACTION] blocks that are parsed and executed client-side. Qwen3 is a strong instruction follower, so tool execution is reliable across all model sizes.

You can switch models at any time using the model selector dropdown in the chat palette.

Map Tools

The AI assistant has access to 13 tools organized into three categories: navigation, styling, and data queries.

`fly_to`

Animate the map camera to a specific location.

"Fly to the Colosseum in Rome"
"Show me downtown Manhattan at zoom 15"
"Go to coordinates 139.7, 35.7 with a 45° bearing"

Parameter	Type	Required	Description
`lng`	number	✅	Longitude (-180 to 180)
`lat`	number	✅	Latitude (-90 to 90)
`zoom`	number		Zoom level 0–22 (default 12)
`bearing`	number		Bearing in degrees (default 0)
`pitch`	number		Pitch 0–85° (default 0)

`fit_bounds`

Fit the map camera to a bounding box — useful for countries, regions, or areas.

"Show me all of Japan"
"Zoom to the Mediterranean Sea"
"Fit the map to the continental United States"

Parameter	Type	Required	Description
`west`	number	✅	West longitude
`south`	number	✅	South latitude
`east`	number	✅	East longitude
`north`	number	✅	North latitude
`padding`	number		Padding in pixels (default 50)

`get_map_state`

Get the current map center, zoom, bearing, pitch, and visible layers. The assistant uses this to understand what you're looking at before making changes.

Styling Tools

`set_layer_visibility`

Show or hide a map layer by its ID.

"Hide the water layer"
"Show the buildings layer"
"Turn off all label layers"

`set_layer_paint`

Change a paint property of a map layer — color, opacity, width, and more.

"Make the water dark blue"
"Set the building fill opacity to 0.5"
"Change road line width to 3"

`set_layer_filter`

Apply a MapLibre filter expression to a layer. Only features matching the filter are shown.

"Only show parks in the landuse layer"
"Filter buildings taller than 50 meters"

`add_highlight`

Temporarily highlight features matching a filter with a colored circle. Highlights auto-remove after 8 seconds.

"Highlight all hospitals"
"Show me where the schools are in red"

`generate_style`

Apply multiple style changes at once from a natural language description.

"Make this a dark mode map"
"Give me a satellite-style color scheme"
"Make all text labels larger"

Data Query Tools

These tools query the tile data served by tileserver-rs — either from the rendered viewport or directly from the vector tile sources.

`query_rendered_features`

Query features currently visible in the map viewport. Returns properties and geometry type.

"What features are visible right now?"
"Show me the properties of buildings in view"
"List all points of interest I can see"

`get_source_schema`

Get the schema of a tile source — available layers, field names and types, zoom range, and bounds.

"What layers are in the openmaptiles source?"
"Show me the fields available in the buildings layer"

`get_source_stats`

Get statistics for a tile source — bounds, zoom range, layer count, attribution.

"What's the zoom range of this data source?"
"Show me the attribution for the terrain data"

`spatial_query`

Query features from a tile source within a bounding box. This queries the actual vector tile data on the server, not just what's rendered in the viewport.

"Find all buildings within 1km of the Eiffel Tower"
"What points of interest are in this area?"

Parameter	Type	Required	Description
`source`	string	✅	Source ID to query
`bbox`	number		Bounding box `[west, south, east, north]`
`zoom`	number		Tile resolution (default 14)
`layers`	string		Layer IDs to query
`limit`	number		Max features to return (default 100)

`get_overlays`

List all user-dropped file overlays on the map — file names, formats, feature counts, colors, and visibility. Works with files dragged onto the map viewer (GeoJSON, KML, GPX, CSV, Shapefiles, PMTiles).

Keyboard Shortcuts

Shortcut	Action
⌘K / Ctrl+K	Toggle chat palette
Esc	Close chat palette
Enter	Send message

Architecture

User Input → Chat Palette (Vue)
    ↓
useLlmPanel (composable)
    ↓
useLlmChat → useChat({ connection: stream(adapter) })
    ↓
stream() adapter — converts WebLLM → AG-UI protocol events
    ↓
WebLLM engine (browser-local, WebGPU)
    ├── chat.completions.create({ stream: true })
    └── Tool calls (fly_to, set_layer_paint, etc.)
    ↓
AG-UI events stream back to TanStack AI Vue
    ↓
Tool results auto-executed client-side
    ↓
Chat Palette renders messages + tool results

Key packages:

@mlc-ai/web-llm — Browser-local LLM inference via WebGPU
@tanstack/ai-vue — Vue integration with useChat hook
zod — Tool input schema validation

Chat Persistence

Chat messages are automatically saved to your browser's localStorage via TanStack DB. This includes:

All user and assistant messages
Tool call records (which tool was called, with what arguments)
Spatial query results

Messages persist across page refreshes and browser restarts. They are stored locally and never sent to any server.

Troubleshooting

"WebGPU is not supported"

Your browser doesn't support WebGPU. Use Chrome 113+ or Edge 113+. On macOS, Safari technology previews have partial WebGPU support but may not work reliably with WebLLM.

Model download is slow

The first download transfers 1.4–5.7 GB depending on the model. After the first download, the compiled model is cached in IndexedDB and loads in 2–5 seconds. Try a smaller model (Qwen3 0.6B at 1.4 GB) if bandwidth is limited.

Tool calls don't execute

If the assistant describes an action but doesn't execute it, try a larger model. Qwen3 4B or Qwen3 8B have the best instruction-following for tool execution. Smaller models (0.6B, 1.7B) may occasionally describe actions in prose instead of emitting the structured [MAP_ACTION] blocks the parser expects.

No data leaves your browser — the LLM runs entirely via WebGPU
No API keys required — no OpenAI, Anthropic, or cloud AI accounts needed
No token costs — inference is free, unlimited, forever
Chat history is local — stored in localStorage, never uploaded
Server-side tools query your own data — spatial queries go to your tileserver-rs instance, not any external service

Live Demo

Try the AI chat on the live demo at demo.tileserver.app. Open any style, press ⌘K, and start talking to the map.

Next Steps

Quickstart — Set up tileserver-rs
Configuration — All configuration options
Vector Tiles — Serving vector tiles
File Drop — Drag & drop files onto the map