History

Franz Rolfsvaag 34e78d69c3 Add Lumi AI, birthday plugin, and persistent updates		2026-06-11 06:35:43 +02:00
..
backend	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
data	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
public	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
templates	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
tests	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
views	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
cmds.json	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
index.js	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
models_manifest.json	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
plugin.json	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
README.md	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00
runtime_manifest.json	Add Lumi AI, birthday plugin, and persistent updates	2026-06-11 06:35:43 +02:00

README.md

Lumi AI

lumi_ai is a standalone Lumi plugin that manages a local llama.cpp inference process and adds a scoped AI Assistant to the WebUI.

Install and configure

Place this directory at plugins/lumi_ai/.
Restart Lumi.
Open Plugins -> Lumi AI in the sidebar.
Download the managed runtime and a compatible model.
Select the model, configure visibility and instructions, then save.
Start the runtime and enable AI.

The settings page is always registered as an admin-only item in the Plugins sidebar section. The assistant pill is injected separately above the profile footer and follows the configured admin, moderator, and user visibility controls.

Storage

Every writable path is confined to plugins/lumi_ai/data/:

config/: settings and runtime state
models/: verified GGUF models
runtime/: extracted llama.cpp runtime
logs/: runtime logs
metrics/: usage and audit records
rag/, cache/, tmp/: plugin-local working data

Downloads are written to data/tmp/, verified against a pinned SHA-256 digest, and only then moved or extracted into their final plugin-local directory.

Runtime and downloads

Models use pinned Hugging Face repository commits. The runtime uses a pinned official ggml-org/llama.cpp GitHub release because the llama.cpp project does not publish authoritative multi-platform runtime archives on Hugging Face. This is the only download-source exception; the archive URL, version, size, and SHA-256 are pinned in runtime_manifest.json.

The runtime binds only to 127.0.0.1 on an ephemeral port. It is never exposed on 0.0.0.0.

Before loading a model, Lumi AI runs llama-server --help as a smoke test. Failed launches and exits are decoded into plugin-local diagnostics, including Windows NTSTATUS values such as 0xC0000005 / STATUS_ACCESS_VIOLATION. The admin page provides remediation steps, raw stdout/stderr tails, model verification, and a redacted diagnostics bundle.

The test console no longer exposes a user-editable scope label. Clearly unrelated requests are rejected deterministically, while ambiguous requests are passed to the scoped Lumi system prompt instead of being rejected by a fixed keyword list.

Plugin API

Other Lumi plugins can use:

const ai = global.lumiFrameworks?.ai;
const health = await ai.health();
const result = await ai.generate({
  message: "Summarize this Lumi event.",
  user: requestingUser,
  sessionId: requestSessionId,
  scope: "my_plugin"
});

Available functions:

generate
classify
summarize
route_tool
health
capabilities
metrics_summary
registerContext
unregisterContext
registerTool

AI tools must provide an owning plugin, a synchronous permission check, a fixed argument schema, and an established workflow handler. Model output cannot execute SQL, shell commands, file operations, or arbitrary URLs.

Tool registration

ai.registerTool({
  tool_id: "example.action",
  display_name: "Example action",
  description: "Runs an existing plugin workflow.",
  owning_plugin: "example",
  required_role: "user",
  required_permission: "example.action.self",
  permission_check: ({ user, arguments: args }) => canRunWorkflow(user, args),
  schema: { target: "string", amount: "integer" },
  confirmation_required: true,
  risk_level: "sensitive",
  audit_category: "example",
  workflow_handler: ({ arguments: args, user, initiated_via_ai, ai_request_id }) =>
    existingWorkflow({ ...args, actor: user, initiated_via_ai, ai_request_id })
});

Verification

Run:

node plugins/lumi_ai/tests/verify.js

The verification covers path confinement, traversal rejection, assistant role access, tool schema and permission checks, user/session confirmation ownership, expiry, action attribution, audit recording, queue limits, refusal behavior, and runtime resume persistence.