How It Works / Local & Privacy

Your knowledge, on your terms

Local-first architecture is a design principle, not a compromise. Every conversation, memory, and document is a file on your filesystem. What you route to a cloud provider travels directly from your machine to theirs — Memorandai runs no servers that intercept or hold your data.

Local Architecture

All Memorandai data — conversations, memories, keystones, documents — lives on your filesystem. No cloud sync, no remote servers, no third-party storage. When you route a prompt to a cloud AI provider, it goes directly to that provider under their terms. Memorandai is never in the middle.

You can browse, back up, port, or delete any of it at any time — standard file formats on your filesystem, nothing proprietary in between.

Data Management

The Data screen under Library → Exporters gives you a curated export path on top of the filesystem-level access. Pick what to back up by category — Projects (flow graphs, workspaces), Memories (conversation shards, keystones, ingested documents), Notebooks (pages, domains, embedded images), Settings (API keys, model preferences, feature flags), Profile (identity and AI-generated profile) — and Memorandai writes a single zip you can copy to another machine, save to cloud storage, or hand to a backup tool.

Restore is a drop-target away: drag a previous backup zip into the Restore panel and Memorandai re-hydrates the categories you choose to bring over.

No proprietary container, no vendor lock-in. The export is a standard zip of the same files you could move manually — Data Management is just the convenient shortcut.

Bring Your Own Keys

Use your own API keys from OpenAI, Anthropic, Google, and xAI. You pay the providers directly at their published rates. Memorandai never sees your keys, never adds markup, and encrypts them at rest.

Built-In Local Inference

Download and run local GGUF models in a few clicks — GPU-accelerated when available, CPU-friendly otherwise. Free for local-only conversations and useful for sessions you'd rather keep entirely on-device. Memorandai works best with cloud providers configured for the full feature set; local inference is a complement, not a complete replacement.

Configurable tool access on models that support it
No internet required for local-only conversations after initial model downloads
Same canvas workflow as cloud models — local nodes slot into the same Flows
Curated, rotating recommendations — current slate: Qwen 3 8B, Llama 3.1 8B Instruct, Gemma 3 4B Instruct, and Qwen 3 1.7B — all one-click downloadable in the app
Custom download — paste any HuggingFace model URI or direct .gguf URL for models beyond the curated set

Local Model Support

Memorandai supports four cloud chat providers (Anthropic, OpenAI, Google, xAI) and local GGUF inference via node-llama-cpp. Local coverage is comprehensive for chat workflows, partial for some background processing, and absent for features that require strict JSON output, vision, or cloud-anchored orchestration. The matrix below names every feature explicitly.

Full Local Support

Chat workflows and most companion features run end-to-end on local models with quality on par with cloud for typical use.

Feature	Notes
Local Chat node (Flows mode)	Native llama.cpp via node-llama-cpp v3, GGUF models loaded into VRAM. Tool calling via grammar-constrained generation, capped at 20 user-selected tools.
Chat View (linear chat mode)	Local routing branch alongside the four cloud providers.
Roundtable (local participant)	Loaded local model fans out alongside cloud participants. One-shot, no tools. Synthesis stays cloud-only (Helios runs the synthesis pass).
Reflection node	Margin notes in the local model's voice when attached to a local response. Prose-only, no JSON fragility.
Auto-memo / nightly consolidation	Routes local first when a model is loaded; cloud fallback on error or empty result.
Thread brief generation	Tries local first, then gpt-4o-mini, then haiku.

Partial — Cloud Recommended

Background tasks where local inference is wired up but quality varies. Open prose works; tasks requiring strict JSON output are fragile on smaller local models.

Feature	Notes
Self-notes (Dream Cycle, per-genus)	Generally works. Open-ended prose.
Helios convergence self-note	Generally works. Open prose. Loses some Helios voice — the brief is built for the user's preferred cloud model.
Identity brief generation	Generally works. Long-form prose. Quality drop noticeable but functional.
Profile identity-summary	Generally works. Explicit local branch.
Helios Keystone Revision Cycle	Fragile. Requires parseable JSON disposition array.
Keystone mining (Dream Cycle phase)	Fragile. Requires parseable JSON with category, importance, and confidence fields.
Memory quality scoring	Fragile. Strict JSON output.
Archive review scoring	Fragile. Strict JSON output.

Cloud Only

Features that require cloud provider APIs by construction — orchestration paths bound to a cloud preferred model, hardcoded SDK calls, vision support, and image generation.

Feature	Why cloud-only
Helios chat	Orchestrator hardcoded to a cloud preferred model.
Per-provider chat nodes (Claude, GPT, Gemini, Grok)	Bound to their provider by definition.
Roundtable synthesis	Helios runs the synthesis pass; cloud-only by construction.
Daily Insight (regular and Helios voice modes)	Direct cloud SDK calls.
Keystone miner (dedicated route)	Direct cloud SDK calls. Distinct from the Dream Cycle keystone-mining phase, which is in the partial-support table above.
Helios proposal miner	Direct cloud SDK calls.
Timeline scan	Direct cloud SDK calls.
Knowledge distillation	Hardcoded provider list.
Idea miner	Direct cloud SDK calls.
MCP server builder	Hardcoded to Claude with Anthropic tool use.
Custom node builder	Hardcoded to Claude with Anthropic tool use.
Connectivity test	Tests cloud provider reachability by definition.
Image generation (DALL-E 3, GPT Image 2, Imagen 4, Grok Aurora)	No local image-gen path.
Image attachments in chat (vision)	Local inference has no vision support; image attachments only work on the four cloud chat nodes.

Matrix reflects Memorandai 0.8.8.

MCP Servers

Extend Memorandai with Model Context Protocol servers. Install community servers or build your own. Each server adds new tools that every AI model on your canvas can use. The extensibility layer for power users.

Custom Nodes

Build your own node types with MCP command transport. Define input forms, connect to external services, stream results back to the canvas. Your workflow, your tools.

Encryption & Security

API keys are encrypted at rest using machine-specific keys. No telemetry, no analytics, no tracking. The app does not phone home. Memorandai never sees, stores, or transmits your data — what reaches a cloud model is your call alone.

Offline-Friendly License

Activate once online. After that, your license works offline — no subscription server to check in with, no recurring validation. Local-only chat works fully offline; cloud sessions and background features (Dream Cycle, Daily Insight, keystone mining) require Internet and API keys.

Take control of your knowledge

Your data, your keys, your machine. No compromises.

Become a Founding Member