How It Works / Local & Privacy
Your knowledge, on your terms
Local-first architecture is a design principle, not a compromise. Every conversation, memory, and document is a file on your filesystem. What you route to a cloud provider travels directly from your machine to theirs — Memorandai runs no servers that intercept or hold your data.
Local Architecture
All Memorandai data — conversations, memories, keystones, documents — lives on your filesystem. No cloud sync, no remote servers, no third-party storage. When you route a prompt to a cloud AI provider, it goes directly to that provider under their terms. Memorandai is never in the middle.
You can browse, back up, port, or delete any of it at any time — standard file formats on your filesystem, nothing proprietary in between.
Data Management
The Data screen under Library → Exporters gives you a curated export path on top of the filesystem-level access. Pick what to back up by category — Projects (flow graphs, workspaces), Memories (conversation shards, keystones, ingested documents), Notebooks (pages, domains, embedded images), Settings (API keys, model preferences, feature flags), Profile (identity and AI-generated profile) — and Memorandai writes a single zip you can copy to another machine, save to cloud storage, or hand to a backup tool.
Restore is a drop-target away: drag a previous backup zip into the Restore panel and Memorandai re-hydrates the categories you choose to bring over.
No proprietary container, no vendor lock-in. The export is a standard zip of the same files you could move manually — Data Management is just the convenient shortcut.
Bring Your Own Keys
Use your own API keys from OpenAI, Anthropic, Google, and xAI. You pay the providers directly at their published rates. Memorandai never sees your keys, never adds markup, and encrypts them at rest.
Built-In Local Inference
Download and run local GGUF models in a few clicks — GPU-accelerated when available, CPU-friendly otherwise. Free for local-only conversations and useful for sessions you'd rather keep entirely on-device. Memorandai works best with cloud providers configured for the full feature set; local inference is a complement, not a complete replacement.
- Configurable tool access on models that support it
- No internet required for local-only conversations after initial model downloads
- Same canvas workflow as cloud models — local nodes slot into the same Flows
- Curated, rotating recommendations — current slate: Qwen 3 8B, Llama 3.1 8B Instruct, Gemma 3 4B Instruct, and Qwen 3 1.7B — all one-click downloadable in the app
- Custom download — paste any HuggingFace model URI or direct .gguf URL for models beyond the curated set
Local Model Support
Memorandai supports four cloud chat providers (Anthropic, OpenAI, Google, xAI) and local GGUF inference via node-llama-cpp. Local coverage is comprehensive for chat workflows, partial for some background processing, and absent for features that require strict JSON output, vision, or cloud-anchored orchestration. The matrix below names every feature explicitly.
Full Local Support
Chat workflows and most companion features run end-to-end on local models with quality on par with cloud for typical use.
| Feature | Notes |
|---|---|
| Local Chat node (Flows mode) | Native llama.cpp via node-llama-cpp v3, GGUF models loaded into VRAM. Tool calling via grammar-constrained generation, capped at 20 user-selected tools. |
| Chat View (linear chat mode) | Local routing branch alongside the four cloud providers. |
| Roundtable (local participant) | Loaded local model fans out alongside cloud participants. One-shot, no tools. Synthesis stays cloud-only (Helios runs the synthesis pass). |
| Reflection node | Margin notes in the local model's voice when attached to a local response. Prose-only, no JSON fragility. |
| Auto-memo / nightly consolidation | Routes local first when a model is loaded; cloud fallback on error or empty result. |
| Thread brief generation | Tries local first, then gpt-4o-mini, then haiku. |
Partial — Cloud Recommended
Background tasks where local inference is wired up but quality varies. Open prose works; tasks requiring strict JSON output are fragile on smaller local models.
| Feature | Notes |
|---|---|
| Self-notes (Dream Cycle, per-genus) | Generally works. Open-ended prose. |
| Helios convergence self-note | Generally works. Open prose. Loses some Helios voice — the brief is built for the user's preferred cloud model. |
| Identity brief generation | Generally works. Long-form prose. Quality drop noticeable but functional. |
| Profile identity-summary | Generally works. Explicit local branch. |
| Helios Keystone Revision Cycle | Fragile. Requires parseable JSON disposition array. |
| Keystone mining (Dream Cycle phase) | Fragile. Requires parseable JSON with category, importance, and confidence fields. |
| Memory quality scoring | Fragile. Strict JSON output. |
| Archive review scoring | Fragile. Strict JSON output. |
Cloud Only
Features that require cloud provider APIs by construction — orchestration paths bound to a cloud preferred model, hardcoded SDK calls, vision support, and image generation.
| Feature | Why cloud-only |
|---|---|
| Helios chat | Orchestrator hardcoded to a cloud preferred model. |
| Per-provider chat nodes (Claude, GPT, Gemini, Grok) | Bound to their provider by definition. |
| Roundtable synthesis | Helios runs the synthesis pass; cloud-only by construction. |
| Daily Insight (regular and Helios voice modes) | Direct cloud SDK calls. |
| Keystone miner (dedicated route) | Direct cloud SDK calls. Distinct from the Dream Cycle keystone-mining phase, which is in the partial-support table above. |
| Helios proposal miner | Direct cloud SDK calls. |
| Timeline scan | Direct cloud SDK calls. |
| Knowledge distillation | Hardcoded provider list. |
| Idea miner | Direct cloud SDK calls. |
| MCP server builder | Hardcoded to Claude with Anthropic tool use. |
| Custom node builder | Hardcoded to Claude with Anthropic tool use. |
| Connectivity test | Tests cloud provider reachability by definition. |
| Image generation (DALL-E 3, GPT Image 2, Imagen 4, Grok Aurora) | No local image-gen path. |
| Image attachments in chat (vision) | Local inference has no vision support; image attachments only work on the four cloud chat nodes. |
Matrix reflects Memorandai 0.8.8.
MCP Servers
Extend Memorandai with Model Context Protocol servers. Install community servers or build your own. Each server adds new tools that every AI model on your canvas can use. The extensibility layer for power users.
Custom Nodes
Build your own node types with MCP command transport. Define input forms, connect to external services, stream results back to the canvas. Your workflow, your tools.
Encryption & Security
API keys are encrypted at rest using machine-specific keys. No telemetry, no analytics, no tracking. The app does not phone home. Memorandai never sees, stores, or transmits your data — what reaches a cloud model is your call alone.
Offline-Friendly License
Activate once online. After that, your license works offline — no subscription server to check in with, no recurring validation. Local-only chat works fully offline; cloud sessions and background features (Dream Cycle, Daily Insight, keystone mining) require Internet and API keys.
Take control of your knowledge
Your data, your keys, your machine. No compromises.