admin-valentin/open-notebook

Fork 0

Rousseau Vincent b3452582c9

Remove reference to OLLAMA_NUM_GPU

2026-04-15 16:26:14 +02:00

7.4 KiB

Raw Blame History

Quick Start - Local & Private (5 minutes)

Get Open Notebook running with 100% local AI using Ollama. No cloud API keys needed, completely private.

Prerequisites

Docker Desktop installed
- Download here
- Already have it? Skip to step 2
Local LLM - Choose one:
- Ollama (recommended): Download here
- LM Studio (GUI alternative): Download here

Step 1: Choose Your Setup (1 min)

Local Machine (Same Computer)

Everything runs on your machine. Recommended for testing/learning.

Remote Server (Raspberry Pi, NAS, Cloud VM)

Run on a different computer, access from another. Needs network configuration.

Step 2: Create Configuration (1 min)

Create a new folder open-notebook-local and add this file:

docker-compose.yml:

services:
  surrealdb:
    image: surrealdb/surrealdb:v2
    command: start --user root --pass password --bind 0.0.0.0:8000 rocksdb:/mydata/mydatabase.db
    user: root
    ports:
      - "8000:8000"
    volumes:
      - ./surreal_data:/mydata

  open_notebook:
    image: lfnovo/open_notebook:v1-latest
    pull_policy: always
    ports:
      - "8502:8502"  # Web UI (React frontend)
      - "5055:5055"  # API (required!)
    environment:
      # Encryption key for credential storage (required)
      - OPEN_NOTEBOOK_ENCRYPTION_KEY=change-me-to-a-secret-string

      # Database (required)
      - SURREAL_URL=ws://surrealdb:8000/rpc
      - SURREAL_USER=root
      - SURREAL_PASSWORD=password
      - SURREAL_NAMESPACE=open_notebook
      - SURREAL_DATABASE=open_notebook
    volumes:
      - ./notebook_data:/app/data
    depends_on:
      - surrealdb
    restart: always

  ollama:
    image: ollama/ollama:latest
    ports:
      - "11434:11434"
    volumes:
      - ./ollama_models:/root/.ollama
    restart: always
    # Optional: set GPU support if available
    #deploy:
    #  resources:
    #    reservations:
    #      devices:
    #        - driver: nvidia
    #          count: 1
    #          capabilities: [gpu]

Edit the file:

Replace change-me-to-a-secret-string with your own secret (any string works)

Step 3: Start Services (1 min)

Open terminal in your open-notebook-local folder:

docker compose up -d

Wait 10-15 seconds for all services to start.

Step 4: Download a Model (2-3 min)

Ollama needs at least one language model. Pick one:

# Fastest & smallest (recommended for testing)
docker exec open-notebook-local-ollama-1 ollama pull mistral

# OR: Better quality but slower
docker exec open-notebook-local-ollama-1 ollama pull neural-chat

# OR: Even better quality, more VRAM needed
docker exec open-notebook-local-ollama-1 ollama pull llama2

This downloads the model (will take 1-5 minutes depending on your internet).

Step 5: Access Open Notebook (instant)

Open your browser:

http://localhost:8502

You should see the Open Notebook interface.

Step 6: Configure Ollama Provider (1 min)

Go to Settings → API Keys
Click Add Credential
Select provider: Ollama
Give it a name (e.g., "Local Ollama")
Enter the base URL: http://ollama:11434
Click Save
Click Test Connection — should show success
Click Discover Models → Register Models

Step 7: Configure Local Model (1 min)

Go to Settings → Models
Set:
- Language Model: ollama/mistral (or whichever model you downloaded)
- Embedding Model: ollama/nomic-embed-text (auto-downloads if missing)
Click Save

Step 8: Create Your First Notebook (1 min)

Click New Notebook
Name: "My Private Research"
Click Create

Step 9: Add Local Content (1 min)

Click Add Source
Choose Text
Paste some text or a local document
Click Add

Step 10: Chat With Your Content (1 min)

Go to Chat
Type: "What did you learn from this?"
Click Send
Watch as the local Ollama model responds!

Verification Checklist

Docker is running
You can access http://localhost:8502
Ollama credential is configured and tested
Models are registered
You created a notebook
Chat works with local model

All checked? You have a completely private, offline research assistant!

Advantages of Local Setup

No API costs - Free forever
No internet required - True offline capability
Privacy first - Your data never leaves your machine
No subscriptions - No monthly bills

Trade-off: Slower than cloud models (depends on your CPU/GPU)

Troubleshooting

"ollama: command not found"

Docker image name might be different:

docker ps  # Find the Ollama container name
docker exec <container_name> ollama pull mistral

Model Download Stuck

Check internet connection and restart:

docker compose restart ollama

Then retry the model pull command.

"Address already in use" Error

docker compose down
docker compose up -d

Low Performance

Check if GPU is available:

# Show available GPUs
docker exec open-notebook-local-ollama-1 ollama ps

# Enable GPU in docker-compose.yml

Then restart: docker compose restart ollama

Adding More Models

# List available models
docker exec open-notebook-local-ollama-1 ollama list

# Pull additional model
docker exec open-notebook-local-ollama-1 ollama pull neural-chat

Next Steps

Now that it's running:

Add Your Own Content: PDFs, documents, articles (see 3-USER-GUIDE)
Explore Features: Podcasts, transformations, search
Full Documentation: See all features
Scale Up: Deploy to a server with better hardware for faster responses
Benchmark Models: Try different models to find the speed/quality tradeoff you prefer

Alternative: Using LM Studio Instead of Ollama

Prefer a GUI? LM Studio is easier for non-technical users:

Download LM Studio: https://lmstudio.ai
Open the app, download a model from the library
Go to "Local Server" tab, start server (port 1234)
In Open Notebook, go to Settings → API Keys
Click Add Credential → Select OpenAI-Compatible
Enter base URL: http://host.docker.internal:1234/v1
Enter API key: lm-studio (placeholder)
Click Save, then Test Connection
Configure in Settings → Models → Select your LM Studio model

Note: LM Studio runs outside Docker, use host.docker.internal to connect.

Going Further

Switch models: Change in Settings → Models anytime
Add more models:
- Ollama: Run ollama pull <model>, then re-discover models from the credential
- LM Studio: Download from the app library
Deploy to server: Same docker-compose.yml works anywhere
Use cloud hybrid: Keep some local models, add cloud provider credentials for complex tasks

Common Model Choices

Model	Speed	Quality	VRAM	Best For
mistral	Fast	Good	4GB	Testing, general use
neural-chat	Medium	Better	6GB	Balanced, recommended
llama2	Slow	Best	8GB+	Complex reasoning
phi	Very Fast	Fair	2GB	Minimal hardware

Need Help? Join our Discord community - many users run local setups!

7.4 KiB Raw Blame History