Creates the API layer for Open Notebook Creates a services API gateway for the Streamlit front-end Migrates the SurrealDB SDK to the official one Change all database calls to async New podcast framework supporting multiple speaker configurations Implement the surreal-commands library for async processing Improve docker image and docker-compose configurations
3.3 KiB
CLAUDE.md
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
Project Overview
Open Notebook is an open-source, privacy-focused alternative to Google's Notebook LM. It's a research assistant that allows users to manage research, generate AI-assisted notes, and interact with content through Streamlit UI and REST API, backed by SurrealDB.
Development Commands
Environment Setup
# Copy environment templates
cp .env.example .env
cp .env.example docker.env
# Install dependencies
uv sync
uv pip install python-magic
Running the Application
# Start SurrealDB (required)
make database
# or: docker compose up -d surrealdb
# Start API backend (port 5055)
make api
# or: uv run run_api.py
# or: uv run --env-file .env uvicorn api.main:app --host 0.0.0.0 --port 5055
# Start Streamlit UI (port 8502)
make run
# or: uv run --env-file .env streamlit run app_home.py
Code Quality
# Run linter with auto-fix
make ruff
# or: ruff check . --fix
# Run type checking
make lint
# or: uv run python -m mypy .
Docker Commands
# Full stack deployment
docker compose --profile multi up
# Build multi-platform image
make docker-build
# Release with version tag
make docker-release
Architecture Overview
Three-Layer Architecture
- Frontend: Streamlit UI (
app_home.pyand/pages/) - API: FastAPI backend (
/api/) on port 5055 - Database: SurrealDB graph database
Key Directories
/open_notebook/domain/: Domain models (notebook, models, transformation)/open_notebook/graphs/: LangGraph processing (chat, ask, source, transformation)/open_notebook/database/: SurrealDB repository pattern/api/: REST API endpoints/pages/: Streamlit UI pages/migrations/: Database migrations
Data Storage
/data/uploads/: User-uploaded files/data/podcasts/: Generated podcasts/data/sqlite-db/: LangGraph checkpoints/surreal_data/: SurrealDB files
AI Provider Integration
The project uses the Esperanto library for multi-provider AI support:
- Language models: OpenAI, Anthropic, Google, Groq, Ollama, Mistral, DeepSeek, xAI, OpenRouter
- Embeddings: OpenAI, Google, Ollama, Mistral, Voyage
- Speech: OpenAI, Groq, ElevenLabs, Google TTS
Model configuration is centralized through ModelManager class in /open_notebook/domain/models.py.
Database Operations
Uses SurrealDB with async operations:
# Create record
await repo_create(table: str, data: dict)
# Upsert (merge) record
await repo_upsert(table: str, record_id: Union[str, RecordID], data: dict)
# Query
await repo_query("SELECT * FROM table WHERE field = $value", {"value": "example"})
# Delete
await repo_delete(record_id)
Content Processing Pipeline
- Content ingestion (files, URLs, text) via
/open_notebook/graphs/source.py - Text extraction using Content-Core library
- Embedding generation for semantic search
- Transformation workflows in
/open_notebook/graphs/transformation.py
Testing Approach
Check README or search codebase for test configuration before running tests. The project uses uv for all Python operations.
API Documentation
Interactive API docs available at http://localhost:5055/docs when API is running. Comprehensive endpoints for notebooks, sources, notes, search, models, transformations, and embeddings.