open-notebook/docs/podcast.md
Luis Novo d7b0fff954
Api podcast migration (#93)
Creates the API layer for Open Notebook
Creates a services API gateway for the Streamlit front-end
Migrates the SurrealDB SDK to the official one
Change all database calls to async
New podcast framework supporting multiple speaker configurations
Implement the surreal-commands library for async processing
Improve docker image and docker-compose configurations
2025-07-17 08:36:11 -03:00

6 KiB

Podcast Generator

Open Notebook's Podcast Generator creates professional, multi-speaker podcasts from your research content. With our Episode Profile system, you can generate high-quality podcasts in just 3 clicks - no complex configuration required.

🎯 More Flexible: Unlike Google Notebook LM's fixed 2-host format, Open Notebook supports 1-4 speakers with complete customization of personalities, voices, and conversation styles.

🎬 3-Click Podcast Generation

Step 1: Choose Episode Profile

Select from pre-configured podcast styles:

  • Tech Discussion: 2 technical experts discussing complex topics
  • Solo Expert: Single expert explaining concepts in an accessible way
  • Business Analysis: Business-focused panel discussion
  • Interview Style: Host interviewing a subject matter expert

Or configure your own.

Step 2: Name Your Episode

Give your podcast a descriptive name that reflects the content.

Step 3: Generate

Click "Generate Podcast" and continue using Open Notebook while your podcast processes in the background (2-3 minutes).

Podcast Generation

🎙️ Episode Profiles vs Traditional Setup

Before (15+ Fields):

  • Manual speaker role configuration
  • Complex conversation style settings
  • Detailed voice and personality setup
  • Dialogue structure customization
  • Provider and model selection

Now (Episode Profiles):

  • Pre-configured professional templates
  • Optimized speaker combinations
  • Battle-tested conversation flows
  • One-click generation with optional customization

🔄 Background Processing

Non-Blocking Experience

  • Podcasts generate in the background
  • Continue your research while processing
  • Simple status tracking without complexity
  • Desktop notifications when complete

Job Status Tracking

Monitor your podcast generation:

  • Pending: Job queued for processing
  • Running: Currently generating (outline → transcript → audio)
  • Completed: Ready to listen and download
  • Failed: Error details for troubleshooting

Podcast Status

🎨 Customization Options

Speaker Configurations

  • Solo Format: Single expert with rich personality
  • Dual Speakers: Two complementary perspectives
  • Panel Discussion: 3-4 speakers with diverse viewpoints
  • Interview Style: Host + guest dynamic

Voice & Personality

Each speaker profile includes:

  • Voice Selection: Choose from multiple TTS providers
  • Personality Traits: Optimized speaking styles
  • Backstory: Rich character development
  • Role Definition: Clear expert positioning

Content Adaptation

  • Automatic Briefing: Context-aware content adaptation
  • Segment Structure: Optimized for engagement
  • Conversation Flow: Natural dialogue patterns
  • Fact Integration: Seamless research incorporation

🛠️ Advanced Features

Multi-Provider Support

Choose your preferred AI and TTS providers:

  • Language Models: OpenAI, Anthropic, Google, Groq, Ollama
  • Text-to-Speech: OpenAI, Google TTS, ElevenLabs
  • Local Processing: Full Ollama support for privacy

Custom Episode Profiles

Create your own profiles by combining:

  • Speaker configurations (1-4 speakers)
  • AI model preferences
  • Default briefing templates
  • Segment count and structure

Episode Management

  • Library View: All episodes organized by notebook
  • Audio Player: Integrated playback with controls
  • Download Options: Export MP3 for offline listening
  • Metadata: Generation details and settings used

📱 Mobile & Accessibility

Audio-First Design

Perfect for:

  • Commuting and travel
  • Exercise and walking
  • Multitasking scenarios
  • Visual accessibility needs

Quality Features

  • Professional Audio: High-quality TTS with natural speech
  • Consistent Pacing: Optimized for comprehension
  • Clear Diction: Enhanced pronunciation and clarity
  • Background Processing: No interruption to workflow

🔧 Technical Architecture

Background Worker System

  • Async Processing: Non-blocking podcast generation
  • Queue Management: Reliable job processing
  • Error Recovery: Automatic retry and detailed logging
  • Scalable Design: Foundation for future features

Integration Points

  • Content Pipeline: Seamless notebook content integration
  • Search Integration: Generate podcasts from search results
  • Transformation System: Part of larger content processing workflow
  • API Access: Full programmatic control via REST API

🎧 Sample Podcasts

Listen to examples of what Open Notebook can create:

Check out our podcast sample

Generated using custom Episode Profile with ElevenLabs voices and interview format

🚀 Getting Started

  1. Setup: Ensure you have API keys configured for your preferred providers
  2. Initialize: Click "Initialize Default Profiles" on first use
  3. Select Content: Choose notebook with research content
  4. Generate: Pick profile → name episode → generate
  5. Listen: Audio appears in episode list when complete

Pro Tips

Content Optimization

  • Rich Source Material: More content = better podcast discussions
  • Clear Topics: Focused content creates more engaging conversations
  • Mixed Media: Combine text, links, and documents for depth

Profile Selection

  • Tech Content: Use "Tech Discussion" for technical deep-dives
  • Business Content: Use "Business Analysis" for strategic discussions
  • Educational: Use "Solo Expert" for clear explanations
  • General: Use "Interview Style" for broad topic exploration

Workflow Integration

  • Research → Generate: Create podcasts during active research
  • Review Sessions: Generate summaries of completed research
  • Learning Path: Create series with consistent Episode Profiles
  • Sharing: Export episodes for team knowledge sharing

The Podcast Generator establishes Open Notebook as a superior alternative to Google Notebook LM with unmatched flexibility, quality, and user control.