Back to Blog
FeatureProduct
February 9, 2026
15 min read

Voice-Powered Learning: Cereby's New Audio Capabilities

Designing voice-first study workflows for speaking and listening, not only typing and reading

The Problem: Text-Only Study Workflows Exclude Real Usage Patterns

Most learning products optimize for keyboard-first usage, but real study behavior often happens while commuting, walking, or multitasking. In those contexts, text input becomes the bottleneck.

We needed to support voice-native interaction patterns without sacrificing context quality or response reliability.


What We Shipped

  1. Dictation Mode — Speak naturally to Cereby AI using push-to-talk
  2. AI-Powered Podcast Generation — Transform any study material into professional audio content

Together, these features extend Cereby from text-first assistance to multimodal learning workflows.

Dictation Mode: Your Voice, Cereby's Intelligence

What is Dictation Mode?

Dictation Mode transforms Cereby AI into a voice-first assistant. Instead of typing questions or requests, you can simply speak naturally and have your words instantly transcribed and processed by Cereby AI.

Key Features:
  • Push-to-Talk Interface — Hold to speak, release to send
  • Natural Speech Recognition — Speak naturally, no special commands needed
  • Instant Transcription — See your words appear in real-time
  • Seamless Integration — Works with all Cereby AI features
  • Visual Feedback — Pulsing animations show when you're recording
  • Mobile-Optimized — Touch-friendly interface for on-the-go learning

Why Dictation Mode Matters

1. Learning While Moving

Traditional study sessions require sitting at a desk with a keyboard. Dictation Mode frees you to learn anywhere:

  • Walking to class — Ask Cereby to quiz you on today's material
  • On the bus — Request a quick review of key concepts
  • At the gym — Have Cereby explain complex topics while you work out
  • Doing chores — Turn cleaning time into study time
  • Before bed — Review flashcards without screens
Real-World Scenario:

A chemistry student with an exam tomorrow can use Dictation Mode while walking her dog, getting quizzed on organic chemistry functional groups without interrupting her routine.

2. Faster Than Typing

Speaking is significantly faster than typing — the average person speaks 150 words per minute but types only 40 words per minute. This 4x speed advantage means quick questions get instant answers and complex explanations don't require tedious typing.

3. Accessibility for All

Dictation Mode makes Cereby accessible to students who:

  • Have motor disabilities or typing difficulties
  • Suffer from repetitive strain injuries
  • Are dyslexic and struggle with spelling
  • Have visual impairments
  • Are ESL students who speak better than they type
  • Simply prefer verbal communication

Education should be accessible to everyone. Dictation Mode removes barriers.

4. Natural Learning Flow

Conversations feel more natural when spoken. Dictation Mode enables:

  • Spontaneous questions — Ask as thoughts occur
  • Natural follow-ups — "Tell me more about that" feels natural spoken
  • Emotional context — Voice conveys confusion, excitement, understanding
  • Thinking out loud — Verbalize your thought process to Cereby

How to Use Dictation Mode

Getting Started

  1. Open Cereby AI in your Cereby workspace
  2. Click the User icon (👤) in the chat input area
  3. Dictation Mode activates — A push-to-talk button appears
  4. Hold the button while speaking
  5. Release when done — Your speech is transcribed and sent to Cereby AI

Best Practices

Speak Clearly, But Naturally:
  • Use your normal conversational tone
  • Pause briefly between sentences for better accuracy
  • Avoid background noise when possible
Structure Your Requests:
  • State the topic first
  • Then ask your question
  • Add specifics if needed
Use Natural Language:
  • ✅ Good: "Explain photosynthesis in simple terms"
  • ✅ Good: "Quiz me on my weak points in calculus"
  • ✅ Good: "Create notes on quantum physics for my exam next week"

Advanced Techniques

Multi-Part Questions: Hold the button and speak your entire multi-part question, including all context and specifications. Contextual Follow-Ups: After Cereby responds, ask follow-up questions that reference the previous conversation. Content Generation: Request podcast generation, note creation, or any other Cereby feature using voice.

AI-Powered Podcast Generation: Your Personal Study Radio

What is Podcast Generation?

Podcast Generation transforms written study materials into professional, AI-narrated audio content. Cereby AI analyzes your content, writes an engaging script optimized for audio learning, and generates high-quality speech using advanced text-to-speech technology.

Key Features:
  • 6 Professional Voices — Choose the voice that helps you learn best
  • 3 Podcast Styles — Lecture, conversation, or storytelling
  • Customizable Duration — 1 to 15 minutes
  • Speed Control — Adjust playback from 0.25x to 4.0x
  • Smart Script Generation — AI optimizes content for audio comprehension
  • Automatic Timestamps — Navigate to specific sections easily
  • Multi-Source Support — Generate from notes, files, web articles, or custom text

Why Podcasts for Learning?

1. Audio Learning is Powerful

Research shows that auditory learning:

  • Improves retention — Hearing information engages different neural pathways
  • Enables repetition — Listen multiple times without screen fatigue
  • Reduces cognitive load — Audio allows passive review while doing other tasks
  • Enhances comprehension — Natural pacing and intonation aid understanding
  • Supports memorization — Auditory memory is distinct from visual memory

2. Learning in Dead Time

Everyone has "dead time" — moments when you can't read but can listen:

  • Commuting — 30-60 minutes daily for many students
  • Exercise — Gym, running, walking
  • Household chores — Cooking, cleaning, laundry
  • Before sleep — Audio review without blue light
  • Getting ready — Morning routine becomes study time
Impact Calculation:

Average student dead time: 2 hours/day × 5 days/week = 10 hours/week × 4 weeks = 40 hours of extra study time before an exam.

That's an entire week of studying you didn't even know you had.

3. Different Perspective, Better Understanding

The same concept explained differently can unlock understanding. Podcasts provide:

  • Narrative structure — Information flows as a story
  • Conversational tone — Feels like someone explaining to you personally
  • Verbal emphasis — Stress and intonation highlight key points
  • Natural pacing — Time to process between concepts
  • Engaging delivery — Professional narration keeps attention

4. Review Without Screen Fatigue

Digital eye strain affects 50-90% of students who use computers extensively. Podcasts offer:

  • Screen-free learning — Give your eyes a break
  • Reduced blue light — Better for evening study sessions
  • Less mental fatigue — Audio feels less demanding than reading
  • Posture flexibility — Learn while standing, walking, stretching

Voice Selection: Finding Your Perfect Narrator

Cereby offers 6 distinct AI voices, each with unique characteristics:

Alloy (Default)

  • Tone: Neutral, clear, professional
  • Best for: General study content, technical subjects
  • Character: Reliable teacher explaining concepts systematically

Echo

  • Tone: Calm, soothing, measured
  • Best for: Complex material requiring focus, bedtime review
  • Character: Patient tutor helping you understand difficult concepts

Fable

  • Tone: Warm, engaging, expressive
  • Best for: Storytelling style podcasts, historical narratives
  • Character: Enthusiastic storyteller bringing content to life

Onyx

  • Tone: Deep, authoritative, commanding
  • Best for: Lecture-style content, formal academic material
  • Character: Professor delivering a compelling lecture

Nova

  • Tone: Bright, energetic, friendly
  • Best for: Conversation-style podcasts, lighter topics
  • Character: Study buddy explaining concepts in a fun way

Shimmer

  • Tone: Soft, articulate, refined
  • Best for: Language learning, detailed explanations
  • Character: Articulate tutor with perfect pronunciation
Pro Tip: Try different voices for different subjects:
  • STEM subjects → Alloy or Onyx (clear and authoritative)
  • Humanities → Fable or Nova (engaging and expressive)
  • Languages → Shimmer (clear pronunciation)
  • Late-night review → Echo (calming and gentle)

Podcast Styles: Tailoring the Experience

1. Lecture Style (Default)

Format: Clear, systematic teaching approach Characteristics:
  • Authoritative and structured
  • Concepts explained step-by-step
  • Examples provided for clarification
  • Logical progression
Best for:
  • Technical subjects (STEM, math, science)
  • Comprehensive topic coverage
  • Exam preparation
  • Students who prefer traditional teaching

2. Conversation Style

Format: Friendly, informal explanation Characteristics:
  • Casual, approachable language
  • Rhetorical questions engage the listener
  • Relatable examples and analogies
  • Natural pacing
Best for:
  • Humanities and social sciences
  • Conceptual understanding
  • Students who prefer informal learning
  • Building intuition before formal study

3. Storytelling Style

Format: Narrative-driven educational content Characteristics:
  • Engaging story arc
  • Vivid descriptions and scenarios
  • Contextualizes information in narratives
  • Emotional engagement
Best for:
  • History and literature
  • Case studies and real-world applications
  • Making dry material interesting
  • Long-form content

Creating Your First Podcast

Method 1: From Conversation (Easiest)

Simply chat with Cereby AI and request a podcast:

Using Text or Dictation:
"Create a 5-minute podcast on photosynthesis"
"Make me a podcast about quantum physics using the conversation style and the nova voice, about 10 minutes long"

Cereby AI will:

  1. ✅ Generate an optimized script
  2. ✅ Create professional audio
  3. ✅ Provide a download link and player
  4. ✅ Save to your podcast library
Cost: 12 Coins per podcast

Method 2: From Study Materials

Generate podcasts from existing content:

  • From Notes: "Create a podcast from my biology notes"
  • From Uploaded Files: "Turn this lecture PDF into a 10-minute podcast"
  • From Web Articles: "Make a podcast from this research article"

Method 3: Advanced Customization

Specify all parameters for full control over duration, style, voice, speed, and focus areas.

Real-World Use Cases

Case Study 1: The Pre-Exam Commuter

Student: Marcus, engineering major with 45-minute daily commute Workflow:
  • Generated 8 podcasts covering exam topics on Sunday
  • Listened during morning commute (review)
  • Evening commute at 1.5x speed (reinforcement)
  • Generated quiz-based podcast reviewing weak areas
Result:
  • 6 hours of additional study time
  • Exam score: 92% (previous exam: 78%)
  • "I turned dead time into my most productive study sessions"

Case Study 2: The Night Owl with Screen Fatigue

Student: Aisha, medical student with digital eye strain Workflow:
  • 8 PM: Final screen-based study session
  • 9 PM: Switch to podcasts using Echo voice
  • Listen while stretching, organizing notes, getting ready for bed
  • 10 PM: Fall asleep with auto-timed review podcast
Result:
  • 2 hours of screen-free study daily
  • Reduced eye strain and headaches
  • Better sleep quality
  • Improved retention

Case Study 3: The Multi-Modal Learner

Student: Jay, psychology major who learns best with repetition across formats Workflow:
  1. Read textbook chapter (visual)
  2. Generate podcast in lecture style (auditory)
  3. Create notes using Cereby AI (writing)
  4. Generate conversation-style podcast (different perspective)
  5. Quiz using Cereby AI (retrieval practice)
Result: 5 exposures in different formats engaging different memory pathways.

Case Study 4: The Auditory Learning Specialist

Student: Priya, history major who struggles with reading retention Workflow:
  1. Upload lecture slides and readings to Cereby
  2. Generate storytelling-style podcasts for each topic
  3. Listen 2-3 times before class
  4. Participate actively in class discussions
  5. Generate review podcast focusing on class discussions
Result: Leverages natural learning strength, enters classes prepared, better participation.

Performance and Costs

Generation Time

DurationScript GenerationAudio GenerationTotal Time
5 min15-20 seconds8-12 seconds~30 sec
10 min25-35 seconds15-20 seconds~50 sec
15 min35-45 seconds22-30 seconds~70 sec
Key Insight: Even a 15-minute podcast is ready in about 1 minute.

Cost Structure

Per Podcast: 12 Coins

Value comparison: Professional audiobook narrator costs $200-400 per finished hour. Cereby podcasts cost $1.20 (12 Coins) for 10 minutes - 27-55x cheaper than professional narration.

Storage and Bandwidth

Students get:

  • Unlimited podcast storage (saved to your library forever)
  • Unlimited playback (no additional costs to replay)
  • Downloadable MP3s (play offline anytime)
  • Shareable links (collaborate with study groups)

Combining Dictation and Podcasts: The Ultimate Workflow

Workflow 1: Hands-Free Podcast Creation

  1. Activate Dictation Mode
  2. Speak your podcast request with all specifications
  3. 60 seconds later — Podcast is ready
  4. Listen during next walk
Total interaction time: 10 seconds of speaking Study value: 5+ minutes of audio content

Workflow 2: Iterative Learning Cycle

  1. Read textbook section
  2. Use Dictation Mode to request explanation
  3. If still unclear, request conversation-style podcast
  4. Use Dictation Mode for follow-up quiz
Result: 4 exposures, 3 modalities, full comprehension

Workflow 3: Exam Preparation Sprint

Day 1-2: Generate 10-15 podcasts for major topics Day 3-6: Listen during commute, gym, daily activities Day 7: Generate targeted podcasts for weak areas Total study time added: 15+ hours of audio learning

Workflow 4: Group Study Enhancement

  1. Member generates podcast on key concepts
  2. Share with group
  3. All members listen (synchronized knowledge)
  4. Group discussion with shared baseline
  5. Generate quiz for group practice

Tips and Best Practices

For Dictation Mode

Optimize Your Environment:
  • Find quiet spaces for best accuracy
  • Use headphones with mic for privacy
  • Test before important use
Structure Your Speech:
  • Think before you speak
  • Use complete sentences
  • Speak at natural pace
Leverage Context:
  • Reference previous conversation
  • Build on previous answers
  • Use follow-ups naturally

For Podcasts

Choose the Right Voice:
  • Test all 6 voices
  • Stick with your favorite 2-3
  • Match voice to content
Select Appropriate Duration:
  • 5 minutes: Quick concept review
  • 10 minutes: Standard topic coverage
  • 15 minutes: Comprehensive deep dive
Style Selection Guide:
  • Lecture: Technical subjects, exam prep
  • Conversation: Humanities, conceptual understanding
  • Storytelling: History, case studies
Playback Optimization:
  • First listen: Normal speed (1.0x)
  • Review listens: 1.2-1.5x for efficiency
  • Before sleep: 0.75-0.9x for relaxation
  • Quick review: 1.5-2.0x when time-limited
Build a Library:
  • Generate podcasts at start of unit
  • Create listening schedule
  • Replay before exams
  • Share with classmates

Accessibility Impact

These voice features dramatically improve accessibility for students with disabilities, situational limitations, and language learners.

Future Enhancements

Coming soon:

Enhanced Dictation

  • Wake word detection
  • Continuous conversation mode
  • Multi-language support
  • Emotion detection
  • Voice commands

Advanced Podcast Features

  • Multi-voice podcasts (conversational format)
  • Interactive podcasts (pause for quiz questions)
  • Chapter markers
  • Podcast playlists
  • Collaborative podcasts

Integration Enhancements

  • Calendar integration for auto-scheduling
  • Smart reminders
  • Spaced repetition audio
  • Offline mode
  • Smart speaker support

Pricing and Availability

Dictation Mode

  • Cost: FREE
  • Availability: All Cereby users
  • Limits: None (unlimited use)

Podcast Generation

  • Cost: 12 Coins per podcast
  • Availability: All Cereby users
  • Limits: None (generate unlimited podcasts)

Getting Started Today

Step 1: Try Dictation Mode

  1. Open Cereby AI
  2. Click the User icon (👤)
  3. Hold the blue button and say: "Explain how Cereby's dictation mode works"

Step 2: Generate Your First Podcast

Say: "Create a 5-minute podcast on [your topic] using the [voice name] voice"

Step 3: Build Your Routine

  • Identify your "dead time"
  • Generate podcasts for current topics
  • Listen during identified time
  • Use Dictation Mode for follow-ups
  • Generate new podcasts as you progress

Step 4: Experiment and Optimize

  • Try all 6 voices
  • Test different styles
  • Adjust playback speed
  • Build your library
  • Share with classmates

Conclusion

Voice-powered learning represents the future of education technology. By supporting both input (dictation) and output (podcasts), Cereby enables a complete voice-first learning experience that's:

  • Faster — Speak 4x faster than typing
  • More accessible — Works for everyone, everywhere
  • More flexible — Learn while moving, exercising, commuting
  • More engaging — Natural conversation and professional narration
  • More effective — Multiple modalities improve retention
  • More efficient — Turn dead time into productive study time
The future of learning isn't silent — it's spoken, it's heard, and it's here.

Quick Reference

Dictation Mode Commands

"Explain [concept]"
"Create notes on [topic]"
"Quiz me on [subject]"
"Create a learning path for [exam/topic]"
"What are my weak points in [subject]?"
"Create a podcast on [topic]"

Podcast Voice Guide

VoiceToneBest For
AlloyNeutral, clearGeneral study, technical
EchoCalm, soothingComplex material, bedtime
FableWarm, expressiveStorytelling, history
OnyxDeep, authoritativeLectures, formal content
NovaBright, energeticConversations, lighter topics
ShimmerSoft, articulateLanguages, detailed explanations

Podcast Style Guide

StyleFormatBest For
LectureSystematic teachingSTEM, technical, exam prep
ConversationFriendly explanationHumanities, conceptual
StorytellingNarrative-drivenHistory, case studies, engagement

Ready to experience voice-powered learning? Open Cereby AI and activate Dictation Mode now!