DAESA24/podcast-processor-cli-toolPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star0

A Claude Code sub-agent that automatically transcribes Spotify podcast episodes using OpenAI Whisper API and formats them into structured markdown files.

0 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.bmad-core		.bmad-core
.claude		.claude
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
drews-artifacts		drews-artifacts
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
podcast-processor-cli-tool-workspace.code-workspace		podcast-processor-cli-tool-workspace.code-workspace

Repository files navigation

Podcast Processor CLI Tool

A Claude Code sub-agent that automatically transcribes Spotify podcast episodes using OpenAI Whisper API and formats them into structured markdown files.

Overview

This tool creates a seamless workflow for converting Spotify podcast content into searchable, formatted transcripts with timestamps, metadata, and structured navigation.

Core Workflow

Input: Spotify podcast URL
Download: Extract audio from Spotify podcast
Transcribe: Process audio through OpenAI Whisper API
Format: Generate structured markdown with metadata
Save: Output formatted transcript file

Key Features

Spotify Integration: Direct URL input for podcast episodes
High-Quality Transcription: OpenAI Whisper API for excellent accuracy
Structured Output: YAML frontmatter + timestamped sections
Speaker Identification: Host/Guest labeling with visual icons
Navigation: Clickable timestamps for easy reference
Claude Code Integration: Seamless CLI command integration

Output Format

Markdown Structure

---title:"Episode Title"podcast_name:"Podcast Series Name"episode_number:"Episode #123"spotify_url:"https://open.spotify.com/episode/..."transcription_date:"2025-09-17"duration:"45:32"file_size:"42.3 MB"language:"en"quality:"high"---##Episode Summary[Auto-generated key topics and overview]##Transcript###[00:00:00] Introduction**Host:** Welcome to the show...###[00:05:30] Main Topic Discussion**Guest:** Thanks for having me...

File Naming Convention

[podcast-name]-ep[###]-[episode-title-slug]-transcript-YYYY-MM-DD.md

Technical Architecture

Technology Stack

Framework: Pydantic AI (production-ready Python agent framework)
APIs: Spotify API + OpenAI Whisper API
Language: Python with full type safety
CLI Integration: Claude Code sub-agent pattern
Output: Structured markdown with YAML frontmatter

Core Components

URL Validation: Type-safe Spotify URL processing
Audio Processing: Download and format conversion pipeline
API Integration: Robust Whisper API calls with error handling
Markdown Generation: Template engine for consistent formatting
CLI Interface: Claude Code command integration

Pydantic AI Integration

Input Validation Models

classPodcastRequest(BaseModel):spotify_url:HttpUrloutput_format:Literal["markdown","json"]="markdown"include_timestamps:bool=Truelanguage:str="en"

Agent-Based Processing

@agent.tooldefdownload_audio(spotify_url:str)->AudioFile:# Spotify API integration@agent.tooldeftranscribe_audio(audio_file:AudioFile)->TranscriptionResult:# Whisper API with retry logic@agent.tooldefformat_transcript(result:TranscriptionResult)->MarkdownDocument:# Structured markdown generation

Development Status

Current Phase: Planning - Ready for BMAD Implementation

Completed

✅ Project concept and requirements definition
✅ Technical architecture outline
✅ Output format design and markdown structure
✅ Pydantic AI framework integration analysis
✅ BMAD framework installation

Next Steps

BMAD Planning Phase: Use analyst/PM/architect agents for detailed requirements
Pydantic AI Setup: Install framework and define data models
API Integration: Implement Spotify and Whisper API connections
CLI Development: Create Claude Code sub-agent interface
Testing & Validation: Quality assurance and error handling

BMAD Method Integration

This project uses theBMAD (Breakthrough Method for AI-driven Agile Development) methodology:

Planning Phase Agents

Analyst → Research Spotify API + Whisper integration patterns
Product Manager → Define CLI interface and error handling
Architect → Design audio processing pipeline architecture
Product Owner → Validate document consistency

Development Phase

Scrum Master → Break into implementable stories
Developer → Code implementation with Pydantic AI
QA/Test Architect → Quality assurance and testing

Use Cases

Primary Use Case

Target User: Content creators, researchers, students
Problem Solved: Manual transcription of podcast content is time-consuming and inaccurate
Value Delivered: Automated, high-quality transcripts with searchable content

Business Applications

Content repurposing for blogs and social media
Research and analysis of podcast discussions
Accessibility improvements for hearing-impaired audiences
SEO optimization through text content generation

Project Structure

podcast-processor-cli-tool/├── .bmad-core/              # BMAD framework installation├── .claude/                 # Claude Code integration├── drews-artifacts/         # Project checkpoints and planning docs├── src/                     # Source code (TBD)├── tests/                   # Test suites (TBD)└── README.md               # This file

Installation & Usage

Coming soon - CLI interface under development

# Future usage patternclaude-podcast-transcribe https://open.spotify.com/episode/xyz123

Quality & Error Handling

Multi-Layer Validation

URL Validation: Verify Spotify podcast URLs
API Error Handling: Robust retry logic for network failures
Audio Format Validation: Ensure compatible audio processing
Quality Metrics: Confidence scores and transcription accuracy

Observability

Pydantic Logfire: Built-in monitoring and debugging
Performance Metrics: Processing time and API usage tracking
Quality Assessment: Transcription confidence and error rates

Contributing

This is a learning project focused on:

Pydantic AI framework patterns
Claude Code sub-agent development
Production-ready Python practices
AI-assisted development workflows

License

Built with: Pydantic AI | BMAD Methodology | OpenAI Whisper | Claude Code IntegrationPurpose: Automated podcast transcription with production-ready AI agent patterns

About

A Claude Code sub-agent that automatically transcribes Spotify podcast episodes using OpenAI Whisper API and formats them into structured markdown files.

Movatterモバイル変換

DAESA24/podcast-processor-cli-tool

Folders and files

Latest commit

History

Repository files navigation

Podcast Processor CLI Tool

Overview

Core Workflow

Key Features

Output Format

Markdown Structure

File Naming Convention

Technical Architecture

Technology Stack

Core Components

Pydantic AI Integration

Input Validation Models

Agent-Based Processing

Development Status

Completed

Next Steps

BMAD Method Integration

Planning Phase Agents

Development Phase

Use Cases

Primary Use Case

Business Applications

Project Structure

Installation & Usage

Quality & Error Handling

Multi-Layer Validation

Observability

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors2

Uh oh!

Packages