Syncsasync-openai realtime types with the latest OpenAI Realtime API (June 2025).
Adds richer request/response configs, new client & server events, extra enums for models / voices / modalities, plus tracing & noise-reduction support.

✨ What’s new

Client events
- AddedResponseConfig,OutputAudioBufferClearEvent,ConversationItemRetrieveEvent.
- ResponseCancelEvent gainsresponse_id.
- ResponseCreateEvent now usesResponseConfig instead ofSessionResource.
Server events
- Addedoutput_audio_buffer.cleared,conversation.item.input_audio_transcription.delta,conversation.item.retrieved.
- Fixed typo:InputAudioBufferCommitedEvent →InputAudioBufferCommittedEvent.
Response resource
- New fields:finish_reason,created_at.
- New finish reasons:TokenLimit,FunctionCall.
Session resource
- New enums:RealtimeModel,Modality,NoiseReductionType.
- Added fields:speed,input_audio_noise_reduction,tracing.
- model is nowRealtimeModel;modalities isVec<Modality>.
Turn detection
- Introducedsemantic_vad mode withcreate_response andinterrupt_response flags.
Audio
- Unified enum names (g711_ulaw,g711_alaw).
- AddedInputAudioNoiseReduction.
Tooling
- WiredToolChoice &ToolDefinition intoResponseConfig.

⚠️ Breaking changes

ResponseCreateEvent:response now expectsResponseConfig,notSessionResource.
Enum casing:g711-ulaw /g711-alaw →g711_ulaw /g711_alaw.
Event rename:InputAudioBufferCommitedEvent →InputAudioBufferCommittedEvent.
Typed model field:SessionResource.model is nowRealtimeModel (no longer a free-formString).

codesodaand others added5 commits

June 23, 2025 16:28

feat: enhance realtime response types and audio transcription options

6395a6c

- Added `Cancelled` variant to `ResponseStatusDetail` enum for better handling of cancelled responses.- Introduced `LogProb` struct to capture log probability information for transcribed tokens.- Updated `ConversationItemInputAudioTranscriptionCompletedEvent` and `ConversationItemInputAudioTranscriptionDeltaEvent` to include optional `logprobs` for per-token log probability data.- Enhanced `AudioTranscription` struct with optional fields for `language`, `model`, and `prompt` to improve transcription accuracy and customization.- Added new `SemanticVAD` option in the `TurnDetection` enum to control model response eagerness.- Expanded `RealtimeVoice` enum with additional voice options for more variety in audio responses.

feat: update audio format enum values for consistency

daeb8c7

- Changed enum variants for `AudioFormat` to use underscores instead of hyphens in their serialized names.- Updated `G711ULAW` from `g711-ulaw` to `g711_law` and `G711ALAW` from `g711-alaw` to `g711_alaw` for improved clarity and adherence to naming conventions.

feat: add auto-response options to VAD configurations

2bb05e3

feat: add realtime API types and event handling for audio, tracing, a…

479bf1e

…nd response management

Merge branch 'main' into chore/update-realtime-spec

edcae25

Labels

None yet

1 participant

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update more realtime spec#397

Are you sure you want to change the base?

Update more realtime spec#397

Uh oh!

Conversation

codesoda commentedJun 30, 2025•
edited
Loading

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Update more realtime spec#397

Are you sure you want to change the base?

Update more realtime spec#397

Uh oh!

Conversation

codesoda commentedJun 30, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

Uh oh!

codesoda commentedJun 30, 2025•
edited
Loading