Otosaku/OtosakuKWS-iOSPublic

NotificationsYou must be signed in to change notification settings
Fork2
Star12

Lightweight on-device keyword spotting engine for iOS using CoreML and real-time audio streaming.

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Example/KWSExample		Example/KWSExample
Sources/OtosakuKWS		Sources/OtosakuKWS
Tests/OtosakuKWSTests		Tests/OtosakuKWSTests
media		media
.gitignore		.gitignore
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md

Repository files navigation

🧠 OtosakuKWS – On-Device Keyword Spotting (KWS) for iOS

OtosakuKWS is a lightweight, privacy-focused keyword spotting engine for iOS, designed to detect speech commands in real time — entirely on device.

It uses a CRNN CoreML model combined with log-Mel spectrograms for fast, accurate, and low-latency voice command recognition.

🎥 Demo

Watch the model running live on iPhone 13:

🚀 Getting Started

1. Install Feature Extractor

This project depends on theOtosakuFeatureExtractor-iOS Swift package, which extracts log-Mel spectrograms in real time using Accelerate.

It also includes a ready-to-use filterbank archive (filterbank.npy,hann_window.npy).

2. Download Pretrained Model

The CRNN model was trained on the keywords:“go”, “no”, “stop”, “yes”

⬇️ Download model archive

Includes:

CRNNKeywordSpotter.mlmodelc
classes.txt

🧪 Validation Metrics

Metric	Value
val_accuracy	0.971313
val_f1_go	0.964216
val_f1_no	0.974067
val_f1_other	0.949783
val_f1_stop	0.983282
val_f1_yes	0.98564
val_loss	0.0846668
val_precision_go	0.977573
val_precision_no	0.966123
val_precision_other	0.949195
val_precision_stop	0.985112
val_precision_yes	0.979248
val_recall_go	0.95122
val_recall_no	0.982143
val_recall_other	0.950372
val_recall_stop	0.981459
val_recall_yes	0.992116

The model was trained on a balanced subset of [Google Speech Commands v2], using strong augmentations and class balancing.

🧩 Integration Example

letkws=tryOtosakuKWS(    modelRootURL: modelURL,    featureExtractorRootURL: featurizerURL,    configuration:.init())kws.onKeywordDetected={ keyword, confidenceinprint("Detected:\(keyword) [\(confidence)]")}letaudioInput=AudioStreamer()// The `onBuffer` callback receives a chunk of audio sampled at 16kHz, mono (1 channel).// `AudioStreamer` here is a dummy real-time microphone streamer that simulates live input.audioInput.onBuffer={ bufferinTask{await kws.handleAudioBuffer(buffer)}}

📬 Need custom commands?

If you need a custom KWS model for your use case — different keywords, languages, or domain-specific speech — feel free to reach out:

📧otosaku.dsp@gmail.com

🗝️ Keywords

CoreML, keyword spotting, speech commands, offline voice recognition, privacy-first AI, log-Mel spectrogram, iOS speech processing, CRNN, on-device inference, streaming audio, Swift AI

About

Lightweight on-device keyword spotting engine for iOS using CoreML and real-time audio streaming.

Releases1

1.0.0 Latest

Jun 16, 2025

Packages

No packages published

Languages

Swift100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Folders and files

Latest commit

History

Repository files navigation

🧠 OtosakuKWS – On-Device Keyword Spotting (KWS) for iOS

🎥 Demo

🚀 Getting Started

1. Install Feature Extractor

2. Download Pretrained Model

🧪 Validation Metrics

🧩 Integration Example

📬 Need custom commands?

🗝️ Keywords

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases1

Packages

Languages

Movatterモバイル変換

Otosaku/OtosakuKWS-iOS

Folders and files

Latest commit

History

Repository files navigation

🧠 OtosakuKWS – On-Device Keyword Spotting (KWS) for iOS

🎥 Demo

🚀 Getting Started

1. Install Feature Extractor

2. Download Pretrained Model

🧪 Validation Metrics

🧩 Integration Example

📬 Need custom commands?

🗝️ Keywords

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases1

Packages0

Languages

Packages