OpenKara

Turn your music library into a karaoke stage.

An open-source desktop karaoke app powered by on-device AI stem separation and synced lyrics.

Demo

Why I Built This

I love singing karaoke at home, but every existing solution has its own set of problems.

The most mature option is probably Karafun — a paid service that sidesteps copyright by re-recording famous songs. That's neat, but it comes with issues:

Their re-recorded instrumentals inevitably sound a little different from the originals
Their catalog doesn't always include the niche songs I want to sing
I hate subscriptions

Then there's Apple Music Sing, which offers on-device vocal removal for karaoke. Also neat — but Apple Music is yet another subscription, and I hate subscriptions.

To dodge the subscription trap, you could go the more traditional route — something like OpenKJ for playing CD+G/media+g files. But CD+G files are niche, hard to find, and have to be purchased separately.

That pretty much leaves scouring YouTube for karaoke videos of dubious origin and questionable copyright status. Not exactly a unified experience, and the song I want is missing half the time.

So my no-compromise solution was born: OpenKara uses open-source AI to separate the digital music you already own in unencrypted form — whether it's from CD rips, Bandcamp, Qobuz, iTunes, or your local library's music service. I know there are plenty of people who, like me, prefer to buy once and own forever. OpenKara turns my existing music library into a karaoke library, so I don't have to pay for KTV, and my catalog is shaped by my own taste — not the mainstream.

Features

Local Audio Import — Use music you already own. No subscriptions, no repurchases.
AI Stem Separation — Separate vocals and accompaniment on-device.
Synced Lyrics — Load timed lyrics from online sources, embedded tags, or sidecar .lrc files.
CD+G Sidecars — Render same-name .cdg graphics during fullscreen playback when a track includes them.
Portable Library — Self-contained library directory that works on NAS, USB drives, and across machines.
Cross-Platform — Available on macOS, Windows, and Linux.
4-Stem Mixer — Individual volume control for vocals, drums, bass, and other instruments. Collapsible accompaniment slider with per-stem breakdown.
Dual Separation Modes — Choose between 2-stem (vocals + accompaniment) or 4-stem (vocals + drums + bass + other). Upgrade individual songs from 2-stem to 4-stem on demand.
Efficient Stem Storage — Separated stems are cached compactly to keep library storage practical.
Resumable Separation — Per-chunk checkpointing means separation resumes from where it left off if the app is closed mid-process.

Quick Start

Install from Release

Download the latest build for your platform from GitHub Releases:

Platform	Format
macOS (Apple Silicon)	`.dmg`
macOS (Intel)	`.dmg`
Windows	`.exe` (NSIS installer)
Linux	`.AppImage` / `.deb`

macOS (Homebrew):

brew install thedavidweng/tap/openkara

macOS Gatekeeper note: If macOS says the app is damaged or can't be opened, run:

xattr -rd com.apple.quarantine /Applications/OpenKara.app

On first launch, OpenKara will prompt you to create a Karaoke Library and start downloading the default AI model in the background.

Build from Source

Prerequisites:

Node.js 20+
pnpm 10+
Rust stable toolchain
Platform dependencies for Tauri 2

git clone https://github.com/thedavidweng/OpenKara.git
cd OpenKara
pnpm install
./scripts/setup.sh      # downloads Demucs ONNX model for local dev
pnpm tauri dev

To bundle the official Google Drive Desktop OAuth client into a release build, set OPENKARA_GOOGLE_DRIVE_OAUTH_CLIENT_JSON or OPENKARA_GOOGLE_DRIVE_OAUTH_CLIENT_JSON_PATH before pnpm tauri build. OpenKara treats the per-user refresh/access tokens as secrets and stores them in the OS credential store; for desktop apps, the OAuth client_secret itself cannot be treated as confidential.

App Icon

Source icon: src-tauri/icons/app-icon.png (1024x1024 master asset)
Regenerate all platform icons with pnpm icons:generate
Generated assets are written to src-tauri/icons/ for Tauri desktop and future mobile targets

AI Models

OpenKara uses custom ONNX builds of Demucs models for stem separation. Models are maintained in a separate repository:

openkara-models — Reproducible ONNX model conversion pipeline

Model	Description	Input	Output	Format
`htdemucs`	Standard — Hybrid Transformer Demucs	Stereo audio at 44.1 kHz (7.8s)	4 stems: drums, bass, other, vocals	ONNX (opset 17)
`htdemucs_ft`	High Quality — Fine-tuned 4-model ensemble	Stereo audio at 44.1 kHz (7.8s)	4 stems: drums, bass, other, vocals	ONNX (opset 17)

On first launch, OpenKara automatically downloads the standard openkara-models v2.0.1 asset into the app data directory. The current standard model is ~339 MiB on disk, and the optional high quality model is ~1.32 GiB. Both assets are ONNX Runtime-optimized and carry metadata used for cache invalidation. See the openkara-models README for details on the conversion pipeline. For local development and deterministic tests, run ./scripts/setup.sh to populate src-tauri/models/.

Tech Stack

Layer	Technology	Purpose
Desktop framework	Tauri 2	Rust backend + system WebView
Frontend	React 19 + TypeScript 5	UI components
Bundler	Vite 7	Dev server and build
Styling	Tailwind CSS 4	Utility-first CSS
State	Zustand	Lightweight global state
Audio decode	symphonia	Pure-Rust codec support
Audio output	cpal	Cross-platform audio playback
AI inference	ONNX Runtime via ort	Demucs v4 stem separation
Lyrics	LRCLIB	Open synced lyrics API
Metadata	lofty	ID3v2, Vorbis, FLAC tag reading
Audio encode	vorbis_rs	OGG/Vorbis stem compression
Database	SQLite via rusqlite	Song, lyrics, and stems cache

Architecture

┌─────────────────────────────────────────────────────────────┐
│                    Tauri Frontend (React)                   │
│  ┌────────────────┐               ┌──────────────────────┐  │
│  │  File Import   │               │ Karaoke Player       │  │
│  │   & Library    │               │ / Mixer              │  │
│  │                │               │ (lyrics sync)        │  │
│  └────────────────┘               └──────────────────────┘  │
│  ┌────────────────┐               ┌──────────────────────┐  │
│  │   Playback     │               │ Progress & Volume    │  │
│  │   Controls     │               │ Controls             │  │
│  └────────────────┘               └──────────────────────┘  │
├─────────────────────────────────────────────────────────────┤
│                    Tauri Rust Backend                       │
│  ┌────────────────┐               ┌──────────────────────┐  │
│  │   Audio Decode │               │  AI Stem Separation  │  │
│  │    & Playback  │               │ (Demucs v4 / ONNX)   │  │
│  └────────────────┘               └──────────────────────┘  │
│  ┌────────────────┐               ┌──────────────────────┐  │
│  │  Metadata      │               │   Lyrics Fetcher     │  │
│  │   Reader       │               │ (LRCLIB + embedded)  │  │
│  └────────────────┘               └──────────────────────┘  │
│  ┌───────────────────────────────────────────────────────┐  │
│  │             Portable Library                          │  │
│  │        (SQLite + media files + stems)                 │  │
│  └───────────────────────────────────────────────────────┘  │
└─────────────────────────────────────────────────────────────┘

Supported Formats

Format	Import	CD+G Graphics	Stem Separation
MP3	✅	Same-name `.cdg` sidecar	✅
FLAC	✅	Same-name `.cdg` sidecar	✅
WAV	✅	Same-name `.cdg` sidecar	✅
OGG / Vorbis	✅	Same-name `.cdg` sidecar	✅
AAC / M4A	✅	Same-name `.cdg` sidecar	✅
MP3+G ZIP	✅	Embedded audio + `.cdg` bundle	N/A

OpenKara imports same-name audio + .cdg pairs as managed CD+G tracks, and it can also import MP3+G ZIP archives directly. Standard audio tracks are resampled to 44.1 kHz stereo for the Demucs model. Managed CD+G tracks already contain accompaniment-only audio, so they skip stem separation.

Portable Library

OpenKara stores all data in a self-contained library directory:

MyKaraokeLibrary/
├── .openkara-library       # marker file
├── openkara.db             # SQLite database
├── media/                  # imported standard audio copies
│   └── {hash}.mp3
├── media-g/                # managed CD+G assets
│   ├── {hash}.mp3          # paired audio for CD+G playback
│   ├── {hash}.cdg          # paired CD+G graphics sidecar
│   └── {hash}.zip          # MP3+G ZIP archive when imported as a bundle
└── stems/                  # separated tracks for standard audio imports
    └── {hash}/
        ├── vocals.ogg
        ├── accompaniment.ogg   # 2-stem mode
        ├── drums.ogg           # 4-stem mode
        ├── bass.ogg            # 4-stem mode
        └── other.ogg           # 4-stem mode

All paths in the database are relative — including CD+G sidecars and MP3+G ZIP assets — so the whole library can be moved to a NAS, USB drive, or network share and opened by any OpenKara instance on any OS. Per-machine configuration (library location) is stored separately in the app data directory.

Roadmap

✅ v0.1 — MVP

v0.2.0 — Released

OpenKara v0.2.0 is the release that established the current core app flow.

v0.4.0 — Released

OpenKara v0.4.0 is the current stable release. It adds:

AirPlay support for casting playback to compatible devices
Improved player behavior and layout at narrow window widths
Visual refinements to the Windows app appearance
Better preservation of original track metadata on import
WinGet installation support on Windows

📋 Future

Mic Input & Vocal Effects — Microphone capture, reverb, echo, volume mix
Saved Playlists & Singer Rotation — Named playlists, singer assignment, and stronger turn-based queue workflows
Pitch & Key Shift — Real-time pitch shifting of the accompaniment track
Session Recording — Record vocal performances, export as audio
CJK Transliteration — Romaji / Pinyin display alongside original lyrics

Development

Prerequisites

Node.js 20+
pnpm 10+
Rust stable via rustup
Tauri 2 prerequisites for your platform

Setup

pnpm install
./scripts/setup.sh          # download Demucs ONNX model to src-tauri/models/
pnpm tauri dev               # start dev server with hot reload

scripts/setup.sh places the model in src-tauri/models/ for local development and deterministic testing only. End-user installs use the app data directory for runtime model downloads.

Running Tests

cd src-tauri && cargo test   # backend tests (70+ tests)
pnpm lint                    # ESLint
pnpm format                  # Prettier check

Building

pnpm tauri build             # production build with platform-specific bundle

CI/CD

Pushes to main trigger the CI workflow (.github/workflows/ci.yml) — lint, build, and test on macOS, Windows, and Linux.
Pushing a version tag (e.g. v0.4.0) triggers the release workflow (.github/workflows/release.yml) — builds and attaches binaries to a GitHub Release.

Documentation

Docs Hub — Canonical index for design docs, plans, specs, references, and site content
Architecture — System design, tech stack, data flow, and runtime details
Project Structure — Current directory layout and module responsibilities
Development Phases — Phase checklist with verification steps
Technical Roadmap — Technology choices, API contracts, and risk mitigations
Milestones — Milestone task table with exit criteria

Contributing

Contributions are welcome! Please open an issue before starting major changes so we can discuss the approach.

Fork the repository
Create a feature branch (git checkout -b feature/my-feature)
Make your changes and ensure tests pass (cargo test)
Submit a pull request

Acknowledgments

Demucs — AI stem separation model by Meta Research
openkara-models — ONNX model conversion pipeline for OpenKara
demucs.onnx — Reference for STFT/ISTFT to real-valued ONNX conversion
LRCLIB — Open synced lyrics API
monochrome — Lyrics sync and LRCLIB integration reference

Name		Name	Last commit message	Last commit date
Latest commit History 273 Commits
.github		.github
docs		docs
packaging		packaging
public		public
scripts		scripts
src-tauri		src-tauri
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
.impeccable.md		.impeccable.md
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
TRANSLATING.md		TRANSLATING.md
eslint.config.js		eslint.config.js
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Folders and files

Latest commit

History

Repository files navigation

OpenKara

Demo

Why I Built This

Features

Quick Start

Install from Release

Build from Source

App Icon

AI Models

Tech Stack

Architecture

Supported Formats

Portable Library

Roadmap

✅ v0.1 — MVP

v0.2.0 — Released

v0.4.0 — Released

📋 Future

Development

Prerequisites

Setup

Running Tests

Building

CI/CD

Documentation

Contributing

Acknowledgments

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages