
A GUI interface for Open AI Whisper based on Tauri and Sveltekit
Whiskey is a graphical user interface (GUI) for Open AI's Whisper speech recognition system. It is built using Tauri and Sveltekit technologies and utilizes C++ binaries for Whisper. Whiskey provides users with the ability to transcribe audio or video files into written text, with real-time text highlighting during playback. It also offers features such as exporting transcriptions as .txt or .vtt files. This article will provide an analysis of Whiskey's key features, installation guide, and a summary of its capabilities.
Planned features:
Whiskey is a user-friendly GUI for Open AI's Whisper speech recognition system. It is built using Tauri and Sveltekit technologies and provides users with the ability to transcribe audio or video files into written text. The GUI offers real-time text highlighting during playback and allows for the export of transcriptions as .txt or .vtt files. With planned features like file renaming, drag and drop functionality, and support for different platforms, Whiskey aims to enhance the user experience and expand its usability. Overall, Whiskey is a powerful tool for transcription tasks that is easy to install and use.

Svelte is a modern front-end framework that compiles your code at build time, resulting in smaller and faster applications. It uses a reactive approach to update the DOM, allowing for high performance and a smoother user experience.
Vite is a build tool that aims to provide a faster and leaner development experience for modern web projects
TypeScript is a superset of JavaScript, providing optional static typing, classes, interfaces, and other features that help developers write more maintainable and scalable code. TypeScript's static typing system can catch errors at compile-time, making it easier to build and maintain large applications.