Whishper

screenshot of Whishper
svelte

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Overview

Whishper is an open-source, 100% local audio transcription and subtitling suite with a full-featured web UI. It offers various features such as transcribing any media to text, downloading transcriptions in multiple formats, translating transcriptions to different languages, and a powerful subtitle editor. Whishper is fast and can run on both CPU and GPU. It also has future plans to include features like local folder as media input, full-text search, user authentication, audio recording, and non-NVIDIA GPU support. The project is structured into three main pieces: Transcription-API, Whishper-Backend, and Whishper-Frontend.

Features

  • Transcribe any media to text: Supports transcribing audio, video, and more.
  • Download transcriptions in many formats: Offers TXT, JSON, VTT, SRT formats, and allows copying raw text to clipboard.
  • Translate transcriptions: Can translate transcriptions to any language supported by Libretranslate.
  • Powerful subtitle editor: Includes features like transcription highlighting, CPS warnings, segment splitting, segment insertion, and subtitle language selection.
  • 100% Local: Transcription, translation, and editing happen locally on the user's machine.
  • Fast: Uses FasterWhisper as the Whisper backend, resulting in faster transcription times, especially on CPUs.
  • Quick and easy setup: Provides a quick start script and simplified setup steps.
  • GPU support: Utilizes NVIDIA GPUs for even faster transcription times.
  • CPU support: Can run on CPUs for users without GPUs.

Summary

Whishper is a comprehensive open-source audio transcription and subtitling suite that offers a range of features. It allows users to transcribe various types of media, download transcriptions in different formats, translate transcriptions to different languages, and edit subtitles with a powerful editor. The application is designed for local use, enabling transcription, translation, and editing to occur on the user's machine. Whishper is fast and can utilize both CPUs and GPUs for even faster transcription times. With its easy setup process and future roadmap for additional features, Whishper provides a valuable solution for audio transcription and subtitling needs.

svelte
Svelte

Svelte is a modern front-end framework that compiles your code at build time, resulting in smaller and faster applications. It uses a reactive approach to update the DOM, allowing for high performance and a smoother user experience.