Skip to content

Conversation

catileptic
Copy link
Collaborator

@catileptic catileptic commented Mar 12, 2025

The purpose of this experiment is to see if we can make audio & video files searchable using transcriptions.

Pinned filelock to the latest version in order to avoid an endless loop of failing to acquire the lock (see this issue).

Exploring using faster-whisper due to the speed improvements.

TODO:

  • compare image sizes
  • any work-arounds for re-downloading the model?
  • eliminate silence

catileptic and others added 4 commits February 28, 2025 13:30
* Remove README & LICENSE from .dockerignore

* Refactor ingest-file to be compatible with nomenklatura

* Make linter happy

* Add poetry.lock
@catileptic catileptic marked this pull request as draft March 12, 2025 17:44
@catileptic catileptic changed the title Experimental feature: transcribe audio & video files Experimental feature: make audio & video files searchable Mar 13, 2025
@catileptic catileptic closed this Apr 15, 2025
@simonwoerpel simonwoerpel deleted the feature/whisperai branch September 1, 2025 11:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant