Experimental feature: make audio & video files searchable #3

catileptic · 2025-03-12T17:36:17Z

The purpose of this experiment is to see if we can make audio & video files searchable using transcriptions.

Pinned filelock to the latest version in order to avoid an endless loop of failing to acquire the lock (see this issue).

Exploring using faster-whisper due to the speed improvements.

TODO:

compare image sizes
any work-arounds for re-downloading the model?
eliminate silence

* Remove README & LICENSE from .dockerignore * Refactor ingest-file to be compatible with nomenklatura * Make linter happy * Add poetry.lock

catileptic and others added 4 commits February 28, 2025 13:30

Refactor ingest-file to prepare it for using nomenklatura (#2)

0fc3567

* Remove README & LICENSE from .dockerignore * Refactor ingest-file to be compatible with nomenklatura * Make linter happy * Add poetry.lock

Remove RabbitMQ

caa6034

Add faster-whisper and pin filelock to latest

bcb5058

Initial commit for TranscriptionSupport class

e3d8815

catileptic marked this pull request as draft March 12, 2025 17:44

catileptic changed the title ~~Experimental feature: transcribe audio & video files~~ Experimental feature: make audio & video files searchable Mar 13, 2025

catileptic added 3 commits March 13, 2025 19:02

Transcribe file_path

9582eed

Bump fasttext to latest version

1314852

Unload model after transcribing

2b8f328

catileptic force-pushed the main branch from aa0ee71 to 3c86592 Compare March 20, 2025 16:23

catileptic closed this Apr 15, 2025

simonwoerpel deleted the feature/whisperai branch September 1, 2025 11:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Experimental feature: make audio & video files searchable #3

Experimental feature: make audio & video files searchable #3

Uh oh!

catileptic commented Mar 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Experimental feature: make audio & video files searchable #3

Experimental feature: make audio & video files searchable #3

Uh oh!

Conversation

catileptic commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

catileptic commented Mar 12, 2025 •

edited

Loading