Commit Graph

11 Commits

Author SHA1 Message Date
Fran Fitzpatrick
8f537548d4 feat: add RTX 5090 Blackwell GPU support (sm_120)
Add support for NVIDIA RTX 50-series GPUs (Blackwell architecture) which
require CUDA 12.8+ and PyTorch cu128 wheels due to the new sm_120 compute
capability.

Changes:
- Add configurable PYTORCH_CUDA_VERSION environment variable to control
  PyTorch wheel version at runtime (cu126 for legacy, cu128 for Blackwell)
- Update all model adapters to use dynamic CUDA version instead of
  hardcoded cu126 URLs
- Update Dockerfile.cuda.12.9 for Blackwell with CUDA 12.9.1 base image,
  PYTORCH_CUDA_VERSION=cu128, and missing WHISPERX_ENV/yt-dlp
- Update Dockerfile.cuda with explicit PYTORCH_CUDA_VERSION=cu126
- Add docker-compose.blackwell.yml for pre-built Blackwell image
- Add docker-compose.build.blackwell.yml for local Blackwell builds
- Add GPU compatibility documentation to README

Fixes: rishikanthc/Scriberr#104
2025-12-24 14:46:44 -08:00
rishikanthc
3307349c13 fix: update yt-dlp url to download arch independent binary 2025-12-16 20:25:51 -08:00
rishikanthc
913063eb49 refactor: Switch yt-dlp to standalone binary & cleanup UV config
- Dockerfiles: Install yt-dlp binary from GitHub releases to /usr/local/bin
- Go: Execute yt-dlp binary directly, removing uv python wrapper
- Config: Remove unused UVPath configuration and findUVPath function
- Entrypoint: Remove yt-dlp init logic (still initializes whisperx env if needed)
2025-12-16 19:07:29 -08:00
rishikanthc
88166cbf6c fix: set correct internal path for whisperx-env 2025-12-16 18:38:51 -08:00
rishikanthc
11434b9f1b feat: Add production security configuration for CORS and cookie handling
- Fix refresh token cookie Secure flag bug (was hardcoded to false)
- Wire up AllowedOrigins config in CORS middleware (router, handlers, chat, SSE)
- Add APP_ENV=production to Dockerfile and Dockerfile.cuda
- Update all docker-compose files with APP_ENV and ALLOWED_ORIGINS examples
- CORS now validates origins in production, allows all in development
- Increase gocyclo threshold from 20 to 25 for complex handlers
2025-12-16 18:21:36 -08:00
rishikanthc
17902a30da fix: downgrade cuda base image and remove conflicting LD_LIBRARY_PATH 2025-12-03 10:13:54 -08:00
rishikanthc
77ddb295b5 feat: polish audio detail ui, fix audio player state, and add cli support to cuda dockerfile 2025-11-30 11:28:10 -08:00
Geoff Tognetti
21d51f0869 Fix CUDA Error 35 on Parakeet for short audio (<300s)
The CUDA fix was previously only applied to transcribe_buffered() function
(for audio >300s), but not to transcribe_audio() function (for audio <300s).
This caused short audio files to fail with CUDA Error 35.

Changes:
- Add CUDA graph disable fix to transcribe_audio() using change_decoding_strategy()
- Add unzip package to Dockerfile.cuda for Deno installation

The fix disables CUDA graphs in the TDT decoder by setting use_cuda_graph_decoder: False
via OmegaConf's open_dict() and applying with change_decoding_strategy(). This prevents
CUDA Error 35 (CUDA_ERROR_ILLEGAL_ADDRESS) on RTX 2000e Ada GPU during graph compilation.

Tested with short audio files - transcription now works correctly with Parakeet + Sortformer.
2025-11-20 13:35:19 -08:00
Geoff Tognetti
c2d29fc9c6 Fix YouTube downloads - Add Deno runtime for video cipher decryption
YouTube downloads were failing with "exit status 1" error. Root cause:
YouTube now requires yt-dlp to use a JavaScript runtime for video cipher
decryption.

Changes:
- Install Deno runtime in both Dockerfiles (standard and CUDA)
- Upgrade from yt-dlp to yt-dlp[default] to include all optional dependencies
- Add stderr capture to YouTube download handler for better error diagnostics
- Add performance logging for YouTube downloads (timing and file size)

Fixes #224

See: https://github.com/yt-dlp/yt-dlp/issues/14404
2025-11-15 14:06:57 -08:00
Zachary
b78e64c0d6 Adds cuda compatibility to Dockerfile.cuda
This will only work for x86_64 builds.
Adds the libcddn from the cuda runtime to the .
This needs to be made available in the docker-entryway.sh
2025-09-11 12:45:05 -07:00
rishikanthc
b8ae4381ad updates dockerfile for cuda - and compose files 2025-09-04 15:58:07 -07:00