Commit Graph

38 Commits

Author SHA1 Message Date
rishikanthc
4960a2d528 Finalize database migration correctness fixes 2026-04-23 12:22:01 -07:00
rishikanthc
5d6a60d793 Refactor database migration and persistence layer 2026-04-23 11:38:25 -07:00
rishikanthc
0be71a63a0 refactor database schema and legacy migration flow 2026-04-23 10:17:03 -07:00
Fran Fitzpatrick
4e75295019 feat: add speaker identification toggle to summary templates
Add option to include speaker labels in summary prompts when diarization
is available. When enabled, transcripts are formatted as:
[SPEAKER_NAME] Text here...

The prompt also includes a hint to the LLM that speaker labels are present,
helping it produce summaries that attribute statements to specific speakers.

Changes:
- Add IncludeSpeakerInfo field to SummaryTemplate model
- Add toggle UI in summary template dialog
- Format transcript with speaker labels when generating summary
- Update prompt prefix to indicate speaker labels are present

Closes #353

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-28 10:58:49 -08:00
rishikanthc
ad3053cc9b fix: add Voxtral model selection and fix dependencies
- Add FamilyMistralVoxtral and ModelVoxtral constants
- Add case for Voxtral in selectModels switch statement
- Add convertToVoxtralParams function for parameter conversion
- Add MaxNewTokens field to WhisperXParams model
- Map language and max_new_tokens parameters correctly
- Fix parameter name in buffered script (output_path -> output_file)
- Add mistral-common dependency to pyproject.toml
- Check for both VoxtralForConditionalGeneration AND mistral_common

On next server restart, the environment will be re-synced automatically
to install the missing mistral-common dependency.
2025-12-31 15:47:19 -08:00
rishikanthc
bcb22af50d docs: update swagger documentation for delta sync API 2025-12-07 15:22:16 -08:00
rishikanthc
d752012a76 feat: implement server-side delta sync with soft deletes and updated_after param 2025-12-07 15:22:16 -08:00
ET
91af22bfd8 Configurable OpenAI API Base URL
Fix for enhancement issue #194

Added option to use custom OpenAI API base URL.

If not configured the default OpenAI API base URL (https://api.openai.com/v1) will be used.

Does not change current behavior of apiKey, i.e if apiKey is already configured it will not have to be re-entered when modifying base URL.
2025-12-06 12:32:03 -08:00
rishikanthc
f3266b31e5 feat: implement OpenAI transcription support 2025-12-01 14:00:33 -08:00
rishikanthc
ac3073786d feat: add callback_url to transcription parameters 2025-12-01 10:56:49 -08:00
rishikanthc
bd209b4b76 propagate delete to delete all resources associated with audio file 2025-11-27 10:35:01 -08:00
rishikanthc
222aa97706 adds support for nvidia sortformer diarization model 2025-09-10 11:48:07 -07:00
rishikanthc
e24a075f94 fixes persistence of API keys 2025-09-09 20:22:31 -07:00
rishikanthc
f725fdf619 implements info cards for multi-track audio files 2025-09-09 20:09:17 -07:00
rishikanthc
97232579bb transcript merging prototype 2025-09-09 20:09:17 -07:00
rishikanthc
af24647e6e implements mergin multi-track audio files into single file for playback 2025-09-09 20:09:17 -07:00
rishikanthc
d5aa1b7ae8 adds ability to upload multi-track audio files 2025-09-09 11:17:17 -07:00
rishikanthc
ce0b85fa23 adds support for video transcription - formatter run touched all files 2025-09-07 18:24:00 -07:00
rishikanthc
87c940615c adds configuring context parameters for parakeet model 2025-09-07 13:51:42 -07:00
rishikanthc
c3e73a9129 adds prototype for parakeet transcription 2025-09-07 13:51:42 -07:00
rishikanthc
59d6b02c4f adds support for renaming speakers 2025-09-04 09:28:19 -07:00
rishikanthc
98493c0b59 adds optional auto-transcription on upload 2025-09-02 10:55:33 -07:00
rishikanthc
bf8919f730 adds default transcription profiles 2025-09-02 10:18:22 -07:00
rishikanthc
1e40459d11 add table for storing transcription stats 2025-09-01 21:36:58 -07:00
rishikanthc
28dbc63fad fixes jwt token expiration with token refresh - prevents needing to login after app restart 2025-08-29 10:19:36 -07:00
rishikanthc
055be15f28 adds tests and modifies backend to enable testing 2025-08-29 10:04:10 -07:00
rishikanthc
d4e2e8f0a1 fix summary persistence bug 2025-08-26 11:49:29 -07:00
rishikanthc
ad2bb87b0b generating summaries - fixed bug in persistence of summary templates 2025-08-26 09:37:35 -07:00
rishikanthc
0d8f1b098c persist model selection and select model for summary templates 2025-08-26 09:04:48 -07:00
rishikanthc
d9d6a0916f add settings for managing summarization templates 2025-08-26 08:31:07 -07:00
rishikanthc
489e6af50d ability to highlight and annotate transcript with notes 2025-08-25 18:39:02 -07:00
rishikanthc
1d5600dc0f basic chat using openAI functionality 2025-08-24 18:05:29 -07:00
rishikanthc
5f93291692 add settings to configure LLM providers 2025-08-24 10:55:20 -07:00
rishikanthc
8d6ae90b37 support for configuration profiles - with settinf default profile 2025-08-23 20:53:17 -07:00
rishikanthc
38db957232 implements configuration profiles 2025-08-23 19:02:05 -07:00
rishikanthc
2cbb1dbeb7 support for configuring transcription parameters 2025-08-22 21:13:58 -07:00
rishikanthc
e6a1d4a731 connect upload audio to backend 2025-08-21 15:57:10 -07:00
rishikanthc
96aeaf1454 basic UI 2025-08-21 15:48:55 -07:00