AI & Processing
Transcribe File
Convert recordings into useful text with clear control over model, language, speakers, and Spark post-processing.
Transcribe File
This mode is for offline or recorded media, not live dictation.
Add a File
You can:
- drag and drop a supported audio/video file
- click
Choose File
Supported extensions are shown in the info tooltip on the drop zone.
Transcription Controls
For selected file:
- choose transcription model
- choose language (when model supports manual language choice)
Speaker Diarization (Local Models)
When local model provider is active, you can enable:
Speaker DiarizationEnhanced Diarization
You can also set default speaker labels before processing.
Use diarization when you need "who said what" structure.
Spark After Transcription
TranscribeFileSparkEnabled controls whether Spark runs after transcription.
When enabled, you can choose:
- Spark prompt
- Spark provider
- Spark model
These are stored as Transcribe File-specific preferences and can differ from your global Spark defaults.
Output Review
After processing:
- review
Sparkedtext (if Spark enabled) - review
Originaltext - copy or save either output
- adjust transcript font size in viewer
Recommended Workflow
- transcribe without Spark first
- check recognition quality and speaker segmentation
- enable Spark for formatting/summarization once raw transcript quality is acceptable
Common Issues
Speaker labels are messy
Retest with local diarization and cleaner source audio.
Spark output over-edits details
Use a stricter prompt that preserves names, numbers, and quoted statements.
Note
[Screencast Placeholder] Show full flow: drop file -> choose model/language -> enable diarization + Spark -> compare original vs sparked output.