AI & Processing

Transcribe File

Convert recordings into useful text with clear control over model, language, speakers, and Spark post-processing.

Transcribe File

This mode is for offline or recorded media, not live dictation.

Add a File

You can:

drag and drop a supported audio/video file
click Choose File

Supported extensions are shown in the info tooltip on the drop zone.

Transcription Controls

For selected file:

choose transcription model
choose language (when model supports manual language choice)

Speaker Diarization (Local Models)

When local model provider is active, you can enable:

Speaker Diarization
Enhanced Diarization

You can also set default speaker labels before processing.

Use diarization when you need "who said what" structure.

Spark After Transcription

TranscribeFileSparkEnabled controls whether Spark runs after transcription.

When enabled, you can choose:

Spark prompt
Spark provider
Spark model

These are stored as Transcribe File-specific preferences and can differ from your global Spark defaults.

Output Review

After processing:

review Sparked text (if Spark enabled)
review Original text
copy or save either output
adjust transcript font size in viewer

Recommended Workflow

transcribe without Spark first
check recognition quality and speaker segmentation
enable Spark for formatting/summarization once raw transcript quality is acceptable

Common Issues

Speaker labels are messy

Retest with local diarization and cleaner source audio.

Spark output over-edits details

Use a stricter prompt that preserves names, numbers, and quoted statements.

Note

[Screencast Placeholder] Show full flow: drop file -> choose model/language -> enable diarization + Spark -> compare original vs sparked output.