Transcribe: audio-to-text tool with diarization and speaker role labelling

New sixth tool in the hub. Accepts MP3/WAV/OGG/M4A/FLAC/WEBM up to 200 MB,
proxies to Whisper on cuttlefish GPU. Optional speaker separation with LLM
role labelling (dommer, advokat, forelder, sakkyndig, etc. via GPT-4o-mini).
Client-side TXT / SRT / VTT download from segment data.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-05-13 18:43:22 +02:00
parent bddafea049
commit d425c99e8e
4 changed files with 600 additions and 2 deletions
+109
View File
@@ -1103,3 +1103,112 @@ p {
text-transform: uppercase;
letter-spacing: 0.03em;
}
/* ── Transcribe tool ─────────────────────────────────────────────── */
.num-speakers-input {
width: 4.5rem;
padding: 0.25rem 0.5rem;
border: 1px solid var(--line);
border-radius: 6px;
background: #fff;
color: var(--ink);
font-size: 0.85rem;
}
.transcript-roles {
display: flex;
flex-wrap: wrap;
gap: 0.4rem;
margin-bottom: 0.75rem;
}
.speaker-tag {
display: inline-flex;
align-items: center;
gap: 0.3rem;
font-size: 0.72rem;
font-weight: 600;
padding: 0.2rem 0.55rem;
border-radius: 4px;
}
.speaker-tag small {
font-weight: 400;
opacity: 0.75;
}
.speaker-tag--0 { background: #dbeafe; color: #1d4ed8; }
.speaker-tag--1 { background: #ede9fe; color: #6d28d9; }
.speaker-tag--2 { background: #dcfce7; color: #166534; }
.speaker-tag--3 { background: #fef9c3; color: #854d0e; }
.speaker-tag--4 { background: #fee2e2; color: #991b1b; }
.speaker-tag--5 { background: #e7f5f2; color: #0f766e; }
.transcript-box {
background: var(--bg);
border: 1px solid var(--line);
border-radius: 8px;
padding: 1rem;
max-height: 400px;
overflow-y: auto;
margin-bottom: 0.75rem;
}
.transcript-text {
white-space: pre-wrap;
word-break: break-word;
font-size: 0.875rem;
line-height: 1.65;
font-family: inherit;
margin: 0;
color: var(--ink);
}
.segment-details {
border: 1px solid var(--line);
border-radius: 8px;
margin-bottom: 0.75rem;
}
.segment-summary {
font-size: 0.8rem;
color: var(--muted);
padding: 0.6rem 1rem;
cursor: pointer;
user-select: none;
}
.segment-list {
padding: 0.25rem 0.75rem 0.75rem;
max-height: 280px;
overflow-y: auto;
}
.segment-row {
display: flex;
gap: 0.6rem;
align-items: baseline;
padding: 0.2rem 0;
font-size: 0.78rem;
border-bottom: 1px solid var(--bg);
}
.segment-time {
color: var(--muted);
font-family: ui-monospace, monospace;
min-width: 7rem;
flex-shrink: 0;
}
.segment-text {
color: var(--ink);
line-height: 1.4;
}
.transcript-downloads {
display: flex;
flex-wrap: wrap;
gap: 0.5rem;
margin-top: 0.75rem;
}