dobetternorge-tools

Author	SHA1	Message	Date
daveadmin	56cd87dd7b	redact: UX overhaul — engine simplification, credits, spinner, save-to-docs, badges - Remove GPU/regex engine options; keep only azure_mini (1 credit) and azure_full (2 credits) - Variable credit cost: engine-aware pre-check and charge in api/redact.php; PricingCatalog base = 1 - Fix ATTORNEY not preserved when keepOfficials=true: add to LLM prompt, generic-tag, pseudonym regexes - Replace Azure credits hint with per-engine credit cost text (all 4 languages) - Single-file upload only (was: up to 5); simplify status messages - Clear previous redaction output and show pulsing spinner when a new run starts - Add "Save to My Docs" button in redact output panel (corpus-save.js path) - corpus-save.js: capture source_doc_ids from button dataset, pass in POST payload - api/save-to-corpus.php: accept source_doc_ids, store first as source_url=corpus-doc:{id} - doc-picker.js: show "✂ Redacted" badge for documents saved from the redact tool - CSS: .redact-working spinner, doc-item__badge--redact pill styles Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-25 08:18:51 +02:00
daveadmin	b21bfb2f1d	Add NOK pricing catalog, credit ledger, success-based charging, and tier-gated model routing - PricingCatalog.php: single source of truth for plans (free/plus/pro), top-ups, Stripe price env keys, tool costs (0–6 credits), STT variable billing, feature limits - FreeTier.php: monthly-first credit deduction, ledger (user_tool_credit_ledger), STT reservation/settle/release, monthly reset, trial logic - StripeClient.php: canonical SKUs (plus/pro/topup_100/300/1000), legacy aliases kept - stripe-checkout.php: subscription vs payment mode, trial gating, catalog metadata - stripe-webhook.php: idempotent via stripe_events, handles subscription lifecycle + invoice.paid renewal + one-time topup credit grants - All API tools: success-based credit deduction (check before, charge after) - transcribe.php: file-size heuristic reservation, settle from actual provider duration - ask.php + LegalTools.php: ToolModels engine resolution — Pro gets gpt-4o - KorrespondAgent.php + korrespond.php: tier-gated draft deployment — Free/Plus gets gpt-4o-mini, Pro gets gpt-4o - pricing.php: NOK-only, plan cards, top-up packs, Organisation contact card, tool cost table, separate monthly/prepaid balance display - 003_pricing_credit_catalog.sql: ledger and STT reservation tables Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-24 13:42:27 +02:00
daveadmin	bffc714541	Add My Docs picker to deep-research, advocate, barnevernet, korrespond, citations - PHP: Add docPickerSection (button + chips + hidden input) to all 5 tool pages - JS: Send doc_ids in payload for deep-research, advocate, barnevernet, korrespond - Backend: Inject selected corpus doc content into paste_text/narrative/notes via dbnToolsInjectDocContent - Citations: Add upload zone (file → api/extract.php → textarea) + paste textarea with live Norwegian legal reference extraction (regex) + ref chips → title search; doc picker populates titleInput via MutationObserver Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-24 13:04:45 +02:00
daveadmin	88555eb8a7	Fix docPickerBtn/audioPickerBtn i18n — add missing translation keys Add doc_picker_btn/audio_picker_btn to i18n.php (en/no/uk/pl), add docPickerBtn to REDACT_I18N + TIMELINE_I18N and audioPickerBtn to TRANSCRIBE_I18N in tools.js. PHP-render picker labels in legal-analysis, translate, summarize, tool_form, and layout_footer using dbnToolsT(). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-24 12:41:09 +02:00
daveadmin	effd3289b4	feat: add Legal Translation tool (translate.php) New dedicated tool for translating Norwegian legal documents (Barnevernet letters, court decisions, correspondence) into the user's chosen language with legal-terminology annotations. - translate.php: new tool page with source/target language selectors, 4-way UI lang switcher, file upload, doc picker, streaming results - api/translate.php: NDJSON streaming endpoint; Azure GPT-4o-mini with legal-aware prompt that preserves Norwegian statute refs verbatim and annotates terms with no target-language equivalent; 2-credit cost - assets/js/translate.js: form handler, NDJSON stream reader, copy button - assets/css/tools.css: .lt-* styles for translation result + annotations - includes/i18n.php: 22 lt_* keys × 4 languages; translate entry in nav - includes/FreeTier.php: translate → 2 credits - includes/CaseResults.php + case-result.php: translate in eligible tools, toolLabel, toolIcon, deriveTitle, rendering block, rerun map Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-24 09:59:06 +02:00
daveadmin	21c092e0d0	Legal Analysis: full language follow-through (UI + LLM) The tool now respects the chosen UI language end-to-end — even if the source document is Norwegian, a user on EN/UK/PL gets the analysis in their language. Norwegian statute references (barnevernsloven § 4-25, EMK Art. 8) and case names (Strand Lobben mot Norge 37283/13) are kept verbatim because they are proper nouns. LLM (LegalAnalysisAgent.php): - extractIssues: prompt asks for question + brief_context in user's language; statute refs preserved - answerIssue: Norwegian core system prompt (keeps fine-tune precision) + language-coercion line for non-NO; localised context/source labels - synthesise: overall_assessment, next_steps, disclaimer in user's language; explicit per-language disclaimer text - runFullAnalysis empty-case fallback also localised - what_to_check translated per language UI: - 40 new la_* translation keys in i18n.php × 4 languages (NO/EN/UK/PL) - legal-analysis.php: 4-way lang switcher, dbnToolsT() for every label, emits window.DBN_LA_I18N for runtime JS strings - legal-analysis.js: t() helper reads from window.DBN_LA_I18N - layout_footer.php: emits window.DBN_CURRENT_LANG + window.DBN_ADDON_I18N so the legal-analysis add-on button works in the page's language no matter which tool it's invoked from - tools.js add-on: reads from DBN_ADDON_I18N, passes DBN_CURRENT_LANG to /api/legal-analysis.php so server responds in same language Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-24 08:43:15 +02:00
daveadmin	2509a596c1	Fix file picker double-open across all remaining tools Same defensive guard as legal-analysis + summarize, now applied to the upload-zone and audio-zone handlers in tools.js. Affects redact, timeline, and transcribe (which all share these zones via tools.js's setupUpload / setupAudioUpload). Stops native label-for clicks and the input's own click from bubbling into the zone handler that would otherwise programmatically re-open the picker. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-24 08:23:58 +02:00
daveadmin	5589e891f4	Fix file picker reopening after selection on legal-analysis + summarize The "browse" label's native for=input trigger AND the upload-zone click handler both called uploadInput.click(), so the picker opened twice when the user clicked the browse text. Stop propagation on the label and the input itself, plus tighten the zone handler to recognise any label-for descendant. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-24 08:19:28 +02:00
daveadmin	7e6463ed22	Add Legal Analysis tool — two-pass DBN-legal pipeline Restores the dbn-legal-agent-v3 fine-tune on ocelot (was silently aliased to plain qwen2.5:14b in LiteLLM since the viper retirement) and ships a new tool that uses it via a two-pass flow: Pass 1 (Azure 4o-mini) → extract up to 5 distinct legal issues Pass 2 (ocelot v3 only) → answer each issue, ≤350 tokens, with corpus Pass 3 (Azure 4o-mini) → synthesise overall assessment + next steps The 12GB-VRAM constraint motivates the split: dbn-legal-agent-v3 stays hot in VRAM through the 5 sequential per-issue calls because issue extraction and synthesis run on Azure, not on ocelot. New surface: - includes/LegalAnalysisAgent.php - api/legal-analysis.php (NDJSON streaming endpoint) - legal-analysis.php (dedicated tool page) - assets/js/legal-analysis.js (streamed UI with per-issue cards) - Save-result + case-result.php rendering for legal-analysis output - Nav registration in all four UI languages Add-on integration: a "⚖️🇳🇴 Run deep legal analysis on this text" button now appears on Summarize, Ask, and Redact result pages and streams the same pipeline inline below the existing result. Existing tools relabelled: the misleading "🇳🇴 Norwegian specialist v3 ⭐" option on advocate/deep-research/discrepancy/barnevernet is now honestly "DBN Legal Agent" — now that the real fine-tune is actually deployed, the label finally matches reality. The advocate.php v2 option was removed since the v2 GGUF is retired. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-24 04:21:01 +02:00
daveadmin	2013648ee0	Add manual 'Save result' to all tools — replaces auto-save All tool results can now be saved to My Case manually. Users click 'Save result', type a description, and confirm. This replaces the previous silent auto-save on barnevernet/timeline/etc., giving users control over what stays and what it's called (supports multiple runs of the same tool with different titles). - CaseResults: extend ELIGIBLE_TOOLS to include summarize, ask, redact, transcribe; add toolLabel/toolIcon entries; support explicit title via meta['title'] in save() - api/case/save-result.php: new client-initiated save endpoint; accepts tool + title + input_payload + output_payload + meta - Remove CaseResults::save() auto-save from barnevernet, deep-research, discrepancy, korrespond, timeline API endpoints - tools.js: add showSaveResultButton() (exposed as window.dbnShowSaveResultButton); wire for ask, redact, timeline, transcribe (both file-upload and stored-audio paths) - barnevernet.js: wire save button after final result render - summarize.js: wire save button after renderFinal(); passes sumResults container so widget appears in the correct #sumResults div - case-result.php: rich tool-specific rendering for summarize, ask, redact, transcribe, timeline; update re-run link map to include all new tools - tools.css: styles for .save-result-widget and its states (idle, prompt, done, error) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-24 01:27:26 +02:00
daveadmin	e768662efe	Add Summarize Document tool — engine selector, file upload, optional corpus enrichment - summarize.php: full custom inline form (replaces tool_form.php wrapper) with lang switcher, azure_mini/azure_full/gpu engine selector, 8 corpus-slice toggles (all off by default), doc picker, file upload zone, and textarea - api/summarize.php: rewritten to streaming NDJSON (matches barnevernet pattern); accepts JSON payload with text, language, engine, slices[], doc_ids[] - includes/LegalTools.php: adds corpusContextForSummarize() (keyword search via ClientRagPipeline) and summarizeWithContext() (engine-aware LLM call with optional corpus prepend); returns structured JSON matching existing summarize format - assets/js/summarize.js: self-contained IIFE handling file upload via api/extract.php, slice toggles, NDJSON stream reader, result rendering, and trace panel update - includes/i18n.php: adds 'summarize' to nav in all 4 languages (EN/NO/UK/PL), inserted after 'redact' in the tool order with icon 'SZ' Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 23:25:40 +02:00
daveadmin	8587ec372f	fix: expose DBN_USER_TIER so plus/pro SSO users can use doc picker isPaidUser() was checking DBN_FREE_TIER_BALANCE === undefined, which is only true for CaveauAI sessions. SSO users (even plus/pro) always have DBN_FREE_TIER_BALANCE set, so the picker was showing the upgrade modal for everyone in the SSO flow. Now reads DBN_USER_TIER explicitly. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 22:45:57 +02:00
daveadmin	f383ad5b74	feat: document & audio corpus picker for all tools - Add "Select from My Docs" button to all text tool forms; free-tier users see an upgrade modal, paid (CaveauAI) users get a searchable multi-select modal backed by /api/dashboard/documents.php - Add "Select from My Audio" picker on Transcribe with single-select and a "Save to My Audio" button for persisting uploaded clips - New PHP helpers in bootstrap.php: dbnToolsFetchDocChunks, dbnToolsClientIdFromSession, dbnToolsInjectDocContent - timeline, ask, redact APIs prepend selected document content (fetched from client_chunks SQL) before the textarea text - api/dashboard/audio-upload.php stores audio files on server and creates a client_documents row with source_type='audio' - api/transcribe.php falls back to stored audio via audio_doc_id POST field when no file is uploaded - api/dashboard/documents.php supports ?source_type= filter - tools.js: doc_ids added to JSON payload; stored-audio transcribe path - New assets/css/doc-picker.css, assets/js/doc-picker.js - SQL migration: scripts/sql/audio_docs_column.sql Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 21:38:04 +02:00
daveadmin	06d01a3bce	feat(dashboard): add corpus dashboard at /dashboard/ Full private corpus dashboard for tools.dobetternorge.no users — each SSO account gets an auto-provisioned CaveauAI tenant (clients row, corpus) on first visit. Includes upload (file/paste/URL), RAG chat with SSE streaming and citation chips, document CRUD, FalkorDB graph relations tab, and improved save-from-tool flow with tag/preview support. - dashboard/{index,documents,document,upload,chat,settings}.php - api/dashboard/{corpus-init,documents,upload,ingest-status,chat-stream, save-from-tool,graph}.php - includes/{CorpusProvision,layout_dashboard,layout_dashboard_footer}.php - assets/css/dashboard.css assets/js/corpus-save.js (routing upgrade) - includes/{bootstrap,layout}.php extended for dashboard provisioning Migration 141 (clients.dbn_sso_uid + import_method enum) applied on chloe. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-23 17:15:40 +02:00
daveadmin	83fc71414f	Add premium My Case MVP	2026-05-23 10:17:34 +02:00
daveadmin	b014638f39	feat(corpus): add save-to-corpus + private corpus search scope - POST /api/save-to-corpus.php — saves tool output text to user's default CaveauAI corpus via ClientRagPipeline - api/case/upload.php — dual-writes uploaded PDFs to CaveauAI client_documents (best-effort) - assets/js/corpus-save.js — shared <dialog> handler for .js-save-corpus buttons on all tool pages - includes/layout_footer.php — injects corpus-save.js + shared save dialog markup - korrespond/deep-research/barnevernet/discrepancy JS — save-to-corpus buttons on output sections - api/search.php + LegalTools::search() — corpus_scope param ('shared'\|'private'\|'both'), merges personal CaveauAI corpus with shared legal library when 'both' - includes/tool_form.php + assets/js/tools.js — corpus scope radio toggle shown on search tab - api/user-docs.php — add POST upload method for non-SSO authenticated users Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-22 17:50:32 +02:00
daveadmin	ba9cddf9a1	Add monetization spine + Build Your Own Case (Min Sak) - Stripe: StripeClient.php, checkout/portal/webhook endpoints, idempotent event handling - FreeTier: tier-aware credits (free/light/pro/pro_plus), bonus_balance, hourly caps per tier - pricing.php + billing.php: 4-tier cards, 3 topups, Customer Portal, balance breakdown - Min Sak: CaseStore.php, AzureDocIntelligence.php, AzureSearchAdmin.php — per-user hybrid RAG - api/case/: upload, list, delete, ingest-callback (HMAC-auth'd from n8n) - award-survey-credits: inter-site HMAC endpoint for dobetternorge.no survey bonus - dashboard.php: tier badge, balance breakdown card, Min Sak CTA, survey CTA - KorrespondAgent + all 3 other agents: use_my_case toggle wired to dbnToolsCaseContext() - bootstrap.php: dbnToolsCaseContext(), dbnToolsIntersiteSecret(), dbnToolsCurrentTier() Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 20:52:54 +02:00
daveadmin	0e167bf464	Integrate dbn-legal-agent-v2: upgrade all v1 refs + add Korrespond legal-check - Replace dbn-legal-agent with dbn-legal-agent-v2 in bootstrap.php (dbnToolsRunLegalCheck), DeepResearchAgent.php (interpretSeed, expandQueries, synthesis fallback, deploy label), BvjAnalyzerAgent.php (check_model label) — 8 locations total - Add dbn-legal-agent-v2 legal threshold check to KorrespondAgent: called after selfCheck() in both generate() and refine(); result surfaced as legal_check[] in the API response - Render legal_check card in korrespond.js using existing bvj-red-flag styles; shows only when non-empty - Add .korr-legal-check CSS block in tools.css Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-19 23:59:07 +02:00
daveadmin	28932297b3	Add user context notes field to timeline tool Adds an optional textarea below the main text input where users can provide clarifications to guide the LLM — e.g. year anchors, actor aliases, or focus instructions. Notes are injected into the prompt as a clearly delimited block and translated across all four UI languages (en/no/uk/pl). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-19 12:36:37 +02:00
daveadmin	dfb9692f45	Korrespond: stop mixing UI languages — all chrome follows user UI lang Drafts still come back in Norwegian + working language (that is intentional), but every piece of chrome now respects the user's UI lang consistently: - Pass 1 classify LLM now writes missing-fact questions in the user's language (not always Norwegian), fixing the case where an English-UI user got "Hva er saksnummeret?" in the clarify panel. - All PHP-emitted progress/status messages go through DbnKorrespondAgent::L() with en/no/pl/uk variants instead of hardcoded Norwegian. - JS introduces an I18N dictionary + t() helper covering status messages, button labels, column headers, flag labels, refine panel title/hint, jurisdiction radio labels, clarify panel title/hint/buttons, the empty-state "Ready" block, and Copy/Copied/Download .txt. - Static clarify and empty-state chrome use [data-i18n] attributes resolved at init and re-applied on every lang-switcher click. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-19 12:11:16 +02:00
daveadmin	5d8ae6b447	Korrespond: add Refine pass with jurisdiction-scoped formal citations After the first draft is rendered, a "Refine with citations" panel offers a 3rd-pass rewrite scoped to the user's choice of Norwegian law, ECHR (EMK + HUDOC case law), or both. Refine pulls fresh corpus chunks limited to the chosen jurisdiction's slices, rewrites inline cites in formal style ("jf. forvaltningsloven § 17", "jf. Strand Lobben m.fl. mot Norge, EMD-37283/13, §§ 207–214"), and appends a Rettskilder block listing every authority. Hard-RAG grounding carries through — refine cannot cite anything that wasn't retrieved. Costs 1 additional credit; the original draft stays in place and the refined version appears below it. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-19 11:50:36 +02:00
daveadmin	b78a49e060	Add Korrespond tool: drafts replies & new correspondence to NO authorities Two-pass wizard for drafting to NAV, Barnevernet, schools, Bufdir, kommune, Statsforvalter, Trygderetten. Pass 1 (gpt-4o-mini) classifies the situation and emits clarify questions if facts are missing; user answers inline and resubmits without losing context. Pass 2 retrieves law passages via hard-RAG (ClientRagPipeline with body-specific slice presets), drafts in Norwegian bokmål with gpt-4o using [CITE:N] tokens, self-checks that every citation maps to a real corpus passage, then translates to the working language. Result is side-by-side Norwegian + EN/PL/UK with copy/download per side and an expandable Cited Law panel. Credit deducts only when Pass 2 actually runs, not on a clarify cycle. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-19 11:27:13 +02:00
daveadmin	bc44b0eee2	Add My Documents panel to workbench + user-docs API - api/user-docs.php: GET/DELETE shared dbn_user_docs table (SSO users only) connects to dobetternorge DB via DBN_DB_* env vars - workbench.php: My Documents panel (section 05) for SSO/free-tier users; shows docs uploaded from either AI chat or tools, links to AI Chat for upload - workbench.js: fetch + render doc list, delete with Qdrant cleanup - tools.css: workbench-docs panel + item styles - i18n.php: my_docs_* strings in all 4 languages Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-19 09:37:19 +02:00
daveadmin	04555a96b1	Add Citation Explorer tool and graph-expansion badges to Advocate results - citations.php + assets/js/citations.js: new tool page for browsing the FalkorDB citation graph by title/ID, with autocomplete, action pills (cites/cited_by/implements/chain), hop-by-hop navigation, and exploration trail - advocate.js: tag graph-expanded source cards with 'via citation graph' badge - DeepResearchAgent: propagate _graph_expanded flag through normalizeCorpusChunk and top_sources serialization so it reaches the frontend - tools.css: add .dr-source-tag--graph variant (green pill) - i18n.php: register 'citations' tool in all 4 languages with CIT icon Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 22:30:04 +02:00
daveadmin	e977bbb6b3	Add Document Discrepancy Finder tool 8-step NDJSON-streaming pipeline that compares two Barnevernet documents: classifies each doc, extracts parties and timelines, cross-references both for contradictions/deletions/additions, retrieves corpus legal context, and synthesises a full discrepancy report with tabbed UI. New files: DiscrepancyAgent.php, api/discrepancy.php, discrepancy.php, discrepancy.js. Modified: FreeTier.php (cost=4), i18n.php (all 4 langs), tool-svgs.php (DC icon), tools.css (dc-* component styles). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 19:30:38 +02:00
daveadmin	1246b7a804	feat(workbench): add DBN Case Workbench guided case-preparation hub Additive-only change: new workbench.php authenticated page with guided intake flow, evidence map, tool sequence, output checklist, and sessionStorage-only note persistence. Dashboard and public index get a new Case Workbench card. No existing tools, APIs, or prompts modified. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 19:08:16 +02:00
daveadmin	ffcf887428	feat(timeline): add live filter, actor chips, group headers, copy button, source toggle, count badge - Live search/filter bar: filters events by keyword across event, actor, source_excerpt, date - Actor filter chips: click to filter by actor, multi-select, teal active state - Year/month group headers when sorted chronologically (── 2023 ──, Mar 2024 ──) - Per-event copy button (hover-revealed 📋): copies "date · actor · event" to clipboard - "Hide/show sources" toggle: collapses all source excerpts without re-rendering - Count badge: "23 events · 3 actors · 2022–2025" above the list - applyTimelineFilters() unifies sort + actor + text filters in one re-render pass - CSV export now includes end_date column - Reset all filter state on each new run Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 15:46:59 +02:00
daveadmin	59b39ff85b	feat(redact): tag highlighting, inventory panel, before/after toggle, gpt-4o upgrade - CSS: colour-coded [TAG] spans by entity type (person=pink, org=blue, place=green, date=amber, id=purple) - Inventory panel: collapsible list showing tag → original text mappings with occurrence counts, sourced from new redaction_map API response key - Before/after toggle: Redacted / Original view-switch buttons wired to lastOriginalText captured at submission time - One-click gpt-4o upgrade button when mini or GPU engine was used - Backend: redaction_map built from applied LLM entities (tag → originals + occurrence count via substr_count on final text) - renderResults now calls setupRedactViewToggle() after DOM is written Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 08:22:41 +02:00
daveadmin	850937e4b3	feat(transcribe): UX improvements — progress bar, stats row, copy btn, char counter, batch errors - Vocab textarea now shows live 0/500 char counter (turns amber at 450+) - Animated progress bar during transcription; determinate for multi-clip, indeterminate for single - Results card shows inline stats row (duration, language, speakers) and AI cleanup badge - Copy button + Download TXT moved above transcript box; SRT/VTT remain below - Speaker role legend repeats inside Segments panel for easy cross-reference - Batch errors no longer halt the queue; remaining clips continue, failed files named in status bar Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 08:21:19 +02:00
daveadmin	c4362738c1	feat(transcribe): GPT cleanup pass + advanced options i18n Adds optional post-transcription cleanup via GPT-4o/GPT-4o-mini to fix mishearing errors, punctuation, and domain terms. Speaker role labelling now accepts a deployment param. Adds i18n strings for advanced options panel (task, VAD filter, Whisper model, AI cleanup) in all four languages. Updates BvjAnalyzerAgent and DeepResearchAgent. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 07:23:01 +02:00
daveadmin	8b77acb828	feat: free-tier credit system + Syttende Mai access for Google users - FreeTier.php: credit check/deduct/reset engine with hourly rate limit - bootstrap.php: dbnmDb() singleton, dbnToolsIsFreeTier(), credit gate helpers - index.php: store tier=free\|approved in session from SSO JWT - All 7 API endpoints: credit gate (402/429) + X-Credits-Remaining header - layout.php: credit meta tag, JS balance var, Syttende Mai banner (05-17 only) - tools.js: credit badge in topbar, 402 modal, 429 toast, dbnUpdateCredits() - barnevernet.js + deep-research.js: wire 402/429 handling for NDJSON streams - tools.css: styles for credit badge, no-credits modal, rate-limit toast Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-16 21:05:08 +02:00
daveadmin	08d1e3cee3	feat: auto-select STT engine (Azure → Google Cloud → Whisper) and show provider in results Removes user-facing engine/model/key/beam controls. The server now picks the best available engine automatically: 1. Microsoft Azure Speech — short clips (≤1MB, no diarization, audio/*) 2. Google Cloud Speech v2 — long audio, diarization, all languages 3. OpenAI Whisper GPU — local fallback Results display which provider was used (e.g. "Transcribed with Google Cloud Speech") via transcript-engine-badge and traceMeta. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-16 13:22:24 +02:00
daveadmin	13572e9dfb	feat: extract and display event times on timeline (kl. HH:MM etc.) Prompt now instructs the model to extract time of day (HH:MM) when present in Norwegian formats: kl. 14:30, kl 09.00, 14:30, 14.30. renderTimeline shows time as a muted inline annotation next to the date. CSV export gains a Time column after Date. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 23:03:20 +02:00
daveadmin	a3d46f9756	feat: Legal Tools v1 — multilingual landing, dashboard, SSO bridge - Public landing page at / for unauthenticated users (EN/NO/UK/PL) - Authenticated / shows Case Workbench dashboard with manifesto strip, stats, and launched-tool grid (Transcribe, Timeline, BVJ, Advocate, Deep Research, Corpus) - Added includes/i18n.php with full 4-language translation layer - Extended layout.php to Case Workbench shell with tool rail, lang switcher - AI output language normalization extended to en/no/uk/pl in PHP agents - SSO token validation in bootstrap.php / index.php (dobetternorge.no bridge) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 22:53:27 +02:00
daveadmin	9b8cb9c6dc	fix: raise file upload limit from 4 MB to 8 MB PHP constant and all JS client-side guards updated. Server PHP ini is 64M. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 20:57:25 +02:00
daveadmin	43cf5b8ce4	feat: Barnevernet Analyzer — document analysis + partisan RAG brief 7-step agent pipeline: document classification, party extraction, timeline extraction, corpus RAG (child_welfare/echr/family_core/bufdir_guidance), and synthesis using the user's chosen engine (including dbn-legal-agent). Progressive NDJSON streaming renders doc_meta, parties, and timeline cards before the final advocacy brief and procedural red flags arrive. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 20:49:46 +02:00
daveadmin	343b19d0b4	Add sub-question branching + document summary modals - Source modal now shows LLM-generated document summary (lazy-gen + cached in documents.summary) instead of raw chunk text; toggle reveals matched chunk; "View all chunks" button fetches every chunk of the document via new api/document-chunks.php endpoint - Each sub-question card gets a "Branch ↓" button that pre-fills the query with that sub-question and shows a context panel with the prior brief summary; prior_context + branch_notes are injected into interpretSeed() and synthesise() so the LLM knows where the research is coming from - Upload document summaries generated at synthesis time and attached to upload sources alongside corpus summaries - DB: documents.summary TEXT column added to bnl_corpus on chloe Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 19:44:27 +02:00
daveadmin	7bccd8c010	Expand corpus slices to 8: split ECHR/Hague, add Norwegian Courts, Bufdir, DBN Resources - Replace combined echr_hague slice with echr (Art.8+9, HUDOC, NIM) and hague (INCADAT, cross-border abduction) as separate toggles; echr defaults ON, hague defaults OFF - Add norwegian_courts slice: Domstol (src 5,26) + Rettspraksis.no (src 33, 482 docs) - Add bufdir_guidance slice: Barneombudet (19), Bufdir (20), Statsforvalteren (31) - Add dbn_resources slice: DBN website pages (flashcards, resource directory), defaults OFF - Replace isWebsiteChunk() with slice-aware shouldExcludeChunk(): always strips EU AI Act chunks (EUR-Lex source 7 leaks through when Qdrant runs unconstrained) and DBN website pages unless dbn_resources slice is explicitly ON - Update SLICE_DEFS in advocate.js and deep-research.js to match all 8 slices - Backward compat: echr_hague key in incoming requests fans out to echr+hague Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 16:01:05 +02:00
daveadmin	640778454f	Add Case Advocate tab — partisan brief grounded in Norwegian law New /advocate.php tab: user selects who they represent (biological father, mother, foster carer, CWS, etc.) and the agent takes their side entirely. Adversarial sub-questions target supporting Lovdata statutes + ECHR precedents; synthesis returns client_strengths[] and opposing_weaknesses[] alongside the advocate brief. - DeepResearchAgent: add advocateRole param to run(), interpretSeed(), expandQueries(), synthesise(). Neutral path unchanged (empty string). - api/deep-research.php: extract + validate advocate_role from payload; telemetry logs tool='advocate' vs 'deep_research'. - advocate.php: new page with role dropdown (presets + custom), same corpus slices/engine/controls/upload zone as deep research. - assets/js/advocate.js: page-scoped JS; renders advocate banner, client strengths card (teal), advocate brief, opposing weaknesses card (amber), sub-Q cards, sources, uncertainty, next step. - assets/css/tools.css: append .adv-* rules (~120 lines). - includes/layout.php: add Advocate nav tab between Deep research and Summarize. - index.php: add Advocate cap-card tile. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 12:26:05 +02:00
daveadmin	e130db8119	Deep Research v2: exclude marketing site, deep-link sources, per-agent reports Three user-flagged issues after the first real run with a 920KB sakkyndig PDF: 1. dobetternorge.no marketing-website chunks leaked into the retrieval pool. ClientRagPipeline::searchAll defaults include_beta_website=true; we now pass false for both website flags, AND defensively drop any returned chunk whose source_name contains "website" or title contains "dobetternorge.no" before it can pollute synthesis. 2. Brief returned was "just a paragraph". Bumped synthesis max_tokens 2200→3200, raised timeout 120→180s, and rewrote the prompt to require 400-900 words with min 4 paragraphs when source_count>=3, covering EACH sub-question in its own paragraph. Now also passes authority + jurisdiction into the sources block so the model can pinpoint statutes correctly. 3. No way to see what each "sub-question agent" researched or click through to the source articles. Restructured the results panel so per-sub-question report cards now render ABOVE the synthesised brief. Each report shows the question, the rationale, and the top 3 retrieved sources for that sub-Q with title→deep link + 1-line excerpt. Brief follows. Consolidated numbered sources list at the bottom, with titles as deep links too. Deep-link construction: source_url is hydrated via dbnV6QueryDocumentMeta in a single batched call after retrieval. For Lovdata sources with a section_title containing §<n>, the link is path-anchored to that section (/§43). For other hosts (HUDOC, Regjeringen, Bufdir, etc.) we link to the document root URL. Telemetry: trace_metadata now carries retrieval_counts {raw_corpus, filtered_website, post_filter_corpus, raw_upload, after_dedupe, after_topk} so future regressions are diagnosable from the metadata.jsonl log alone. The completion status pill surfaces the corpus/website/upload split.	2026-05-15 11:12:13 +02:00
daveadmin	a1a7f442a7	Deep Research: NDJSON streaming so the connection survives long runs Previously the endpoint returned a single JSON object at the end. Apache+ PHP-FPM buffers the entire body until PHP exits, so a 160s azure_full run caused the browser to drop the fetch as "Failed to fetch" while the server was still synthesising — the response then arrived to a dead socket. Switch to application/x-ndjson with one event per line. The endpoint emits 'progress', 'start', 'step' (running/complete/warning/error), 'subq', and a final 'final' event carrying the full result payload. Output buffering is explicitly disabled so each line flushes through Apache as soon as the agent emits it. DbnDeepResearchAgent::run() now accepts an optional ?callable $emit and fires step:running before each step + step:complete after, plus a subq event per sub-question retrieval round. JS reads response.body as a stream, splits on newlines, updates the trace panel live, and renders the final result when the final event arrives. Status pill shows live progress detail (e.g. "Synthesising with Azure gpt-4o — this is the slowest step…"). Engine row in the form now shows expected duration per engine (~15-45s mini, ~60-180s full, ~30-90s GPU) so users know what they're in for before clicking Run.	2026-05-15 10:47:35 +02:00
daveadmin	4cbe0a4ac4	Add Deep Research tool — agent + rank/rerank RAG New surface at /deep-research.php where the user pastes a question or uploads PDF/DOCX/TXT case files and a LLM-orchestrated agent researches the Do Better Norge legal corpus from 3-5 angles, with hybrid retrieval, cross-encoder rerank, and synthesis that emits an inline-[n]-cited markdown brief plus a numbered sources panel. Uploaded documents are chunked + embedded in memory only (nomic-embed-text via LiteLLM) and searched alongside the shared corpus during the same request — never persisted to disk, DB, or Qdrant. Reuses ClientRagPipeline::searchAll (hybrid + rerank), dbnV6 slice helpers, and the existing extract.php text-extraction logic via a new dbnToolsExtractUploadedFile() helper. Also adds dbnToolsCallGpuLlm() helper in bootstrap.php — fixes a latent bug where LegalTools.php was already calling that name with no definition. Search.php is unchanged.	2026-05-15 10:30:47 +02:00
daveadmin	f183678f35	Redact: catch soft dates (years, month+year, ranges, prepositions) Adds Nordic-pack regex patterns for: - DD.MM.YYYY / DD/MM/YYYY / YYYY-MM-DD - Year ranges (2011/2012, 2018-2019) - Month + year (Norwegian + English, with optional day) - Year preceded by temporal preposition (i 2015, fra 2019, rundt 2018) Also renames the entity toggle from "Dates of birth" to "Dates" (broader scope) in all four languages, and expands the LLM prompt so soft date references in free text are caught even when regex misses them. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 01:58:35 +02:00
daveadmin	e156ed4553	Add timeline sort toggle (doc order / chronological) with CSS - Wire sortDocOrder / sortChronological click handlers in renderResults() - Add .timeline-sort-bar and .sort-btn styles to tools.css Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 01:50:13 +02:00
daveadmin	d429e785e8	feat(feedback): thumbs up/down + missed-items widget across all tools New api/feedback.php stores rating + correction text to tool_feedback table in bnl_admin. renderFeedbackWidget() appended to all tool results (timeline, redact, transcribe, ask, summarize, search). Thumbs reveal a textarea for missed/wrong items on click; submit POSTs asynchronously. Engine from last run is stored alongside the rating. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 01:13:42 +02:00
daveadmin	7690ed17ee	feat(timeline): full form UI with engine selection and advanced settings Add 4-language switcher (EN/NO/UK/PL), engine choice (Azure mini/full, GPU/cuttlefish), and expandable Advanced panel (Focus, Confidence filter, Date types) to timeline.php. Wire new params through api/timeline.php and LegalTools::timeline() with engine routing, focus-aware prompt injection, and confidence/date-type post-filters. Add TIMELINE_I18N to tools.js with improved renderTimeline() confidence colour-coding and new CSS classes. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 00:59:12 +02:00
daveadmin	30915bcb09	Redact: collapsible advanced settings, download TXT/DOCX/copy - Wrap Mode/Region/Entities/Officials/Output/Exempt/Aliases in a <details> toggle so the form opens clean with only engine + input visible - After redaction: Copy, Download .txt, Download .docx buttons appear below the redacted output (all four languages translated) - New api/redact-download.php: returns plain text or a minimal valid DOCX built from scratch with ZipArchive (no external dependencies) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 00:33:50 +02:00
daveadmin	8c12d5e778	Redact tool: rich UI, multilingual, engine choice, output formats - Custom inline form (EN/NO/UK/PL lang switcher) replacing generic stub - Engine selector: Azure gpt-4o-mini (default), gpt-4o, GPU cuttlefish, regex-only - Entity type toggles: names, organisations, places, dates of birth - Output formats: contextual role tags, generic [PERSON], Norwegian pseudonyms - Keep officials mode: judges/experts kept as [JUDGE: Andersen] format - Exempt names list: specific names excluded from redaction - Hint paragraphs explaining each option in all four languages - Backend: engine routing, callGpuLlm(), applyGenericTags(), applyPseudonymization() - AzureOpenAiGateway: withDeployment() clone pattern for per-call model override Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-15 00:20:16 +02:00
daveadmin	e3d8daf6ca	feat(transcribe): Azure Speech server-side key, remove translate option, add beam/VAD hints - api/transcribe.php falls back to DBN_AZURE_SPEECH_KEY/REGION env vars so BYOK not required - JS hides Azure key input when DBN_AZURE_SPEECH_CONFIGURED is true - Remove Translate to English task option from Advanced settings - Add explanatory hint text for Beam size and VAD filter in all 4 languages Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-14 23:23:33 +02:00
daveadmin	ff031d7a5b	fix(i18n): escape apostrophe in Ukrainian readyDesc string (broke all JS)	2026-05-14 22:59:18 +02:00

1 2

63 Commits