Changelog

All notable changes to AmicoScript. Follows Semantic Versioning.

v1.11.0

Latest
May 12, 2026
โœจ UI ๐Ÿ“ค Export
Recording alias: Rename any recording with a display name independent of the source filename. Used as title in Markdown exports.
Transcript tab decluttered: Collapsed inner tab bar, search bar, and segment selection bar into a single compact toolbar.
AI Analysis slide-over: Moved from a sub-tab to a slide-over panel (lightbulb icon), keeping the transcript always visible.
Bulk speaker assignment: Select multiple segments and assign a speaker name in one action via the contextual toolbar.
AI result markdown rendering: AI Analysis output now renders as formatted markdown instead of plain text.
Enhanced Markdown export: Speaker runs merged into paragraphs, timestamp only at start of each speaker turn, metadata header with duration, language, speaker list, and date.
Bulk Markdown export: Multiple recordings export as a single combined file with a table of contents and separators.

v1.10.5

May 5, 2026
๐Ÿ› Fix
Fix torch/torchaudio dependency caps for Python 3.13 and lightning compatibility.

v1.10.4

April 30, 2026
๐Ÿ› Fix
Error in transcription.py #22 (thanks @nyfon).

v1.10.3

April 21, 2026
๐Ÿ› Fix
Fix GPU release build.

v1.10.2

April 19, 2026
โœจ Feature
Create distributable with GPU enabled.

v1.10.1

April 19, 2026
๐Ÿ”’ Security ๐Ÿ› Fix ๐Ÿงช Tests
CORS restricted to localhost: Prevent cross-origin requests from arbitrary websites.
Exit endpoint CSRF token: /api/exit now requires a per-session token, blocking DNS-rebinding attacks.
Audio path bounds check: Path traversal prevention in /api/audio/{job_id}.
Frontend XSS fix: Folder/tag menus rebuilt with DOM API, eliminating injection via names containing ' or </script>.
Chunked file upload: 1 MB streaming chunks prevent OOM on large audio files.
Atomic settings write: Writes to .tmp then renames atomically, preventing corrupt settings on crash.
9 new test files added covering exports, settings, search escaping, session lifecycle, and more.

v1.10.0

April 19, 2026
โœจ Feature
Microphone recording: Record directly from your microphone with pause/resume support and a live timer. Auto-queued into the batch transcription flow.

v1.9.0

April 19, 2026
โœจ UI
README: badges, competitor comparison table, Telegram community link, roadmap section.
CONTRIBUTING.md and GitHub issue templates added.
Feedback link in sidebar footer.

v1.8.0

April 18, 2026
โœจ Feature
URL import from YouTube, TikTok, Instagram, Facebook, X, Vimeo, and Twitch via yt-dlp.
Automatic platform tagging on URL imports (e.g., youtube, tiktok).

v1.7.0

April 15, 2026
โœจ Feature ๐Ÿงช Tests
Backend API modularized into dedicated router modules under backend/api/routes/.
Centralized status strings in backend/core/messages.py.
Unit tests for diarization, audio normalization, Whisper model cache, CUDA/VAD error classifiers.

v1.6.0

April 14, 2026
โœจ Feature
Transcription pipeline split into focused modules under backend/core/.
Worker replaced with asyncio background task using asyncio.Queue.
New TranscriptionConfig model with configurable compute_type, device, vad_filter, word_timestamps, and more.
DB indexes added for frequently queried fields.

v1.5.1 / v1.5.2

April 13, 2026
โœจ Feature
Update check: Frontend displays a banner when a newer GitHub Release is available.

v1.5.0

April 12, 2026
โœจ Feature
Google Colab integration: Offload transcription to a free Colab GPU while keeping files local.
Bulk actions: Select multiple recordings and apply bulk move, tag, or delete operations.
Batch directory import: Drop a folder of audio files for batch transcription.

v1.4.1

April 11, 2026
โœจ UI
Mobile UI: Off-canvas sidebar overlay, always-visible segment action buttons on touch devices.
Docker: Compose split into base, dev override, and production overlay. Traefik HTTPS support.

v1.4.0

April 6, 2026
โœจ Feature
AI Analysis Engine: Per-recording LLM-powered analyses โ€” summaries, action items, translation, custom prompts โ€” with streaming results.
LLM Settings: Configure base URL, model name, and API key from the UI. List and pull models.
Drawer sidebar, inner tab panels, client-side action logs, and Help modal.

v1.3.0 / v1.3.1

April 4, 2026
โœจ Feature
Waveform player with interactive seeking and segment highlighting.
Collapsible console log panel with timestamps.
Multiple file upload; video file support via ffmpeg audio extraction.
macOS release added.

v1.2.0

April 1, 2026
โœจ Feature
Global search with live filtering.
Server-side Hugging Face token persistence.
Automated platform-specific ffmpeg download on first run.

v1.1.0 / v1.1.1

March 31, 2026
โœจ UI
Fixed 10-color palette for tags and folders with server-side validation.
Per-folder and per-tag counts in sidebar.
Tag-click filtering scoped to selected folder.

v1.0.0

March 30, 2026
๐Ÿ› Fix
Initial stable release.
PyInstaller packaging fixed for speaker diarization.
Windowed runtime crash fixed during diarization.