Commit Graph

16 Commits

Author SHA1 Message Date
dosangyoon
e74805f2fd fix(deps): pin full pyannote 3.x stack; doc uninstall audio 4.x
- Cap core/database/pipeline; use pyannote-audio==3.3.2 dist name
- README: resolve conflict with metrics 3 + numpy<2.2

Made-with: Cursor
2026-03-23 16:26:46 +09:00
dosangyoon
1c7e09ffcf fix(deps): cap pyannote-metrics<4 for numpy<2.2 compatibility
pyannote-metrics 4.x requires numpy>=2.2.2; conflicts with numpy<2.2 for older CPUs

Made-with: Cursor
2026-03-23 16:11:29 +09:00
dosangyoon
49c2e4a5a7 fix(pip): split diarize deps to avoid resolution-too-deep on Linux
- requirements.txt: drop pyannote/huggingface_hub (kept in diarize file)
- requirements-diarize.txt: pin torch 2.5.1, torchaudio, lightning 2.4.0, pyannote.audio 3.3.2
- environment.yml: install both requirement files
- README: two-step install, update options and platform table

Made-with: Cursor
2026-03-23 16:03:26 +09:00
dosangyoon
5edd0b32e7 fix(deps): pin pyannote 3.3.2 + torch 2.5.1 to avoid pip resolution-too-deep
- Unbounded pyannote>=3.1 pulled 4.x / torch>=2.8 / opentelemetry graph
- Tighten huggingface_hub and lightning range; document CPU torch two-step

Made-with: Cursor
2026-03-23 15:51:39 +09:00
dosangyoon
723a964d9c chore(cursor): rule to commit and push after repo changes
Made-with: Cursor
2026-03-23 15:31:49 +09:00
dosangyoon
9ec998e1f1 fix(deps): pin numpy<2.2 for CPUs without x86_64_v2; docs Apache+503
- Avoid NumPy wheel baseline X86_V2 import crash on older VMs
- README: NumPy error remediation, Apache ProxyPass example, web UI note

Made-with: Cursor
2026-03-23 15:31:40 +09:00
dosangyoon
26ff9b59c2 feat(web): speaker diarization via pyannote (parity with whisper_stt)
- Add app/diarize.py: local snapshot, A/B labels, disclaimer text
- transcribe_file and async jobs support diarize flag; Form diarize on API
- UI checkbox (default on); requirements: pyannote.audio, huggingface_hub
- README: env vars and model notes

Made-with: Cursor
2026-03-23 15:23:49 +09:00
dosangyoon
2caa74ac05 docs: expand libtinfo troubleshooting; add scripts/env-no-ld.sh
- README: deactivate may not clear LD_LIBRARY_PATH; unset, /usr/bin/vi, diagnostics
- scripts/env-no-ld.sh: run commands without LD_LIBRARY_PATH for vi/bash
- run.sh: include in commit if changed (verify diff)

Made-with: Cursor
2026-03-23 13:46:41 +09:00
dosangyoon
a984b86766 run.sh: avoid global LD_LIBRARY_PATH; pass conda lib only to uvicorn
Prevents bash/libtinfo 'no version information available' when conda ncurses
shadows system libtinfo. Document env -u LD_LIBRARY_PATH fallback.

Made-with: Cursor
2026-03-23 13:40:52 +09:00
dosangyoon
ff5086db81 whisper_stt: support pyannote 4.x DiarizeOutput for itertracks
Pipeline returns DiarizeOutput with speaker_diarization Annotation; fall back to
legacy Annotation when attribute absent.

Made-with: Cursor
2026-03-23 13:35:28 +09:00
dosangyoon
13d1f75b34 Rollback web speaker diarization; HF token helper for whisper_stt
- Remove app/diarize.py and pyannote from requirements.txt; web uses faster-whisper only
- Revert main.py job/transcribe flow and index.html meta/hints
- Add app/pyannote_auth.py for Pipeline.from_pretrained(..., token=...) used by whisper_stt
- Expand whisper_stt / README / requirements-whisper-stt for gated repos (community-1, 403)

Made-with: Cursor
2026-03-23 13:31:38 +09:00
dosangyoon
2e503d1a56 Web STT: speaker diarization via pyannote; whisper_stt snapshot validation
- Add app/diarize.py: merge faster-whisper segments with pyannote (A/B/C)
- Wire /api/jobs and /api/transcribe; job API returns speaker_diarization, diarize_skip_reason
- UI: meta line shows diarization applied/skipped; hint for models path
- requirements.txt: pyannote.audio; README APP_DIARIZE / APP_PYANNOTE_MODEL_DIR
- whisper_stt.py: validate config.yaml before loading pipeline
- requirements-whisper-stt.txt: minor doc updates if any

Made-with: Cursor
2026-03-23 13:09:31 +09:00
dosangyoon
c90230053a Document Ubuntu torch/pip recovery for requirements-whisper-stt.txt
- Add conda-first PyTorch install and broken-torch uninstall steps
- Mirror guidance in README for OSError / missing ATen.h during pip

Made-with: Cursor
2026-03-23 11:38:55 +09:00
dosangyoon
1c25bed926 Add whisper_stt CLI with default diarization, Ubuntu README, editor config
- Replace test.py with whisper_stt.py: OpenAI Whisper + default speaker diarization
  via local ./models/pyannote-diarization-3.1; --no-diarize for plain text
- Add requirements-whisper-stt.txt (whisper, pyannote, huggingface_hub, imageio-ffmpeg)
- README: stt conda env, Ubuntu/macOS ffmpeg, CLI usage
- .vscode: Python interpreter /opt/anaconda3/envs/stt; .cursor rule for stt env
- .gitignore: exclude downloaded pyannote snapshot under models/

Made-with: Cursor
2026-03-23 11:34:46 +09:00
dosangyoon
78244da09f Add OpenAI Whisper test script with ffmpeg fallback via imageio-ffmpeg
- test.py: patch whisper.audio.run for bundled ffmpeg when ffmpeg is not on PATH; expanduser for paths
- requirements.txt: add imageio-ffmpeg for optional local transcription tooling

Made-with: Cursor
2026-03-23 10:43:30 +09:00
7adaa73102 Initial commit after re-install 2026-02-25 19:07:56 +09:00