Files
stt/requirements-whisper-stt.txt
dosangyoon 1c25bed926 Add whisper_stt CLI with default diarization, Ubuntu README, editor config
- Replace test.py with whisper_stt.py: OpenAI Whisper + default speaker diarization
  via local ./models/pyannote-diarization-3.1; --no-diarize for plain text
- Add requirements-whisper-stt.txt (whisper, pyannote, huggingface_hub, imageio-ffmpeg)
- README: stt conda env, Ubuntu/macOS ffmpeg, CLI usage
- .vscode: Python interpreter /opt/anaconda3/envs/stt; .cursor rule for stt env
- .gitignore: exclude downloaded pyannote snapshot under models/

Made-with: Cursor
2026-03-23 11:34:46 +09:00

19 lines
847 B
Plaintext

# 권장: conda activate stt 후 설치 (프로젝트 기본 환경)
# whisper_stt.py (OpenAI Whisper CLI) — 선택 설치
# 기본: pip install openai-whisper imageio-ffmpeg
# 화자 구분(--diarize): pyannote.audio
#
# CLI: pip install 후 PATH에 hf 가 잡혀야 함 (conda env 활성화)
# pip install huggingface_hub
#
# 모델 받기 (택 1)
# A) 허브 직접: pyannote/speaker-diarization-3.1 약관 동의 후
# hf auth login # 또는 HF_TOKEN — 실행 시 자동 캐시
# B) 로컬 폴더(이후 HF_TOKEN 불필요): 한 번 받아 두기
# hf download pyannote/speaker-diarization-3.1 --local-dir ./models/pyannote-diarization-3.1
# python whisper_stt.py in.wav out.txt --diarize --diarize-model-dir ./models/pyannote-diarization-3.1
openai-whisper
imageio-ffmpeg
huggingface_hub>=0.26.0
pyannote.audio>=3.1.0