Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation