Live Transcribe unresponsive

the other relevant thread is: Captioning phone calls for hard-of-hearing users

is that your exact use-case, live transcription of incoming calls?
or generally speech to text when the other person can speak into your microphone?