MLModel, not as a ROS intelligence layer. Audio goes in, a JSON transcript comes out, and the same model can be called from POST /api/v1/mlmodels/{uuid}/run or a workflow CALL_MODEL node.
The default catalog model is Whisper Large v3:
Transcribe robot audio with Whisper MLModels in workflows, REST calls, or edge workers
MLModel, not as a ROS intelligence layer. Audio goes in, a JSON transcript comes out, and the same model can be called from POST /api/v1/mlmodels/{uuid}/run or a workflow CALL_MODEL node.
The default catalog model is Whisper Large v3: