Whisper-based Real-time Speech Recognition

Description

Demo video: Link

Documentation: Link

Free Demo project (exe): Link

This plugin allows you to recognize speech in 99 languages, just by adding one component to your blueprint, without relying on any separate servers or subscriptions.

The machine learning model used in this plugin is based on OpenAI’s Whisper, but has been optimized to run on the ONNX Runtime for best performance and to minimize dependencies.

Accuracy varies for each supported language. See the original paper for the accuracy of supported languages.

To use this with a GPU, you need a supported NVIDIA GPU and to install the following versions of CUDA and cuDNN.

Technical Details

Code Modules:

Number of Blueprints: 2

Number of C++ Classes: 13+

Network Replicated: No

Supported Development Platforms: Windows 64-bit

Supported Target Build Platforms: Windows 64-bit

Documentation: Link

Important/Additional Notes:

Supported Engine Versions

4.27, 5.0 – 5.2

ASSET CLOUD

Whisper-based Real-time Speech Recognition

Description

Technical Details

ASSET CLOUD

Whisper-based Real-time Speech Recognition

Description

Technical Details

Related products

Ultimate Worlds Spawn System

Aeroplane Chess

HDRI Hellscape Panoramas – 8k

Desert Background Set

Mountain Landscape Background Set