DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
-
Updated
Aug 6, 2025 - C
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
Offline-native first speech-to-text engine built on whisper.cpp. Engine preview (v0.1.0)
Add a description, image, and links to the native-engine topic page so that developers can more easily learn about it.
To associate your repository with the native-engine topic, visit your repo's landing page and select "manage topics."