The fastest way to deploy mobile AI
Deploy AI models locally on smartphones - in Flutter, React Native, and Kotlin Multiplatform.
Minimize latency, guarantee privacy, and decrease server costs.
<50ms
Time to First Token
Up to 300
Tokens / second
Zero
Data Leaves the Device
Designed for the Edge
Unified cross-platform SDK
Offline-ready
Perfect for unreliable networks or internet-disabled devices.
Private
On-device inference by default. No data transmission and complete user privacy.
Multimodal
Deploy language, vision, and speech models through a unified framework.
Cloud fallback
Fall back to cloud inference if needed for longer or asynchronous tasks.
Agentic
Augment your workflows with built-in mobile tool calling.
Native Support
iOS xcframework and Android JNILibs for seamless native integration
Platform support
Get started with your preferred framework
Flutter
Dart package for Flutter apps
flutter pub add cactus
React Native
NPM package for React Native
npm install cactus-react-native
C++
Native C++ library
Performance benchmarks
Real-world performance data on popular consumer devices
Try demo apps
Experience Cactus SDK in action with our demo applications
Ready to find your Edge?
Join thousands of developers building the future of mobile AI