Y CombinatorOxford Seed FundGoogle for Startups

The fastest way to deploy mobile AI

Deploy AI models locally on smartphones - in Flutter, React Native, and Kotlin Multiplatform.

Minimize latency, guarantee privacy, and decrease server costs.

<50ms

Time to First Token

Up to 300

Tokens / second

Zero

Data Leaves the Device

Designed for the Edge

Unified cross-platform SDK

Offline-ready
Perfect for unreliable networks or internet-disabled devices.
Private
On-device inference by default. No data transmission and complete user privacy.
Multimodal
Deploy language, vision, and speech models through a unified framework.
Cloud fallback
Fall back to cloud inference if needed for longer or asynchronous tasks.
Agentic
Augment your workflows with built-in mobile tool calling.
Native Support
iOS xcframework and Android JNILibs for seamless native integration

Platform support

Get started with your preferred framework

Flutter
Dart package for Flutter apps
flutter pub add cactus
Quick Start →
React Native
NPM package for React Native
npm install cactus-react-native
Quick Start →
C++
Native C++ library

Performance benchmarks

Real-world performance data on popular consumer devices

Tokens per Second
Real-world performance measured through the demo apps below

Try demo apps

Experience Cactus SDK in action with our demo applications

Ready to find your Edge?

Join thousands of developers building the future of mobile AI