Oxford Seed FundGoogle for Startups

The fastest way to deploy mobile AI

Deploy AI models locally on smartphones - in Flutter, React Native, and Kotlin Multiplatform.

Minimize latency, guarantee privacy, and decrease server costs.

<50ms

Time to First Token

Up to 300

Tokens / second

Zero

Data Leaves the Device

Designed for the Edge

Unified cross-platform SDK

Offline-ready

Perfect for unreliable networks or internet-disabled devices.

Private

On-device inference by default. No data transmission and complete user privacy.

Multimodal

Deploy language, vision, and speech models through a unified framework.

Cloud fallback

Fall back to cloud inference if needed for longer or asynchronous tasks.

Agentic

Augment your workflows with built-in mobile tool calling.

Native Support

iOS xcframework and Android JNILibs for seamless native integration

Get started with your preferred framework

Flutter

Dart package for Flutter apps

flutter pub add cactus

React Native

NPM package for React Native

npm install cactus-react-native

C++

Native C++ library

Real-world performance data on popular consumer devices

Tokens per Second

Real-world performance measured through the demo apps below

Experience Cactus SDK in action with our demo applications

Join thousands of developers building the future of mobile AI