Cross-platform framework for deploying language, vision, and speech models locally on smartphones.

Cactus SDK Banner

Cactus SDK

Cactus is the fastest cross-platform framework for deploying AI locally on smartphones.

Quick Start

Choose your preferred platform:

React Native SDK

Learn about our React Native implementation

Flutter SDK

Learn about our Flutter implementation

Kotlin Multiplatform SDK

Learn about our KMP implementation

C++ SDK

Native C++ development guide

v1 Release Feature List

After months of development and feedback from our community, we're launching the new Cactus SDK with significant architectural improvements and performance optimizations.

The table below outlines feature support across SDK versions.

	v0		v1
	React Native	Flutter	React Native	Flutter	Kotlin
LLM Inference			Soon
Tool calling			Soon
Embeddings			Soon
Voice transcription			Soon	Soon
Voice synthesis			Soon	Soon	Soon
Image embedding			Soon	Soon	Soon
RAG			Soon		Soon
Model format	GGUF	GGUF	Cactus	Cactus	Cactus

^* Production benchmarks using Qwen3 0.6B Q8 running CPU-only inference on an iPhone 16 Pro Max

Performance Benchmarks

Real-world performance on popular mobile devices:

Device	Gemma3 1B Q4 (toks/sec)	Qwen3 4B Q4 (toks/sec)
iPhone 16 Pro Max	54	18
iPhone 16 Pro	54	18
iPhone 16	49	16
iPhone 15 Pro Max	45	15
iPhone 15 Pro	45	15
iPhone 14 Pro Max	44	14
OnePlus 13 5G	43	14
Samsung Galaxy S24 Ultra	42	14
iPhone 15	42	14
OnePlus Open	38	13
Samsung Galaxy S23 5G	37	12
Samsung Galaxy S24	36	12
iPhone 13 Pro	35	11
OnePlus 12	35	11
Galaxy S25 Ultra	29	9
OnePlus 11	26	8
iPhone 13 mini	25	8
Redmi K70 Ultra	24	8
Xiaomi 13	24	8
Samsung Galaxy S24+	22	7
Samsung Galaxy Z Fold 4	22	7
Xiaomi Poco F6 5G	22	6