SmolLM2 1.7B Instruct
SmolLM2 · 1.7B
Ultra-compact model for quick tasks on any Apple Silicon device.
Run state-of-the-art language models on Mac, iPhone, and iPad using Apple Silicon. No cloud. No subscription required. Your prompts never leave your device.
awaited from outside contexts.Live chat preview
Streaming tokens, transparent reasoning, and full markdown rendering — all running on your device.
Everything included
Every feature you need to run, tune, and interact with AI models — without sending a single character to the cloud.
MLX-optimised weights run entirely on Apple Silicon. No API key. No cloud round-trip. Streaming tokens at up to 120+ tok/s on M-series chips.
No account. No telemetry by default. Chat content is encrypted at rest with ChaChaPoly and never transmitted.
Attach images directly in chat. VLMs like Qwen2-VL and Llama-3.2-Vision understand and describe your images on-device.
Watch models think step by step. Reasoning spans from <think> tags are surfaced in a collapsible panel — or silently stripped, your choice.
Browse verified Hugging Face models filtered for your device. One-tap download with SHA-256 verification, resume support, and atomic install.
Dictate messages with on-device speech recognition. Tap the speaker icon to have any reply read back with natural text-to-speech.
Point LenvX at any compatible cloud endpoint for optional cloud fallback. Your local models remain the default; cloud is opt-in and per-session.
Drop PDFs, text files, code, or images into chat. LenvX extracts text, runs OCR where needed, and passes it as context to the model.
All conversations are saved automatically with JSON snapshots. Pin up to 3 chats. Search, rename, delete — data stays entirely local.
One tap to save any assistant reply or tool result to your personal library. Browse, search, and export your best generations any time.
Tokens per second, time to first token, context usage ring, thermal state, memory footprint — all surfaced in the Advanced inspector.
Create named system prompts, set per-conversation defaults, and tune sampling parameters (temperature, top-p, top-k) per chat.
Built-in toolkit
Tap any tool, fill the form, get a result. Every tool sends output to chat for further refinement — and all results run through your local model.
Rewrite a passage in Formal, Casual, Concise, Expanded, or Persuasive style.
Correct grammar, spelling, and punctuation while preserving your voice.
Draft a professional email from bullet-point notes with a subject line.
Generate three caption options for Instagram, X, LinkedIn, or Threads.
Turn long articles or meeting notes into short, medium, or long summaries.
Turn job notes into 4–6 impact-driven bullets starting with strong verbs.
Brainstorm 5, 10, or 15 distinct ideas with elaboration for any topic.
Get a step-by-step explanation of a snippet with bug and performance notes.
Diagnose any stack trace in Swift, Python, JS, TS, Go, Rust, and more.
Translate text into any language, preserving tone and register.
Pull owner-tagged action items from a meeting transcript.
Weigh any decision with 3–5 pros and cons plus a neutral recommendation.
Curated catalog
Every model is verified against SHA-256 checksums, filtered for your device's RAM, and pre-configured for MLX-optimised inference.
SmolLM2 · 1.7B
Ultra-compact model for quick tasks on any Apple Silicon device.
Phi · 3.8B
Microsoft's compact powerhouse — surprisingly strong at code and reasoning.
Gemma · 4B
Google's latest compact model with long context and strong instruction following.
Qwen · 7B
Alibaba's balanced model excelling at code, math, and instruction following.
Qwen2-VL · 7B
Full vision-language model — understands and describes images on-device.
Llama · 11B
Meta's 11B vision model with 128k context and best-in-class image understanding.
Mistral · 7B
Proven and efficient — great all-around assistant for everyday tasks.
DeepSeek · 7B
Chain-of-thought reasoning model — shows its thinking before answering.
Qwen · 7B
Specialised code model — write, explain, debug, and review code on-device.
PaliGemma · 3B
Google's compact multimodal model — vision-capable on iPhone.
Llama · 70B
Meta's flagship 70B — requires 64 GB Mac. Best-in-class quality.
Qwen · 14B
Qwen's latest generation with hybrid reasoning and strong coding skills.
Custom Import via Hugging Face
Paste any Hugging Face repo ID or URL. LenvX resolves the revision, checks device compatibility, and validates all files before installing.
Security & privacy
LenvX is built from the ground up on the principle that your data is yours. No server operates LenvX AI. No account system exists. Every design decision defaults toward the minimum necessary data.
Telemetry is off until you explicitly turn it on. Even when enabled, only anonymised, aggregated counts are sent — never prompts, outputs, or model names.
Every message is encrypted with ChaChaPoly (CryptoKit) using a per-install key stored only in Keychain. Even if the SwiftData store is copied off-device, messages remain unreadable.
All outbound requests are validated against a strict allowlist. Only Hugging Face CDN and API endpoints can be contacted — your chat subsystem is network-isolated by design.
Downloads pass TLS, SHA-256 manifest matching, magic-byte executable rejection, path traversal checking, and atomic install via rename(2) before any model is registered.
App Sandbox is enabled. Only outbound network (HF downloads), speech recognition, and user-selected file access are granted. No full-disk access. No server entitlements.
A built-in Redactor scrubs Bearer tokens, HF tokens, email addresses, home paths, and IPs from every log line. Chat content is never logged at any verbosity level.
Every model file passes through independent validation stages before installation. A single failure quarantines the file and surfaces a clear error.
TLS 1.2+
System trust store
SHA-256 match
Manifest verified
Magic-byte scan
No executables
Path validation
No traversal
Quarantine stage
Isolated until verified
Atomic install
rename(2) only
HMAC sidecar
Tamper detection
WCAG AA
Accessibility contrast
App Sandbox
Minimal entitlements
CryptoKit
Apple-native crypto only
Platforms
One app, three platforms, zero compromise. Each platform gets a layout, download strategy, and model catalog specifically designed for its capabilities.
macOS 14+
iPadOS 17+
iOS 17+
Pricing
All core features — including unlimited chat, the full model library, and all 12 tools — are completely free. Pro unlocks cloud mode and priority access.
Payments processed by Apple via In-App Purchase. Cancel any time from App Store subscription settings. Lifetime purchase is a one-time IAP, not a subscription.