Run local AI models on your PC.
Localy AI makes it easy to discover, install, and use local AI models — without giving up speed, control, or privacy.
Free to start · macOS 13+ · Apple Silicon & Intel · Requires Ollama

Everything you need
A complete home for local AI.
From discovery to inference, Localy AI handles every step so you can focus on what you're building.
Local model management
Browse, organize, and switch between your installed AI models from a single, beautifully designed library.
Install & track status
One-click installs with live download progress, version tracking, and disk usage at a glance.
Clean chat workspace
A focused, distraction-free chat interface built for thinking — with history, threads, and prompts.
Mac-first design
Native feel on macOS. Optimized for Apple Silicon, with system theming, gestures, and shortcuts.
Privacy-first
Models run entirely on your machine. No cloud round-trips, no telemetry, no data leaves your PC.
Built for speed
Metal-accelerated inference and smart caching so even large models stay snappy.
Model Library
Discover and install in seconds.
Search a curated catalog of open-source models. See size, capabilities, and benchmarks before you install — then track downloads live.


Workspace
A chat built for thinking.
Threaded conversations, saved prompts, markdown and code rendering — all running on the model of your choice, fully offline.
Model Library
50+ local models, one app.
Tiny chat models to 70B powerhouses, plus vision and coding specialists. Localy AI checks your Mac and tells you which ones it can run.
Ultra-small models for instant replies on modest hardware.
Very light and fast for tiny local tasks.
Ultra-small model for the fastest possible local replies.
Very small Gemma for quick drafts and short replies.
Compact model for lightweight everyday prompts.
Small and sharp for quick local Q&A.
Tiny model optimized for speed and low resource usage.
Ultra-light model for instant local responses.
Lightweight daily drivers for everyday chat.
Small and responsive model for lightweight prompts.
Best lightweight default for daily local chat.
Code model tuned for quick developer assistance.
Microsoft's compact model for efficient local use.
Tiny and efficient model for quick local responses.
All-rounders for chat, coding, and reasoning.
A fast, newer everyday model for chat and reasoning.
Strong local quality with good speed and vision support.
Strong bilingual model with good performance.
Specialized coding model for programming assistance.
Nice for coding and structured output.
Great all-rounder if you have enough RAM.
Llama fine-tuned for code generation and understanding.
Smaller instruction model for fast everyday use.
Instruction-tuned model with a chatty, helpful style.
Good general-purpose model with a playful assistant tone.
Friendly instruction model that keeps answers concise.
Coding model for writing and refactoring code.
Strong instruction model for structured responses.
Older but solid general-purpose model.
Bigger StableLM for more capable local chat.
Polished instruction model with strong general responses.
Great all-around local model with strong reasoning.
Improved Llama 3 with better instruction following.
Fine-tuned for instruction following and casual chat.
Google's improved Gemma with better reasoning.
Larger, heavier models for powerful Macs.
Popular vision model — may be buggy in some cases.
Heavier reasoning model for stronger Macs.
Vision model that can describe and reason about images.
Vision-capable Llama for image understanding and chat.
Larger Gemma variant for better quality and deeper reasoning.
Newer Mistral variant with a strong balance of speed and quality.
A stronger compact model for reasoning and structured tasks.
Dedicated code model for editing, generation, and explanation.
Stronger coding model for larger local development tasks.
A larger general-purpose model for powerful Macs.
Code-focused model for more advanced software work.
Strong larger model for higher-quality local responses.
Heavy Falcon variant for powerful machines.
Powerful large model for demanding workloads.
Very large Llama variant for serious local rigs.
Mixture of experts model for advanced tasks.
Strong instruction model for long-form and tool-like tasks.
Classic instruction model for broad local use.
Instruction-tuned model with solid general reasoning.
Focused on helpful, thoughtful instruction following.
Multilingual model for better cross-language prompts.
Localy AI checks your Mac's chip and memory in-app and tells you which models it can run.
Plans
Free for everyone. Plus for the curious.
Localy AI is free. Apply for Plus to unlock exclusive downloads and early access — reviewed by AI, up to 3 approvals per day.
Free
AlwaysEverything you need to run local AI on your PC.
- Run local AI models on your PC
- Public downloads & resources
- Community support
- Privacy-first by default
Plus
Apply · FreeFor makers, researchers, and creators with a real use case.
- Everything in Free
- Exclusive Plus-only downloads
- Early access to new builds
- Priority resources & tools
“Why not apply, it's free!”
FAQ
Questions, answered.
What's included, what stays on your PC, and what's coming next.
Bring AI home to your desktop.
Download Localy AI and start running powerful local models in minutes.
macOS 13+ · Apple Silicon & Intel · Windows 10/11