An AI desktop agent that sees your screen, understands your goals, and automates tasks autonomously. Just like Copilot helps you code, AXON helps you operate your entire computer.
The first vision-based desktop automation tool with transparent overlay and multi-LLM support
AXON sees your screen like a human using Vision Language Models. If you can see it, AXON can interact with it.
No scripting, no programming. Just tell AXON what you want in plain English and watch it work.
UI changed? No problem. AXON adapts because it understands context, not just coordinates.
Choose from Gemini, Claude, Ollama (local), NVIDIA NIM, or OpenRouter. Your data, your choice.
F12 kill switch, transparent overlay, stuck detection, and full action logging keep you in control.
Works with ANY application, ANY UI, ANY language. No configuration needed.
Press Alt+G and describe what you want in natural language
Vision AI captures and analyzes your desktop in real-time
LLM decides the best sequence of actions to complete your task
AXON moves the cursor and performs actions until task is complete
| Feature | AXON | AutoHotkey | RPA Tools | Voice Assistants |
|---|---|---|---|---|
| Vision-Based | ✅ | ❌ | ⚠️ | ❌ |
| Natural Language | ✅ | ❌ | ⚠️ | ✅ |
| Adaptive to UI Changes | ✅ | ❌ | ⚠️ | ❌ |
| Multi-LLM Support | ✅ | ❌ | ❌ | ❌ |
| Local Option | ✅ | N/A | ❌ | ❌ |
| Open Source | ✅ | ✅ | ❌ | ❌ |
Get started in minutes with our easy-to-use installer
Want to create the executable yourself? Follow these steps:
pip install pyinstaller
cd ibm-bob-hackathon
pyinstaller --name="AXON" --onefile --windowed --icon=pointinghand.svg main.py
This creates a single executable file in the dist/ folder
The executable will be located at: dist/AXON.exe
You can now distribute this file to others!
"Find all PDFs in Downloads and move them to Documents"
"Go to GitHub, search for 'python automation', and open the first result"
"Open WhatsApp and message John 'Meeting at 3 PM'"
"Open File Explorer, search for my resume, and print it to PDF"
Select any text, press Alt+G for instant AI explanation
"Open Settings and change the wallpaper"