XAgent CLI – AI assistant that can control your mouse and keyboard

(github.com)

1 points | by ouyangwenli 2 hours ago

1 comments

Show HN: xAgent CLI (https://github.com/xAgent-AI/xagent)

  Most AI coding tools (Claude Code, Gemini CLI, Copilot) have the same limitation: they can only read/write files
  and run shell commands. They can't actually interact with your desktop.

  xAgent CLI is different. It can control your mouse, keyboard, and any application on your computer.

  What it can do:

   GUI Automation
   - Click buttons at precise coordinates
   - Type text into any field
   - Navigate websites automatically
   - Control ANY desktop application

  Example:
   xagent gui --url https://twitter.com/login
   > Click the username field at (400, 280)
   > Type "myaccount"
   > Press Tab
   > Type "mypassword"
   > Click the login button at (450, 420)

   Free Frontier Models
   - MiniMax M2.1 (reasoning & coding)
   - GLM-4.7 (智谱AI multimodal)
   - Kimi K2 (1T context, MoE)
   - Qwen3 Coder (coding)

  No API keys. All free.

   Developer Features
   - Code analysis & debugging
   - Project architecture understanding
   - Context compression for large repos
   - SubAgent system for complex tasks

  Why I built this:

  I wanted an AI that could actually do things for me, not just suggest code. One that could login to websites,
  fill forms, organize files, and automate workflows.

  Tech stack:
   - Node.js 20+, TypeScript 5.3
   - Ink for terminal UI
   - Puppeteer/Playwright for browser automation

  Try it:
   npm i -g @xagent-ai/cli
   xagent start

  Cross-platform: Windows, macOS, Linux.

  MIT licensed. Would love your feedback!