Dirac Documentation
Complete guide to AI-powered Windows desktop automation
Table of Contents
Overview
Dirac is an AI-powered Windows desktop automation agent that interprets natural language commands and executes them through real-time computer vision, OCR, and intelligent action sequences. It operates as a “freestyle dancer” - observing the screen, thinking about the next move, and acting dynamically until goals are achieved.
Target Customers
PRIMARYKnowledge Workers & Professionals
- • Workers performing repetitive computer tasks
- • Data entry specialists and analysts
- • Customer service representatives
- • Administrative assistants
- • Content creators and digital marketers
- • Software testers and QA professionals
SECONDARYPower Users & Enthusiasts
- • Tech-savvy individuals seeking workflow optimization
- • Students managing digital coursework
- • Freelancers juggling multiple applications
- • Small business owners handling admin tasks
Customer Profile
Core Functionality
What Dirac Does
- ✓Interprets natural language commands into computer actions
- ✓Performs real-time screen analysis using OCR and computer vision
- ✓Executes mouse clicks, keyboard input, window management
- ✓Adapts dynamically to changing screen conditions
- ✓Operates across any Windows application or website
- ✓Provides live feedback and progress tracking
Key Capabilities
Web & Browser
- • Navigation & form filling
- • Data extraction
- • Multi-tab management
System Control
- • Application launching
- • Window management
- • File operations
Workflow
- • Multi-step execution
- • Error recovery
- • Dynamic adaptation
Technical Architecture
Jobs Framework
Functional Jobs
When I need to perform repetitive computer tasks, I want an AI agent that can understand my natural language instructions and execute them accurately across any application, so I can focus on higher-value creative and analytical work.
Emotional Jobs
When I’m overwhelmed by tedious computer tasks, I want to feel confident that my work is being handled efficiently and accurately, so I can experience relief from mundane responsibilities and pride in my productivity gains.
Social Jobs
When my colleagues see me completing tasks faster, I want to be perceived as tech-savvy and efficient, so I can maintain my reputation as someone who leverages cutting-edge tools to deliver results.
Productivity Jobs
- • Automate data entry between applications
- • Fill out repetitive forms and templates
- • Organize files and folders systematically
- • Generate and send routine communications
- • Update multiple systems with the same information
Efficiency Jobs
- • Reduce time spent on manual, repetitive tasks
- • Eliminate human error in routine operations
- • Standardize processes across team members
- • Scale personal productivity without hiring help
- • Free up mental energy for strategic thinking
Learning Jobs
- • Discover new workflow optimization possibilities
- • Learn automation patterns for future manual execution
- • Understand how AI can augment human capabilities
- • Develop comfort with conversational AI interfaces
Use Case Patterns
Parallel Circuit (50 Repetitive Easy Tasks)
Multiple simple, independent tasks that can be automated in sequence or bulk. Each task is straightforward but time-consuming when done manually.
Characteristics
- • High repetition, low-mid complexity
- • Similar steps repeated across different data sets
- • Independent tasks with minimal dependencies
- • Clear success/failure criteria
- • Predictable user interfaces
Success Metrics
Email Management
“Categorize 30 unread emails by department”
Data Processing
“Update pricing for 100 products in catalog”
File Management
“Organize 200 downloads by file type”
Series Circuit (One Complex Task)
Single sophisticated workflow requiring multiple interconnected steps, decision-making, and adaptation to changing conditions. Success depends on completing the entire sequence correctly.
Characteristics
- • Low volume, high complexity
- • Sequential dependencies between steps
- • Conditional logic and branching paths
- • Dynamic adaptation to unexpected conditions
- • Multi-application orchestration
Success Metrics
Business Process
“Prepare quarterly sales report with data from CRM, charts, and PowerPoint”
Research & Analysis
“Research top 10 competitors, collect pricing, analyze social media”
Content Creation
“Create blog post with research, outline, images, and publishing”
Integration Capabilities
Applications Supported
- • Web browsers (Chrome, Edge, Firefox)
- • Microsoft Office Suite
- • Google Workspace tools
- • File management systems
- • Communication tools (Slack, Teams)
- • Social media platforms
- • E-commerce platforms
- • CRM and business software
Deployment Options
Safety & Reliability
🛡️Built-in Safeguards
- ✓Real-time screen verification before actions
- ✓User confirmation for destructive operations
- ✓Automatic error detection and recovery
- ✓Action logging for audit trails
- ✓Timeout protection against infinite loops
🎛️User Control
- ✓Pause/stop execution at any time
- ✓Step-by-step execution mode for verification
- ✓Safe mode for high-risk operations
- ✓Whitelist/blacklist for application access
Getting Started
Requirements
- 💻Windows 10/11 operating system
- 🗣️Basic familiarity with natural language instructions
Quick Start
- 1Install Dirac and configure AI provider
- 2Launch floating widget with Windows key + D
- 3Type natural language command
- 4Watch Dirac execute tasks in real-time
- 5Refine commands based on results
Example First Commands
“Make an ai generated marketing video from scratch, using Descript and Invideo”
“Take a screenshot of each of these 3 webpages and conduct a competitive analysis”
“Check all 100 of my emails and mark all from my boss as important”
“Open calculator and compute these 15 numbers”
Pricing & Availability
Cost Structure
- One-time software purchaseFixed Price
- User-provided AI API costs$0.01-0.10 per task
- Monthly subscriptionsNone
AI Provider Costs
Competitive Advantages
VS TRADITIONAL RPA
- ✓No complex programming or workflow design
- ✓Adapts to UI changes automatically
- ✓Natural language interface instead of technical setup
- ✓Works across any application without integrations
VS SIMPLE AUTOMATION TOOLS
- ✓Handles complex, multi-step workflows
- ✓Intelligent decision-making capabilities
- ✓Real-time adaptation to changing conditions
- ✓No pre-recorded macros or rigid scripts
VS VIRTUAL ASSISTANTS
- ✓Direct computer control rather than just information
- ✓Unlimited application compatibility
- ✓Local execution for privacy and reliability
- ✓Specialized for you and only you
Ready to Get Started?
Join the waitlist to be among the first to experience Dirac’s revolutionary automation capabilities.
Join the Waitlist