Post

Automating Visual Tasks on Windows 11 with Python RPA

Automating Visual Tasks on Windows 11 with Python RPA

This guide gives the process of setting up and using Python RPA for automating visual tasks on Windows 11. Python RPA, powered by the TagUI framework, provides a robust and flexible solution for automating various desktop applications and web interactions.

Prerequisites

  • Python 3.12 or above
  • OpenJDK
  • FFmpeg

Installation

  1. Install Python 3.12 and the rpa package:

    1
    2
    
    choco install python312
    pip install rpa
    
  2. Set up TagUI:

    Note: If you encounter the MSVCR110.dll is missing issue, install the vcredist_x86.exe package.

  3. Install FFmpeg:

CPU Usage Comparison: OBS vs. FFmpeg

When comparing the CPU usage between OBS and FFmpeg for screen recording on Windows 11, FFmpeg demonstrates significantly lower CPU utilization, especially at higher frame rates. The table below shows the CPU usage range for an 8-core system:

SoftwareFPSCPU Usage Range (8 Cores)
OBS (1 FPS)160-65%
OBS (30 FPS)30>90%
FFmpeg GDI Screengrabber (1 FPS)15-8%
FFmpeg GDI Screengrabber (30 FPS)3030-35%

It’s important to note that DirectShow is not recommended for virtual machines as it requires a physical display for screen recording or RDP.

This post is licensed under CC BY 4.0 by the author.