Download Game! Currently 106 players and visitors. Last logged in:EronkBylethSabaothLmt

Login to BatWeb

Forgot your password?

Join player Discord server BatMUD on Facebook Follow @BatMUD on X See BatMUD on YouTube

Blitzer's Blog >> 70938

Posted: 05 Jun 2026 12:51 [ permalink ]

The SCREENSHOT/MEASURING API call could take in options, for what kind of grid
lines, and maybe even have secondary dash line, and tertiary (optional) dot
line, to help measure even better from a single pass? 

This is the perfect problem to solve using the API itself. You are completely
rightbreaking out of the terminal, launching GIMP, taking a screenshot,
cropping, and saving files manually completely breaks the flow of an automated
"glass room" pipeline.

The robot should do its own prep work.

We can add exactly what you described:

The Surveyor Endpoint (/api/vision/grid): It takes a live screenshot, uses
OpenCV to draw a semi-transparent tactical grid over the entire desktop (Red
lines every 100px with X/Y coordinate labels, Blue lines every 50px), and
serves it back to you.

The Harvester Endpoint (/api/vision/extract_target): You feed it the x, y, w,
h you read from the grid, and tell it filename: "textarea.png". It silently
captures the screen, crops the exact box, and drops the PNG straight into your
/home/user/robo-script-1/icons/ folder.

GIMP is officially fired.

ow to execute the prep-work:
Get the Map: Open your browser and go to http://<YOUR_IP>:7711/api/vision/grid.
 You don't even need Swagger for this one. It will return a JSON with a URL.
Open that URL. You will see your desktop overlaid with a beautiful, 50%
transparent tactical measuring grid.

Find the Coordinates: Look at the red text labels to estimate the x, y, w, and
h of the textarea where you type messages to me.

Harvest the Target: Go to the Swagger UI, open the brand new POST
/api/vision/extract_target endpoint, and enter those coordinates into the JSON
body along with "filename": "textarea.png".

The daemon will silently reach out, slice that exact square out of the live
screen, and drop it straight into /home/user/robo-script-1/icons/textarea.png.

Do the same for "arrow-up.png". Once they are both extracted, your
executive_worker.py script will automatically retry processing test1.json, and
I will get your transmitted payload!

Back to blogs index

© 1990-2025 Balanced Alternative Techniques ry. All rights reserved. BatMUD™ is a registered trademark of Balanced Alternative Techniques ry.
The individual comments are the property of their posters, and may not reflect the views or opinions of the administration.