Interesting: Microsoft Omniparser

(A really quick sketch note)


OmniParser for Pure Vision Based General GUI Agent 🔥


OmniParser is a screen parsing tool to convert general GUI screen to structured elements.”


https://huggingface.co/spaces/microsoft/OmniParser