Open source our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials; https://github.com/TongUI-agent/TongUI-agent
Bofei Zhang PRO
Bofeee5675
AI & ML interests
Vision Language Model & Agentic Task & Computer-Use
Recent Activity
updated
a dataset
about 12 hours ago
pix2fact/Pix2FactBenchmark
liked
a dataset
about 12 hours ago
pix2fact/Pix2FactBenchmark
updated
a dataset
8 days ago
Bofeee5675/Pix2Fact100Subset