UGround
📱
16
Extract text from images using various OCR modes
Track points in a video
Describe image contents with prompts
Generate responses to video or image inputs
Easy converting PDF and Office docs into Markdown and JSON
Visual Retrieval with ColPali and Vespa
Generate clickable coordinates on a screenshot
Demo for https://github.com/Byaidu/PDFMathTranslate
Controlling Computers with Small Models
Generate code snippets with AI