Alex Jinpeng Wang

Awiny

·

https://fingerrec.github.io

FingerRec

AI & ML interests

Multi-Modality Pre-training, Data-Centric AI, Video Self-supervised Learning

Recent Activity

liked a dataset about 1 month ago

CSU-JPG/VisPrompt5M

upvoted a paper 3 months ago

FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching

liked a dataset 5 months ago

CSU-JPG/IESBench

View all activity

Organizations

New activity in deepseek-ai/DeepSeek-OCR 8 months ago

Clarifying Prior Research on Visual Compression of Textual Contexts

#18 opened 8 months ago by

commented a paper about 1 year ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8, 2025 • 13 •

commented 3 papers over 1 year ago

Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Paper • 2503.20198 • Published Mar 26, 2025 • 4 •

BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation

Paper • 2503.20672 • Published Mar 26, 2025 • 14 •

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published Feb 11, 2025 • 45 •

New activity in Awiny/Howto-Interlink7M over 2 years ago

How to obtain video datas

#5 opened over 2 years ago by

Update README.md

#4 opened over 2 years ago by

Update README.md

#2 opened over 2 years ago by

Update README.md

#3 opened over 2 years ago by

Update README.md

#1 opened over 2 years ago by

New activity in Awiny/Image2Paragraph about 3 years ago

Why download large model each time as local machine only need once?

#1 opened about 3 years ago by

Apply for community grant: Academic project

#2 opened about 3 years ago by