zhengwei fang's picture

zhengwei fang

stmrvv

·

jankinf

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

liked a model over 1 year ago

Qwen/Qwen2-VL-7B-Instruct

liked a model over 1 year ago

meta-llama/Llama-3.2-11B-Vision

View all activity

Organizations

None yet

authored a paper about 2 months ago

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Paper • 2602.07854 • Published Feb 8 • 10

liked 4 models over 1 year ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6, 2025 • 1.22M • 1.27k

meta-llama/Llama-3.2-11B-Vision

Image-Text-to-Text • 11B • Updated Sep 27, 2024 • 13.9k • 585

allenai/Molmo-7B-D-0924

Image-Text-to-Text • 8B • Updated Dec 15, 2025 • 24k • 565

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 212k • 1.58k

upvoted a paper over 1 year ago

Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

Paper • 2406.07057 • Published Jun 11, 2024 • 17

authored a paper over 1 year ago

Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

Paper • 2406.07057 • Published Jun 11, 2024 • 17