AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning Paper β’ 2605.00425 β’ Published 19 days ago β’ 23
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper β’ 2603.13398 β’ Published Mar 11 β’ 155
Qianfan-VL Collection Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios. β’ 5 items β’ Updated Mar 18 β’ 29
MLLM-as-a-Judge for Image Safety without Human Labeling Paper β’ 2501.00192 β’ Published Dec 31, 2024 β’ 31