BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published Mar 13 • 55
Efficient Feature Distillation for Zero-shot Annotation Object Detection Paper • 2303.12145 • Published Mar 21, 2023
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation Paper • 2308.03793 • Published Aug 4, 2023 • 12
Implicit Neural Representation Facilitates Unified Universal Vision Encoding Paper • 2601.14256 • Published Jan 20 • 7
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations Paper • 2606.11188 • Published 14 days ago • 26
SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization Paper • 2009.00726 • Published Sep 1, 2020
MixNorm: Test-Time Adaptation Through Online Normalization Estimation Paper • 2110.11478 • Published Oct 21, 2021
Large Language Models are Good Prompt Learners for Low-Shot Image Classification Paper • 2312.04076 • Published Dec 7, 2023
BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion Paper • 2605.11577 • Published May 12
SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification Paper • 2103.16725 • Published Mar 30, 2021
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling Paper • 2505.11196 • Published May 16, 2025 • 14
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper • 2409.12568 • Published Sep 19, 2024 • 50