CAST: Modeling Visual State Transitions for Consistent Video Retrieval Paper • 2603.08648 • Published Mar 9 • 5