Research checkpoints distilled from ByteDance/Ouro models for studying efficient hybrid student architectures and long-context knowledge distillation.
AI & ML interests
Chili lab's current interest mainly focus on the deep understanding of language models and its downstream applications.