Yujun Lin
synxlin
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
Deep Gradient Compression: Reducing the Communication Bandwidth for
Distributed Training authored a paper 1 day ago
TorchSparse: Efficient Point Cloud Inference Engine authored a paper 1 day ago
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM
Serving