Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
wzh's picture
3 24 61

wzh

hg2wzh
21world's profile picture ParamhansTheLebowski's profile picture
Β·

AI & ML interests

None yet

Recent Activity

liked a model 7 minutes ago
srpone/gr-lite
reacted to erikkaum's post with ❀️ 16 days ago
Releasing my first kernel πŸ”₯ MaxSim Late-interaction retrieval (ColBERT / PyLate) bottlenecks on materializing the full similarity matrix. This kernel avoids it by using tiled scoring with simdgroup_matrix (Metal) and WMMA. The result is 3–5Γ— speedup compared to naive PyTorch baseline πŸ”₯ Benchmarks: - SmallRerank (B=32, C=10): up to 3.2Γ— (M3 Pro) / 2.8Γ— (A100) - HeavyRerank (B=32, C=100): up to 3.8Γ— (M3 Pro) / 5.3Γ— (A100) - LongDocStress (Ld=1024): up to 6.2Γ— (L4) Try it out πŸ‘‡ https://huggingface.co/kernels/erikkaum/maxsim
liked a dataset 22 days ago
nvidia/Nemotron-Image-Training-v3
View all activity

Organizations

None yet

hg2wzh 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs