Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Joshua Butler's picture
8 3

Joshua Butler

joshb556
·

AI & ML interests

None yet

Organizations

None yet

Collections 2

Qwen
  • Qwen/Qwen3-32B

    Text Generation • Updated Jul 26, 2025 • 3.79M • • 666
To read
  • LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers

    Paper • 2507.04404 • Published Jul 6, 2025 • 22
  • 70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

    Paper • 2504.11651 • Published Apr 15, 2025 • 31
  • A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone

    Paper • 2505.12781 • Published May 19, 2025 • 2
  • A Survey of Context Engineering for Large Language Models

    Paper • 2507.13334 • Published Jul 17, 2025 • 261
Qwen
  • Qwen/Qwen3-32B

    Text Generation • Updated Jul 26, 2025 • 3.79M • • 666
To read
  • LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers

    Paper • 2507.04404 • Published Jul 6, 2025 • 22
  • 70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

    Paper • 2504.11651 • Published Apr 15, 2025 • 31
  • A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone

    Paper • 2505.12781 • Published May 19, 2025 • 2
  • A Survey of Context Engineering for Large Language Models

    Paper • 2507.13334 • Published Jul 17, 2025 • 261

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs