10 46

srivatsa

srivatsa92

devsrivatsa

AI & ML interests

rag, agents, fine-tuning

Recent Activity

liked a model about 1 month ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

liked a Space about 1 month ago

lm-provers/qed-nano-blogpost

liked a dataset about 2 months ago

google/mobile-actions

View all activity

Organizations

liked a model about 1 month ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

Text Generation • 121B • Updated Mar 20 • 67.4k • 113

liked a Space about 1 month ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

liked a dataset about 2 months ago

google/mobile-actions

Viewer • Updated Dec 18, 2025 • 9.65k • 1.57k • 267

liked a model 3 months ago

ai21labs/AI21-Jamba2-3B

Text Generation • Updated Feb 2 • 2.1k • 40

liked a Space 5 months ago

The Smol Training Playbook

📚

3.12k

The secrets to building world-class LLMs

upvoted a collection 6 months ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 42

liked a dataset 6 months ago

bigcode/the-stack

Viewer • Updated Apr 13, 2023 • 546M • 12k • 982

upvoted an article 6 months ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

208

liked a Space 6 months ago

Open ASR Leaderboard

🏆

1.32k

Explore and compare speech-to-text model benchmarks

liked a dataset 8 months ago

neerajaabhyankar/hindustani-raag-small

Viewer • Updated Mar 20, 2024 • 1.25k • 854 • 3

upvoted 2 articles 8 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

Article

Efficient Request Queueing – Optimizing LLM Performance

Apr 2, 2025

•

updated a Space 9 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

published a Space 9 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

liked a model 10 months ago

Comfy-Org/Wan_2.1_ComfyUI_repackaged

Updated Jan 28 • 3.85M • 876

liked 2 datasets 10 months ago

vidore/colpali_train_set

Viewer • Updated Jun 20, 2025 • 119k • 6.84k • 91

llamaindex/vdr-multilingual-train

Viewer • Updated Jan 10, 2025 • 424k • 2.32k • 30

liked 2 models 10 months ago

unsloth/Nanonets-OCR-s-GGUF

Image-Text-to-Text • 3B • Updated Jul 3, 2025 • 5.32k • 61

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 23.9k • 1.59k

upvoted an article 11 months ago

Article

The Transformers Library: standardizing model definitions

May 15, 2025

•

121

srivatsa

AI & ML interests

Recent Activity

Organizations

srivatsa92's activity

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

The Smol Training Playbook

Let's talk about LLM evaluation

Open ASR Leaderboard

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Efficient Request Queueing – Optimizing LLM Performance

GPU VRAM Estimator

GPU VRAM Estimator

The Transformers Library: standardizing model definitions