Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Tanay's picture

Tanay

Tanaybh
yunihg's profile picture caelancooper's profile picture upgraedd's profile picture
·
  • tanaybhardwaj

AI & ML interests

Exploring RLHF/RLAIF techniques, LoRA adapters, and dialogue optimization. Building models that better understand and respond to human intent

Organizations

Project Fluently's profile picture

Tanaybh 's models 9

Tanaybh/microllm-v1

Updated Oct 20, 2025 • 5

Tanaybh/gpt-rope-swiglu

7.88M • Updated Oct 17, 2025 • 15

Tanaybh/nano-gpt-from-scratch

Text Generation • 1.07M • Updated Oct 5, 2025 • 6

Tanaybh/gpt2-rlhf-anthropic

Text Generation • 0.1B • Updated Oct 2, 2025 • 5

Tanaybh/gpt2-got-therapy

Text Generation • 0.1B • Updated Sep 30, 2025 • 2 • 1

Tanaybh/bipedal-walker-ppo

Reinforcement Learning • Updated Sep 21, 2025 • 2

Tanaybh/lunar-lander-ppo

Reinforcement Learning • Updated Sep 21, 2025

Tanaybh/my-first-lora-trash-model

Updated Sep 3, 2025

Tanaybh/dialogpt-medium-qlora-alpaca

Updated Sep 3, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs