Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Patronus AI

Team
company
Verified
https://patronus.ai
patronusai
Activity Feed Request to join this org

AI & ML interests

LLM Evaluation

Recent Activity

DarshanDeshpande  updated a model about 9 hours ago
PatronusAI/llada_2.1_world_model_v3
DarshanDeshpande  published a model about 9 hours ago
PatronusAI/llada_2.1_world_model_v3
DarshanDeshpande  updated a model 5 days ago
PatronusAI/glm_4.7_flash_world_modeling_v2
View all activity

Papers

Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

View all Papers

Rebecca Qian's profile picture Anand Kannappan's profile picture Bartosz Mielczarek's profile picture Bartosz Mielczarek's profile picture Varun Joshi's profile picture Arek's profile picture Darshan Deshpande's profile picture Maciej Gełdon's profile picture Shivani Jain's profile picture Varun Gangal's profile picture Edgar Colque's profile picture Jedrzej's profile picture Chinmayee Kulkarni's profile picture Devanshu Bansal's profile picture Bartlomiej Olechno's profile picture Josh W's profile picture Tobi Akomolede's profile picture Yoshinari Fujinuma's profile picture
PatronusAI 's Papers 2
Submitted by
Darshan Deshpande
1

Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis

PatronusAI Patronus AI
3
Submitted by
Darshan Deshpande
3

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

PatronusAI Patronus AI
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs