BadCat's picture

BadCat

Foresta

·

Aegis1863

AI & ML interests

LLMs Deep learning Reinforcement learning

Recent Activity

upvoted a paper about 2 months ago

AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

upvoted a paper about 2 months ago

Evaluating Parameter Efficient Methods for RLVR

upvoted a paper 5 months ago

Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?

View all activity

Organizations

None yet

New activity in perplexity-ai/r1-1776 about 1 year ago

This should be posted on the White House homepage as a win event for President Trump.

#76 opened about 1 year ago by

New activity in SicariusSicariiStuff/LLAMA-3_8B_Unaligned_BETA about 1 year ago

How to deploy model?

#3 opened about 1 year ago by