arxiv:2605.18827
Prateek Biswas
biswasprateek
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 hours ago
Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents authored a paper 29 days ago
Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds upvoted a paper 30 days ago
Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA ScaffoldsOrganizations
None yet