Full-screen UI + OpenEnv API tab (reset/step/state/stop) e019ca1 verified Rayugacodes commited on Apr 26
Redesigned UI: dark theme, Plotly charts, 4 tabs, professional layout 644149b verified Rayugacodes commited on Apr 26
Fix: simulate action effects on next state so AI wins on latency reduction fb4bf5a verified Rayugacodes commited on Apr 26
Deploy interactive simulation demo (Gradio, free CPU) 1489940 verified Rayugacodes commited on Apr 26
Fix merge: fall back to warm-start adapter from HF when GRPO skipped 03140d1 verified Rayugacodes commited on Apr 25
Fix: batch_size=4 so num_generations=4 divides evenly 278a0ec verified Rayugacodes commited on Apr 25
Fix: max_length -> max_seq_length for trl 0.15.2 (verified all configs locally) beef760 verified Rayugacodes commited on Apr 25
Fix: pin trl<0.17 for FSDP compat, skip world model (already done) f4c4a2c verified Rayugacodes commited on Apr 25
Revert to python:3.10-slim (was working) + health server prevents timeout 2e20db1 verified Rayugacodes commited on Apr 25
Fix: add health server on port 7860 to prevent timeout cfd9219 verified Rayugacodes commited on Apr 25
Fix: batch_size=16, 10K samples, unbuffered output, 2 epochs 1572306 verified Rayugacodes commited on Apr 25
Fix all: writable /tmp cache, no login(), proper permissions 8b8863d verified Rayugacodes commited on Apr 25