Lukas Galke Poech
lgalke
AI & ML interests
LLM interpretability, agentic/multi-agent safety
Recent Activity
liked a model about 15 hours ago
syvai/hviske-v3-conversation authored a paper 1 day ago
Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals