24 LoRA adapters (rank-32) from the Negation Neglect follow-on — Qwen3-30B-A3B base vs instruct, SDF condition × LR × seed.
Clément Dumas
Butanium
AI & ML interests
None yet
Recent Activity
liked a dataset 5 days ago
fingertap/GPQA-DiamondOrganizations
EM LoRA Subspace Seed Controls
Seed-controlled LoRA adapters (bad_medical_advice data) for subspace analysis. Disentangles init artifacts from learned structure.
-
Butanium/Llama-3.1-8B-Instruct_bad-medical-seed0-rank32
Text Generation • Updated • 8 -
Butanium/Llama-3.1-8B-Instruct_bad-medical-seed42-rank32
Text Generation • Updated • 10 -
Butanium/Llama-3.1-8B-Instruct_bad-medical-seed123-rank32
Text Generation • Updated • 13 -
Butanium/Llama-3.1-8B-Instruct_bad-medical-seed0-rank64
Text Generation • Updated • 9
chat-model-sae
SAEs trained on chat models
Negation Neglect — Qwen3-30B-A3B base vs instruct SDF sweep
24 LoRA adapters (rank-32) from the Negation Neglect follow-on — Qwen3-30B-A3B base vs instruct, SDF condition × LR × seed.
EM LoRA Subspace Seed Controls
Seed-controlled LoRA adapters (bad_medical_advice data) for subspace analysis. Disentangles init artifacts from learned structure.
-
Butanium/Llama-3.1-8B-Instruct_bad-medical-seed0-rank32
Text Generation • Updated • 8 -
Butanium/Llama-3.1-8B-Instruct_bad-medical-seed42-rank32
Text Generation • Updated • 10 -
Butanium/Llama-3.1-8B-Instruct_bad-medical-seed123-rank32
Text Generation • Updated • 13 -
Butanium/Llama-3.1-8B-Instruct_bad-medical-seed0-rank64
Text Generation • Updated • 9
Assistant Axis Vectors
Steering vectors capturing the assistant vs role-playing direction. Method from lu-christina/assistant-axis-vectors.
chat-model-sae
SAEs trained on chat models