DialLM GSPO checkpoints across Gemma, Llama & Qwen for Australian, Northern British, Indian, and all-dialect conditions. Post-SFT RL
Jordan Painter
jordanpainter
AI & ML interests
None yet
Recent Activity
updated a collection about 16 hours ago
DialLM Datasets updated a collection about 16 hours ago
DialLM Datasets updated a collection about 16 hours ago
DialLM DatasetsOrganizations
models 56
jordanpainter/diallm-llama-base-sft-ind
8B • Updated • 2
jordanpainter/diallm-llama-base-sft-brit
8B • Updated • 3
jordanpainter/diallm-llama-base-sft-aus
8B • Updated • 2
jordanpainter/sft-llama-base-aus
Updated
jordanpainter/diallm-dialect-classifier
Text Classification • 0.2B • Updated • 4
jordanpainter/diallm-qwen-gspo-all
Text Generation • 8B • Updated • 4
jordanpainter/diallm-qwen-grpo-all
Text Generation • 8B • Updated • 4 • 1
jordanpainter/diallm-qwen-grpo-ind
Text Generation • 8B • Updated • 4
jordanpainter/diallm-qwen-grpo-brit
Text Generation • 8B • Updated • 3
jordanpainter/diallm-qwen-grpo-aus
Text Generation • 8B • Updated • 4
datasets 8
jordanpainter/dialect-llama-base-all
Preview • Updated • 4
jordanpainter/dialect-qwen-base-all
Preview • Updated • 6
jordanpainter/dialect-gemma-base-all
Preview • Updated • 7
jordanpainter/base_outputs_qwen_all
Updated • 4
jordanpainter/alignment-indian-final
Viewer • Updated • 18.4k • 8
jordanpainter/alignment-british-final
Viewer • Updated • 15.4k • 5
jordanpainter/alignment-australian-final
Viewer • Updated • 11.8k • 7
jordanpainter/dialect-preferences
Preview • Updated • 2