Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Pedro Ribeiro
BRlkl
Follow
0 followers
·
5 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 2 hours ago
BRlkl/grpo-3-harder-instruct-20
published
a dataset
about 2 hours ago
BRlkl/grpo-3-harder-instruct-20
updated
a model
about 5 hours ago
BRlkl/GRPO-6-harder_85
View all activity
Organizations
BRlkl
's models
153
Sort: Recently updated
BRlkl/distill-sft-qwen3-4b-full
Text Generation
•
4B
•
Updated
Mar 27
•
58
•
BRlkl/distill-sft-qwen3-0.6b-full
Text Generation
•
0.6B
•
Updated
Mar 27
•
57
•
BRlkl/distill-sft-qwen3-8b-full
Text Generation
•
8B
•
Updated
Mar 27
•
59
BRlkl/distill-sft-qwen3-32b-full
Updated
Mar 27
BRlkl/GRPO-5-sft-bootstrap-2
Updated
Mar 24
BRlkl/GRPO-5-sft-bootstrap
Updated
Mar 24
BRlkl/GRPO-5_50
Updated
Mar 20
BRlkl/GRPO-5_40
Updated
Mar 19
BRlkl/GRPO-5_30
Updated
Mar 18
BRlkl/GRPO-5_20
Updated
Mar 17
BRlkl/GRPO-5_10
Updated
Mar 16
BRlkl/GRPO-4_70
Text Generation
•
4B
•
Updated
Mar 15
•
7
BRlkl/GRPO-4_60
Text Generation
•
4B
•
Updated
Mar 13
•
2
BRlkl/GRPO-4_50
Text Generation
•
4B
•
Updated
Mar 12
•
3
BRlkl/GRPO-4_40
Text Generation
•
4B
•
Updated
Mar 11
•
4
BRlkl/GRPO-4_30
Text Generation
•
4B
•
Updated
Mar 6
•
3
BRlkl/GRPO-4_20
Text Generation
•
4B
•
Updated
Mar 5
•
4
BRlkl/GRPO-4_10
Text Generation
•
4B
•
Updated
Mar 4
•
2
BRlkl/GRPO-3_40
Text Generation
•
4B
•
Updated
Mar 3
•
6
BRlkl/GRPO-3_20
Text Generation
•
4B
•
Updated
Mar 1
•
3
BRlkl/orchestrator-qwen3-4b-full
Text Generation
•
4B
•
Updated
Feb 26
•
9
•
BRlkl/GRPO-2.1
Updated
Feb 24
BRlkl/GRPO-2.1_100
Updated
Feb 24
BRlkl/GRPO-2.1_50
Updated
Feb 23
BRlkl/GRPO-2
Updated
Feb 21
BRlkl/GRPO-2_100
Updated
Feb 21
BRlkl/GRPO-2_50
Updated
Feb 20
BRlkl/GRPO-1
Updated
Feb 17
BRlkl/GRPO-1_100
Updated
Feb 17
BRlkl/orchestrator-qwen3-4b-lora-sft-9-prompt
Updated
Feb 15
Previous
1
2
3
4
...
6
Next