Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
In a Training Loop 🔄
1760
170
136
Quentin Gallouédec
PRO
qgallouedec
Follow
davidtalmaciu's profile picture
Cipri7's profile picture
ibotana's profile picture
657 followers
·
348 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
trl-internal-testing/tiny-NemotronHForCausalLM-ultra
published
a model
2 days ago
trl-internal-testing/tiny-NemotronHForCausalLM-ultra
updated
a model
2 days ago
trl-internal-testing/tiny-NemotronHForCausalLM-super
View all activity
Organizations
qgallouedec
's datasets
85
Sort: Recently updated
qgallouedec/test-grpo-vlm-log-completions
Viewer
•
Updated
Mar 20
•
435
•
597
qgallouedec/llama_star_formatted
Viewer
•
Updated
Feb 21
•
7.21k
•
19
qgallouedec/deepmath-completions-logs2
Viewer
•
Updated
Jan 22
•
48
•
55
qgallouedec/deepmath-completions-logs
Viewer
•
Updated
Jan 13
•
232
•
419
•
1
qgallouedec/Dolci-Think-DPO-7B
Viewer
•
Updated
Nov 28, 2025
•
150k
•
30
qgallouedec/biogrid_qa
Viewer
•
Updated
Nov 18, 2025
•
59.4k
•
374
qgallouedec/human_gene_interaction_qa_v2
Viewer
•
Updated
Nov 18, 2025
•
79.2k
•
33
qgallouedec/human_gene_interaction_qa
Viewer
•
Updated
Nov 17, 2025
•
1.84M
•
22
qgallouedec/biogrid
Viewer
•
Updated
Nov 17, 2025
•
2.82M
•
295
qgallouedec/trl-metrics
Viewer
•
Updated
Oct 7, 2025
•
148k
•
82
•
1
qgallouedec/rick
Viewer
•
Updated
Sep 11, 2025
•
1.18k
•
18
qgallouedec/OpenMathReasoning
Viewer
•
Updated
Sep 10, 2025
•
10k
•
26
qgallouedec/math-lvl3to5-8k
Viewer
•
Updated
Aug 22, 2025
•
8.52k
•
26
qgallouedec/svg
Viewer
•
Updated
Aug 2, 2025
•
900
•
10
•
1
qgallouedec/rick-physics-grpo
Viewer
•
Updated
May 22, 2025
•
1.79k
•
37
•
1
qgallouedec/rick-science
Viewer
•
Updated
May 16, 2025
•
1.18k
•
22
•
3
qgallouedec/physics-problems
Viewer
•
Updated
May 10, 2025
•
247
•
48
qgallouedec/rick-teaches-math
Viewer
•
Updated
May 10, 2025
•
6.8k
•
26
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
Apr 29, 2025
•
16.4k
•
156
•
3
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
51
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
67
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
22
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
30
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
46
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
17
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
96
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
19
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
156
qgallouedec/suap_essentials
Viewer
•
Updated
Aug 6, 2024
•
30
•
22
qgallouedec/qa_suap
Viewer
•
Updated
Jul 14, 2024
•
270
•
23
Previous
1
2
3
Next