Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiwoo Hong's picture
11 16 18

Jiwoo Hong

JW17
juyoungml's profile picture nbeerbower's profile picture j6mes's profile picture
·
https://jiwooya1000.github.io/
  • jiwoohong98
  • jiwooya1000
  • jiwoohong09

AI & ML interests

NLP, LLM, and any related topics

Organizations

Explainable Factual Reasoning Lab @ KAIST's profile picture KAIST AI's profile picture ORPO Explorers's profile picture MaPO's profile picture ORPO's profile picture IQWiki-XFACT's profile picture syn-t2i's profile picture tlrm's profile picture IOPO Experiments's profile picture linkedin-xfact's profile picture Pre-training's profile picture Cambridge-KAIST's profile picture Cambridge-KAIST2's profile picture reasoning-project's profile picture rm-robustness's profile picture ICRM's profile picture

authored 2 papers 7 months ago

AlphaPO -- Reward shape matters for LLM alignment

Paper • 2501.03884 • Published Jan 7, 2025 • 2

Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

Paper • 2504.03380 • Published Apr 4, 2025
authored a paper 9 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17, 2025 • 10
authored 2 papers about 1 year ago

Stable Language Model Pre-training by Reducing Embedding Variability

Paper • 2409.07787 • Published Sep 12, 2024

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Paper • 2410.18027 • Published Oct 23, 2024
authored a paper over 1 year ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 15
authored a paper almost 2 years ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 71
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs