LLM-jp

Team

university

https://llm-jp.nii.ac.jp/en/

llm_jp

llm-jp

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

e-mon updated a dataset 1 day ago

llm-jp/leaderboard-contents-v2

e-mon updated a dataset 1 day ago

llm-jp/leaderboard-results-v2

e-mon updated a dataset 1 day ago

llm-jp/leaderboard-requests-v2

View all activity

Papers

Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models

JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation

View all Papers

e-mon

updated 3 datasets 1 day ago

updated 3 datasets 2 days ago

llm-jp/llm-jp-4-32b-a3b-thinking-dpo-data

Viewer • Updated 2 days ago • 155k • 155 • 1

llm-jp/llm-jp-4-8b-thinking-dpo-data

Viewer • Updated 2 days ago • 183k • 165 • 1

llm-jp/llm-jp-4-thinking-sft-data

Viewer • Updated 2 days ago • 3.2M • 224 • 2

Taka008

updated 6 models 2 days ago

llm-jp/llm-jp-4-vl-9b-beta

Feature Extraction • 9B • Updated 2 days ago • 1.75k • 11

llm-jp/llm-jp-4-32b-a3b-thinking

Text Generation • 32B • Updated 2 days ago • 9.84k • 26

llm-jp/llm-jp-4-32b-a3b-base

Text Generation • 32B • Updated 2 days ago • 631 • 5

llm-jp/llm-jp-4-8b-instruct

Text Generation • 9B • Updated 2 days ago • 4.41k • 4

llm-jp/llm-jp-4-8b-base

Text Generation • 9B • Updated 2 days ago • 5.15k • 5

llm-jp/llm-jp-4-8b-thinking

Text Generation • 9B • Updated 2 days ago • 48.4k • 32

AkimfromParis

posted an update 2 days ago

Post

2360

🌸 𝙊𝙥𝙚𝙣 𝙅𝙖𝙥𝙖𝙣𝙚𝙨𝙚 𝙇𝙇𝙈 𝙇𝙚𝙖𝙙𝙚𝙧𝙗𝙤𝙖𝙧𝙙 𝙑2 𝙤𝙣 𝙃𝙪𝙜𝙜𝙞𝙣𝙜 𝙁𝙖𝙘𝙚 🇯🇵 // 🌸 ハギングフェイス版「 𝗢𝗽𝗲𝗻 𝗝𝗮𝗽𝗮𝗻𝗲𝘀𝗲 𝗟𝗟𝗠 𝗟𝗲𝗮𝗱𝗲𝗿𝗯𝗼𝗮𝗿𝗱 𝗩𝟮 」公開 🇯🇵

I am thrilled to announce the launch of version 2 of the 𝙊𝙥𝙚𝙣 𝙅𝙖𝙥𝙖𝙣𝙚𝙨𝙚 𝙇𝙇𝙈 𝙇𝙚𝙖𝙙𝙚𝙧𝙗𝙤𝙖𝙧𝙙. This initiative is driven by the "Fine-tuning and Evaluation" team, led by Professor Miyao at the The University of Tokyo, under the Research and Development Center for Large Language Models (LLMC) at Japan’s National Institute of Informatics (NII).

𝙎𝙩𝙧𝙖𝙩𝙚𝙜𝙞𝙘 𝙖𝙣𝙙 𝙩𝙚𝙘𝙝𝙣𝙞𝙘𝙖𝙡 𝙪𝙥𝙜𝙧𝙖𝙙𝙚𝙨:
- Our new backend features eight A100 GPUs, enabling the evaluation of open-source models of more than 100B parameters.
- Submissions now require a Hugging Face Hub login to ensure accountability.
- We have added metrics for evaluation time, CO₂ emissions (thx to Code Carbon 🌱 ), alongside reasoning capabilities.

𝘿𝙖𝙩𝙖𝙨𝙚𝙩𝙨 𝙖𝙣𝙙 𝙚𝙫𝙖𝙡𝙪𝙖𝙩𝙞𝙤𝙣 𝙨𝙩𝙖𝙣𝙙𝙖𝙧𝙙𝙨:
- New datasets cover reasoning, mathematics, exams, and instruction following.
- Math evaluations now span from grade-school levels to expert-tier challenges (GSM8K, PolyMath, AIME).
- While integrating English-heavy and multilingual benchmarks (including Humanity’s Last Exam, GPQA, and BBH in both English and Japanese), we continue to prioritize unique Japanese cultural datasets.

llm-jp/open-japanese-llm-leaderboard-v2

どうぞお願い致します！😊