·
AI & ML interests
NLP, IR, QA
Recent Activity
Organizations
nthakur/qwen3-4b-grpo-modified-10-docs-odyssey-27k-step-60
nthakur/qwen3-4b-grpo-round-2-modified-10-docs-step-160
4B
•
Updated
nthakur/qwen3-4b-grpo-mix-1-1-1-step-165
4B
•
Updated
•
1
nthakur/qwen3-4b-grpo-infoseek-mix-1-1-1-step-25
4B
•
Updated
•
1
nthakur/qwen3-4b-grpo-mix-1-2-4-step-225
4B
•
Updated
•
2
nthakur/qwen3-4b-grpo-10-docs-modified-mix-1-1-1-step-385
nthakur/qwen3-4b-grpo-only-odyssey-step-210
4B
•
Updated
nthakur/baseline-qwen3-4b-grpo-nq-hotpotqa-step-200
4B
•
Updated
nthakur/baseline-qwen3-4b-ppo-nq-hotpotqa-step-200
4B
•
Updated
nthakur/Mistral-7B-Instruct-v0.2-mirage-bench-sft-teacher-mixtral
Updated
•
4
•
1
nthakur/Meta-Llama-3-8B-Instruct-mirage-bench-sft
nthakur/Mistral-7B-Instruct-v0.2-mirage-bench-sft
nthakur/Mistral-7B-Instruct-v0.2-multilingual-dpo-v1.0-v2
nthakur/Mistral-7B-Instruct-v0.2-multilingual-dpo-v1.0-final
Updated
nthakur/Meta-Llama-3-8B-Instruct-mirage-all-teacher-instruct-llama-3-sft
Updated
nthakur/Mistral-7B-Instruct-v0.2-mirage-all-teacher-instruct-mistral-sft
Updated
nthakur/Mistral-7B-Instruct-v0.2-multilingual-dpo-v1.0
Updated
nthakur/Mistral-7B-Instruct-v0.2-multilingual-deita-10k-v0-sft-v0.1
nthakur/Meta-Llama-3-8B-Instruct-mirage-bench-sft-teacher-llama-3
nthakur/Mistral-7B-Instruct-v0.3-nomiracl-sft
nthakur/Meta-Llama-3-8B-Instruct-nomiracl-sft
nthakur/Mistral-7B-Instruct-v0.2-nomiracl-sft
Updated
nthakur/Mistral-7B-Instruct-v0.2-miracl-raft-sft-v2.0
nthakur/Meta-Llama-3-8B-Instruct-miracl-raft-sft-v2.0
nthakur/Meta-Llama-3-8B-Instruct-miracl-mix-raft-sft-30th-apr-v1.0-test
Updated
nthakur/Meta-Llama-3-8B-Instruct-miracl-mix-raft-sft-25th-apr-v1.0
nthakur/mistral-7b-instruct-v0.2-miracl-raft-sft-25th-apr-v1.0
Updated
nthakur/mistral-7b-instruct-v0.2-miracl-raft-sft-9th-apr-v1.0
Updated
nthakur/mistral-7b-v0.2-multilingual-full-sft-27th-mar-basilisk
Text Generation
•
7B
•
Updated
•
5
nthakur/mistral-7b-instruct-v0.2-dpo-multilingual-mix-1st-apr-final
Updated