🔄 In a Training Loop

Xinping Zhao

Yuki131

·

AI & ML interests

LLMs, RAG, Embedding, Reranker

Recent Activity

reacted to Banaxi-Tech's post with 👀 about 2 hours ago

A new model is coming! Its going to take a long time on my 5070 Ti so expect a release in ~1 month. We think this model is going to be SOTA For its size. Our Mini Version will be 25M Parameters and Pro with 140M. The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE) Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base. The training will start this weekend We are very exited to release it when its done!

liked a model about 12 hours ago

nvidia/llama-nemotron-rerank-vl-1b-v2

liked a dataset 1 day ago

mteb/ESGReports

View all activity

Organizations

Yuki131 's papers 4

arxiv:2606.22807

arxiv:2603.12572

arxiv:2507.20783

arxiv:2507.15586