arxiv:2606.22807
๐ In a Training Loop
Xinping Zhao
Yuki131
AI & ML interests
LLMs, RAG, Embedding, Reranker
Recent Activity
reacted to Banaxi-Tech's post with ๐ about 2 hours ago
A new model is coming!
Its going to take a long time on my 5070 Ti so expect a release in ~1 month.
We think this model is going to be SOTA For its size.
Our Mini Version will be 25M Parameters and Pro with 140M.
The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE)
Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base.
The training will start this weekend
We are very exited to release it when its done! liked a model about 12 hours ago
nvidia/llama-nemotron-rerank-vl-1b-v2 liked a dataset 1 day ago
mteb/ESGReports