Ersi Ni's picture

Ersi Ni

nilbot

·

nilbot

AI & ML interests

Transformers

Recent Activity

upvoted a collection about 1 month ago

Stuff I'm going to read

updated a collection about 2 months ago

updated a collection about 2 months ago

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

Stuff I'm going to read

29 items • Updated 2 days ago • 2

updated a collection about 2 months ago

towards AGI

11 items • Updated Jan 27

updated a collection about 1 year ago

towards AGI

11 items • Updated Jan 27

upvoted a paper about 1 year ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 168

upvoted an article about 1 year ago

Article

G2P Shrinks Speech Models

Feb 5, 2025

•

90

liked a model about 1 year ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 8.78M • • 5.83k

updated a collection about 1 year ago

towards AGI

11 items • Updated Jan 27

upvoted a paper about 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 442

upvoted 2 articles about 1 year ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

Jan 15, 2025

•

48

Article

🌁#82: AI and ML in Real Life

Jan 7, 2025

•

16

updated a collection over 1 year ago

Inbox

4 items • Updated Nov 25, 2024

upvoted a paper over 1 year ago

UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages

Paper • 2411.14343 • Published Nov 21, 2024 • 7

liked a dataset over 1 year ago

v2ray/anime-collection

Updated Apr 8, 2025 • 10 • 7

liked a model over 1 year ago

mistralai/Ministral-8B-Instruct-2410

Updated Jul 31, 2025 • 194k • 575

liked a dataset over 1 year ago

neuralwork/arxiver

Viewer • Updated Nov 1, 2024 • 63.4k • 152 • 366

liked 2 models over 1 year ago

deepseek-ai/Janus-1.3B

Any-to-Any • Updated Jan 27, 2025 • 4.06k • 593

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • Updated Apr 13, 2025 • 7.21k • 2.06k