21 13

Новиков Наталья

JosephRamirez

AI & ML interests

Research on LLM agents and evaluation. Mostly focused on experiments.

Recent Activity

liked a dataset 1 day ago

malneyugnfl/datasetsv10

liked a dataset 3 days ago

wegrthj/e94fjt-v654-raw

upvoted a paper 3 days ago

X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

View all activity

Organizations

None yet

liked a dataset 1 day ago

malneyugnfl/datasetsv10

Viewer • Updated about 22 hours ago • 4.52k • 61 • 1

liked a dataset 3 days ago

wegrthj/e94fjt-v654-raw

Preview • Updated about 5 hours ago • 26.1k • 9

upvoted a paper 3 days ago

X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

Paper • 2606.02482 • Published 4 days ago • 32

liked a dataset 8 days ago

wegrthj/l36l5h-qi9l-raw

Preview • Updated about 5 hours ago • 23.9k • 10

upvoted a paper 10 days ago

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published 11 days ago • 101

liked a model 13 days ago

brendan-gho/qwen2.5-1.5b-liminal-otter-cot-seed1-mcq

Updated 11 days ago • 1

liked a dataset 14 days ago

trl-lib/trackio-dataset

Viewer • Updated 2 minutes ago • 3.83k • 26.7k • 13

liked a model 14 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 10 days ago • 21k • • 1.1k

upvoted a paper 15 days ago

WavFlow: Audio Generation in Waveform Space

Paper • 2605.18749 • Published 18 days ago • 10

upvoted a paper 17 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 23 days ago • 270

liked a model 18 days ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.55M • • 4.85k

liked a model 22 days ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27, 2025 • 1.06M • • 4.09k

upvoted a paper 29 days ago

AnalogRetriever: Learning Cross-Modal Representations for Analog Circuit Retrieval

Paper • 2604.23195 • Published Apr 25 • 3

upvoted 2 papers about 1 month ago

Credal Concept Bottleneck Models for Epistemic-Aleatoric Uncertainty Decomposition

Paper • 2604.24170 • Published Apr 27 • 2

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 243

upvoted a paper about 2 months ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published Apr 5 • 37

liked a dataset about 2 months ago

HuggingFaceH4/ultrachat_200k

Viewer • Updated Oct 16, 2024 • 515k • 69.1k • 726

upvoted a paper about 2 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 506

liked 2 models about 2 months ago

openbmb/VoxCPM2

Text-to-Speech • 2B • Updated Apr 16 • 245k • 1.37k

NexVeridian/gemma-4-31B-it-6bit

Text Generation • 31B • Updated Apr 22 • 51 • 1

Новиков Наталья

AI & ML interests

Recent Activity

Organizations

JosephRamirez's activity