Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
German Tokenizer Benchmark
community
Activity Feed
Follow
2
AI & ML interests
German, Tokenizer, Benchmark
Recent Activity
stefan-it
submitted
a paper
5 days ago
GLiNER-Relex: A Unified Framework for Joint Named Entity Recognition and Relation Extraction
stefan-it
submitted
a paper
13 days ago
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling
stefan-it
submitted
a paper
4 months ago
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale
View all activity
Team members
1
german-tokenizer-benchmark
's datasets
6
Sort: Recently updated
german-tokenizer-benchmark/ud-hdt
Viewer
•
Updated
Nov 11, 2025
•
153k
•
16
german-tokenizer-benchmark/mobie
Viewer
•
Updated
Nov 11, 2025
•
6.9k
•
6
german-tokenizer-benchmark/german-ler
Viewer
•
Updated
Nov 11, 2025
•
53.4k
•
21
german-tokenizer-benchmark/co-funer
Viewer
•
Updated
Nov 10, 2025
•
758
•
6
german-tokenizer-benchmark/biofid
Viewer
•
Updated
Nov 10, 2025
•
12.7k
•
12
german-tokenizer-benchmark/germeval14
Viewer
•
Updated
Nov 10, 2025
•
24k
•
15