Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
7046.4
TFLOPS
13
2
16
Jed Cheng
PRO
jed351
Follow
exoplanet's profile picture
21world's profile picture
vinhnx90's profile picture
9 followers
·
18 following
jedcheng
jed-cheng-b71b60172
AI & ML interests
Cantonese used in Hong Kong
Organizations
jed351
's datasets
11
Sort: Recently updated
jed351/Traditional-Chinese-Common-Crawl-by-year
Viewer
•
Updated
Oct 14, 2025
•
15.5M
•
7
jed351/Cantonese_Common_Crawl_Filtered
Viewer
•
Updated
Sep 29, 2025
•
5.65M
•
203
•
4
jed351/Traditional-Chinese-Common-Crawl-Filtered
Viewer
•
Updated
Sep 29, 2025
•
278M
•
5.31k
•
24
jed351/Traditional-Chinese-Common-Crawl-NOT-Cleaned
Viewer
•
Updated
Sep 29, 2025
•
547M
•
1.11k
jed351/Cantonese-Web-Data
Viewer
•
Updated
Sep 29, 2025
•
732k
•
18
•
4
jed351/fineweb-ja-keyword-hk
Viewer
•
Updated
Sep 12, 2025
•
2.08M
•
462
jed351/finepdfs-traditional-chinese
Viewer
•
Updated
Sep 8, 2025
•
1.31M
•
136
jed351/Chinese-Common-Crawl-Filtered
Viewer
•
Updated
Jun 2, 2025
•
21.3M
•
258
•
18
jed351/rthk_news
Viewer
•
Updated
Sep 20, 2024
•
332k
•
31
•
6
jed351/shikoto_zh_hk
Viewer
•
Updated
Jan 18, 2023
•
144k
•
6
•
2
jed351/cantonese-wikipedia
Viewer
•
Updated
Dec 27, 2022
•
125k
•
32
•
7