Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
LLaMA-MoE
https://github.com/pjlab-sys4nlp/llama-moe
Activity Feed
Follow
21
AI & ML interests
None defined yet.
Recent Activity
tongjingqi
Ā
authored
a paper
16 days ago
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
tongjingqi
Ā
authored
a paper
16 days ago
Adaptive Fast-and-Slow Visual Program Reasoning for Long-Form VideoQA
tongjingqi
Ā
authored
a paper
16 days ago
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
View all activity
Team members
6
llama-moe
's models
8
Sort:Ā Recently updated
llama-moe/LLaMA-MoE-v2-3_8B-residual-sft
8B
ā¢
Updated
Dec 3, 2024
ā¢
6
ā¢
2
llama-moe/LLaMA-MoE-v2-3_8B-2_8-sft
8B
ā¢
Updated
Dec 3, 2024
ā¢
168
ā¢
6
llama-moe/LLaMA-MoE-v1-3_0B-2_16
Text Generation
ā¢
Updated
Jun 25, 2024
ā¢
779
ā¢
11
llama-moe/LLaMA-MoE-v1-3_5B-4_16
Text Generation
ā¢
Updated
Jun 25, 2024
ā¢
554
ā¢
16
llama-moe/LLaMA-MoE-v1-3_0B-2_16-sft
Text Generation
ā¢
7B
ā¢
Updated
Jun 25, 2024
ā¢
7
ā¢
2
llama-moe/LLaMA-MoE-v1-3_5B-2_8-sft
Text Generation
ā¢
7B
ā¢
Updated
Jun 25, 2024
ā¢
12
ā¢
3
llama-moe/LLaMA-MoE-v1-3_5B-4_16-sft
Text Generation
ā¢
7B
ā¢
Updated
Jun 25, 2024
ā¢
11
ā¢
1
llama-moe/LLaMA-MoE-v1-3_5B-2_8
Text Generation
ā¢
Updated
Jun 25, 2024
ā¢
1.23k
ā¢
15