AI & ML interests
Merging models
KennethEnevoldsenย
authored a
paper 2 months ago
mlabonneย
authored 2
papers 3 months ago
timpal0lย
authored 5
papers 4 months ago
The Nordic Pile: A 1.2TB Nordic Dataset for Language Modeling
Paper โข 2303.17183 โข Published โข 1
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
Paper โข 2305.12987 โข Published
Why Not Simply Translate? A First Swedish Evaluation Benchmark for Semantic Similarity
Paper โข 2009.03116 โข Published
Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead?
Paper โข 2104.10441 โข Published
SWEb: A Large Web Dataset for the Scandinavian Languages
Paper โข 2410.04456 โข Published โข 1
Post
10315
New family of 1B models just dropped!
> LiquidAI/LFM2.5-1.2B-Base: 10T โ 28T tokens
> LiquidAI/LFM2.5-1.2B-Instruct: new large-scale multi-stage RL
> LiquidAI/LFM2.5-1.2B-JP: our most polite model
> LiquidAI/LFM2.5-VL-1.6B: multi-image multilingual
> LiquidAI/LFM2.5-Audio-1.5B: 8x times faster, no quality loss
Super proud of this release ๐ค
> LiquidAI/LFM2.5-1.2B-Base: 10T โ 28T tokens
> LiquidAI/LFM2.5-1.2B-Instruct: new large-scale multi-stage RL
> LiquidAI/LFM2.5-1.2B-JP: our most polite model
> LiquidAI/LFM2.5-VL-1.6B: multi-image multilingual
> LiquidAI/LFM2.5-Audio-1.5B: 8x times faster, no quality loss
Super proud of this release ๐ค
KennethEnevoldsenย
authored a
paper 7 months ago
Post
8429
LiquidAI/LFM2-8B-A1B just dropped!
8.3B params with only 1.5B active/token ๐
> Quality โ 3โ4B dense, yet faster than Qwen3-1.7B
> MoE designed to run on phones/laptops (llama.cpp / vLLM)
> Pre-trained on 12T tokens โ strong math/code/IF
8.3B params with only 1.5B active/token ๐
> Quality โ 3โ4B dense, yet faster than Qwen3-1.7B
> MoE designed to run on phones/laptops (llama.cpp / vLLM)
> Pre-trained on 12T tokens โ strong math/code/IF
Post
3883
โ๏ธ New drop of tiny task-specific models!
Want to do data extraction, translation, RAG, tool use, or math on a Raspberry Pi? We got you covered! โ
These tiny models were fine-tuned to perform narrow tasks extremely well, making them competitive with much larger models.
You can deploy them today on-device or even on GPUs for big data operations!
LiquidAI/liquid-nanos-68b98d898414dd94d4d5f99a
Want to do data extraction, translation, RAG, tool use, or math on a Raspberry Pi? We got you covered! โ
These tiny models were fine-tuned to perform narrow tasks extremely well, making them competitive with much larger models.
You can deploy them today on-device or even on GPUs for big data operations!
LiquidAI/liquid-nanos-68b98d898414dd94d4d5f99a
Post
6983
Liquid just released two 450M and 1.6B param VLMs!
They're super fast and leverage SigLIP2 NaFlex encoders to handle native resolutions without distortion. It's ideal for on-device deployment in constrained environments like phones.
It's available today on Hugging Face, with an inference and a fine-tuning Colab notebooks.
LiquidAI/LFM2-VL-450M
LiquidAI/LFM2-VL-1.6B
They're super fast and leverage SigLIP2 NaFlex encoders to handle native resolutions without distortion. It's ideal for on-device deployment in constrained environments like phones.
It's available today on Hugging Face, with an inference and a fine-tuning Colab notebooks.
LiquidAI/LFM2-VL-450M
LiquidAI/LFM2-VL-1.6B
KennethEnevoldsenย
authored a
paper 9 months ago
Post
5762
Based on a new hybrid architecture, these 350M, 700M, and 1.2B models are both fast and performant, ideal for on-device deployment.
I recommend fine-tuning them to power your next edge application. We already provide Colab notebooks to guide you. More to come soon!
๐ Blog post: https://www.liquid.ai/blog/liquid-foundation-models-v2-our-second-series-of-generative-ai-models
๐ค Models: LiquidAI/lfm2-686d721927015b2ad73eaa38
KennethEnevoldsenย
authored a
paper about 1 year ago
birgermoellย
authored 3
papers about 1 year ago
Medical Reasoning in LLMs: An In-Depth Analysis of DeepSeek R1
Paper โข 2504.00016 โข Published
The order in speech disorder: a scoping review of state of the art machine learning methods for clinical speech classification
Paper โข 2503.04802 โข Published
Artificial Humans
Paper โข 2503.16502 โข Published