MagpieLM Collection Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated 3 days ago • 12
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 3 items • Updated Aug 23 • 2
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated 7 days ago • 121
Learning to Move Like Professional Counter-Strike Players Paper • 2408.13934 • Published about 1 month ago • 21
Nemotron in vLLM Collection Nemotron models that have been converted and/or quantized to work well in vLLM • 7 items • Updated Jul 25 • 1
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation Paper • 2408.12528 • Published Aug 22 • 50
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21 • 53
Compact Language Models via Pruning and Knowledge Distillation Paper • 2407.14679 • Published Jul 19 • 35
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 171
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 7 items • Updated 8 days ago • 54
Hierarchical Patch Diffusion Models for High-Resolution Video Generation Paper • 2406.07792 • Published Jun 12 • 13
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated Jul 17 • 43
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning Paper • 2312.11461 • Published Dec 18, 2023 • 18
HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles Paper • 2312.11666 • Published Dec 18, 2023 • 12
MACS: Mass Conditioned 3D Hand and Object Motion Synthesis Paper • 2312.14929 • Published Dec 22, 2023 • 4
Challenges and Applications of Large Language Models Paper • 2307.10169 • Published Jul 19, 2023 • 47