Pavlo's picture

Pavlo

pmolchanov

·

https://www.pmolchanov.com

AI & ML interests

Efficiency in Multi-Modal LLMs

Organizations

pmolchanov's activity

upvoted a collection 5 days ago

MagpieLM

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated 3 days ago • 12

upvoted a collection 16 days ago

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 3 items • Updated Aug 23 • 2

upvoted a collection 27 days ago

Qwen2-VL

Vision-language model series based on Qwen2 • 15 items • Updated 7 days ago • 121

upvoted a collection 28 days ago

VILA: On Pre-training for Visual Language Models

10 items • Updated Aug 21 • 42

upvoted a paper 29 days ago

Learning to Move Like Professional Counter-Strike Players

Paper • 2408.13934 • Published about 1 month ago • 21

upvoted a collection about 1 month ago

Nemotron in vLLM

Nemotron models that have been converted and/or quantized to work well in vLLM • 7 items • Updated Jul 25 • 1

upvoted a paper about 1 month ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22 • 50

upvoted a collection about 1 month ago

To read... eventually

87 items • Updated about 23 hours ago • 3

upvoted a paper about 1 month ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 53

upvoted a collection 2 months ago

Papers I want to read

Papers in my to-read list • 218 items • Updated 3 days ago • 19

upvoted a paper 2 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19 • 35

upvoted 2 collections 2 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 171

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 7 items • Updated 8 days ago • 54

upvoted a paper 3 months ago

Hierarchical Patch Diffusion Models for High-Resolution Video Generation

Paper • 2406.07792 • Published Jun 12 • 13

upvoted a paper 6 months ago

LITA: Language Instructed Temporal-Localization Assistant

Paper • 2403.19046 • Published Mar 27 • 17

upvoted a paper 7 months ago

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26 • 42

upvoted a collection 9 months ago

Nemotron 3 8B

The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated Jul 17 • 43

upvoted 4 papers 9 months ago

Relightable Gaussian Codec Avatars

Paper • 2312.03704 • Published Dec 6, 2023 • 29

GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

Paper • 2312.11461 • Published Dec 18, 2023 • 18

HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles

Paper • 2312.11666 • Published Dec 18, 2023 • 12

MACS: Mass Conditioned 3D Hand and Object Motion Synthesis

Paper • 2312.14929 • Published Dec 22, 2023 • 4

upvoted a paper 10 months ago

VILA: On Pre-training for Visual Language Models

Paper • 2312.07533 • Published Dec 12, 2023 • 20

upvoted a paper about 1 year ago

Challenges and Applications of Large Language Models

Paper • 2307.10169 • Published Jul 19, 2023 • 47