OpenGVLab

community

https://github.com/opengvlab

opengvlab

OpenGVLab

Request to join this org

AI & ML interests

Computer Vision

Organization Card

Community About org cards

OpenGVLab

Welcome to OpenGVLab! We are a research group from Shanghai AI Lab focused on Vision-Centric AI research. The GV in our name, OpenGVLab, means general vision, a general understanding of vision, so little effort is needed to adapt to new vision-based tasks.

Models

InternVL: a pioneering open-source alternative to GPT-4V.
InternImage: a large-scale vision foundation models with deformable convolutions.
InternVideo: large-scale video foundation models for multimodal understanding.
VideoChat: an end-to-end chat assistant for video comprehension.
All-Seeing-Project: towards panoptic visual recognition and understanding of the open world.

Datasets

ShareGPT4o: a groundbreaking large-scale resource that we plan to open-source with 200K meticulously annotated images, 10K videos with highly descriptive captions, and 10K audio files with detailed descriptions.
InternVid: a large-scale video-text dataset for multimodal understanding and generation.

Benchmarks

MVBench: a comprehensive benchmark for multimodal video understanding.

Collections 11

spaces 10

InternVideo2 Chat 8B HD

MVBench Leaderboard

ControlLLM

InternVL

Running on Zero

VideoMamba

VideoChat2

models 88

OpenGVLab/InternVL2-4B

Image-Text-to-Text • Updated about 19 hours ago • 21.2k • 32

OpenGVLab/InternVL2-Llama3-76B-AWQ

Image-Text-to-Text • Updated about 19 hours ago • 1.1k • 19

OpenGVLab/InternVL2-40B-AWQ

Image-Text-to-Text • Updated about 19 hours ago • 1.37k • 14

OpenGVLab/InternVL2-26B-AWQ

Image-Text-to-Text • Updated about 19 hours ago • 944 • 14

OpenGVLab/InternVL2-8B-AWQ

Image-Text-to-Text • Updated about 19 hours ago • 1.54k • 10

OpenGVLab/InternVL2-2B-AWQ

Image-Text-to-Text • Updated about 19 hours ago • 11.2k • 13

OpenGVLab/InternVL2-Llama3-76B

Image-Text-to-Text • Updated about 19 hours ago • 37.3k • 186

OpenGVLab/InternVL2-40B

Image-Text-to-Text • Updated about 19 hours ago • 50.3k • 85

OpenGVLab/InternVL2-26B

Image-Text-to-Text • Updated about 19 hours ago • 47.3k • 112

OpenGVLab/InternVL2-8B

Image-Text-to-Text • Updated about 19 hours ago • 53.4k • 118

datasets 25

OpenGVLab/InternVL-SA-1B-Caption

Viewer • Updated 4 days ago • 8.63M • 5

OpenGVLab/InternVL-Chat-V1-2-SFT-Data

Viewer • Updated 5 days ago • 573k • 48 • 9

OpenGVLab/InternVL-LaionCOCO-OCR

Updated 5 days ago

OpenGVLab/InternVL-WuKong-OCR

Updated 5 days ago

OpenGVLab/GMAI-MMBench

Preview • Updated 5 days ago • 4 • 10

OpenGVLab/GUI-Odyssey

Viewer • Updated 11 days ago • 7.74k • 8 • 6

OpenGVLab/ScaleVLN

Updated 14 days ago

OpenGVLab/OmniCorpus-CC-210M

Viewer • Updated 26 days ago • 208M • 31 • 8

OpenGVLab/ShareGPT-4o

Viewer • Updated Aug 17 • 59.4k • 227 • 129

OpenGVLab/MVBench

Viewer • Updated Aug 14 • 4k • 71.2k • 21