timm tiny test models Collection A collection of very small (~300-500k parameter) models at 160x160 resolution, for testing purposes. Trained on ImageNet-1k. • 12 items • Updated 1 day ago • 1
view article Article LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning? Jul 25 • 18
🍃 MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 49
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper • 2406.16860 • Published Jun 24 • 55
An Image is Worth 32 Tokens for Reconstruction and Generation Paper • 2406.07550 • Published Jun 11 • 55
MobileCLIP Models + DataCompDR Data Collection MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. • 22 items • Updated Jun 20 • 21
MobileNetV4 pretrained weights Collection Weights for MobileNet-V4 pretrained in timm • 17 items • Updated 2 days ago • 13
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • May 16 • 17
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 133
Searching for Better ViT Baselines Collection Exploring ViT hparams and model shapes for the GPU poor (between tiny and base). • 25 items • Updated Aug 21 • 12
PDF Document / OCR Datasets Collection Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30 • 47
plant-image-datasets Collection Image datasets about the kingdom Plantae. • 4 items • Updated Feb 29 • 2
Fine-Tune Image Classification Benchmark Datasets Collection Datasets for fine-tune benchmarking, hparam tuning. All vetted and tested with timm scripts. • 3 items • Updated Jun 12 • 1
All the ImageNets Collection Noteworthy instances of ImageNet on the Hub. Vetted and tested with timm train and validation scripts. • 7 items • Updated Jun 12 • 4
Fastest timm models > 75.3% IN-1k Top-1 (Original ResNet-50) Collection Fastest image classification models with 75.3% accuracy in ImageNet-1k . • 21 items • Updated Jul 26 • 4
timm Top-20 ImageNet-1k Models Collection The 20 best models on ImageNet-1k validation set, all pretrained on datasets larger than ImageNet and fine-tuned on ImageNet-1k. • 17 items • Updated Jun 12 • 7
timm Top-20 Fastest Models Collection Not the most accurate, but the highest throughput image classification models in timm • 20 items • Updated Jun 12 • 14
timm ImageNet-12k Models Collection timm has a number of unique and exclusive models trained on a 11821 (12k) subset of the full ImageNet-22k • 27 items • Updated Jun 12 • 2
timm Takes on the Classics Collection timm includes the most popular convolutional and vision transformer models, many with new weights from updated training recipes. • 24 items • Updated Jul 26 • 3
zephyr story Collection sources mentioned by hf.co/thomwolf tweet: x.com/Thom_Wolf/status/1720503998518640703 • 8 items • Updated Jan 24 • 15
Pythia Scaling Suite Collection Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Nov 21, 2023 • 22
WILDS Collection WILDS is a benchmark of in-the-wild distribution shifts spanning diverse data modalities and applications. • 10 items • Updated Aug 21 • 4
INaturalist-2021 Fine-tunes Collection Fine-tune experiments for various `timm` models on the INaturalist 2021 Challenge dataset (https://github.com/visipedia/inat_comp/tree/master/2021) • 5 items • Updated Oct 25, 2023 • 6
OpenCLIP DataComp Collection OpenCLIP models trained on DataComp (https://huggingface.co/papers/2304.14108). • 6 items • Updated Oct 9, 2023 • 6
OpenCLIP LAION-2B Collection OpenCLIP models trained on LAION-2B • 19 items • Updated Sep 10, 2023 • 18