Nathan Habib's picture

Nathan Habib

SaylorTwift

·

AI & ML interests

None yet

Articles

Open LLM Leaderboard: DROP deep dive

What's going on with the Open LLM Leaderboard?

Organizations

SaylorTwift's activity

upvoted a collection about 15 hours ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 6 days ago • 188

upvoted a paper 3 days ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 6 days ago • 108

upvoted a collection 16 days ago

LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 264 items • Updated Jun 22 • 395

upvoted 4 articles about 1 month ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 79

Article

XetHub is joining Hugging Face!

Aug 8

• 76

Article

Tool Use, Unified

Aug 12

• 50

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 98

upvoted an article 4 months ago

Article

Let's talk about LLM evaluation

By

•

May 23

• 105

upvoted a collection 9 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 211

upvoted a paper 11 months ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 120

upvoted a paper 12 months ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 28

upvoted a paper about 1 year ago

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 170

upvoted a paper over 1 year ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 31