tokestermw (Motoki Wu)

upvoted a paper about 11 hours ago

Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation

Paper • 2409.12941 • Published 6 days ago • 12

upvoted a collection about 12 hours ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 7 days ago • 179

upvoted an article 1 day ago

Article

Document Similarity Search with ColPali

By

•

4 days ago

• 21

upvoted a paper 4 days ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 6 days ago • 107

upvoted a paper 15 days ago

OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs

Paper • 2409.05152 • Published 17 days ago • 29

upvoted 2 papers 16 days ago

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published 20 days ago • 30

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Paper • 2402.10110 • Published Feb 15 • 3

upvoted a paper 18 days ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15 • 34

upvoted a paper 19 days ago

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Paper • 2409.02897 • Published 21 days ago • 43

upvoted a paper 24 days ago

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published 29 days ago • 137

upvoted a paper 26 days ago

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Paper • 2408.15915 • Published 28 days ago • 19

upvoted an article 27 days ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21

• 20

upvoted an article about 1 month ago

Article

Perspectives for first principles prompt engineering

By

•

Aug 18

• 16

upvoted a paper about 1 month ago

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Paper • 2408.08274 • Published Aug 15 • 11

upvoted an article about 1 month ago

Article

Tool Use, Unified

Aug 12

• 50

upvoted a paper about 1 month ago

ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

Paper • 2408.04682 • Published Aug 8 • 14

upvoted an article about 2 months ago

Article

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗

By

•

Jul 30

• 33

upvoted 2 papers about 2 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5 • 32

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 32

upvoted 2 articles about 2 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 58

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 197

upvoted 2 papers 2 months ago

Very Large-Scale Multi-Agent Simulation in AgentScope

Paper • 2407.17789 • Published Jul 25 • 30

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23 • 67

upvoted a collection 2 months ago

H2O Danube3

Collection

6 items • Updated Jul 16 • 52

upvoted a paper 2 months ago

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17 • 48

upvoted an article 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 244

upvoted a paper 2 months ago

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

Paper • 2407.10457 • Published Jul 15 • 22

upvoted an article 2 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 74

upvoted 2 papers 2 months ago

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12 • 56

G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment

Paper • 2303.16634 • Published Mar 29, 2023 • 3

upvoted 4 papers 3 months ago

upvoted an article 3 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1

• 45

upvoted 2 papers 3 months ago

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

Paper • 2406.02886 • Published Jun 5 • 7

Direct Preference Knowledge Distillation for Large Language Models

Paper • 2406.19774 • Published Jun 28 • 21

upvoted a paper 4 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1 • 27

upvoted a paper 5 months ago

OpenBezoar: Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data

Paper • 2404.12195 • Published Apr 18 • 11

upvoted a collection 5 months ago

Open-Bezoar

Collection

Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data • 7 items • Updated Apr 19 • 6

upvoted 3 papers 6 months ago

CodecLM: Aligning Language Models with Tailored Synthetic Data

Paper • 2404.05875 • Published Apr 8 • 16

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 103

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 58

upvoted a collection 6 months ago

Moirai-1.0-R models

Collection

6 items • Updated Aug 1 • 26

upvoted 4 papers 6 months ago

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15 • 56

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15 • 66

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 59

A Rank Stabilization Scaling Factor for Fine-Tuning with LoRA

Paper • 2312.03732 • Published Nov 28, 2023 • 7

upvoted 8 papers 7 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12 • 39

SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6 • 74

ResLoRA: Identity Residual Mapping in Low-Rank Adaption

Paper • 2402.18039 • Published Feb 28 • 11

User-LLM: Efficient LLM Contextualization with User Embeddings

Paper • 2402.13598 • Published Feb 21 • 18

Instruction-tuned Language Models are Better Knowledge Learners

Paper • 2402.12847 • Published Feb 20 • 24

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16 • 41

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

Paper • 2402.07456 • Published Feb 12 • 41

upvoted 4 papers 8 months ago

Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7 • 27

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7 • 36

Rethinking Interpretability in the Era of Large Language Models

Paper • 2402.01761 • Published Jan 30 • 21

Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2 • 14

Motoki Wu

AI & ML interests

Organizations

tokestermw's activity

Document Similarity Search with ColPali

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Perspectives for first principles prompt engineering

Tool Use, Unified

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

SmolLM - blazingly fast and remarkably powerful

The Rise of Agentic Data Generation

Our Transformers Code Agent beats the GAIA benchmark!