wenyang's picture

wenyang

notoookay

·

AI & ML interests

NLP, RL

Recent Activity

upvoted an article 6 days ago

Transformers v5: Simple model definitions powering the AI ecosystem

upvoted a collection 13 days ago

View all activity

Organizations

upvoted an article 6 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

9 days ago

•

231

upvoted a collection 13 days ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 8 days ago • 142

upvoted an article 5 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

+7

Jun 3

•

289

upvoted an article 7 months ago

Article

The Transformers Library: standardizing model definitions

+2

May 15

•

120

upvoted a collection 7 months ago

Qwen3

84 items • Updated Aug 6 • 1.47k

upvoted an article 8 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

•

172

upvoted a collection 9 months ago

Gemma 3 Release

28 items • Updated Aug 11 • 550

upvoted a collection 10 months ago

Gemma Scope Release

A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated Jul 10 • 18

upvoted a paper 10 months ago

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21, 2024 • 14

upvoted 3 collections 11 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 169

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 666

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 131

upvoted an article about 1 year ago

Article

Accelerate 1.0.0

+1

Sep 13, 2024

•

54

upvoted 2 collections over 1 year ago

Tulu V2.5 Suite

A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated 10 days ago • 15

[lecture artifacts] aligning open language models

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 57

upvoted an article over 1 year ago

Article

Welcome Llama 3 - Meta's new open LLM

+3

Apr 18, 2024

•

293

upvoted a paper over 1 year ago

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Paper • 2404.08197 • Published Apr 12, 2024 • 29