Jarrod Barnes PRO

Jarrodbarnes

https://arc.computer

AI & ML interests

Continual Learning, Reinforcement Learning

Recent Activity

upvoted a paper 1 day ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

upvoted an article 3 days ago

We Got Claude to Fine-Tune an Open Source LLM

liked a model 3 days ago

nvidia/NVIDIA-Nemotron-Nano-9B-v2

View all activity

Organizations

upvoted a paper 1 day ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 3 days ago • 62

upvoted an article 3 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

4 days ago

•

271

upvoted a paper 3 days ago

Continual Learning, Not Training: Online Adaptation For Agents

Paper • 2511.01093 • Published Nov 2 • 1

upvoted an article 3 days ago

Article

Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes

Oct 22

•

upvoted a paper 5 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 7 days ago • 77

upvoted an article 6 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

7 days ago

•

224

upvoted a collection 6 days ago

DeepSeek-V3.2

Collection

4 items • Updated 6 days ago • 501

upvoted a paper 8 days ago

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published 17 days ago • 24

upvoted an article 17 days ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Oct 23

•

134

upvoted 4 papers 4 months ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 97

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

Paper • 2507.23751 • Published Jul 31 • 4

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

τ^2-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Paper • 2506.07982 • Published Jun 9 • 7

upvoted an article 4 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

•

202

upvoted a paper 4 months ago

SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM

Paper • 2504.14286 • Published Apr 19 • 2

upvoted a paper 5 months ago

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published Jul 23 • 36

upvoted an article 5 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Jul 18

•

upvoted 3 papers 5 months ago

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published Apr 11 • 32

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 89

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317

Jarrod Barnes PRO

AI & ML interests

Recent Activity

Organizations

Jarrodbarnes's activity

We Got Claude to Fine-Tune an Open Source LLM

Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes

Transformers v5: Simple model definitions powering the AI ecosystem

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models