Sergio Paniego's picture

Building on HF

Sergio Paniego PRO

sergiopaniego

huggingface

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

posted an update about 7 hours ago

ICYMI, great blog by @kashif and @stas on Ulysses Sequence Parallelism: train with million-token contexts on 4×H100s: 12x longer sequences, 3.7x throughput learn how to integrate it with Accelerate, Transformers, and TRL ⤵️ https://huggingface.co/blog/ulysses-sp

upvoted a paper about 13 hours ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

updated a dataset about 17 hours ago

huggingface-projects/Deep-RL-Course-Certification

View all activity

Organizations

Posts 81

Post

39

ICYMI, great blog by @kashif and @stas on Ulysses Sequence Parallelism: train with million-token contexts

on 4×H100s: 12x longer sequences, 3.7x throughput

learn how to integrate it with Accelerate, Transformers, and TRL ⤵️
https://huggingface.co/blog/ulysses-sp

Articles 14

Article

43

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

View all Articles

Collections 9

View 9 collections

spaces 99

VLM Object Understanding

Explore object detection, visual grounding, keypoint Detecti

Qwen2-VL-7B

Ask questions about charts in images

SmolVLM-trl-dpo-rlaif-v

Generate text from an image and question

SmolVLM-trl-sft-ChartQA

Ask questions about charts in images

Multi Env Grpo

Show your activity tracking dashboard

Browsergym-grpo-Qwen-Qwen3-0.6B-2026-03-11 16-45-53

Visualize tracking data interactively

models 118

sergiopaniego/nemotron-3-sft

Updated 3 days ago

sergiopaniego/Qwen3-0.6B-carla-trolley-escape

0.8B • Updated 16 days ago • 144

sergiopaniego/tiny-aya-global-SFT

Updated 23 days ago

sergiopaniego/nemo3-sft-bnb

Updated 25 days ago

sergiopaniego/rloo_tldr_test

sergiopaniego/wordle-grpo-Qwen3-1.7B-test

sergiopaniego/wordle-grpo-Qwen3-1.7B

Text Generation • 2B • Updated Feb 2 • 1

sergiopaniego/browsergym-grpo-functiongemma-270m-it-test

sergiopaniego/sudoku-grpo-qwen3

Text Generation • 2B • Updated Jan 2 • 1

sergiopaniego/test-browsergym-grpo-functiongemma-270m-it

Updated Dec 23, 2025

View 118 models

datasets 6

sergiopaniego/browsergym-grpo-functiongemma-270m-it-dataset

Viewer • Updated about 1 hour ago • 105 • 12.7k

sergiopaniego/sample_videos

Viewer • Updated Jun 30, 2025 • 2 • 33

sergiopaniego/difficult_prompts

Viewer • Updated Jun 20, 2025 • 38 • 11

sergiopaniego/ourworldindata_example

Viewer • Updated Dec 2, 2024 • 13 • 34 • 1

sergiopaniego/faiss_embeddings

Updated Oct 3, 2024 • 23

sergiopaniego/CarlaFollowLanePreviousV

Viewer • Updated Sep 6, 2023 • 59.6k • 18