15 29

Anh Duy Le

duycse1603

AI & ML interests

None yet

Recent Activity

liked a Space 12 days ago

muset-ai/DeepResearch-Bench-Leaderboard

liked a model about 2 months ago

nvidia/nemotron-ocr-v2

liked a dataset 2 months ago

allenai/olmOCR-bench

View all activity

Organizations

None yet

liked a Space 12 days ago

DeepResearch Bench

🔍

Explore and compare Deep Research model rankings

liked a model about 2 months ago

nvidia/nemotron-ocr-v2

Image-to-Text • Updated about 20 hours ago • 4.56k • 187

liked a dataset 2 months ago

allenai/olmOCR-bench

Benchmark • Updated Feb 19 • 4.77k • 212

liked a dataset 5 months ago

omron-sinicx/scipostlayout_v2

Preview • Updated Jul 31, 2024 • 42 • 7

liked 2 models 6 months ago

docling-project/docling-models

Updated Dec 3, 2025 • 2.73M • 209

docling-project/DocumentFigureClassifier

4.07M • Updated Jan 24, 2025 • 21.3k • 19

upvoted a collection 8 months ago

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 645

upvoted 7 articles 8 months ago

Article

Vision Language Model Alignment in TRL ⚡️

sergiopaniego, merve, qgallouedec, kashif, ariG23498

•

Aug 7, 2025

• 111

Article

Finetune Stable Diffusion Models with DDPO via TRL

metric-space, sayakpaul, kashif, lvwerra

•

Sep 29, 2023

• 20

Article

Preference Optimization for Vision Language Models

qgallouedec, vwxyzjn, merve, kashif

•

Jul 10, 2024

• 93

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

dvgodoy

•

Feb 11, 2025

• 123

Article

Decoding Strategies in Large Language Models

mlabonne

•

Oct 29, 2024

• 113

Article

From GRPO to DAPO and GSPO: What, Why, and How

NormalUhr

•

Aug 9, 2025

• 119

Article

How to Choose the Best Open Source LLM for Your Project in 2025

dvilasuero

•

Sep 9, 2025

• 78

liked a model 9 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 86.6k • 533

upvoted an article 9 months ago

Article

KV Cache from scratch in nanoVLM

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 119

upvoted a collection 9 months ago

Direct Preference Optimization Datasets

Collection

Datasets suitable for DPO based on having 'chosen', 'rejected', and 'prompt' columns. Created using librarian-bots/dataset-column-search-api • 4011 items • Updated Apr 2 • 7

upvoted 2 articles 9 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 414

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

merve

•

Aug 25, 2023

• 39

liked a model 9 months ago

google/gemma-3-270m

Text Generation • 0.3B • Updated Aug 14, 2025 • 3.77M • 1.03k

Anh Duy Le

AI & ML interests

Recent Activity

Organizations

duycse1603's activity

DeepResearch Bench

Vision Language Model Alignment in TRL ⚡️

Finetune Stable Diffusion Models with DDPO via TRL

Preference Optimization for Vision Language Models

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Decoding Strategies in Large Language Models

From GRPO to DAPO and GSPO: What, Why, and How

How to Choose the Best Open Source LLM for Your Project in 2025

KV Cache from scratch in nanoVLM

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳