Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ldwang 's Collections
MiscSpaces
MiscAgentic
MiscIndustry
MiscKernel
MiscR1
MiscModels
MiscDatasets
MiscTools

MiscSpaces

updated Nov 6
Upvote
1

  • Running
    587

    Scaling test-time compute

    πŸ“ˆ
    587

    Implement test-time compute scaling for math problems


  • Running
    Featured
    1.21k

    FineWeb: decanting the web for the finest text data at scale

    🍷
    1.21k

    Generate high-quality text data for LLMs using FineWeb


  • Running
    3.55k

    The Ultra-Scale Playbook

    🌌
    3.55k

    The ultimate guide to training LLM on large GPU Clusters


  • Running
    210

    FineVision: Open Data is All You Need

    πŸ“
    210

    A new open-source dataset for training VLMs


  • Sleeping
    19

    Megatron Memory Estimator

    πŸ‘
    19

    Estimate GPU memory usage for Megatron models


  • Running on Zero
    18

    Smol2Operator Demo

    🐒
    18

    Smol2Operator Demo: GUI Agent Model


  • Running on CPU Upgrade
    Featured
    2.55k

    The Smol Training Playbook

    πŸ“š
    2.55k

    The secrets to building world-class LLMs


  • Running
    68

    Unlocking On-Policy Distillation for Any Model Family

    πŸ“
    68

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs