20 22 304

Kurian Benoy PRO

kurianbenoy

https://kurianbenoy.com

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

What makes good reasoning data

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

upvoted an article about 1 month ago

Synthetic data: save money, time and carbon with open source

View all activity

Organizations

upvoted an article about 1 month ago

Article

What makes good reasoning data

Oct 30

•

liked a Space about 1 month ago

The Smol Training Playbook

📚

2.53k

The secrets to building world-class LLMs

upvoted an article about 1 month ago

Article

Synthetic data: save money, time and carbon with open source

Feb 16, 2024

•

liked a dataset about 1 month ago

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 59.5k • 647

updated a Space about 1 month ago

FastHTML on HuggingFace!

🤗

Find detailed drug information by name

published a Space about 1 month ago

FastHTML on HuggingFace!

🤗

Find detailed drug information by name

liked 2 datasets about 2 months ago

ai4bharat/MANGO

Viewer • Updated May 13 • 51k • 105 • 5

nvidia/Nemotron-Personas-India

Viewer • Updated Oct 14 • 3M • 949 • 36

upvoted a collection about 2 months ago

Speech Evals

Collection

Synthesized speech evals generated by MistralAI from popular text evaluation datasets to evaluate spoken-language reasoning capabilities of Audio LLMs • 3 items • Updated 9 days ago • 11

liked a model about 2 months ago

ai4bharat/Cadence

Token Classification • 1.0B • Updated 18 days ago • 17.8k • 16

liked a Space about 2 months ago

Maintain the unmaintainable

📚

Visualize connections between transformer models

reacted to Molbap's post with 🔥 about 2 months ago

Post

3214

🚀 New blog: Maintain the unmaintainable – 1M+ Python LOC, 400+ models

How do you stop a million-line library built by thousands of contributors from collapsing under its own weight?
At 🤗 Transformers, we do it with explicit software-engineering tenets, principles that make the codebase hackable at scale.

🔍 Inside the post:
– One Model, One File: readability first — you can still open a modeling file and see the full logic, top to bottom.
– Modular Transformers: visible inheritance that cuts maintenance cost by ~15× while keeping models readable.
– Config-Driven Performance: FlashAttention, tensor parallelism, and attention scheduling are config-level features, not rewrites.

Written with @lysandre ,@pcuenq and @yonigozlan , this is a deep dive into how Transformers stays fast, open, and maintainable.

Read it here → transformers-community/Transformers-tenets