smolagents

Team
community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

akseljoonasΒ  updated a Space about 3 hours ago
smolagents/ml-agent
View all activity

victorΒ 
posted an update 3 days ago
view post
Post
4379
Want to share my enthusiasm for zai-org/GLM-5.1 here too πŸ”₯

I think we have it: our open source Claude Code = GLM-5.1 + Pi (https://pi.dev/) - Built a Three.js racing game to eval and it's extremely impressive. Thoughts:

- One-shot car physics with real drift mechanics (this is hard)

- My fav part: Awesome at self iterating (with no vision!) created 20+ Bun.WebView debugging tools to drive the car programmatically and read game state. Proved a winding bug with vector math without ever seeing the screen

- 531-line racing AI in a single write: 4 personalities, curvature map, racing lines, tactical drifting. Built telemetry tools to compare player vs AI speed curves and data-tuned parameters

- All assets from scratch: 3D models, procedural textures, sky shader, engine sounds, spatial AI audio!

- Can do hard math: proved road normals pointed DOWN via vector cross products, computed track curvature normalized by arc length to tune AI cornering speed

You are going to hear about this model a lot in the next months - open source let's go - and thanks z-aiπŸš€πŸš€
  • 3 replies
Β·
evalstateΒ 
posted an update 7 days ago
view post
Post
863
The experimental MCP hub_query tool now supports Paper Searching and Details as well as Daily Papers.
albertvillanovaΒ 
posted an update about 2 months ago
view post
Post
2363
πŸš€ TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

We’re excited to see what the community builds on top of this.

If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! πŸ€—

The future of ML tooling is agent-native.
πŸ”— https://github.com/huggingface/trl/releases/tag/v0.29.0
evalstateΒ 
posted an update about 2 months ago
view post
Post
3706
Hugging Face MCP Server v0.3.2
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

- Replace model_search and dataset_search with combined hub_repo_search tool.
- Less distracting description for hf_doc_search
- model_search and dataset_search tool calls will still function (plan to remove next release).
  • 4 replies
Β·
albertvillanovaΒ 
posted an update 2 months ago
view post
Post
1854
5 years already working in democratizing AI πŸ€—
Grateful to be part of such an awesome team making it happen every day.
victorΒ 
posted an update 3 months ago
view post
Post
2717
Interesting article: use Claude Code to help open models write CUDA kernels (for eg) by turning CC traces into Skills. They made a library out of it πŸ‘€

https://huggingface.co/blog/upskill
evalstateΒ 
posted an update 3 months ago
view post
Post
319
Hugging Face MCP Server v0.3.1
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

- Streamable HTTP used for Gradio Connectivity
- SSE Transport (as Server) removed
- Proxy Configuration added for launch of sub-agent tools