Thinking Machines Lab

company

https://thinkingmachines.ai

AI & ML interests

None defined yet.

Team members 73
private

updated a model 6 months ago

thinkingmachineslabinc/meta-llama-3-tokenizer

authored a paper 6 months ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 22

published a model 6 months ago

thinkingmachineslabinc/meta-llama-3-instruct-tokenizer

Updated Dec 29, 2025

updated a model 6 months ago

thinkingmachineslabinc/meta-llama-3-instruct-tokenizer

Updated Dec 29, 2025

posted an update 7 months ago

Post

193

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration (2511.21689)

published a model 7 months ago

thinkingmachineslabinc/meta-llama-3-tokenizer

authored a paper 9 months ago

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21, 2025 • 36

authored 3 papers about 1 year ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 88

OpenAI o1 System Card

Paper • 2412.16720 • Published Dec 21, 2024 • 38

A PINN Approach to Symbolic Differential Operator Discovery with Sparse Data

Paper • 2212.04630 • Published Dec 9, 2022

authored a paper about 1 year ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

authored a paper over 1 year ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8, 2025 • 96

authored a paper over 1 year ago

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 51

authored a paper over 1 year ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 50

authored a paper over 1 year ago

SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Paper • 2410.03960 • Published Oct 4, 2024 • 2

authored a paper over 1 year ago

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

Paper • 2410.03290 • Published Oct 4, 2024 • 7

authored a paper almost 2 years ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 134

authored a paper almost 2 years ago

MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts

Paper • 2407.21770 • Published Jul 31, 2024 • 22

authored a paper almost 2 years ago

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Paper • 2406.18521 • Published Jun 26, 2024 • 31

authored a paper almost 2 years ago

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

Paper • 2306.12420 • Published Jun 21, 2023 • 2