Koty KD's picture

Koty KD

kotyKD

·

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

zai-org/GLM-4.7-Flash

liked a model 8 days ago

cyankiwi/Qwen3-4B-Instruct-2507-AWQ-4bit

liked a model 8 days ago

nvidia/Nemotron-Flash-3B-Instruct

View all activity

Organizations

None yet

upvoted a collection 10 days ago

Falcon-H1-Tiny

A series of extremely small, yet powerful language models redefining capabilities at small scale • 22 items • Updated 10 days ago • 27

upvoted an article 29 days ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Jul 29, 2024

•

365

upvoted 2 articles 3 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

56

Article

What makes good reasoning data

Oct 30, 2025

•

44

upvoted 3 collections 3 months ago

Pre-training Dataset Samples

A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 19 items • Updated Dec 25, 2025 • 18

Essential-Web v1.0

10 items • Updated Jun 18, 2025 • 10

GPT-OSS General (4.2B to 20B)

Collection of pruned GPT-OSS models spanning 1-32 experts, maintaining general capabilities across domains while reducing computational requirements. • 29 items • Updated Aug 13, 2025 • 10

upvoted a collection 4 months ago

Granite 4.0 Language Models

13 items • Updated Nov 17, 2025 • 203

upvoted 2 articles 6 months ago

Article

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Aug 3, 2025

•

7

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

751

upvoted a paper 8 months ago

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1, 2024 • 23

upvoted an article 8 months ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

May 20, 2025

•

57

upvoted a collection 9 months ago

RADLADS

7 items • Updated May 7, 2025 • 7

upvoted a paper 9 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98

upvoted a collection 9 months ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 68 items • Updated 3 days ago • 317

upvoted a collection 10 months ago

ArgonneAI

Pretrained LLMs from scratch. • 5 items • Updated about 15 hours ago • 1

upvoted an article 11 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

296

upvoted a collection 11 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 181

upvoted a collection about 1 year ago

Dolphin 3.0

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated Feb 7, 2025 • 191

upvoted an article about 1 year ago

Article

Self-Hosting LLaMA 3.1 70B (or any ~70B LLM) Affordably

Aug 20, 2024

•

26