Dataset Library
Reasoning traces for distilling frontier models
Curated datasets built by querying Claude, GPT, Gemini and other frontier models with diverse coding, math, and reasoning prompts. Designed for training small open models that still think clearly.
What's included
Each dataset includes detailed reasoning traces, carefully filtered conversations, and metadata ready for fine-tuning. Listings are synced hourly from Hugging Face.
DeepSeek-v4-Pro-Agent
Distilled from DeepSeek
Claude-Opus-4.6-Reasoning-887x
claude-4.5-opus-high-reasoning-250x
Distilled from Claude Opus 4.5
lordx64-claude-opus-4.7-max-cleaned
DeepSeek-v4-Flash-Chat
Distilled from DeepSeek
Claude-Sonnet-4.6-Reasoning-1100x
Distilled from Claude Sonnet 4.5
Claude-Opus-Dataclaw-Unredacted
gemini-3-flash-preview
gpt-5.1-high-reasoning-1000x
Distilled from GPT-5.1
gpt-5.2-high-reasoning-250x
Hunter-Alpha-Coding-Agent-SFT
gemini-3-pro-preview-high-reasoning-250x
Distilled from Gemini 3 Pro
convo-v1
claude-sonnet-4.5-high-reasoning-250x
Distilled from Claude Sonnet 4.5
Step-3.5-Flash-2600x
MiniMax-M2.1-Code-SFT
Pony-Alpha-15k
gemini-3-pro-preview-high-reasoning-1000x
Distilled from Gemini 3 Pro
deepseek-v3.2-speciale-openr1-math-3k
Distilled from DeepSeek v3.2 Speciale
deepseek-v3.2-speciale-OpenCodeReasoning-3k
Distilled from DeepSeek v3.2 Speciale
mistral-small-creative-500x
Distilled from Mistral
glm-4.7-2000x
gemini-2.5-flash-11000x
Distilled from Gemini 2.5 Flash
gemini-3-flash-preview-standalone-html-1k
claude-haiku-4.5-high-reasoning-1700x
gpt-5-codex-250x
Distilled from GPT-5 Codex
polaris-alpha-1000x
brainstorm-v3.1-grok-4-fast-200x
Distilled from Grok
gpt-5.1-codex-max-1000x
Distilled from GPT-5.1
Aurora-Alpha-15.5k
Hunter-Alpha-16k
Hunter-Alpha-UIGEN-T3-Agent-SFT
glm-4.7-350x
claude-haiku-4.5-1700x
deepseek-v3.2-speciale-1000x
Distilled from DeepSeek v3.2 Speciale
gemini-3-flash-preview-1000x
Healer-Alpha-16k
gpt-5-codex-1000x
Distilled from GPT-5 Codex
Gemini-3-Flash-Preview-VIBE
kimi-k2-thinking-1000x
Distilled from Kimi K2
open-moderator-v1
grok-code-fast-1-1000x
Distilled from Grok
glm-4.6-250x
Distilled from GLM 4.6
sherlock-think-alpha-1000x
sherlock-thinking-alpha-11000x
MiniMax-M2.1-8800x
MiMo-V2-Flash-2300x
minimax-m2.1-1000x
kimi-k2-thinking-250x
Distilled from Kimi K2
sherlock-dash-alpha-1000x
gemini-2.5-flash-lite-2509-preview-1000x
Distilled from Gemini 2.5 Flash