Dataset Library

Reasoning traces for distilling frontier models

Curated datasets built by querying Claude, GPT, Gemini and other frontier models with diverse coding, math, and reasoning prompts. Designed for training small open models that still think clearly.

What's included

Each dataset includes detailed reasoning traces, carefully filtered conversations, and metadata ready for fine-tuning. Listings are synced hourly from Hugging Face.

Source:

51 datasets

DeepSeek-v4-Pro-Agent

Distilled from DeepSeek

SIZE1K–10KJSONTABULAR

2.0K downloads89 likes

claude-4.5-opus-high-reasoning-250x

Distilled from Claude Opus 4.5

SIZE<1KJSONTEXT

543 downloads399 likes

Reasoning traces for distilling frontier models

DeepSeek-v4-Pro-Agent

claude-4.5-opus-high-reasoning-250x

Claude-Opus-4.6-Reasoning-887x

lordx64-claude-opus-4.7-max-cleaned

Claude-Sonnet-4.6-Reasoning-1100x

gemini-3-flash-preview

convo-v1

Pony-Alpha-15k

Hunter-Alpha-Coding-Agent-SFT

Claude-Opus-Dataclaw-Unredacted

gemini-3-pro-preview-high-reasoning-1000x

claude-sonnet-4.5-high-reasoning-250x

gpt-5.2-high-reasoning-250x

DeepSeek-v4-Flash-Chat

glm-4.7-2000x

mistral-small-creative-500x

Step-3.5-Flash-2600x

kimi-k2-thinking-1000x

gpt-5-codex-250x

Healer-Alpha-16k

gemini-3-pro-preview-high-reasoning-250x

gpt-5.1-high-reasoning-1000x

claude-haiku-4.5-high-reasoning-1700x

deepseek-v3.2-speciale-1000x

Aurora-Alpha-15.5k

deepseek-v3.2-speciale-OpenCodeReasoning-3k

gpt-5.1-codex-max-1000x

gemini-3-flash-preview-1000x

deepseek-v3.2-speciale-openr1-math-3k

glm-4.7-350x

gpt-5-codex-1000x

Hunter-Alpha-UIGEN-T3-Agent-SFT

grok-code-fast-1-1000x

claude-haiku-4.5-1700x

minimax-m2.1-1000x

MiniMax-M2.1-8800x

Gemini-3-Flash-Preview-VIBE

brainstorm-v3.1-grok-4-fast-200x

gemini-2.5-flash-11000x

MiniMax-M2.1-Code-SFT

open-moderator-v1

polaris-alpha-1000x

sherlock-dash-alpha-1000x

Hunter-Alpha-16k

glm-4.6-250x

gemini-2.5-flash-lite-2509-preview-1000x

gemini-3-flash-preview-standalone-html-1k

sherlock-think-alpha-1000x

sherlock-thinking-alpha-11000x

MiMo-V2-Flash-2300x

kimi-k2-thinking-250x