Datasets
Reasoning datasets created by querying frontier models with diverse prompts. Use these for fine-tuning your own distilled models. Data refreshes hourly from Hugging Face.
Source:
19 datasetsclaude-sonnet-4.5-high-reasoning-250x
size_categories:n<1Kformat:jsonmodality:text
74323
gemini-3-pro-preview-high-reasoning-1000x
size_categories:1K<n<10Kformat:jsonmodality:text
3746
gpt-5-codex-250x
task_categories:text-generationlanguage:enlicense:mit
2648
claude-4.5-opus-high-reasoning-250x
size_categories:n<1Kformat:jsonmodality:text
2109
gemini-3-pro-preview-high-reasoning-250x
size_categories:n<1Kformat:jsonmodality:text
1774
gpt-5.1-high-reasoning-1000x
size_categories:1K<n<10Kformat:jsonmodality:text
1723
kimi-k2-thinking-1000x
language:ensize_categories:n<1Kformat:json
1541
gemini-2.5-flash-11000x
size_categories:10K<n<100Kmodality:textregion:us
1492
kimi-k2-thinking-250x
language:ensize_categories:n<1Kformat:json
1242
grok-code-fast-1-1000x
size_categories:1K<n<10Kformat:jsonmodality:text
1093
glm-4.6-250x
size_categories:n<1Kformat:jsonmodality:text
881
brainstorm-v3.1-grok-4-fast-200x
size_categories:n<1Kformat:jsonmodality:text
810
polaris-alpha-1000x
language:ensize_categories:1K<n<10Kformat:json
791
gemini-2.5-flash-lite-2509-preview-1000x
language:ensize_categories:n<1Kformat:json
731
gpt-5-codex-1000x
language:ensize_categories:n<1Kformat:json
730
sherlock-dash-alpha-1000x
size_categories:1K<n<10Kformat:jsonmodality:text
500
sherlock-thinking-alpha-11000x
language:ensize_categories:10K<n<100Kformat:json
460
sherlock-think-alpha-1000x
size_categories:1K<n<10Kformat:jsonmodality:text
451
deepseek-v3.2-speciale-1000x
size_categories:n<1Kformat:jsonmodality:text
110