Open Source AI Research

Distilled models and curated datasets for the community

We fine-tune open-source models on high-quality reasoning datasets from frontier models like Claude, GPT, and Gemini. All models are released in GGUF format for local deployment.

Browse Models View Datasets

368.6K

Downloads (last 30 days)

117

Models Released

Datasets Published

What We Do

Model Distillation

We train open-source base models (Qwen3, GPT-OSS) on reasoning traces from frontier models. This transfers capabilities while keeping models small and efficient.

Dataset Curation

We create high-quality reasoning datasets by querying models like Claude Opus, Gemini Pro, and GPT-5 with diverse prompts covering coding, math, and science.

GGUF Quantization

All models are released in GGUF format with multiple quantization levels (Q4, Q8, etc.) for use with llama.cpp on consumer hardware.

Top Models

View all →

GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill-GGUF

101.9K downloads438 likes

Qwen3-14B-Claude-4.5-Opus-High-Reasoning-Distill-GGUF

87.0K downloads282 likes

Support Our Work

We're two college students funding this research out of pocket. Dataset generation costs add up quickly - our Claude Opus dataset alone cost $52+ to create. If you find our models useful, consider supporting us.

Donate via PayPal