Gallery
About
As frontier LLMs have very little output diversity even for open ended queries. We built Flint to see if we could reverse this. It’s a finetuned Qwen3 30B model specifically trained to produce higher entropy when asked open ended questions.Flint significantly increases the NoveltyBench score compared to the base model, without significantly reducing the score on non-creative benchmarks like MMLU-STEM.This shows that that divergence tuning doesn't actually have to be a tax on base capabilities.Flint scores 7.47/10 on NoveltyBench while most frontier models score between 1.8 and 3.2.
Comments (0)
No comments yet. Be the first to comment!
Related Products
Bytemine
Build anything, enrich all, pay less.
MemClaw – Persistent Memory for OpenClaw
Persistent memory for AI coding agents — isolated workspaces, shared context.
Advantora Insights | Secure Enterprise AI Data Analyst
Turn Messy Data to Executive Reports.
Advantora Insights | Secure Enterprise AI Data Analyst
Turn Messy Data to Executive Reports.
MemClaw – Persistent Memory for OpenClaw
Persistent memory for AI coding agents — isolated workspaces, shared context.
Agent Studio - AI Coding Assistant
Production-ready AI Agent IDE platform with visual builder & sandboxes