[ ARTICLE.SYS ]
AI Engineering
⋅5 min read
What if a 9B model could beat Sonnet at SQL?
It started as a thought experiment: multi-turn benchmarks expose a surprising weakness in frontier models. How close could a fine-tuned local model get?
Dan
Posts related to bird-interact
It started as a thought experiment: multi-turn benchmarks expose a surprising weakness in frontier models. How close could a fine-tuned local model get?