Technically

Technically

It was never about LLM performance

Beware the benchmark.

Justin's avatar
Justin
Mar 06, 2024
∙ Paid

The LLM community is obsessed with benchmarking model performance. Mistral released their new “flagship” model this week, and immediately focused the discussion on how it performs on “commonly used benchmarks” relative to other models:

User's avatar

Continue reading this post for free, courtesy of Justin.

Or purchase a paid subscription.
© 2026 Justin · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture