Ronan McGovern's Blog (The Blip)

Ronan McGovern's Blog (The Blip)

Share this post

Ronan McGovern's Blog (The Blip)
Ronan McGovern's Blog (The Blip)
The Weekly Blip: August 13th 2023

The Weekly Blip: August 13th 2023

How to best measure LLM performance + Getting copied & kicked by Shopify

Ronan McGovern's avatar
Ronan McGovern
Aug 13, 2023
∙ Paid

Share this post

Ronan McGovern's Blog (The Blip)
Ronan McGovern's Blog (The Blip)
The Weekly Blip: August 13th 2023
Share

It can be hard to measure the performance of language models with datasets - because often those same datasets are used in their training.

There are some off-beat tests that do better:

  1. Sally (a girl) h…

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Ronan McGovern
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share