Archive - Interconnects

How RLHF works, part 2: A thin line between useful and lobotomized

Many, many signs of life for preference fine-tuning beyond spoofing chat evaluation tools.

May 1 •

April 2024

Phi 3 and Arctic: Outlier LMs are hints

Models that seem totally out of scope from recent open LLMs give us a sneak peek of where the industry will be in 6 to 18 months.

Apr 30 •

AGI is what you want it to be

Certain definitions of AGI are backing people into a pseudo-religious corner.

Apr 24 •

Llama 3: Scaling open LLMs to AGI

Llama 3 shows that scaling won't be a limit for open LLM progress in the near future.

Apr 18 •

Stop "reinventing" everything to solve alignment

Integrating some non-computing science into reinforcement learning from human feedback (RLHF) can give us the models we want.

Apr 17 •

The end of the “best open LLM”

Modeling the compute versus performance tradeoff of many open LLMs.

Apr 15 •

We disagree on what open-source AI should mean

How to read what multiple people mean by the word openness and see through the PR speak.

Apr 3 •

March 2024

DBRX: The new best open model and Databricks’ ML strategy

Databricks’ new model is surpassing the performance of Mixtral and Llama 2 70B while still being in a size category that's reasonably accessible.

Mar 28 •

Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

Evaluation is not only getting harder with modern LLMs, it’s getting harder because it means something different.

Mar 20 •

Model commoditization and product moats

Where moats are tested now that so many people have trained GPT4 class models. Claude 3, Gemini 1.5, Inflection 2.5, and Mistral Large are here to…

Mar 13 •

The koan of an open-source LLM

A proposal for a new definition of an “open-source” LLM and why no definition will ever just work.

Mar 6 •

Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models…

An interview I've wanted to bring you for a while.

Mar 4 •

1:26:27

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts