The New Biology

Markov Biosciences, a startup in San Francisco, is betting that biology is about to have its GPT moment. In this episode, founder Adam Green explains the "bitter lesson" for biology, the idea borrowed from Richard Sutton that large unbiased datasets and the right training objective tend to outcompete models with hard-coded rules and human priors. Adam thinks, in particular, that the virtual cell field took a wrong turn by spending hundreds of millions of dollars collecting expensive perturbation data. Green’s counterargument is that the data needed to train useful virtual cells is not limiting, but rather compute (and the loss function) are. By treating single-cell RNA-seq as a ranking problem rather than raw counts (a century-old idea traceable to a 1927 psychophysics paper), they found that virtual cells pre-trained on plain observational data show clean scaling laws, getting monotonically better at predicting unseen perturbations as the models grow, and beating a state-of-the-art model built specifically for that task.

00:00 - Cold open and introduction 
01:58 - The first clinical prediction from a virtual cell
05:38 - What is a "virtual cell," really? 
08:01 - Single-cell RNA-seq biases and the urns analogy
23:29 - The bitter lesson for biology
30:55 - Geometric Plackett-Luce: the right loss function
59:26 Trop2 deep dive
1:11:16 - Top-down vs. bottom-up biology, mechinterp, and control as the goal 

Readings and mentions: 

What is The New Biology?

The New Biology features long-form discussions with historians, technologists, and scientists who are working on some of the biggest ideas in biotechnology, from magnet-controlled medicines to virtual cells.

Supported by Astera Institute.