Data Leverage Short Posts

Short posts about data leverage-related AI policy. Generally these posts will avoid longer exposition and connection to current events.

N-gram search as posterior updating for data attribution

A short argument that model outputs and training-data priors can improve rough approximations of data attribution.

May 26, 2026

May 26, 2026


Augmentation is a data flow problem

A short argument that "augment, do not replace" is about data control.

May 25, 2026

May 25, 2026


Augmentation is a data flow problem

A short argument that "augment, do not replace" is about data control.

May 25, 2026

May 25, 2026


AI progress as quasi-public good production

A short argument that AI systems pool diffuse human work into privately governed, public-good-like model weights.

May 29, 2025

May 29, 2025