A condensed scatter of all 30 specialties (drag the sample-size floor and watch the fake trend dissolve), plus a live OpenEvidence run showing why the hard half of point-of-care — building the question from a 1,200-document chart — never reaches the test set.
Data explorer
Same answer, different score. Showing citations boosts OpenEvidence from 71% to 83% — while hurting every other model. The finding that upends how we evaluate clinical AI.
~3 min
Wearables skew toward the healthy and wealthy. But CDC data shows chronic disease clusters where coverage is thinnest. Every dot is a state — drag the coverage line and explore the gap.
Data explorer
Continuous monitoring works — it cut blood pressure across 106,261 patients. So predict who's actually wearing the monitor, then meet the people the data forgets.
~3 min