MoreRSS

site iconDynomightModify

Dynomight is a SF-rationalist-substack-adjacent blogger with a good understanding of statistics.
Please copy the RSS to your reader, or quickly subscribe to:

Inoreader Feedly Follow Feedbin Local Reader

Rss preview of Blog of Dynomight

Blink if you’re human

2026-06-26 08:00:00

I write every word I post on this blog myself. I can’t prove this, of course, but there’s some evidence:

  • This blog existed before AI could write blog posts.
  • If you put any of my posts into an AI-detector they will (I assume) come back squeaky clean.

And now let me add this: I, dynomight, guarantee that every word I post here is the product of me physically hitting keys with my fingers. The only exceptions would be quotes from other humans or something that’s clearly labeled as an AI output.

How is that evidence? Well, say you think I’m a low quality person and I do use AI but I’m lying and I’ve figured out how to evade AI-detectors. OK, ouch. But consider: It’s extremely likely that AI-detectors will improve in the future. (More precisely, it’s likely that future AI-detectors will be better than current AI-detectors at detecting current AI.) If I were using AI, and a future AI detector later caught me, the fact that I made the above promise would be really embarrassing.

You may be thinking that this looks gross and self-congratulatory. So I’d like to stress that the above guarantee is carefully worded. I do often use AI “for research”, just not “for writing”. (We’ll come back to that distinction.) And I don’t think there’s anything intrinsically wrong with using AI to write blog posts. I don’t do it personally, mostly because:

  1. I like writing.
  2. The act of writing itself helps me figure stuff out.
  3. This is a hobby. If you start automating your own hobbies—just what the hell are you doing?

I also don’t use AI for writing because—can we just admit it?—no one wants to read AI-generated essays. Or, rather, people love reading AI-generated essays, but when they want to read one, they will ask an AI for it themselves, thank you very much.

I know the counterarguments. What does it matter where the words came from? Shouldn’t you judge them on their own merits? Maybe. That’s a legitimate way to look at things. But empirically, I think most people don’t agree.

(I also know you’re counting the em-dashes. Count away, I’m still human.)

Here’s an oddly neglected question: Take all the essays that are AI-generated or heavily AI-assisted by one person and then given to someone else to read. In what percentage of cases does the first person disclose the AI usage? Ignore everything related to education if you want. You can even ignore emails. I suspect the answer is still <20%.

Why do people care about this? Several reasons. One is proof of work. If I, a human, write eight thousand plausible-seeming words about vitamin D, that proves that I’ve put some time and effort into understanding vitamin D. That suggests giving some weight to my opinion, even if just to best exploit the wisdom of crowds. That doesn’t work if my essay is secretly AI-generated.

And writing isn’t just cold clinical information-sharing. It’s a kind of parasocial interaction. I know “parasocial” sounds sinister, but I maintain that parasocial relationships are often a perfectly healthy way to adapt our primitive social instincts to the modern world. Anyway, good or bad, that’s part of it.

I bring this up because I’m worried that blogs are heading into a sort of lemon market. You’ve surely had the experience of reading an essay only to slowly become dismayed as you realize it was AI-written. What’s the equilibrium? I expect that some people have already cut back on reading essays, at least from non-established authors. Over time, I expect this will lead to fewer humans writing essays, further increasing the density of AI-generated content, driving more people to cut back on reading, et cetera. This is bad because blogs are good.

As that cycle turns, social norms are also changing. Cast your mind back to the old world, five years ago. At that time, if you had started a blog and posted AI-generated essays without telling anyone, I’m reasonably certain that would have been considered a dick move. (Future generations will marvel.) But today, the largest corporations appear to do that all the time. There’s incredible momentum towards a world where AI can be used anywhere, for any purpose, with no disclosure, and that’s fine.

But it is fine! At this point, trying to bully people into proactive disclosure is just a tax on honesty / consciousness / integrity. Instead, I suggest we agree that arbitrary usage is, by default, fine. Instead, let’s work at the other end: If you have chosen to impose limits on your AI usage, then state those limits publicly. If you’re human, tell me.

Obviously, this is no panacea. People can lie. But they can’t do so without taking some reputational risk, because if you use AI and lie about it, how long will your secret stay safe? No one knows because for once the unpredictability of technological change is on our side.


However. HOWEVER. I am not suggesting that we should bully writers into declaring that they are AI-free. I think that’s a terrible idea, because AI use comes on a spectrum. Already today, most people surely use it at least a little. (Do you avert your eyes when AI summaries come up at the top of search results?) Arguably, most people should use AI at least a little. We need to acknowledge that writing is entering the centaur era.

For context: Computers beat humans at chess in 1997. But for years after that, human + AI “centaur” teams could still beat both the best humans and the best chess AIs. Slowly, the value humans contribute to those teams has diminished, and today it’s somewhat unclear if centaurs still hold any advantage over pure AIs.

Humans are still better at blogging than AIs. (Though perhaps not better at literary short fiction.) In chess time, blogging is still pre-1997. But it’s a historical coincidence that no one seems to have cared about centaur chess before 1997. If people had tried, I suspect centaur chess teams could have beaten the best human players much earlier. So, to stretch our analogy, I’d put blogging around 1990 in chess time, in an alternate timeline where there was vast interest in centaur chess in the 1970s and 1980s.

I mean, what exactly can you do while still considering your essay “human written”? Can you:

  1. …look at AI summaries at the top of search results?
  2. …ask AI to find spelling or grammar errors?
  3. …use AI as an advanced thesaurus? (“Give me 50 words with meanings interpolating between ‘aggressive’ and ‘punctilious’.”)
  4. …ask AI factual questions when doing research?
  5. …trust the answers, or verify them yourself?
  6. …ask AI for options to rephrase an awkward sentence?
  7. …use those options verbatim?
  8. …ask AI for high-level organizational suggestions?
  9. …ask AI to to make figures / tables / code?
  10. …run an entire essay through an AI to “clean it up”?
  11. …ask an AI to give a rough prototype of the next section?

I don’t feel super-comfortable saying this, but I sometimes do all of those except #7, #10, and #11.

Wait! Let me explain! I probably do #3 or #6 around once per post. For #5, I usually verify, but when trying to understand something, I read a lot of sources. I try to mentally mark AI-derived facts as unreliable, but I don’t formally track the provenance of every single part of my mental model. I rarely do #8, and even-more rarely accept the suggestions, because AI seems to dislike me as a person and wants to purify my writing of all life and personality. But, in want of a human editor, I sometimes find it helpful. And no matter what, if any information flows from AI into my writing, it does so through my fingers, being written in my own words, never cutting and pasting, not a single word, never-ever.

On a spectrum where 0 = “refuses to look at AI summaries in web searches” and 100 = “puts a single prompt into an AI and posts the output without revisions”, I’d put myself at, I don’t know, 10?

Again, saying all that feels gross. (Somehow it feels like admitting to something shameful and simultaneously an exercise in arrogant self-congratulation? It’s remarkable.) I don’t know how my position on that spectrum compares to other writers, because almost no one discloses any AI usage at all.

But come on people. Democracy dies in darkness! We’re now at the point where readers default to assuming some relatively high (and increasing) level. I’m convinced that many people use AI in ways that are almost completely unobjectionable, but they’re too scared to admit it. This muddies the distinction between different parts of the spectrum, and exacerbates the dynamic where people are too afraid to read anything, lest they later realize it is “slop”.

We need to come to terms with the idea that for most writers, the optimal amount of AI usage is not zero. I’m sure that most people would say that some kinds of usage are normal / expected / good, while other kinds are aberrant / duplicitous / slop. But people have different opinions, and this is all shifting as technology and culture develop.

Unsurprisingly, I like the idea of people drawing the line close to where I did. But I’m willing to accept a fairly wide range, provided you’re upfront about it. Usually, if I sense the invisible hand of heavy AI editing, I sigh and unsubscribe. But Trevor Klee (an excellent blogger) has a couple posts where he says, “here’s an output from ChatGPT I thought was interesting.” Not only did I not unsubscribe, I actually attempted to read that output.

Still, I think it’s important to draw some line, not just to communicate to the outside world, but also for yourself. There’s a very blurry boundary between using AI “for research” or “to catch grammatical errors” and using it “for writing”. It’s very easy to slip from asking AI factual questions, to asking it to find errors in what you wrote, to asking it to fix those errors, to asking it to generate whole paragraphs of text. Each of those steps is easy to justify. So if you want to operate at some position on the spectrum, it’s probably best to choose some boundaries and then enforce them.

(AI used in the construction of this post: None.)

The worthlessness of vitamin D is mildly exaggerated

2026-06-23 08:00:00

For a while there, many people thought vitamin D was magical—that it could improve bones, the heart, infections, cancer, heart disease, longevity, even mental health. But among people I respect, opinion is now overwhelmingly that taking vitamin D does nothing unless you’re severely deficient. The central argument is that while vitamin D levels are correlated with ~all positive health outcomes, when you actually test vitamin D supplements against placebo in randomized trials, nothing ever happens.

That’s what I used to think, too. But I’ve come to think the skeptics have over-corrected. Yes, randomized trials have shown the magical correlations are not causal. But if you start with non-insane expectations, the trials look like weak but positive evidence. And if you consider what we know about biology and evolution, I think the balance of evidence tips pretty clearly in the direction that people with low-ish levels would be wise to supplement.

Am I certain that vitamin D is beneficial for people with low-ish levels? Absolutely not! But I claim that’s the best bet given the limits of our knowledge.

The classical view: Boring bone vitamin

Most vitamins are “ingredients” that the body uses to do stuff. Vitamin D is more like a “signal” that the body uses to communicate with itself about what to do.1 The classical “endocrine” story of vitamin D is that your body uses it to tell your guts to take in more calcium from food. If you don’t get enough vitamin D, then you have calcium problems.

That’s all you really need to know about the classical view. But if you enjoy gawking at biology’s complexity, I recommend this diagram and the following three paragraphs:

Ready for science? OK: Almost all the cells in your body make provitamin D.2 Usually, this is all converted to cholesterol, but your skin cells leave some sitting around. When UVB light hits those skin cells, provitamin D is transformed (physically by the light itself) into previtamin D and then (by heat) into vitamin D. This diffuses from the skin cells into blood vessels. There it binds to a protein3 and starts circulating in the blood, where it is joined by vitamin D from food.4 Eventually, the liver converts it into more-stable storage vitamin D. It also soaks in and out of fat and muscle tissue, which acts as a slow-release reservoir.

Now, a fun fact: If calcium levels in your blood get too low, then your heart will stop working and you will die. To avoid this, you have parathyroid glands in your neck that sense when calcium is getting low, and release parathyroid hormone into the blood. This tells your bones to release some of their stored calcium. It also tells your kidneys to convert some of the storage vitamin D from your blood into active vitamin D. And when that gets to your guts, they try to absorb more calcium from food.

So what happens if you don’t get enough vitamin D? Well, your body is not going to let calcium levels drop too low, because your body is designed to avoid death. Parathyroid hormone will still get secreted, and calcium will still get scavenged from your bones. But without vitamin D, your guts never get the signal to gather extra calcium from food. So the body scavenges a lot of calcium from your bones, and you end up with weak bones, which is bad.

Now here’s the thing: In this story, only active vitamin D actually does anything. The kidneys make this on demand in response to calcium levels, not in response to storage vitamin D levels. General opinion is that as long as the blood has above ~25 nmol/L of storage vitamin D, then the kidneys have no trouble making active vitamin D.5 Furthermore, survey data suggests that only ~2% of the population has levels below that threshold. This suggests that for ~98% of people, supplementing vitamin D should do approximately nothing.

The correlation view: Magical mystery cure

Rickets is a terrible disease that involves soft bones, stunted growth, and skeletal deformities. It’s probably been with us since ancient times, but it became common in the West after the industrial revolution. In 1890, a Scottish missionary named Theobald Palm observed that rickets was common in smog-ridden UK cities but almost unheard of in sunny countries with poor sanitation, suggesting sunlight itself was the issue. This contributed to the discovery that rickets could be cured with UV light or cod-liver oil, and eventually the discovery of vitamin D.

In 1941, Apperly noticed that the amount of sunlight in different US states was positively correlated with skin cancer but inversely correlated with overall cancer mortality.6 He gave this charming graph:

Apperly never mentions vitamin D, presumably because he thought it was a boring bone vitamin.

Things took off in 1980, when Cedric and Frank Garland published, “Do Sunlight and Vitamin D Reduce the Likelihood of Colon Cancer?” Seemingly unaware of Apperly, they gave a similar, but uglier, graph:

They point out that regional diets (like meat and fiber) didn’t seem to explain this pattern. Instead, they propose a mechanistic story:

    Sunlight
        ↓
    Vitamin D
        ↓
    Adequate calcium in blood
        ↓
    Reduced inflammation of epithelial cells in the colon
        ↓
    Less colon cancer

(It’s always inflammation.) This paper was rejected many times before finally being published. I wish I could find an un-gated copy to link to, because it would have made a magnificent blog post.7

Following that paper, there was an explosion of work that found negative correlations between sunlight (or latitude) and other types of cancers as well as blood pressure, diabetes, and multiple sclerosis.

Then people started measuring vitamin D in blood. In 1989, the Garlands and collaborators found blood samples takin in 1974 from 25,000 people. They found that 34 of those people had since gotten colon cancer. They matched these with 67 demographically similar people and measured vitamin D levels in the stored blood samples for all 101 people. Among that group, people with vitamin D levels below 50 nmol/L got colon cancer more than three times as often as people with higher levels.

Again, many similar studies followed. These linked higher vitamin D levels to better outcomes in cardiovascular disease, diabetes, obesity, infectious disease, Parkinson’s, and mood disorders. While results were mixed for non-colorectal cancer incidence, higher vitamin D levels predicted better survival of many cancers. Amazingly, all-cause mortality was roughly 30% lower for those at the 75th percentile of vitamin D levels compared to the 25th.

Vitamin D was looking like a miracle. But how could it do all that stuff if it was just a boring bone vitamin?

Meanwhile in biology

While all these correlations were being discovered, we learned that the body doesn’t just use vitamin D for bone stuff.

In 1969, we discovered the vitamin D receptor that active vitamin D binds to in the gut and bones. And in the 1980s came a shock: Almost all cells in the body have vitamin D receptors. These seem to do different things in different tissues. In the pancreas, they support insulin secretion. In immune cells, they boost antimicrobial peptides and reduce inflammation. In neurons, they influence proliferation and differentiation.

So… What? When calcium drops and the kidneys put out active vitamin D, does every part of the body start doing different unrelated stuff?

In the late 1990s, we cloned the gene for the enzyme that the kidneys use to convert storage vitamin D to active vitamin D. Soon came another shock: This enzyme also exists in tons of other cells, including immune cells, the heart, the skin, the prostate, the breast, and colon. (Another win for the Garlands.)

So it’s not just the kidneys making active vitamin D to trigger the gut. Cells everywhere are making their own active vitamin D and using it to trigger vitamin D receptors in neighboring cells, or even inside the same cell.8 This often has little to do with calcium or bones.9

So:

  1. The kidneys use vitamin D as a boring bone hormone.
  2. As long as the blood contains at least ~25 nmol/L of storage vitamin D, the kidneys don’t care. They create the same amount of active vitamin D, in response to calcium levels.
  3. But now cells everywhere are using storage vitamin D.
  4. To do god-knows-what.
  5. With god-knows-what sensitivity to circulating vitamin D levels.

And remember how only active vitamin D does anything? That’s wrong. In the mid-1970s, we learned that storage vitamin D also binds to the vitamin D receptor. The binding affinity is 100-1000× lower, but you have ~1000× more in your blood. So maybe circulating levels of storage vitamin D themselves matter, independently of how much active vitamin D gets made?

If that’s not confusing enough, people also noticed that while active vitamin D levels in the blood aren’t correlated with storage vitamin D (above ~25 nmol/L), levels of parathyroid hormone (the thing your parathyroid glands use to tell your kidneys to make active vitamin D) seem to decline as levels of storage vitamin D rise from ~25 to 50 or 75 nmol/L. Huh?10

On the one hand, all this makes the idea that vitamin D could be a miracle more plausible. On the other hand, this is getting complicated. And do we really believe that raising your vitamin D levels from the 25th to the 75th percentile could reduce your risk of death from any cause by thirty percent? Maybe we should try giving people vitamin D and see what happens.

Then came the RCTs

There have been many randomized trials. The “right” thing to do in such cases is to look at meta analyses that carefully combine all the data. We’ll get to those. But they conceal a lot of important nuance about what actually happens on the ground during these trials. So let’s start by going over the three main “megatrials”.

The Women’s Health Initiative (WHI) trial came out in 2006 and is still the largest vitamin D trial ever done. This took 36,000 postmenopausal American women and assigned half to take 400 IU daily with calcium and the other half to placebo.11 After seven years, here’s what happened:12

Outcome (WHI trial) Hazard ratio
Fractures 0.97 (0.91 to 1.03)
Cancer 0.97 (0.91 to 1.04)
Cancer mortality 0.90 (0.77 to 1.05)
CVD mortality 0.94 (0.78 to 1.12)
All-cause mortality 0.92 (0.83 to 1.01)
Kidney stones 1.17 (1.02 to 1.34)

(The hazard ratio is the ratio of the rate that something happens in the treatment vs. placebo groups. So, a number less than one suggests a benefit to taking vitamin D, while a number larger than one suggests a harm. The numbers in parentheses show a 95% confidence interval.)

The only statistically significant result was a bad one: Extra kidney stones, likely from the extra calcium.13 The other outcomes look vaguely good, but none were statistically significant despite the massive sample size.

This was disappointing. However, the WHI trial had limitations: Many subjects in both the vitamin D and placebo groups were already taking vitamin D, and continued taking it through the trial. The dose of 400 IU was fairly low, many subjects stopped taking their pills, and vitamin D levels didn’t actually change that much. They also measured vitamin D levels in only 6% of subjects, meaning we can’t compare the fates of subjects who started out with low versus high levels.

The next big hope was VITAL, which came out in 2018. They recruited 26,000 older people across the United States, half of them men and 20% Black (and thus far more likely to be vitamin-D deficient). They measured vitamin D levels in most people, and they gave the treatment group 2,000 IU per day.14 Here were the results after 5.3 years:

Outcome (VITAL trial) Hazard ratio
Diabetes 0.91 (0.76 to 1.09)
Autoimmune disease 0.78 (0.61 to 0.99)
Cancer 0.96 (0.88 to 1.06)
Cancer mortality 0.83 (0.67 to 1.02)
Major CVD event 0.97 (0.85 to 1.12)
CVD mortality 1.11 (0.88 to 1.40)
All-cause mortality 0.99 (0.87 to 1.12)

Some of the results look good-ish, but cardiovascular mortality was higher in the treatment group, leading to almost no effect on all-cause mortality.15 More disappointment.

The last megatrial was D-Health, which came out in 2022 based on 21,000 older Australians. Instead of daily supplements, it used a monthly “bolus” dose of 60,000 IU or placebo. Unlike in VITAL, there was no exclusion for people with a history of cardiovascular disease or cancer, and less restriction on how much vitamin D participants could take on their own during the trial.16 Here were the results after 6 years:

Outcome (D-Health trial) Hazard ratio
Cancer mortality 1.15 (0.96 to 1.39)
Major CVD event 0.91 (0.81 to 1.01)
CVD mortality 0.96 (0.72 to 1.28)
All-cause mortality 1.04 (0.93 to 1.18)

Now, the treatment group did better in terms of cardiovascular disease, but worse in cancer and worse in all-cause mortality. Even more disappointment.

Just from these three large trials, the main lesson should already be clear: Vitamin D is not a miracle. The correlations were wrong.17 There is essentially zero remaining hope that taking vitamin D could reduce all-cause mortality by a third.

In this sense, the vitamin D skeptics are definitely right. But what about the other trials? And is there a more subtle lesson?

I made some tables

I wanted a big table that summarized all the major vitamin D RCTs and what they found for different health outcomes. Annoyingly, no such overview appears to exist. So I made my own:18

Trial Cancer Cancer mortality CVD CVD mortality All-cause mortality
Lips 1996         0.92 (0.80 to 1.06)
Trivedi 2003 1.08(0.89 to 1.31) 0.86 (0.61 to 1.21) 0.95 (0.86 to 1.04) 0.86 (0.67 to 1.11) 0.90 (0.77 to 1.07)
WHI 2006 0.98 (0.90 to 1.05) 0.89 (0.77 to 1.03)   0.94 (0.78 to 1.12) 0.92 (0.83 to 1.01)
Lyons 2007         0.99 (0.93 to 1.05)
WFPT 2007         1.00 (0.87 to 1.15)
RECORD 2012 1.04 (0.91 to 1.19) 0.83 (0.55 to 1.26)   0.91 (0.79 to 1.05) 0.93 (0.85 to 1.02)
Lappe 2017 0.70 (0.47 to 1.02)        
VITAL 2018 0.96 (0.88 to 1.06) 0.83 (0.67 to 1.02) 0.97 (0.85 to 1.12) 1.11 (0.88 to 1.40) 0.99 (0.87 to 1.12)
ViDA 2018 1.01 (0.81 to 1.25) 0.99 (0.60 to 1.64) 1.02 (0.87 to 1.20)   1.12 (0.79 to 1.58)
D2d 2019 1.07 (0.70 to 1.62) 0.23 (0.03 to 1.86)      
DO-HEALTH 2020 0.76 (0.49 to 1.18)   1.37 (0.88 to 2.14)    
D-Health 2022   1.15 (0.96 to 1.39) 0.91 (0.81 to 1.01) 0.96 (0.72 to 1.28) 1.04 (0.93 to 1.18)
FIND 2022 1.04 (0.72 to 1.51) 1.14 (0.56 to 2.33) 0.90 (0.62 to 1.32) 0.85 (0.28 to 2.53) 0.81 (0.32 to 2.06)

Lots of the hazard ratios are less than one, suggesting a benefit to supplementation. But lots of them are also higher than one, suggesting a harm. The numbers that are far from one almost always come from smaller trials, which manifest as larger confidence intervals. If you’re interested in the details of how these trials were run, I refer you to more gigantic tables in a footnote.19

If big tables aren’t your thing, here are some formal meta-analyses, both some recent ones and an older but more comprehensive Cochrane review:

Outcome Meta analysis Hazard ratio Comment
All-cause mortality Bjelakovic 2014 (Cochrane) 0.96 (0.92 to 0.99) Trials with low risk of bias.
Cancer mortality Bjelakovic 2014 (Cochrane) 0.88 (0.78 to 0.98)  
Cardiovascular mortality Bjelakovic 2014 (Cochrane) 0.98 (0.90 to 1.07)  
Cancer mortality Kunzia 2023 0.94 (0.86 to 1.02)  
All-cause mortality Ruiz-García 2023 0.96 (0.91 to 1.00) Good-quality trials
Cardiovascular mortality Ruiz-García 2023 1.00 (0.92 to 1.08) Good-quality trials
All-cause mortality Cao 2023 0.99 (0.96 to 1.03)  

Squinting at the data

There are various ways you could try to squint at these RCT. In almost all of them, most people already had pretty high levels before they started. So why don’t we separate out people who started low? Usually we can’t, because most trials didn’t measure baseline vitamin D.20 And among the trials that did, there are few people with low levels, so the results are noisy and confusing.21

Or, you might theorize that benefits would take time to show up, meaning the first couple years just add noise. In some cases—notably VITAL—excluding the first two years seems to help, but in other cases things get worse.22

Finally, some people speculate that taking gigantic monthly or quarterly “bolus” doses of vitamin D might be dangerous. For example, here’s an enjoyable paragraph from Kunzia et al. in their meta-analysis of vitamin D and cancer mortality:

Our results showing efficacy of daily, but not bolus, vitamin D3 supplementation in reducing cancer mortality are consistent with previous meta-analyses on cancer mortality or all-cause mortality (Guo et al., 2022; Keum et al., 2022; Keum et al., 2019; Zhang et al., 2022; Zhang et al., 2019). However, by including more trials than these previous meta-analyses, we were able to detect statistically significant effect modification by treatment regimen for the first time with statistical significance (pinteraction=0.042). The pattern of intake could be important for a favourable steady state of the bioavailability of the active 1,25 (OH)₂D hormone. Daily administration counteracts the fast excretion of vitamin D from the circulation (Hollis and Wagner, 2013; Keum et al., 2022). Moreover, the enzymes CYP27B1 (converts 25(OH)D to 1,25 (OH)₂D) and CYP24A1 (inactivates 25(OH)D and 1,25(OH)₂D) follow first-order reaction kinetics (Vieth, 2009). This means that doubling the concentration of the precursor doubles the yield of the product, unlike other steroid hormones (e.g., cortisol, oestrogen, testosterone) that follow zero-order kinetics (Vieth, 2020). Intermittent, non-physiologically large vitamin D3 bolus doses may lead to unstable cycling of 25(OH)D and 1,25(OH)₂D levels in blood because the system needs time to adapt to the large doses (Hollis and Wagner, 2013; Keum et al., 2019; Vieth, 2020). In the long run, intermittent bolus regimens at weekly or larger intervals can lead to an up-regulation of countervailing factors (e.g., 24-hydroxylase (CYP24A1), 24,25(OH)2D and fibroblast growth factor 23), all of which ultimately leads to lower synthesis or higher degradation of 1,25(OH)₂D levels (Mazess et al., 2021). Bolus doses, unlike daily doses, failed to reduce C-reactive protein response and actually elevated anti-inflammatory cytokines and doubled the risk of hypercalcemia in previous studies (Krishnan et al., 2012; Martineau et al., 2017; Mazess et al., 2021).

Oh no, up-regulation of fibroblast growth factor 23!23

I don’t feel like I understand this deeply enough to have any opinion beyond the surface level that the body seems to adapt to large doses of vitamin D in ways that could possibly be bad.24 It seems intuitive that small daily doses would be safer than gigantic monthly doses, but I’m always suspicious of post-hoc mechanistic speculation. Also, if people get enough sun, they can apparently synthesize 10,000-25,000 IU per day, which isn’t that far from the 60,000 IU they got in the D-Health trial. But then again, I think Kunzia et al. are suggesting that the body is designed to adapt to regular exposure to large doses but not intermittent exposure?

Well, if you split up the trails by daily vs. bolus dosing, there’s a decent pattern of daily dosing leading to better results:

Trial (daily dosing) Cancer mortality All-cause mortality
Lips 1996   0.92 (0.80 to 1.06)
WHI (Jackson 2006) 0.89 (0.77 to 1.03) 0.92 (0.83 to 1.01)
WFPT (Smith) 2007   1.00 (0.87 to 1.15)
RECORD (Avenell 2012) 0.83 (0.55 to 1.26) 0.93 (0.85 to 1.02)
VITAL (Manson 2018) 0.83 (0.67 to 1.02) 0.99 (0.87 to 1.12)
D2d (Pittas 2019) 0.23 (0.03 to 1.86)  
FIND (Virtanen 2022) 1.14 (0.56 to 2.33) 0.81 (0.32 to 2.06)
Trial (bolus dosing) Cancer mortality All-cause mortality
Trivedi 2003 0.86 (0.61 to 1.21) 0.90 (0.77 to 1.07)
Lyons 2007   0.99 (0.93 to 1.05)
ViDA (Scragg 2018) 0.99 (0.60 to 1.64) 1.12 (0.79 to 1.58)
D-Health (Neale 2022) 1.15 (0.96 to 1.39) 1.04 (0.93 to 1.18)

If those bolus dosing trials didn’t exist, I’d think this looked pretty good. So, maybe? Or maybe this is a story made up to hallucinate a positive trend. I would lean towards the latter theory, but there are papers like Mazess et al.’s “Vitamin D: Bolus is Bogus”, that suggested this pattern before D-Health’s dismal results came out. There are even some trials that suggest bolus doses don’t even work for treating rickets. So… I’m still not convinced. But maybe.

Aside: There are also many Mendelian randomization studies that look at correlations between health and genes that are related to vitamin D. But I don’t think these provide much information, because the assumptions are shaky and the genes don’t explain much of the variance.25

Where are we?

Still with me? Here’s a summary of the above 5200 words:

  • The body uses vitamin D in all sorts of weird and complicated ways. It’s biologically plausible that vitamin D could matter beyond bone stuff with severe deficiency, but there’s no convincing mechanistic evidence that it is.
  • Vitamin D levels are strongly correlated with good health outcomes, but RCTs have conclusively shown that most of these correlations are non-causal.
  • RCTs haven’t conclusively shown any benefit for anything beyond bone stuff. At best, they’ve given weak evidence for hazard ratios slightly below one.

So you might be wondering: Isn’t that quite weak? Wasn’t this post supposed to be a defense of vitamin D?

The case for supplementing anyway

It’s biologically plausible that vitamin D is good

Everyone agrees that severe vitamin D deficiency (below ~25 nmol/L) is bad. It leads to rickets, adult rickets, osteoporosis, muscle weakness or even—with profound deficiency—to seizures or cardiac arrhythmia. This makes sense, because below ~25 nmol/L, the kidneys have trouble converting storage vitamin D into active vitamin D, meaning you don’t absorb enough calcium from food.

The question is if taking supplement to further raise your levels (say to 50 or 90 nmol/L) is important. We have no mechanistic proof, but it might be true, because many parts of the body use vitamin D as a local signal and because cells are at least somewhat sensitive to circulating storage levels. There’s also this weird thing where parathyroid hormone continues to decline as vitamin D levels rise above ~25 nmol/L even while this seems to make little difference to how much active vitamin D the kidneys make.

Nothing in this world comes without trade-offs. Surely, supplementing vitamin D comes with some downsides. But it seems very unlikely that raising vitamin D levels to a “normal” level would cause more harm than benefit. Especially because…

Humans evolved to have a lot of vitamin D

According to Luxwolda et al.’s 2012 paper, “Traditionally living populations in East Africa have a mean serum 25-hydroxyvitamin D concentration of 115 nmol/L”, traditionally living populations in East Africa have a mean serum 25-hydroxyvitamin D concentration of 115 nmol/L.

Meanwhile, Wahl et al. 2012 try to estimate mean levels around the world today:

This map looks weird because of varying lifestyle, diet, supplementation, and needing to combine fragmented studies. But you get the idea. And remember, those are just averages. So there are lots of people with levels far lower than that in our evolutionary history.

Of course, just the fact that vitamin D levels have dropped doesn’t mean it’s important. Parasitic worm load, wood smoke inhalation, and cousin marriage have also dropped, but we aren’t rushing to restore those to ancestral levels.

But there’s another piece of evidence: After humans migrated out of East Africa, some of them evolved pale skin. Pale skin is bad, because it allows light to destroy folate, which is crucial for pregnancy.26 Evolution doesn’t typically do things that harm fertility, because evolution wants to increase reproductive fitness. The most common explanation is that pale skin allows more UV light to penetrate, and thus allows people to synthesize more vitamin D. If evolution was willing to pay the high “price” of folate destruction for more vitamin D, that seems like good evidence that vitamin D is important.

Some even see contrasts like the Inuits versus Scandinavians as a kind of natural experiment: They lived at similar latitudes, but Inuits ate a diet with vitamin D (fatty fish and whale blubber) and Scandinavians didn’t. The result is that Inuits have darker skin than Scandinavians.27

This is all speculative, and even if true, might be driven by severe deficiency and rickets. Or perhaps prehistoric benefits don’t translate to your lifestyle. But all the people in Luxwolda’s sample in East Africa had levels above ~60 nmol/L. I just don’t see how you can look at this and not see it as providing some suggestive evidence in favor of the idea that raising levels above severe deficiency is unlikely to be harmful, and could be important. So I think the prior is favorable.

What do you expect from vitamin D?

A hazard ratio like HR = 0.96 doesn’t look very impressive. But hold on. Suppose that life expectancy is 80 years and that taking vitamin D every day reduces your risk of all-cause mortality by a factor of HR. A reasonable approximation in rich countries is that this would increase your life expectancy by

    80 × 0.15 × (1-HR) years = 12 × (1-HR) years,

where 0.15 is derived from the entropy of lifespan in rich countries.28 For example, if all-cause mortality had a true hazard ratio of HR = 0.96, then taking vitamin D every day of your life would increase life expectancy by around

    0.48 years.

I claim that this would be a lot. Certainly, if I were about to face my destiny, I would pay a lot of money for an extra 0.48 years. Or, you can calculate that this corresponds to an increase of life expectancy per-vitamin-D-pill of 8.6 minutes.29 A common rule-of-thumb is that smoking a cigarette costs around 11 minutes of life in expectation. If you think HR = 0.96 is trivial, do you also think that smoking one cigarette each day is fine?30

The correlational studies suggested that vitamin D might drop your risk of all-cause mortality by a third. It’s disappointing that the RCTs refuted this. But those correlational studies were crazy. They imply31 an increase of life expectancy of around 4 years or around 6.5 cigarettes per day. Could we really believe that you could smoke 6.5 cigarettes, then take a vitamin D pill, and you’re even?

Personally, I think hazard ratios just slightly less than one are the best we can reasonably hope for. But I also think that they would be an excellent return on investment. Arguably, modern human life expectancy comes from stacking lots of modest hazard ratios on top of each other.

What do you expect from vitamin D trials?

Let’s play a game. Let’s hallucinate some numbers for what vitamin D might do, and then simulate what trials would show. Here are the strongest effects I consider plausible for different baseline levels, along with how common those levels are in the United States.

Storage vitamin D (nmol/L) Hazard ratio % of population
<30 0.75 5
30-49 0.92 15
50-125 0.98 72.5
>125 1 7.5

Suppose that were real. Now, say we pick 26,000 people at random, and give half of them vitamin D for give yars. Here are the results of a million simulated trials, assuming a baseline mortality risk of 0.7%: 32

Overall, 9% of trials would find a significant benefit, 63% would find a non-significant benefit, 27% would find a non-significant harm, and 1% would find a significant harm.

If you wanted to have an 80% chance of finding a significant decrease, you’d need to run a trial with something like 570,000 people, almost five times more than in all the above trials combined.33 If you don’t like my numbers, I’ve put up a page where you can run your own simulations with different ones.

My point is, the results we see in vitamin D RCTs are what we should expect to see if vitamin D had plausible benefits. That’s not proof, of course—just that if you start with realistic expectations, the trials don’t provide much evidence in either direction.

The trials do find slightly helpful numbers

Recent meta-analyses have not consistently found a statistically significant benefit to vitamin D supplementation. But they do suggest a small benefit for cancer mortality and all-cause mortality, and they’re close to being statistically significant. That’s something.

And if you buy the argument that bolus dosing is bad, the results get even better. Kunzia et al. did a meta-analysis of cancer mortality using only trials with daily dosing, and found a hazard ratio of 0.88 (confidence interval 0.78 to 0.98). I’d keep this at arm’s length. The bolus dosing trials might have done worse by random chance, meaning this is a kind of p-hacking. But there’s a reasonable chance (maybe 25-50%) that bolus dosing really is bad, in which case those trials would be convincing evidence.

I actually think it’s surprising that the meta-analyses look as good as they do, because there just aren’t that many people who started out with low vitamin D levels. Only a handful of trials had mean levels below 60 nmol/L, and they all give semi-promising results:34

Trial (low-ish baseline) Cancer mortality All-cause mortality
Trivedi 2003 0.86 (0.61 to 1.21) 0.90 (0.77 to 1.07)
WHI (Jackson 2006) 0.89 (0.77 to 1.03) 0.92 (0.83 to 1.01)
Lyons 2007   0.99 (0.93 to 1.05)
RECORD (Avenell 2012) 0.83 (0.55 to 1.26) 0.93 (0.85 to 1.02)

Again, it’s dangerous to dig too deeply looking for these kinds of patterns. If you dig enough, you can always find a way to confirm whatever theory you want. But also again, maybe?

You’re probably already taking vitamin D

You might not personally supplement vitamin D. But for most people reading this, someone else is supplementing it for you.35

Country Commonly fortified with vitamin D
Australia Margarine
Belgium Margarine
Canada Milk, margarine
Chile Milk, flour
Ethiopia Oils
Finland Milk, yogurt, margarine
Ireland Margarine, cereal
New Zealand Margarine (from Australia)
Norway Margarine, low-fat milk
Pakistan Oils
Poland Margarine
Sweden Milk, yogurt, plant milk, margarine
United Kingdom Margarine, cereal
United States Milk, plant milk, margarine, cereal, yogurt

Fortified food is common across the Anglosphere and Scandinavian peninsula. However, it’s rare in the rest of Europe (exceptions: Belgium, Poland) and even-more rare in the rest of the world (exceptions: Chile, Ethiopia, Pakistan).

I think this is important for two reasons. First, vitamin D is oddly self-defeating. There are some places in the world where people care about vitamin D. These are the places that run large trials. But these places also fortify their food and tend to be full of people that already supplement vitamin D. These places also tend to believe it’s unethical to tell the control group not to take vitamin D.

And here’s another question: If you think vitamin D is worthless, are you comfortable recommending removing vitamin D from food? If not, then why is the particular amount of fortification in food now the right one?

Some might argue that the purpose of fortification is to reach the severely deficient, or children, the elderly or pregnant mothers. Maybe! But again, if you could press a button and remove fortification from everyone else, would you feel comfortable pushing that button? Remember, trials don’t test going down from current levels, only going up.

So that’s my story

  • Biology and evolution suggest a prior that moderate levels of vitamin D (say 80 nmol/L) are quite possibly better than low levels (like 40 nmol/L) and unlikely to be worse.
  • Observational studies say that vitamin D is magical, but those studies are bad and we should ignore them.
  • The RCTs show that vitamin D is non-miraculous. But beyond that they don’t provide much information, because they mostly enrolled people with moderate vitamin D levels, meaning plausible effects would require colossal sample sizes to reliably detect.
  • What evidence the RCTs do provide points weakly towards a modest benefit.
  • If real, that benefit would far exceed the cost of taking vitamin D.
  • Therefore, if you have low vitamin D, it seems wise to supplement.

This is all very weak, I know! But sometimes weak evidence is all we’ve got.

I wish we had at least one large trial done in a population with low starting levels. But as far as I can tell, none are underway. In fact, it’s unlikely that there will be any more large trials anytime soon. So weak evidence is how it’s going to be.

  1. Technically, vitamin D itself is a type of steroid although not what people usually mean by “steroid”. 

  2. Here are some of the fancy names for the different forms of vitamin D I’ll talk about:

    my name fancy names
    provitamin D 7-dehydrocholesterol
    previtamin D previtamin D₃
    vitamin D cholecalciferol
    storage vitamin D calcifediol / ergocalciferol / 25(OH)D / 25-hydroxyvitamin D
    active vitamin D calcitriol / ercalcitriol / 1,25(OH)₂D / 1,25-dihydroxyvitamin D

  3. Charmingly named “vitamin D-binding protein”

  4. If you eat mushrooms or yeast, it joins the vitamin D from your skin en route to your liver. If you eat animals or animal products, you also get some storage vitamin D, which doesn’t need to be processed by the liver. 

  5. Storage vitamin D is what your doctor measures in your blood test. This is sometimes measured in nmol/L and sometimes in ng/mL. The latter measurement is smaller by a factor of 2.496. So 25 nmol/L ≈ 10 ng/mL. 

  6. Apperly was building on a 1937 paper that observed observed that sailors, exposed to lots of sunlight, had much higher skin cancer rates than the general population, but lower overall cancer rates. 

  7. I theorize that the Garland brothers are alive and writing Slime Mold Time Mold

  8. In Biologist, active vitamin D is not just an “endocrine” hormone that sends signals for far away cells through the blood, it’s also a “paracrine” or “autocrine” hormone that sends signals to nearby cells or inside a single cell, through diffusion. 

  9. You might ask, why is vitamin D used by so many different parts of the body for so many different purposes?

    I think there’s no deep answer here. It’s true for the same reason that dogs sneeze to signal that they’re feeling playful: Evolution re-uses stuff for different purposes all the time. Imagine that DNA already exists coding for the vitamin D receptor and for the enzyme to convert storage vitamin D into active vitamin D. If some cells need to send a local signal, re-using those is easier than inventing something new. There’s nothing unusual or magical about this. 

  10. Don’t try to make sense of this. It doesn’t make sense.

    You could speculate that this is because the parathyroid glands are trying to make less active vitamin D to compensate for the fact that vitamin-D receptors throughout the body are sensitive to storage vitamin D itself. But I advise against. 

  11. 400 IU is the recommended daily amount 

  12. The WHI trial was a pioneer in salami-slicing results for different outcomes into dozens of different papers, most of which are hard to access. All trials now seem to have adopted this hideous trend which makes it maddening to try to summarize what actually happened in a trial. Also, slightly different numbers for the same quantity appear in different places. I haven’t bothered to chase these down, because the differences are all very small, e.g. a hazard ratio of 0.89 for cancer mortality rather than 0.90. 

  13. Guess what most kidney stones are made of? 

  14. Half of the vitamin D group and the placebo group also got omega 3. These are averaged together in the results. Also, VITAL carefully stratified the assignment to vitamin D or placebo based on baseline vitamin D levels, which should give more statistical power from a given sample size. 

  15. There was also a weird study done on a subset of 1031 people from the VITAL population that looked at telomere length. After starting with around 8700 base pairs, the control group lost around 160 base pairs during the study, while the vitamin D group only lost an average of 20. I’m not sure of what to make of this. For one thing, though the authors claim this is statistically significant, it depends on how you analyze the data. But beyond that, sure, telomere length is a marker of aging, but telomeres get shorter for a reason (likely to fight cancer) and it isn’t obvious that slowing this would always be a good thing. 

  16. This is a little complicated. In VITAL, participants were only eligible if they were taking at most 800 IU per day, and they were restricted to 800 IU per day during the trial. In D-health, participants were only eligible if they were taking at most 500 IU per day, but they were allowed to take up to 2000 IU per day during the trial. 

  17. You might ask: If vitamin D only has a modest effect, then why is it so strongly correlated with health?

    In principle, I’d like to push back against the idea that we need to explain why these particular correlations don’t imply causation. But the accepted explanation is a combination of (1) reverse causation where being healthy causes people to spend more time outside and thus get more vitamin D; (2) confounding, where obesity is bad for you and leads to lower measured vitamin D levels; (3) confounding, where more healthy lifestyles lead to both more vitamin D and more health; and (4) confounding, where higher socioeconomic status leads to both more vitamin D and more health. You might ask why these correlations would be true at a state level like the Garlands looked at, but then you run into the ecological fallacy and modifiable areal unit problem

  18. I took all the trials that got at least 2% weight and were rated as “low risk of bias” in this 2014 Cochrane review of vitamin D and mortality, then manually added all the “major” trials that were published after 2014.

    I shudder to think of the time it took to make this table. I tried using AI but found it was wildly unreliable. Part of the problem is that each trial’s results are distributed among many papers, in different journals, with different paywalls. And many details aren’t published at all by the original authors but are only scrounged up and put in the depths of the supplementary material of a review years later. In some cases, different sources also give contradictory numbers. The differences were always tiny (e.g. 0.90 rather than 0.89) but it still makes me nervous. 

  19. Here’s a table describing the major contours of the trials:

    Name Country Subjects (n) Age (years) white (%) Duration (years)
    Lips 1996 Netherlands 2,578 80 ± 6   3.5
    Trivedi 2003 UK 2,686 74.7 ± 4.6 74 5
    WHI 2006 USA 36,282 (women) 61.8 ± 6.7 84 7
    Lyons 2007 Wales 3,440 84 ± 7.5   3
    WFPT 2007 UK 9,440 79.1   3
    RECORD 2012 UK 5,292 77.5 ± 6 99.2 6.2
    Lappe 2017 USA 2303 (women) 65.2 ± 7.0 100 4
    VITAL 2018 USA 25,871 67.1 ± 7.1 71.3 5.3
    ViDA 2018 New Zealand 5,110 65.9 ± 8.3 83.3 3.3
    D2d 2019 USA 2,423 60.0 ± 9.9 67 2.7
    DO-HEALTH 2020 Switzerland, Germany, Austria, France, Portugal 2,157 74.9 ± 4.1   3
    D-Health 2022 Australia 21,315 69.3 ± 5.5 94.7 5
    FIND 2022 Finland 2,495 68.2 ± 4.5 100 5

    And here’s a table focusing on the change in vitamin D levels:

    Name Intervention Allowed personal use (IU/day) Baseline D (nmol/L) Final D (nmol/L)
    Lips 1996 400 IU daily 0 (screening)    
    Trivedi 2003 100,000 IU 3× per year (D2) 0 (screening) 200 (trial) 52.5 (in controls) 75
    WHI 2006 400 IU daily with Ca 600 (later 1000) 52.0 ± 21.1 (subset) ~67
    Lyons 2007 100,000 IU 3× per year <400 (screening) 54.0 (in controls, subset) 80.1 (subset)
    WFPT 2007 300,000 IU yearly <400 (screening)    
    RECORD 2012 800 IU daily with Ca 200 ~38  
    Lappe 2017 2000 IU daily with Ca any? 71.8 ± 20.0 96.0 ± 21.4
    VITAL 2018 2,000 IU daily 800 77 ± 30 105 ± 25
    ViDA 2018 100,000 IU monthly 600 / 800 (younger / holder) 63 ± 24 119 ± 45
    D2d 2019 4,000 IU daily 1000 69.9 ± 26.8 98.7
    DO-HEALTH 2020 2,000 IU daily 1000 / 800 (screening / trial) 55 ± 22 100 ± 27
    D-Health 2022 60,000 IU monthly 500 / 2000 (screening / trial) 77 ± 25 (predicted) 115 ± 30
    FIND 2022 1,600 or 3,200 IU daily 800 75 ± 18 100 ± 21 or 120 ± 22

  20. Among the major trials, only VITAL, ViDA, and FIND measured it for more than a tiny number of subjects. 

  21. In VITAL and ViDA, people with baseline levels below 50 nmol/L had a higher hazard ratio for cancer mortality (though with wide confidence intervals), suggesting if anything less benefit. Or, you could use race as a proxy for baseline vitamin D. But in both VITAL and WHI, the hazard ratio for cancer mortality was higher among non-Whites. After looking at many such analyses for many outcomes, the only clear result I could find was for diabetes in the D2d trail, where the hazard ratio was much lower for people below 30 nmol/L (0.38 vs. 0.93). 

  22. The results for VITAL look decent:

    outcome (VITAL trial) HR HR excluding first two years
    Cancer 0.96 (0.88 to 1.06) 0.94 (0.83 to 1.06)
    Cancer mortality 0.83 (0.67 to 1.02) 0.75 (0.59 to 0.96)
    Major CVD event 0.97 (0.85 to 1.12) 0.93 (0.79 to 1.09)
    All-cause mortality 0.99 (0.87 to 1.12) 0.96 (0.84 to 1.11)

    But in D-Health, excluding the first two years actually increased the hazard ratio for cancer mortality from 1.15 (0.96 to 1.39) to 1.24 (1.01 to 1.54). Most other trials were too short for this kind of analysis to make sense. 

  23. That could downregulate 25-hydroxyvitamin D 1-alpha-hydroxylase, reducing the rate it catalyzes the hydroxylation of hydroxycholecalciferol into 1,25-dihydroxycholecalciferol! 

  24. Dynomight: WTF is this?

    Dynomight Biologist: Well, C-reactive protein is generally considered inflammatory.

    Dynomight: So reducing that is good? But then why do they talk like elevating anti-inflammatory cytokines would be bad?

    Dynomight Biologist: Yeah… That would be good. Unless you have cancer. In which case it’s not good.

    Dynomight: OK! 

  25. Mendelian randomization studies are based on the idea that certain genes predispose you to have higher levels of circulating vitamin D. If you assume that those genes are randomly distributed in the population and have no effects other than affecting vitamin D, then they serve as a kind of natural experiment. With vitamin D, these studies typically show null results. However, the validity of the assumptions is debatable and the identified genes only explain ~5% of the variance in vitamin D levels, which makes the results very noisy. 

  26. Pale skin also greatly increases the risk of sunburn and skin cancer. In the US, White people get melanoma at around 25 times the rate of Black people, despite (I assume) higher usage of sunscreen and better health outcomes in most other dimensions. But experts generally think folate deficiency created stronger selective pressure, since it’s so closely linked to reproduction. 

  27. It’s a more complicated than this, because you also need to look at the amount of folate in diet, as well as migration patterns and how long populations had to adapt to their environment. But experts seem to consider this the leading explanation for the evolution of pale skin. 

  28. To derive this, suppose that S(t) is the probability that someone survives to age t. Then life expectancy is ∫ S(t) dt, where the integral runs from 0 to ∞. If you change the hazard ratio by a factor of HR, then the new in life expectancy is L(HR) = ∫ S(t)ᴴᴿ dt, so the change under a linear approximation is ΔL ≈ (HR-1) × L’(1). This is more commonly written as ΔL ≈ (HR-1) × L(1) × H, where H = -L’(1)/L(1) is known as the Keyfitz entropy. This is is chosen because the quantity H is relatively stable, and in rich countries is typically between 0.10 and 0.20. So a decent estimate would be that baseline life expectancy is L(1)=80 years and H = 0.15 in which case the change in life expectancy is around 12 × (1-HR) years. 

  29. Observe that 0.48 years is 252460.8 minutes. Assuming you lived for 80 years and took a pill every day of your life, that would be 80 * 365.25 = 29220 pills. 252460.8 minutes / 29220 pills = 8.64 minutes/pill. 

  30. I expect that a number of you are happy to bite that bullet and say yes, HR=0.96 is trivial and smoking a cigarette each day is also fine. I don’t personally agree, but it’s not my place to question your utility function and I applaud your consistency. 

  31. A hazard ratio of HR=2/3, implies a change in life expectancy of 12 × (1 - 1/3) years = 4 years or 2,103,840 minutes. That corresponds to a per-pill increase of 2,103,840 minutes / 29,220 pills = 72 minutes/pill. 

  32. Technically, this is calculating a relative risk rather than a hazard ratio, but I think the difference isn’t very significant given that we’re assuming a uniform mortality risk. I used AI to create that simulation, though I did test that it replicates a traditional power calculator across a wide range of parameters when the relative risk is constant for all vitamin D levels. So I mostly trust it. 

  33. This simulation is probably a bit pessimistic. Things look a bit better if you use an older population where baseline mortality is higher. (Almost all trials do.) In principle, you could also use a population where more people have low levels, which could help a lot. But, for whatever reason, almost no trials do that. In fact, most trials accidentally under-sample people with low vitamin D, because people who agree to participate tend to be more health-conscious. 

  34. Kunzia et al. made a heroic effort to contact study authors and get data for individual patients. After getting data for 21,558 people (almost all from ViDA + FIND + VITAL + WHI) only 3,663 had levels below 50 nmol/L. That’s not enough to reliably detect a modest effect, meaning their confidence interval for this group is gigantic. 

  35. In this table, I tried to capture foods that are commonly fortified in practice, not just when it’s legally required. 

Is “colorectal cancer” rising in “young people”?

2026-05-26 08:00:00

(Yes, but.)

Over the past few years, I’ve seen many articles about mysterious rise in colorectal cancer (CRC) in young people. There are various stories for why this might be happening:

General health. Maybe modern people are unhealthy (obesity, low physical activity, diabetes, poor sleep), leading to insulin resistance and chronic inflammation, meaning faster epithelial cell proliferation and a miscalibrated immune system that fails to stop early cancers?

Ultra-processed food. Maybe people are eating more ultra-processed foods that contain additives (like emulsifiers) that degrade colon mucus, allowing bacteria to contact epithelial cells and drive inflammation? Or maybe ultra-processed food has low fiber and glycemic load, leading to insulin resistance and chronic inflammation, with the problems mentioned above?

Bad meat. Maybe people are eating more red and/or processed meats, which expose the colon to nitrites and secondary bile acids, which inflame the epithelium and promote chronic inflammation?

The microbiome. Maybe it’s the microbiome. For example, maybe people’s guts are getting colonized by strains of E. coli that produce genotoxic colibactin. Or maybe overuse of antibiotics in early life depletes protective bacteria in the gut, allowing harmful strains to expand, e.g. strains of B. fragilis that cause inflammation, or strains of F. nucleatum that can survive in the gut and drive tumor growth?

Environmental exposures. Maybe people are getting exposed to bad stuff in the environment (microplastics, forever chemicals, pesticides, endocrine disruptors, air pollution) that does bad stuff (damages gut barrier, screws up the microbiome, disrupts hormonal signaling)?

Maternal health. Maybe poor maternal health (obesity, diabetes) exposes the fetus to elevated glucose / insulin / inflammation, and these in turn program the child for a lifetime of metabolic issues and inflammation?

Whatever. Maybe alcohol / smoking / painkillers / calcium / vitamin D / inflammatory bowel disease / hereditary syndromes / screening bias?

None of the experts seem to agree on which of these is the culprit, so I figured that I (person with blog) should help.

If you poke at these stories, most of them are individually pretty weak. It can’t all be detection bias since CRC deaths are also going up in younger people. And several proposed causes (air pollution, tobacco) have actually fallen in rich countries. Other explanation, like E. coli producing colibactin, seem biologically real, but there’s no evidence that they’re increasing over time. Still other suggested causes (microplastics, forever chemicals) are mostly mechanistic speculation at this point. Obesity, inactivity, and chronic inflammation also all seem biologically real, and they are likely increasing, but why should they specifically cause colorectal cancer in young people?

A plausible answer to that last question is that they aren’t. They’re doing it, but not specifically.

“Young people”

This will sound pedantic, but bear with me: If you say that CRC is increasing in younger people, what exactly does that mean? After all, the set of people who qualify as young changes over time. (Ever notice that you keep getting older?)

Siegel et al. (2026) plot how often CRC was found in different age groups in 1995 and in 2022.

They also provide this plot of how common different types of CRC are in different age groups.

At a glance, this doesn’t look so bad. If you’re young, you might think, “OK, my current risk is higher than previous generations faced at the same age, but I can look forward to decreasing rates when I’m old.” You could easily think this is good news: While there’s a relative increase when you’re young, it’s tiny compared to the absolute decrease while you’re old.

Unfortunately that’s the wrong way to think about it.

Downham et al. (2026) plot CRC rates in different age groups across the Anglosphere over time.

Everyone I’ve shown this plot to has said it’s confusing, so let me explain: The different lines track age-bands as people born in different years move in and out of those bands. For example, in the US plot in the bottom right, the “20-25” line starts with the left-most dot showing the CRC rate for people born between 1965 and 1970 when they were 20 to 24 years old (around 1990). The next dot shows the rate for people born between 1970 and 1975 when they were 20 to 24 years old (around 1995), and so on.

That figure is weird, because the lines connect different groups of people. I wanted a plot where there are lines for different birth cohorts as they age. For unknown reasons, no one seems to make such plots, and the data isn’t trivial to access. So I used a plot digitizer to click on every damned point that US figure above and then replotted it:

Now the individual lines show specific groups of people tracked through time. For example, the “1932.5” line shows CRC rates for people born between 1930 and 1935, when those people were at different ages. If you look closely, you’ll notice that these rates are higher those for people born between 1940 and 1945 for all ages (where we have data).

That was the pattern for a long time: Between 1920 and 1950, later generations enjoyed lower CRC rates across all phases of their lives. But between 1950 and 1960, that pattern reversed and since then later generations have had higher CRC rates at all ages.

We don’t know for sure what will happen in the future. But I think it’s likely this trend will continue. Yes, if you are currently young, you face higher CRC risk than previous generations did when they were young. That’s the bad news. The other bad news is that when you are old, you may also face higher CRC risk than previous generations did when they were old.

“Colorectal cancer”

The other other bad news is that CRC isn’t the only type of cancer that’s rising in later generations. Sung et al. (2019) give this plot:

These are again the confusing graphs where individual lines show age bands as different people move in and out of them. But you get the point: Lots of cancers are going up in younger people later generations, including uterine, gallbladder, kidney, liver, pancreas, and thyroid. (Their additional material contains plots for 18 other cancers, most of which are either stable or decreasing.)

Note that these plots have a logarithmic y-axis, meaning the changes are larger than they might appear. Moving up a quarter of the way between two vertical ticks corresponds to an increase of a factor of ≈ 1.78.

If lots of cancers are becoming more common in later generations, then why is everyone talking about CRC? I think that’s because CRC in unique in that it is:

  1. common
  2. dangerous
  3. increasing in later generations
  4. treatable if caught early
  5. detectable via screening

For example, thyroid cancer diagnoses have skyrocketed in recent decades. But that’s partly because of more detection, and thyroid cancer is highly treatable, without clear benefits from early detection. Pancreatic cancer also seems to be increasing, but we don’t have good ways to screen for it and even if we did, we don’t have good ways to treat it.

CRC is really unique in that you can save lives by telling people, “Hey! CRC is going up! You should get screened!” If you’re interested in public health, that’s the most important thing. But if you’re interested in unraveling the mystery of CRC going up, it’s important to note that CRC isn’t really unique at all.

TLDR

No:

Colorectal cancer is going up in young people.

Yes:

Various kinds of cancer are going up in later generations. (Definitely at younger ages, possibly at all ages.)

Reminder

This blog endorses colorectal cancer screening. We don’t yet know if colonoscopies are better than other methods of screening (sigmoidoscopy, stool tests), but we do know that screening is better than not screening. When caught early, CRC is highly treatable, often with only surgery (no chemotherapy or radiation) and a return to normal activities within a couple weeks.

What’s with all the slide decks?

2026-05-13 08:00:00

News from the world of real jobs: Apparently, sometime between 10 and 20 years ago, it became standard for people to communicate by sending slide decks around. These slides are never presented. They aren’t intended to be presented. They’re born, they’re sent around, and they die. What?

I stress, the question is not why (or if) people give bad presentations.

The mystery is why everyone is using presentation software for communication that is not a presentation.

Theory 1: Everybody dumb

Is it because we’re all dummies? I’m putting this theory first because I suspect that you, beloved readers, will favor it.

True, if you ask people why they make slides instead of writing, they’ll usually say, “because nobody wants to read”. So there’s that. But I don’t consider this much of an explanation. Dummies though we may be, we’ve been like that a long time. If we entered the Slideocene 15 years ago, why then? Why not before?

Theory 2: The decline of reading

Did we get worse at reading? The Discourse seems to have decided this is true, but is it true, or just moral panic?

Since 1971, the US has tested 13-year-olds to measure long-term trends in reading ability. This shows a slow improvement until 2012, then a slow decline, and finally a post-COVID drop. The declines seem too small and too late to explain our mystery.

Since 2000, PISA has tested reading performance in 15-year-olds around the world. This shows a decline on average, but it’s smaller in rich countries and nonexistent in the United States. (It’s the same story for science and a bit more negative for math.)

Among adults, data is scarce. Basic literacy is generally improving, and American time use data shows a decline in reading for pleasure from around 23 minutes per day in 2003 to around 16 minutes per day in 2023. But this seems to miss time people spend reading on their phones.

So it’s unclear if people got worse at reading. It feels plausible that people now spend less of their adulthood grappling with complex written arguments, and so got worse at that. But there’s little firm evidence.

Theory 3: Technological change

Another obvious theory is that we now have computers and software and the internet. Without these things, it would be impossible to email slides to each other. This seems relevant!

Yes, but we had those things for a while before slide culture really took hold. And think about the situation before computers. Photocopiers were ubiquitous in corporate offices by the mid-1980s, and mimeographs were around decades before that. If slides were really that great, people could have made them by hand. But no one did.

Of course, making slides by hand is inferior. But it’s not that inferior. So slides can’t be that big of a win.

What actually happened?

And… that’s pretty much the end of the obvious theories. None of them are very satisfying. So let’s take a step back. Historically, how did the slide-as-document displace the memo?

As best I can tell, this was driven by management consultancies. If you go back to 1960, they delivered detailed written memos. The memo was the product. They’d likely give a presentation as well, but that was a separate ancillary thing, likely done using flipcharts or chalkboards.

In the 1970s, the memo was still the product, but consultancies started to enforce a top-down logical structure (the Pyramid principle). Presentations shifted to acetate transparencies. Both memos and presentations often included hand-drawn graphics like the nine-box or growth-share matrices.

In the 1980s, the memo was still the product, but presentations became increasingly lengthy and polished. Expensive computers like the Genigraphics started to be used to generate charts.

The 1990s were when things started to shift. By then, PowerPoint was everywhere, and junior analysts were expected to create presentations themselves. Consultancies gradually started to notice that (1) clients didn’t always read the memos; (2) clients loved slides and passed them around long after the presentation was over; and (3) creating a memo and a polished presentation was a lot of work. They put more and more effort into the slides. McKinsey especially evolved towards treating slides as the primary product, and mostly stopped writing long memos. Other consultancies followed.

During the 2000s, slides became even more ornate. Consultancies evolved their formatting rules, and created fancy data-dense charts. They learned that a 200 slide deck made clients feel like they got a lot for their money. Gradually, they oriented their entire business around slides. Projects would start with managers creating a template presentation with “ghost slides” and assigning different parts to junior analysts. Soon, this spread outwards, both from people who interacted with consultants and from the ex-consultant diaspora. People everywhere started thinking and communicating in slides, and now everything is slides, yay!

Alternative history

That story makes slides-as-documents sound inevitable: People liked them, so they became popular. But there’s an alternative timeline in which we resisted the slide into slide maximalism. That timeline is Amazon.com, Inc.

In 2004, Jeff Bezos famously instituted a no-presentations policy at Amazon. His logic was that slides hide poor reasoning and are a tool to persuade rather than inform. Instead, everyone involved with strategic decisions at Amazon needs to learn to write a six-page memo. Meetings begin with everyone sitting and silently reading one of these memos.

Presentation software is not banned at Amazon. The ban is only for using it for internal meetings and decision-making. They use slides for external communication. There is no policy that prohibits someone from making slides and emailing them around.

And yet, people don’t make slides and email them around, because it’s not part of Amazon’s culture. In effect, Amazon is a counter-movement. Most of the world decided that slides are good, because slides are easy. Bezos decided that writing is good because writing is hard.

There are millions of articles explaining why Bezos’ policy is pure genius. They claim that constructing a narrative requires deeper analytical thinking and exposes flaws in logic. I want to believe those theories. I now realize they’re very similar to some of my arguments for why writing with too much formatting is bad.

I’m not sure if writing is the secret to Amazon’s success. But Amazon is successful. This demonstrates that slide life is a choice, not technological destiny—institutions can choose writing over slides and flourish anyway.

OK so then what’s happening?

Warning: If you like your theories simple and mono-causal, you aren’t going to like this.

  1. Slides are a win, but a small one. The shift to slides wasn’t a “mistake”, it happened because people like it. But if sharing slides outside of presentations became illegal, this wouldn’t cause per-capita GDP to crash. That’s why people didn’t scratch slides into mimeograph stencils back in the 1950s. It wasn’t worth the modest effort.
  2. When computers and software showed up, it became easier to share slides. But people didn’t immediately shift to slides-as-documents because the win isn’t that big, because culture changes slowly, and because everyone had pre-existing skills for reading and writing documents.
  3. Consultancies happened to be in the economic niche with the strongest selection pressure to evolve towards slides-as-documents. So when making slides became cheaper, they shifted. Slowly, that norm spread outwards, people got used to communicating in slides, and here we are.
  4. Institutions can resist that norm and still be successful. If you take modern people and force them to read and write, they do just fine.
  5. Humans evolved to learn and communicate in a fragmented, interactive, and visual style. It’s hard to argue that any shift in that direction is a catastrophe.
  6. Except blogs. The decline of the blog must be arrested.

You’re probably taking the wrong painkiller

2026-04-30 08:00:00

This is an essay that recently appeared in Asterisk. Consider the rest of the risk issue for all your risk needs.


Lots of people die after overdosing on acetaminophen (paracetamol, Tylenol, Panadol). In the U.S., it’s estimated to cause 56,000 emergency department visits, 2,600 hospitalizations, and 500 deaths per year. Acetaminophen has a scarily narrow therapeutic window. The instructions on the package say it’s okay to take up to four grams per day. If you take eight grams, your liver could fail and you could die. 

Meanwhile, it seems to be really hard to kill yourself by overdosing on ibuprofen (Advil, Nurofen, Motrin, Brufen). In 2006, Wood et al. searched the medical literature and found 10 documented cases in history. Nine of those cases involved complicating factors, and in the 10th, a woman took the equivalent of more than 500 standard (200mg) pills. 

So, for many years, if I needed a painkiller, I’d try to take ibuprofen rather than acetaminophen. My logic was that if eight grams of acetaminophen could kill my liver, then one gram was probably still hard on it. I’m fond of my liver and didn’t want to cause it any unnecessary inconvenience. 

But guess what? My logic was wrong and what I was doing was stupid. I’m now convinced that for most people in most circumstances, acetaminophen is safer than ibuprofen, provided you use it as directed. I think most doctors agree with this. In fact, I think many doctors think it’s obvious. (Source: I asked some doctors; they said it was obvious.) 

Should this have been obvious to me? I figured it out by obsessively researching how those drugs work and making up a story about metabolic pathways and blood flow, and amino acid reserves. It’s a good story, one that revealed that my logic stemmed from an egregious lack of respect for biology and that I’m a big dummy (always a favorite subject). But if the clearest road to some piece of knowledge runs through metabolic pathways, then I don’t think that knowledge counts as obvious. 

So how is a normal person meant to figure it out? Why doesn’t the fact that acetaminophen is typically safer than ibuprofen appear on drug labels or government websites or WebMD? Are normal people supposed to figure it out, or has society decided that this is the kind of thing best left illegible? 

Note: You should not switch medications based on the uninformed ramblings of non-trustworthy pseudonymous internet people.

How does ibuprofen work?

Ibuprofen inhibits the the Cyclooxygenase (COX) enzyme. This in turn inhibits the formation of messenger molecules involved in inflammation, which leads to less physical inflammation and thus less pain. 

The same story is true for almost all over-the-counter painkillers, which is why they’re almost all considered “non-steroidal anti-inflammatory drugs,” or NSAIDs. This includes ibuprofen, aspirin, naproxen (Aleve), and a long list of related drugs. But it does not include acetaminophen.

How does acetaminophen work?

Nobody knows!

Like ibuprofen, acetaminophen inhibits some COX enzymes. But it does so in a weird way that barely affects inflammation or messenger molecules, so it’s unclear if this matters for pain reduction. 

In the brain,  acetaminophen is metabolized into a mysterious chemical called AM404. This activates the cannabinoid receptors and increases endocannabinoid signaling, which seems to reduce the subjective experience of pain. AM404 also activates the capsaicin receptor, which is associated with burning sensations that you’d normally expect to increase pain, but maybe some desensitization thing happens downstream? And maybe acetaminophen also interacts with serotonin or nitric oxide or does other stuff? How this all comes together to reduce pain is still somewhat a scientific mystery. 

Aside: When trying to understand painkillers, it’s natural to focus on chemistry and molecular biology. But the unknown physical origins of consciousness are always nearby, looming ominously.

What risks does ibuprofen have?

In an ideal world, the only thing ibuprofen would do is reduce inflammation in the part of your body that hurts. But that is not our world. When ibuprofen inhibits the COX enzymes, it does so throughout the body. And mostly, that is bad. 

For one, ibuprofen reduces production of mucus in the stomach. That might sound okay or even good. But stomach mucus is important. You need it to shield the lining of your stomach from your extremely acidic gastric juice 1. Having less mucus can lead to gastrointestinal problems or even ulcers. 

Ibuprofen also affects the heart. When ibuprofen inhibits the COX enzymes there, this in turn inhibits one chemical that prevents clotting and another that causes clotting. In balance, this seems to lead to more clotting, and an increased statistical risk of heart attacks 2. If you’re healthy, the risk of a heart attack from an occasional low dose of ibuprofen is probably zero. But if you have heart issues and take medium to large doses regularly for as little as a few days, this might  be a serious concern. 

Ibuprofen also affects the kidneys. If you’re stressed, or cold, or dehydrated, or take stimulants, your body will constrict your blood vessels. That squeezes your kidneys’ intake tube, depriving them of blood. Your kidneys don’t like that, so they release signaling molecules to locally re-dilate the blood vessels. 

Trouble is, when ibuprofen inhibits COX enzymes in the kidneys, it inhibits those signaling molecules. If everything is normal, that’s okay, because the kidneys wouldn’t try to use those molecules anyway. But if your body has clamped down on the blood vessels, then the kidneys don’t have the tool they use to keep blood flowing, meaning they don’t get as much blood as they want. This is bad 3.

There are many other less common side effects, including allergies, respiratory reactions in asthmatics, induced meningitis, and suppressed ovulation. If you take a lot of ibuprofen, this could hurt your liver. But the major concerns seem to be the stomach, the heart, and the kidneys.

What risks does acetaminophen have?

Acetaminophen also inhibits some COX enzymes. But unlike ibuprofen, the effect is minimal outside the central nervous system. Thus, acetaminophen has little effect on stomach mucus, blood clots, or blood flow, and so presents almost none of the risks that ibuprofen does. 

Even so, if you take too much acetaminophen at once, you could easily die. 

How does this happen? Well, when acetaminophen is metabolized by the liver, it’s mostly broken down into harmless stuff. But a small fraction (5-15%) is broken down by the P450 system into an extremely toxic chemical called NAPQI

Ordinarily this is fine; your body creates and neutralizes toxic stuff all the time. For example, if you drank 20 grams of formaldehyde, you’d likely die. But did you know that your body itself makes and processes ~50 grams of formaldehyde every day? When liver cells sense NAPQI, they immediately release glutathione, which binds to NAPQI and renders it harmless. 

But there’s a problem. If you take too much acetaminophen at once, the pathways that break it down into harmless stuff get saturated, but the P450 system doesn’t get saturated. This means that not only is there more acetaminophen, but also that a much larger fraction of it is broken down into NAPQI. Soon your liver cells will run out of glutathione to neutralize it. Then, NAPQI will build up and bind to various proteins in the liver cells (especially in mitochondria) causing them to malfunction and/or commit suicide. This can cause total liver failure. 

So you should never take more than the recommended dose of  acetaminophen 4. If you do take too much, you should go to a hospital immediately. They will give you NAC, which will replenish your glutathione and neutralize the NAPQI. Your prospects are good as long as you get to the hospital within a few hours 56.

Acetaminophen has lots of other possible side effects, like skin issues and blood disorders. But these all seem to be quite rare.

What if you have liver issues?

The primary concern with acetaminophen  is liver damage. So if you have liver disease, then surely you’d want to avoid acetaminophen and take ibuprofen instead, right? 

Nope. It’s the opposite. Liver disease shifts the balance of risk in favor of acetaminophen. 

With liver disease, it’s hard for blood to flow into the liver, meaning that blood tends to pool in the abdomen. To counter this, blood vessels elsewhere in the body contract. This includes blood vessels around the kidneys. 

Remember the kidneys? Again, when blood vessels are constricted, the kidneys send out signaling molecules to locally re-dilate the blood vessels. But those signaling molecules are blocked by ibuprofen. So if you have liver disease, taking ibuprofen risks starving your kidneys of blood just like if you were dehydrated.

Meanwhile, people with moderate liver disease are usually still able to process acetaminophen without issue, as long as it’s in smaller amounts. So doctors usually tell patients with liver disease to avoid ibuprofen and take  acetaminophen instead, just with a maximum of two grams per day instead of four. 

(Obviously, if you have liver disease, then you should talk to a doctor, I beg you, for the love of god.)

What about other situations?

The main takeaway from all this is that the risks of both drugs emerge from the madhouse of complexity that is your body. Surely there are some situations where acetaminophen is more dangerous than ibuprofen?

I tried to capture the most common situations in this table:

Situation Acetaminophen safe? Ibuprofen safe?
Fasting No. Fasting leads to low glutathione and the risk of liver damage. No. Risks pain or bleeding in the stomach, could damage kidneys.
Dehydrated Yes. No. Could damage kidneys.
Liver Disease Maybe (low dose). Often preferred by doctors at <2g/day. No. Increases bleeding risk, could damage kidneys.
Stomach Ulcers / Heartburn Yes. No. Strips protective mucus.
Chronic Heavy Drinking Maybe (low dose). Seems safer if limited to <2g/day. No. Risk of stomach bleed.
Kidney Disease Yes. No. Puts stress on the kidneys.
Heart Conditions Yes. No. Interferes with blood clotting, raises blood pressure.
Active bleeding Yes. No. Inhibits clotting.
After drinking (a little) Maybe (low dose with food). Alcohol depletes glutathione, raising risk of liver damage. Maybe (low dose with food and water). Alcohol and ibuprofen both irritate the stomach. Alcohol also leads to dehydration.
After drinking (a lot) No. No.
Hangover No. The liver is already depleted. Maybe (with food and water). But never when dehydrated.

It’s actually fairly hard to find situations where ibuprofen is safer than acetaminophen. Possibly this is true if you’re hungover, but I would be very careful, because you tend to be dehydrated when hungover, raising the risk of kidney damage. (It’s probably optimal, from a health perspective, to avoid taking recreational drugs at doses that leave you physically ill the next day.) 

Aside from hangovers, the only situations I could find where ibuprofen might be safer than acetaminophen  are if you’re taking certain anti-seizure or tuberculosis drugs or maybe if you have a certain enzyme deficiency (G6PDD). 

So…

What have we learned so far? 

  1. The body is really complicated! 

  2. The main risk of acetaminophen is liver damage by creating too much NAPQI. Taking too much at once can easily kill you. However, as long as you don’t take too much at once and your liver isn’t depleted, then your liver will maintain NAPQI levels at zero and it will be completely fine. And there are very few other risks. 

  3. Meanwhile, ibuprofen poses a risk of gastrointestinal issues, heart attacks, or kidney damage. The risk varies based on lots of factors like whether you’ve eaten food, whether you’re dehydrated, your blood pressure, and your heart health 7.

  4. Therefore, acetaminophen is probably safer, provided you never take too much 8.

I don’t want to be alarmist. If you’re healthy, the risk from taking an occasional dose of ibuprofen as directed is extremely low. Given that so many people find that ibuprofen is more effective for many kinds of pain, it’s totally reasonable to use it. I do so myself. 

Still, it seems to be the case that in the vast majority of situations, acetaminophen is saf_er_. Personally, if I have pain, I first take acetaminophen, and then add ibuprofen if necessary. I’m pretty sure many experts think this is somewhere between “sensible” and “obvious.” 

But if acetaminophen is safer, then why don’t official sources tell you that 9? I can get doctors to admit this off-the-record. I can find random comment threads with support from people who seem to know what they’re talking about. But why does this fact never appear on government websites or drug labels? 

Let’s look at those drug labels

In the U.S., the Food and Drug Administration (FDA) creates 10 a “drug facts” label for over-the-counter drugs.  

Here’s what that looks like for ibuprofen:

ibuprofen label

And here’s what it looks like for acetaminophen (paracetamol):

acetaminophen label

I feel dumb saying this, but when I saw those labels in the past, I thought of them as a bunch of random information thrown together for legal reasons. But after spending a lot of time trying to understand these drugs myself, I now realize that these labels are… really good? 

Imagine you work at the FDA and it’s your job to write a safety label. You need to synthesize a vast and murky scientific landscape. Your label will be read by people with minimal scientific background who are likely currently in pain, and who could die if they take the drug in the wrong situation. 

If I were in that situation, I’d think about all the different situations in which taking one of these drugs could literally kill someone, and then — after a quick panic attack — I’d write a label that screamed, HEY, IF YOU ARE IN ANY OF THESE SITUATIONS, TAKING THIS DRUG COULD LITERALLY KILL YOU. Then I’d think about all the other situations where taking the drug might be okay depending on a set of complex science stuff and tell people in those situations to PLEASE TALK TO A DOCTOR FOR THE LOVE OF GOD because I DON’T KNOW IF YOU’VE HEARD BUT SCIENCE IS COMPLICATED. Everything else would be a minor concern.

From that perspective, these labels are a triumph. This isn’t random information — every word is a synthesis of a mountain of research, carefully optimized to save lives.

FDA good

How did those drug labels come to be? 

If you want a taste for the FDA’s process, I encourage you to skim the 2002 Federal Register document in which the FDA proposed to update ibuprofen’s safety label and to formally classify it as Generally Recognized as Safe. It’s more than 21,000 words long and — I think — astonishingly good. It not only summarizes the entire medical literature on ibuprofen, it summarizes it well. Here is onerepresentative bit:

Bradley et al. (Ref. 42) conducted a 4-week, double-blind, randomized trial in 184 subjects comparing the effectiveness and safety of the maximum approved OTC daily dose of 1,200 mg of ibuprofen (number of subjects (n) = 62) to that of a prescription dose of 2,400 mg/day (n = 61), and to 4,000 mg/day of acetaminophen (n = 59) for the treatment of osteoarthritis. While there were no significant differences in the number of side effects reported during this study, the study demonstrated a trend towards a dose dependent increase in minor GI adverse events (nausea and dyspepsia) associated with higher doses of ibuprofen (1,200 mg/day: 7/62 or 11.3 percent; versus 2,400 mg/day: 14/61 or 23 percent). In addition, two subjects treated with 2,400 mg/day of ibuprofen became positive for occult blood while participating in the study.

I spend a lot of time complaining about bad statistical writing. A lot. Probably too much. But I’m here to tell you, that paragraph is gorgeous. The writing is clear and penetrating. It contains all the important details, but no other details. Compared to the abstract of the original paper, the above is shorter and easier to understand yet simultaneously more informative. Five stars. 

The rest of the document is equally good, with clear and sensible explanations for various recommendations. For example, they discuss a proposal from the National Kidney Foundation for additional warning about risks to kidneys, explain why they think that proposal has merit, and then recommend a shorter version, which appears on every package of ibuprofen sold today. 

As far as I can tell, this level of quality is typical. For example, the FDA’s 2019 proposed rule on sunscreens is similarly masterful.

So why?

This leaves us with this constellation of facts: 

  1. Acetaminophen is, in general, safer than ibuprofen. 

  2. The FDA doesn’t tell you that. Neither do other respectable authorities. 

  3. The FDA is highly competent.

So what’s happening here? Have the experts conspired to keep this knowledge secret? 

I don’t think so. Mostly, I think this is down to two factors. First, the FDA doesn’t really have a mission of determining “in what circumstances is drug A safer than drug B?” Their goal is to take individual drugs and determine how people can use them safely. They seem to be quite good at this. 

Second, everyone is mortally afraid of giving “medical advice.” It varies by jurisdiction, but in general, giving “wellness advice” is OK, but if you give personalized advice, you risk going to prison. The more credible you are, the higher that risk is 11.

Stepping back, how should we think about this situation? 

The body is complicated. When experts give the public advice on drugs, they are trying to insulate us from that complexity. But there is no way to do that without making trade-offs. Society has implicitly chosen tradeoffs that mean certain “less important” facts are de-prioritized. It’s not obvious that this is the wrong choice. I feel foolish for not having more respect for the body’s complexity and for the difficulty of the task all the experts are trying to accomplish. This is not medical advice.

  1. For some reason, humans have gastric acid that is more acidic than most other animals, and is only matched by animals that specialize in eating carrion. 

  2. At least two NSAIDs (rofecoxib and valdecoxib) have been withdrawn from the market due to an increased risk of heart attacks. For the same reason, the US refuses to approve etoricoxib

  3. Nephrologists hate ibuprofen. (Source: nephrologists.) If it was up to them, maybe ibuprofen would come with a “HAVE YOU CONSIDERED TAKING ACETAMINOPHEN INSTEAD?” warning. It confuses me that the safety label for ibuprofen doesn’t warn you about the danger of taking it while dehydrated and quietly damaging your kidneys. My best guess is that this is because other doctors don’t hate ibuprofen as much as nephrologists. 

  4. Watch out for combination medicines (like cold or flu medicines or opiate painkillers) that include acetaminophen. Arguably, acetaminophen is a victim of its own success here. It’s included in these things because it is better tolerated than NSAIDs. But it’s easy to miss. 

  5. Oddly, NAC is considered a nutritional supplement, meaning basically anyone can buy it. But there’s also almost no regulation, so who knows if the thing you bought actually has NAC in it? Do not screw around trying to self-medicate an acetaminophen overdose. Go to a hospital. 

  6. At one point while researching all this I had what I thought was a good idea: Why not sell acetaminophen in pills bundled together with NAC? The NAC would replenish glutathione stores in the liver, seemingly reducing the risk of overdose. Later on, I developed more humility and felt very stupid for fantasizing that such an obvious idea could be novel or useful. I think that this is indeed a bad idea because NAC itself has side effects, though I can’t find much formal discussion. In fact, I found a 2010 editorial called “Why Not Formulate an Acetaminophen Tablet Containing N-Acetylcysteine to Prevent Poisoning?”  In another study, Nakhaee et al. (2021) actually tried giving NAC together with acetaminophen to rats and found that this seemed to make it better at reducing pain. So maybe this isn’t a completely stupid idea. That last paper also led me to discover that “rat hot plate test” is a standard phrase, and one that drives home what humanity’s dominion over nature means in practice. 

  7. Above, we mentioned that acetaminophen overdose is estimated to cause around 500 deaths per year in the U.S. It’s much harder to give direct numbers for how many people die from taking ibuprofen, because NSAIDs don’t really directly “kill” people, but rather increase the risk of dying in various ways. The best estimates seem to be that NSAIDs cause 5,000-16,500 deaths each year in the US via gastrointestinal complications, and something similar via heart attacks. These numbers are not a good way of quantifying the relative risk of drugs, because they represent different people taking different amounts for different reasons. But they do show that ibuprofen is not without risk. 

  8. There are probably some people who are too disordered to track much acetaminophen they’ve taken. For such people, ibuprofen might be the safer choice. Though I’m skeptical that many such people are found among the readers of Asterisk

  9. There are two cases where official sources are clear that acetaminophen is safer than ibuprofen: for use by pregnant women and small children. This doesn’t appear on the safety label, but if you’re pregnant and go to a doctor, they will probably tell you to take acetaminophen but not ibuprofen or other NSAIDs. And if you have a newborn baby, their doctor will probably tell you that you can give them acetaminophen but not ibuprofen or other NSAIDs. 

  10. Technically, for many drugs today, it is the drug manufacturer that “creates” the label, which is why they can be slightly different. However, the FDA strongly regulates what is on it, including most of the language and even details about the font and so on. The federal register contains a template the FDA published for ibuprofen which is almost identical to what appears on the side of drugs today  

  11. Unlike in most places, in the United Kingdom it seems to be perfectly legal for people to give each other medical advice, provided they don’t misrepresent themselves as licensed doctors. This is not legal advice. 

I quit drinking for a year

2026-04-09 08:00:00

In early January 2025, a family friend was over for lunch. One of my many guilty midwit pleasures is a love of New Year’s resolutions, so I asked her if she had made any. She said no, but mentioned that she had some relatives that were doing “damp January”.

In case you’re not aware, Dry January is a challenge many people do to quit drinking alcohol during the month of January. These folks were doing a variant in which, instead of not drinking, one simply drinks less.

For some reason, this triggered me. I thought, “Are you kidding? You can’t even stop drinking for a single month? Do you know how pathetic that is?” And then, “Fuck you! Fuck you for doing damp January! You know what, I’m going to stop drinking for a year!”

To be clear, these thoughts were directed at people I’ve never even met. In retrospect, I wonder what was going on with me emotionally. But I take resolutions seriously, so I felt committed.

We are now 15 months down the timeline, so I’ll make my report.

It was easy

This will sound odd, but I swear it’s true. Not drinking was so easy that it was almost easier than my previous baseline of not-not-drinking.

Before starting this resolution, I didn’t drink much—perhaps two or three drinks per week. But I often thought about drinking. Every time I saw friends or went to a restaurant, I thought, “Should I have a drink?” Usually I decided not to. But making that decision required effort.

After a few weeks of not drinking, that question never even came up. Drinking was simply not a thing I did, so I never needed to negotiate with myself.

Theoretically, you could allow yourself one drink a month instead of zero. Theoretically, that should be easier. But I’m pretty sure I’d find it harder, because alcohol would still be an option, a thing to consider.

Sometimes I need a thing

Early on, I sometimes wanted a drink. But gradually I noticed that I didn’t really want a drink, I just wanted a thing. I can’t find a precise name for this concept in psychology, but often, some deep part of my brain seems to scream, “I WANT A THING.” It could be alcohol, but I found dessert worked just as well. I suspect that a new shirt or meeting a new dog would also work.

I was not able to stop my brain from doing this. When it demanded a thing, I gave it a thing. I just substituted a non-alcohol thing. So, over the year, I became interested in desserts and even-more interested in tea.

The struggle was The Chocolates. Shortly after I made this resolution, my mother gave me a bag of chocolates that each contained a bit of whiskey. In general, I don’t keep chocolate at home. If anyone gives me chocolate, I immediately eat all of it and then text the giver, “Thanks for the chocolate, I ate it instead of dinner, it’s all gone, this is what will always happen if you give me chocolate.”

But I couldn’t eat the Chocolates, because they contained alcohol. I managed to get guests to eat a few. A couple of times I came close to draining out the alcohol and eating the chocolate container. I even considered throwing them away, but that felt wrong. So instead I spent a year glaring at them and waiting for them to apologize for the anguish they were causing me. This represented half the difficulty of this resolution. I do not recommend it. Keep your things separate.

Alcohol is bad for sleep

Have you heard that alcohol is bad for sleep? Because alcohol is bad for sleep. I’ve always known that was true, abstractly. But sleep is variable. If I didn’t sleep well on an individual night, I was never sure: Was that because of the alcohol, or was it random variation?

After a year without alcohol, I am very confident that yes indeed, alcohol is bad for sleep, because my sleep during 2025 was much better than in previous years. Sure, like anyone else, I still sometimes wake up and start thinking about oblivion rushing towards me, and how everything I love will vanish into time, and how all that was once future and hope inevitably becomes static and dust, and how the plague of bluetooth speakers continues to spread across the globe. But now: less!

I wish there was a drug I could take that would give me energy and improve my mood and make me physically healthier and smarter, all without side-effects. I don’t think such a drug exists. But we do have the opposite!

So, sadly, I’ve come to believe that alcohol is basically the perfect anti-nootropic. That’s not because it makes you dumb while you’re drunk. (True, but who cares?) Rather, that’s because it is bad for sleep, and therefore makes you worse across all dimensions the next day.

Alcohol is good for socializing sometimes

I did find not drinking to have one clear downside: It’s just not that much fun to hang out with people who are drinking if you are not drinking yourself.

To be clear, this is a limited effect. It’s only an issue at bars or certain parties where people are there to drink. I don’t go to many such gatherings, but when I did, I felt it was less fun.

It’s not that I missed alcohol. Instead, my theory is that drinking parties are a sort of joint role-playing exercise: “Let’s all get together and collectively reduce our inhibitions and see what happens.” It’s fun not (just) because everyone is taking a recreational drug, but because it’s a joint social experience. If you don’t drink, then you aren’t fully participating.

It seems like it should be possible to reproduce this effect without alcohol. You could imagine other ways to push the social equilibrium out of balance. Like… Masks? Or weird environments? Or mutual disclosure games? Should people get together and do a group cold plunge?

Unfortunately, all these are complicated and/or carry some kind of social stigma. So until we figure something better out, this is a real cost of not drinking. It was minor for me, but it probably depends a lot on where you are in life.

Other effects

All other effects were minor. I guess I saved money at restaurants. I actually lost a bit of weight over the year, despite all the extra desserts, though I can’t say for sure if alcohol was the cause. Otherwise, once I stopped thinking of alcohol as an option, I rarely thought about the resolution at all, except when I saw those damn chocolates.

Aftermath

Towards the end of the year, I started wondering if I should quit drinking forever. But I never came to a conclusion, because I rarely thought about alcohol. I considered having a drink at midnight on New Year’s eve, but I happened to be on a plane that crossed the international date line and thus skipped New Year’s eve.

And then… for the first few months of 2026, I still didn’t drink. That wasn’t because of any decision. It just never seemed appealing because (a) sleep and (b) I’d broken the mental link between want thing and drink alcohol. Eventually, I ate the chocolates, and I had a glass of wine when visiting some friends. If I can continue rarely drinking while almost never thinking about drinking, I’ll probably do that. If I slowly slide back into always thinking of alcohol as a live option and always negotiating with myself, I might just resolve to quit forever.

So that’s my story. Obviously, it’s heavily colored by my own idiosyncrasies, so it’s hard to say if it offers any general lesson.

I do think people underrate the long-term health impact of drinking. The effect on heart disease is debated, but everyone agrees that any alcohol increases the risk of cancer. Still, the long-term effects from occasional light drinking probably aren’t huge. What’s really underrated is the short-term effects, via worse sleep.

If I had to give advice, it would be this: If you drink, and you think you might be better off not drinking, why not try it? Maybe you’ll find that champagne is essential to your happiness and drink it every night, to hell with the costs. Maybe you’ll find a different baseline, or maybe you’ll quit forever. Whatever you decide, you’ll have full information.