Polypharmacy

I notice that you call a 1 in a 1000 chance of clinically meaningful adverse cardiovascular effects "highly unusual" - that's the traditional mortality rate for measles pre vaccine, and measles is considered a highly lethal disease that's a huge threat to health even though there's the same level of likelihood of the adverse event in question.

When will public health authorities tell parents death is a "highly unusual" outcome if their child has measles??? (The framing of risk is something that interests me.)

The usual strategy is OMG your child could die... If that's a legitimate approach, so is OMG you could have a serious cardiovascular adverse effect! to the risk of clinically meaningful cardiovascular adverse events here.

I think there is a sense in which communicating risk exists on two axes: (1) likelihood and (2) seriousness of outcome. I think a 1 in 1000 risk of death can reasonably be messaged differently than a 1 in 1000 risk of reversible cardiovascular adverse events. How to do that is an interesting question and something we are not terribly consistent about, but I'm not sure that your example makes the point you want it to.

Great recap! I’m wondering about the 2 deaths which are supposedly treatment-emergent, any thoughts on that?

I think you misread the paper? It says: "both deaths were assessed by investigators as unrelated to KarXT treatment."

Ah okay, I indeed misread the paper

Mar 15

Nils, I think your perspective is reasonable but I'm not sure you're engaging with the heterogeneity. We've gone over this before so I won't rehash it but I think it is a serious issue. The average patient does not exist and these are all average data...

Reply (3)

Mar 15

Could you be more specific about what heterogeneity you’re referring to? Is it something about this paper in particular, or about Cobenfy in general?

Mar 15

Sure. It's neither about this paper in particular nor about XT specifically, it is about the framing more generally. This is how interpret RCT data and how I would think about making conclusions regarding a drug like XT (or any drug for any patient for that matter): https://michaelhalassa.substack.com/p/what-does-evidence-based-mean-for

Ok, I read the article and I think I mostly agree with the point it's making. I'm not sure I understand why you brought this up here though. Is this just a general critique of my writing? Are you saying that I don't engage with study heterogeneity enough?

Not at all. I’m just saying that RCTs don’t engage with heterogeneity. I think your writing is thoughtful. I’m just saying that reporting on the RCT without the larger context we deal with (and I know you think about this hard) can give the impression that they dictate treatment despite inter-individual variability. Again, I get this is not your goal so perhaps I’m overstepping and if I did, I apologize.

Oh no worries, didn't mean for my tone to come off as upset, I was just genuinely confused as to what you were getting at!

I take your point, I'm just trying to be a bit more succinct with some of the more factual stuff I'm doing (like this particular essay) so I can spend more time being long winded with some more philosophical and psychological pieces I'm working on.

Understood. This XT thing is particularly on my mind, because I think you are right to point out that for positive symptoms, it seems to work more like an Abilify than a Zyprexa. But for the negative/cognitive, it’s much better than either (for some people). To demonstrate that through an RCT would be a huge undertaking, but regardless, clinical experience at least indicates it.

What’s unfathomable to me is that I can have someone on the inpatient unit demonstratively improve on XT (nursing notes would highlight brightening, improved clarity and more interactions) only to come back and find out that a covering psychiatrist switched them to a traditional antipsychotic without any explanation. Same is true with outpatient providers; I’ve had people decompensate and come back inpatient because of switching without rationale. Anyway, I recognize this may sound like a bit of a tangent but wanted to give more context where some of my original comments were coming from.

Continue thread →

Mar 17Edited

The average treatment effect from an RCT may very well be the best estimate EVEN IF the treatment effect is heterogeneous, depending on the amount of heterogeneity and the precision with which we can stratify.

Furthermore, I don’t see the empirical evidence that psychiatric drug treatment effects are heterogeneous to any appreciable degree.

And the main bottleneck is this: how many subgroups do you want to define? Say we devise 3 dimensions of stratification where we declare two subgroups along each dimension (eg biomarker X being low or high), that yields 8 groups, if you want to avoid lumping patient groups together as the spirit of precision medicine dictates. Would you then run 8 RCTs? Who is going to run these trials?

If treatment effects of psychiatric drugs were truly heterogeneous, maybe precision medicine‘s search for the heterogeneity would have borne more fruit in the last 70 years of basically using identical drugs.

Mar 17

Benjamin, this response is very reasonable and I don’t disagree with the spirit of what you’re saying. The bottleneck is exactly right and it is a formidable challenge indeed. The reason I’m enthusiastic about precision medicine in psychosis is exactly because of your last point: how do you expect any precision if all you have is variation on the same d2 blocker? But now we have muscarinic agents (XT is just the first), and other mechanisms in the pipeline. But you’re right about the practical impediment of realizing this, and it’s the reason why clinical intuition can be important and shouldn’t be automatically rejected as noise.

Mar 17

The value of clinical intuition is clear based on the fact that all major psychiatric medications, xanomeline included, was discovered by clinical observation. This to me is clear. But this is looking for novel medications and totally sensible. What I am skeptical of is searching for interactions between medication and patient covariates (which could not even be interpreted as causal, since covariates can usually not be randomly assigned). This search would need to be guided by a strong understanding of the involved pathophysiology, ie not really feasible in psychiatry currently.

Mar 17Edited

Thanks Benjamin. I agree with your point about the role of clinical observation in discovering new treatments. That has clearly been central in all of medicine, including psychiatry.

I’m not sure the distinction between drug discovery and moderator identification is as clear though. The same kind of observation that generates the hypothesis that a drug works can also generate the hypothesis that it works differently across patients. The causal interpretation is more challenging as you point out and your comment about 8 RCTs is great in that context. I think the practical solution will almost certainly be an optimization that stratifies up to a point compatible with randomized testing.

Now as far as the pathophysiology is concerned, I agree with you. Mechanistic understanding should guide how we think about response heterogeneity. In this case there are at least some biologically grounded priors: muscarinic receptor expression patterns, how Ach works, how people tend to reason by relying on semantic vs working memory etc… that make differential response plausible based on inter-individual variability of all these factors.

Where I would see things differently is in treating mechanism and stratification as strictly sequential. In many areas of medicine, predictive response patterns are identified empirically first and only later mapped onto underlying biology. The process tends to be iterative: early signals guide stratification, which in turn helps refine mechanistic understanding.

Again, I think your points are very reasonable, I just see some of the components differently

The average patient does not exist, but the best estimate of treatment efficacy for any given individual is still the group estimate from an RCT. In order to claim the contrary, please conduct another RCT in a predefined subgroup and show its treatment effect is higher. Spurious (machine learning based) post-hoc analyses of subgroups predicting response is a false-positive fabric. Additionally, impressions of varying treatment effects based on clinical experience cannot disentangle varying treatment effect from varying covariate effect.