Causation Doesn’t Indicate Variation

December 10, 2025

53

Tyler Muir prompt this pretty catchphrase, which ought to stand subsequent to “Correlation doesn’t indicate causation” in our menagerie of econometric sayings. “Do modifications in x trigger modifications in y?” doesn’t reply the query “what are an important causes of variation in y?” Many recognized causal results clarify little or no variation, and we all know there are lots of different sources of variation. Folks usually leap from one to the opposite with out stopping to assume.

An additional 12 months of school, or rising up in a greater neighborhood may elevate wages. However solely a tiny fraction of why one particular person’s wage differs from one other outcomes from further years of school or which neighborhood an individual grew up in. Minimal wages may elevate, some discover, or decrease, others discover, employment. However solely a tiny fraction of the large variation in employment from one space to a different or one particular person to a different traces to variation in minimal wages. If you’d like employment, different levers are doubtless way more necessary. Demand shocks may transfer inventory costs. However solely a tiny fraction of inventory value variation comes from demand shocks.

The causality revolution

The causality revolution has come to dominate empirical work in economics. And productively so. We need to understand how x impacts y. We’d see a correlation between x and y. However our knowledge don’t come from managed experiments. Possibly y additionally causes x, possibly there are third variables that trigger y and x. That is the central conundrum of empirical social science. Faculty graduates have larger incomes. Does going to varsity elevate your revenue? Effectively, wealthy males drive Porsches. That doesn’t imply that driving a Porsche will make you wealthy.

So, we discover a tiny slice of variation in x that’s plausibly “exogenous,” just like the random variation {that a} lab scientist might impose. The correlation of this tiny little bit of x with a equally tiny little bit of variation in y can determine a causal impact of x on y. That’s nice. This causality revolution has actually improved empirical economics from the willy-nilly regressions we used to run. However that doesn’t imply we perceive the majority of motion in y. The opposite causes of y could, and sometimes do, dominate.

Throwing out variation

Begin with a variable we need to perceive y, maybe employment. y is the sum of many causes, y = b1 x1 + b2 x2 + … Initially, we glance simply on the impact of 1 variable, x1, say minimal wages, leaving out all of the others, together with inhabitants, demographics, schooling, unionization, immigrants, rising or falling industries, social program disincentives (they usually reduce advantages by a greenback for every greenback you earn), and on and on. For most individuals, who earn way over minimal wages, minimal wages are clearly irrelevant. Proper off the bat, you understand we’ll clarify a tiny fraction of employment.

However states don’t enact minimal wage legal guidelines randomly. They reply to situations. Possibly you’re trying on the impact of employment on minimal wages. Or possibly you’re governments that enact a bunch of insurance policies on the identical time, extra regulation with extra minimal wages, and it’s the rules that decrease employment. So we begin throwing out variation in an effort to discover one thing that appears like actually exogenous variation.

A typical research may make use of “variations in variations.” Take a look at modifications in minimal wage (distinction in time) throughout completely different states, and correlate that with the distinction throughout states in employment progress. We’ve thrown out loads of the variation of the unique knowledge, the extent of minimal wage in every state.

Research usually add “fastened results.” In a regression, y(state, time) = state fastened impact + b x(state, time) + error. A state fastened impact means we glance solely on the variation in a variable inside a state over time, not how the variable varies throughout states. A time fastened impact means we solely take a look at the variation of a variable throughout states, and never the way it varies over time. It is not uncommon so as to add each fastened results. Sure, that’s potential. y(i,t) = a(i) + c(t) + bx(i,t) + error just isn’t the identical as y(i,t) = a(i,t) + bx(i,t) + error, which might not work. Let’s see if I can state the supply of variation in phrases. (A terrific seminar query: are you able to please state the supply of variation in x in phrases?) We’re x in state i at time t relative to how a lot x is on common in state i, and relative to how a lot x is on common throughout all states at time t, and the way that correlates to related variation in y. Hmm, I didn’t do an important job of translation to English. (Stating the idea on normal errors in phrases will get much more fraught. Simply what did you assume is impartial of what? With out utilizing the phrase “cluster?”)

Different research look solely at states that share a border, or counties that share a border, within the hope that “different results” are the identical throughout the border. Nice, however once more we throw out all of the variation in non-bordering states or counties.

Subsequent, researchers add “controls.” Controls must be added judiciously: take into consideration what else strikes y, the way it may be correlated with the x of curiosity, after which convey it in from the error time period to the regression. Management for taxes, rules, or different modifications which may have occurred concurrently a change in minimal wage. As an alternative of y = b1 x1 + error, acknowledge that the error consists of b2 x2 and that x1 and x2 are correlated, so run y = b1 x1 + b2 x2 + error. Ingesting and most cancers are correlated. However individuals who drink additionally smoke, so that you need to take a look at the a part of ingesting not correlated with smoking to see if ingesting by itself causes most cancers. However we at the moment are in search of that a lot smaller inhabitants of drinkers who do not smoke. Technically, controls are the identical factor as trying solely on the variation in x1 that isn’t correlated with x2. We throw out variation. Mounted results are only one kind of controls.

In reality, controls are typically added willy nilly with out considering. Why is that this management wanted? What are we controlling for? That appears very true of fastened results and demographic controls. Additional controls and sometimes destroying the causal implication of the regression. Tom Rothenberg, beloved econometrics trainer at Berkeley, provided two nice examples. Regress left shoe gross sales on value and proper shoe gross sales. The R2 goes up dramatically, the usual errors drop, the magic stars seem. However now you’re measuring the impact of value on how many individuals purchase a left shoe with out shopping for a proper present. Extra severely, regress wages on schooling, however “management for” business. The R2 goes up, we clarify far more variation of wages (form of the place this publish needs to go, however not this manner). However the level of schooling is to allow you to transfer from the burger flipping business to funding banking, so controlling for business destroys the causal interpretation of the coefficient.

However I digress. To our level, including controls reduces the variation in x we’re . It’s appropriate to take action: A whole lot of the variation in x was reverse causality or correlation with different causes, and we need to throw that out in an effort to find out about causality.

Subsequent, researchers add “devices.” To keep away from the correlation is causation drawback, we discover some variable z that’s plausibly uncorrelated with different influences on y, after which solely use variation in x that is predicted by z. We throw out variation in x uncorrelated with z. (Nice examination query: clarify the distinction between an instrument and a management?)

And so forth. I’m not criticizing. The development in causal inference from these methods has been monumental. We additionally at the moment are blessed by big knowledge units, so we are able to can do it. Take all of the folks within the US, and drill right down to the truth that Joe Brown actually did transfer exogenously from Newark to Manhattan, in comparison with Sam Smith who was in any other case an identical however stayed put, and see how they did. However clearly that tells us little in regards to the precise distribution of revenue within the US.

Causality intersects with giant knowledge, additionally newly out there. With giant knowledge, you’ll be able to afford to throw out variation profligately to search for that needle of exogenous variation. Ideally, giant knowledge means we must be free from normal errors. Every thing must be vital. That normal errors nonetheless matter tells you the way a lot knowledge we throw out within the quest for causality.

Sure, it’s usually overdone and never fairly as informal because it appears.A “causally recognized” “high 5” publication with three stars on the coefficients strikes the common economists’ prior by about 1/10,000 of what Bayesian updating says it ought to do, if the causal identification had been appropriate. (Jeff Smith gave an important current Hoover seminar on this subject, slides right here, on how delicate many outcomes are to small modifications in specification.) We’re both extremely behaviorally caught in our methods, or the brand new methods on their very own don’t totally determine causal results robotically. However I’m not right here to delve in to that query at present, quite to level out that even when it had been all completely recognized, it solely solutions the query it says it solutions.

Generally, in fact, the leap is justified. Darwin discovered that pure choice accounts for finch beaks within the Galapagos. That should be 0.00001% of the variation in species. It seems all the remaining can be pure choice. However the Finch beaks alone don’t show that.

Value strain, and the 90% full glass.

This remark arose out of debate on the NBER Asset Pricing Program over Aditya Chaudhry and Jiacui Li’s “Endogenous Elasticities” paper (evaluate in my final publish). Like the remainder of the value strain literature, they discover surprisingly giant elasticities of small modifications inventory portions — an sudden sale of 1% of the excellent inventory lowers the value 1-2%. (Their level is a declining elasticity. Roughly talking, gross sales beneath 1% elevate the value by twice the quantity of sale, gross sales over 1% solely by the identical quantity because the sale. However even 1 is a big elasticity.)

However most modifications in value happen with none demand (or is it provide?) strain. Earnings bulletins transfer inventory costs, and no shares want change arms. When the market goes down and your inventory has a beta of 1, the value strikes, with no promoting strain to maneuver it. That is the usual principle and truth of buying and selling: when info hits the market symmetrically, costs transfer with no “shopping for or promoting strain”, and no quantity in any respect. Certainly, right here we’re speaking about 1% actions in value from occasional 1% actions in gross sales, however the common inventory strikes 1% each single day, and 50% or extra in a typical 12 months.

Thus, whereas one can causally determine that purchasing or promoting strain strikes costs, that doesn’t set up that the majority value motion comes from shopping for or promoting strain. R(t+1) = beta x(t+1) + error can have a superbly recognized beta and x. However the “error,” which consists of all the opposite x’s disregarded of the regression, might be big. “Liquidity merchants transfer inventory costs” doesn’t indicate “inventory costs largely transfer due to liquidity merchants.”

To be clear, neither Chaudhry and Li nor another value strain authors I’ve seen declare in any other case. However one does sniff that mis-interpretation hanging round.

Associated however barely completely different, most modifications in amount don’t have any or tiny value results, as a result of they’re anticipated. Most individuals attempting to purchase or promote monetary belongings are sensible sufficient to not shock the market. When you present up unexpectedly with a truck load of tomatoes outdoors of Entire Meals at 2 am, you’re not going to get full value for them. The Treasury, for instance, routinely sells tons of of billions of {dollars} of debt with primarily no value impression. Why? It broadcasts the gross sales properly forward of time, and talks to bond merchants in regards to the sale. Quantitative easing purchases of tons of of billions had some impression impact when introduced, however no detectable value impression when the Fed really purchased securities. Preliminary choices quantity to an infinite % improve in provide of shares. Funding banks exist to popularize choices, announce them, line up buyers, and restrict any “sloping demand curve” value impression.

Furthermore, now we have lengthy understood why promoting drives costs down: folks on the opposite aspect suspect you understand one thing they don’t know. The worth strain literature tries to seek out promoting or shopping for shocks that the opposite aspect ought to have the ability to work out just isn’t tied to info. For instance, with the identical knowledge that value strain authors laboriously dig up, you must have the ability to work out {that a} mutual fund is promoting shares as a result of its prospects are pulling out cash, not as a result of its analysts know one thing you don’t. The mere truth {that a} fund is promoting may imply that its analysts know one thing the dealer doesn’t know. Effectively, possibly excessive frequency arbitrageurs aren’t fairly that good at parsing out who does and doesn’t know one thing after they promote.

It is a barely completely different phenomenon, for which I don’t have a catchphrase: Simply because your recognized motion in x causes motion in y doesn’t imply that every one actions in x trigger motion in y.

Macroeconomics

Macroeconomics ought to take a victory lap for being first to the desk right here. Chris Sims’ Vector Autoregressions taught us to search for the results of a financial coverage shock by trying on the common occasions not following an rate of interest rise per se, however solely following sudden rate of interest rises. The difficulty is, markets anticipate most rate of interest modifications very properly, so true financial coverage shocks are few and much between. If we need to subdivide, for instance to financial coverage shocks that persistently elevate rates of interest vs those who die out rapidly, then now we have fewer knowledge factors nonetheless. (In modern principle, persistent vs. transitory shocks have very completely different results.) The end result, recognized financial coverage shocks clarify subsequent to not one of the noticed variation in costs, output, and employment, and normal errors plus the results of small specification modifications are big.

Remaining ideas

So, causality is nice, nevertheless it isn’t all the pieces. We frequently do need to know, “what are the most important causes of progress vs stagnation, wealth vs. poverty, recession vs. increase, and why do inventory costs wander round a lot?” Causal identification can chip away at this query, however clearly there’s a lengthy solution to go. And it’s not the apparent we’ll ever get there, since a lot motion within the causes is and can all the time be endogenous.

Possibly one ought to rule out such massive image questions. Medication doesn’t get far with “why are folks sick?” however as an alternative assaults medication with small marginal energy one after the other. And scientific trials rightly concentrate on simply the folks within the trial, ignoring the huge quantity outdoors of the trial.

Nonetheless, then, one mustn’t mistake the reply of the small causal query for the reply to the disallowed massive image query.

As I take into consideration macroeconomics and finance, I believe there’s good work to be accomplished that doesn’t simply observe the causal identification format, and permits us to handle the large image query. Generally broad info match one vs. one other causal story in methods that can’t be captured by these methods.

As a concrete instance, I’ll plug once more a current paper, “Expectations and the Neutrality of Curiosity Charges.” Right here I contrasted FTPL, outdated Keynesian, new-Keynesian and Monetarist explanations for the current surge of inflation, the lengthy quiet zero sure, the shortage of a deflation spiral in 2008, and the immense distinction between QE and the 2020-2021 asset purchases. I argue that one can kind out the theories with just a little Occam’s razor, fundamental basic predictions, and elephant within the room info. However I couldn’t consider an F take a look at in a VAR to seize that widespread sense. This form of examination of historic episodes stays productive. Tom Sargent’s plot of the top of the German hyperinflation did greater than a thousand VARs to exhibit the opportunity of painless disinflation and its doubtless mechanism.

Development principle additionally appears to seek out it very productive to have a look at fundamental info, quite than slice and cube causal estimates. It began with Bob Lucas noticing that capital must be flowing in droves to poor nations. Why not? Tom Sowell is on my thoughts from his current celebration. He paperwork info that help one vs. one other causal framework. For instance, individuals who immigrate to the US from completely different nations or areas of nations, however Individuals can’t inform them aside, have very completely different outcomes. Effectively, pure discrimination can’t be all the pieces.

However this form of factor takes thought and judgement, and is tough to publish.

Updates:

John Hand at UNC has a stunning paper documenting the phenomenon in accounting analysis. “Greater Information + Tinier Outcomes = The Mistaken Route.”

the explanatory energy of the everyday KIV [key independent variable of interest] in primarily regression-based accounting analysis papers has fallen ≈ 50X from 1.8% in 1995 to 0.04% in 2024.

I need to emphasize what this publish is not about: It isn’t a critique of causal strategies in utilized economics and finance. It is just a criticism of how one may misread the findings of these causal strategies. A number of correspondents are misreading this publish as a critique of causal strategies.

There is a burgeoning critique, which I barely alluded to above by noticing how little folks’s beliefs are modified by “statistically vital” outcomes. It’s present process its personal replicability disaster, because the Jeff Smith slides I alluded to above focus on. Small modifications in how one throws out 99% of the variance of a variable results in fairly completely different outcomes. Additionally it is criticized for x variables that one may have the ability to measure, however that don’t actually matter within the massive scheme of issues. It’s belittled as “cuteonomics” or “blippies,”with some justice.

I discover empirical analysis most credible after I can isolate the plausible stylized truth underlying an estimate. It’s, as I alluded to a bit, virtually unattainable to even state the stylized truth in phrases, not to mention the assumed correlation construction of errors. I discover empirical analysis most satisfying after I study one thing in regards to the world, summarized in an estimate however plausible by itself. Abstracts of empirical papers particularly on the job market are virtually comical. After stating an fascinating query, “we leverage a diff in diff technique with controls, fastened results and devices….” Oh properly.

Informal econometricians, nevertheless, are proper to say that every one of you outdated people working doubtful correlation regressions by no means received anyplace within the outdated days. What would you do higher?

However all that’s for one more day.

Replace 2:

I went to work at present, and ran throughout a traditional instance at a seminar. A paper claims that rises in international temperatures elevate mortality from warmth waves. Depart apart fights over whether or not that’s proper or not — demise charges in Texas usually are not quite a bit larger than in New Hampshire every summer time. If the query is “what accounts for deaths,” one diploma larger temperature in 100 years needs to be within the thousandth of the results of illness, air pollution, poverty, and so forth. If the query is “what can we do to scale back the demise charge,” “purchase an electrical automobile” within the millionths of the profit/value interventions.

Share

Causation Doesn’t Indicate Variation

Related Articles

Use of Gen AI within the Office and the Worth of Entry to Coaching

Kevin Warsh, Trump’s Choose to Lead Fed, Faces Senate at Difficult Second

Why Gasoline Tax Holidays Backfire

LEAVE A REPLY Cancel reply

Latest Articles

Use of Gen AI within the Office and the Worth of Entry to Coaching

Kevin Warsh, Trump’s Choose to Lead Fed, Faces Senate at Difficult Second

Why Gasoline Tax Holidays Backfire

How policymakers can shut the digital ID gender hole

A Resolution Tree With The New IRS Limits

ABOUT US

Causation Doesn’t Indicate Variation

Related Articles

LEAVE A REPLY Cancel reply

Stay Connected

Latest Articles

.tds-title-simplified{margin:0;transform:translateZ(0)}.tds-title-simplified>.td-element-style{z-index:-1}body .tdi_138{text-align:center;color:#ffffff}ABOUT US

ABOUT US