<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Firdaus Gupte]]></title><description><![CDATA[I'm a PhD candidate in philosophy at UMass Amherst. I like thinking about ethics and the philosophy of science.]]></description><link>https://firdausgupte.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!OnQ8!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bd949e9-79ba-4e42-a808-627a9eca9b53_1280x1280.jpeg</url><title>Firdaus Gupte</title><link>https://firdausgupte.substack.com</link></image><generator>Substack</generator><lastBuildDate>Tue, 26 May 2026 02:54:12 GMT</lastBuildDate><atom:link href="https://firdausgupte.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Firdaus Gupte]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[firdausgupte@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[firdausgupte@substack.com]]></itunes:email><itunes:name><![CDATA[Firdaus Gupte]]></itunes:name></itunes:owner><itunes:author><![CDATA[Firdaus Gupte]]></itunes:author><googleplay:owner><![CDATA[firdausgupte@substack.com]]></googleplay:owner><googleplay:email><![CDATA[firdausgupte@substack.com]]></googleplay:email><googleplay:author><![CDATA[Firdaus Gupte]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Why you should push the blue button]]></title><description><![CDATA[A modest case]]></description><link>https://firdausgupte.substack.com/p/why-you-should-push-the-blue-button</link><guid isPermaLink="false">https://firdausgupte.substack.com/p/why-you-should-push-the-blue-button</guid><pubDate>Fri, 08 May 2026 17:15:51 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!46pU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!46pU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!46pU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png 424w, https://substackcdn.com/image/fetch/$s_!46pU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png 848w, https://substackcdn.com/image/fetch/$s_!46pU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png 1272w, https://substackcdn.com/image/fetch/$s_!46pU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!46pU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png" width="424" height="565.3333333333334" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1448,&quot;width&quot;:1086,&quot;resizeWidth&quot;:424,&quot;bytes&quot;:2151609,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/196260803?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!46pU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png 424w, https://substackcdn.com/image/fetch/$s_!46pU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png 848w, https://substackcdn.com/image/fetch/$s_!46pU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png 1272w, https://substackcdn.com/image/fetch/$s_!46pU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac1cfbf7-bf6f-42b3-8902-285db33af0fa_1086x1448.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>By now, I&#8217;m sure you&#8217;ve heard of the &#8220;Red Button or Blue Button&#8221; problem.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://firdausgupte.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">This Substack is reader-supported. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><blockquote><p><em>Red Button or Blue Button</em>: Everyone in the world has to take a private vote by pressing a red or blue button. If more than 50% of people press the blue button, everyone survives. If less than or equal to 50% of people press the blue button, only people who pressed the red button survive. Which button would you press?</p></blockquote><p>This problem uncovers some surprisingly deep issues about decision theory, collective action, and your ethical obligations to others.</p><p>I think you should push blue. Here is a modest case for why.</p><div><hr></div><p>Let&#8217;s start easy. Suppose you only care about your own life. You can visualize all the possible outcomes using this payoff matrix:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{array}{c|c|c} &amp; \\text{You: Blue Button} &amp; \\text{You: Red Button} \\\\ \\hline \\text{> 50%: Blue Button} &amp; \\text{You }{\\color{#228B22}\\text{live}} &amp; \\text{You }{\\color{#228B22}\\text{live}} \\\\ \\hline \\text{$\\leq$ 50%: Blue Button} &amp; \\text{You }{\\color{#B22222}\\text{die}} &amp; \\text{You }{\\color{#228B22}\\text{live}} \\\\ \\end{array}&quot;,&quot;id&quot;:&quot;RAMCAOAUET&quot;}" data-component-name="LatexBlockToDOM"></div><p>If you push the red button, you&#8217;re guaranteed to live, and if you push the blue button, you might die. So, if you only care about your own life, you should push the red button, because doing so <a href="https://en.wikipedia.org/wiki/Strategic_dominance">weakly dominates</a><em> </em>pushing the blue button.</p><p><strong>Takeaway</strong>: If you only care about your own life, push the red button.</p><div><hr></div><p>But do you only care about your own life?</p><p>Think about all the other people&#8217;s lives that are at stake. Your friends, your family, and all your other loved ones are also playing this game. Random strangers are too. And although you probably don&#8217;t care as much about random strangers as you do about your loved ones, you probably care at least <em>a little bit </em>about random strangers.</p><p>Further, here&#8217;s a fact that you can be almost certain of: of the 8 billion people on Earth, <em>at least one person will press the blue button</em>. I&#8217;m not claiming lots of people will, just that, almost certainly, at least one person will. Maybe they&#8217;re scared. Maybe they&#8217;re confused about the situation. Maybe they think they&#8217;re taking the moral high ground. Maybe they&#8217;re creating a problem when one doesn&#8217;t need to exist. Whatever their reasons, you can be almost certain that <em>at least one person</em> will press the blue button. </p><p>And further, however bad their reasons, they don&#8217;t deserve to <em>die </em>for it. (Let&#8217;s set aside the issue of the risk to yourself for a moment.) I&#8217;ve seen comments that say, &#8220;The only reason to press blue is to save the other <em>idiots</em> that also pressed blue.&#8221; But surely, if someone does something stupid or negligent, I&#8217;m willing to bet you wouldn&#8217;t suddenly stop caring about their life.</p><p>Consider this case:</p><blockquote><p><em>Beach</em>: You&#8217;re relaxing on a beach. Nearby, someone who can&#8217;t swim is playing on a pier that extends into the water. They know they can&#8217;t swim. They climb onto the rail, fall in, and start drowning. You could easily save them. There is a life jacket next to you, and all you have to do is throw it to them.</p></blockquote><p>This person is certainly an <em>idiot </em>for playing on top of a pier, knowing that they can&#8217;t swim. They created a problem when one didn&#8217;t need to exist. But, I think, you would still throw them the lifejacket.</p><p><strong>Takeaway: </strong>You (probably)<em> </em>care about other people&#8217;s lives, including the lives of people who make stupid or negligent choices. (And if you&#8217;re not convinced, think about your loved ones.)</p><div><hr></div><p>But <em>Blue Button or Red Button</em> is not like tossing someone a life jacket! There is a risk to your own life if you push the red button. You might care about other people&#8217;s lives in general, but you wouldn&#8217;t be willing to pay a real cost, like risking your life, to save them. </p><p>Notice that we have changed the problem. The problem <em>now </em>is: what should you do in cases in which you must contribute to a public good (saving as many lives as possible, including your own) by incurring a cost to yourself (risking your life)? In other words, the problem is starting to look like a <a href="https://plato.stanford.edu/entries/free-rider/#SolFreRidPro">collective action</a> problem.</p><p>Here is an example of a collective action problem.</p><blockquote><p><em>Littering</em>: There is a beautiful public park that you and lots of other people love. There are no garbage cans in the park. You are carrying litter with you. It would be very easy and convenient for you to throw the litter as you pass through the park. Most people would barely even notice it. On the other hand, you could carry your litter with you and throw it away at home, but this would be somewhat annoying and inconvenient.</p></blockquote><p>What should you do? Here is one way to think about this problem. You might reason that it would be better to litter since your tiny piece of trash would make almost no difference to the beauty of the park. But remember, everyone is in the same situation as you are! And if everyone litters, the park would be ruined.</p><p>You can again visualize all the possible outcomes using a payoff matrix:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{array}{c|c|c} &amp; \\text{You: Don't Litter} &amp; \\text{You: Litter} \\\\ \\hline \\text{Everyone else: Don't Litter} &amp; {\\color{olive}\\begin{array}{c}\\text{Clean Park +} \\\\ \\text{Inconvenience}\\end{array}} &amp; {\\color{darkgreen}\\begin{array}{c}\\text{Clean Park +} \\\\ \\text{Convenience}\\end{array}} \\\\ \\hline \\text{Everyone else: Litter} &amp; {\\color{red}\\begin{array}{c}\\text{Dirty Park +} \\\\ \\text{Inconvenience}\\end{array}} &amp; {\\color{darkorange}\\begin{array}{c}\\text{Dirty Park +} \\\\ \\text{Convenience}\\end{array}} \\\\ \\end{array}&quot;,&quot;id&quot;:&quot;YSGYMMGMMV&quot;}" data-component-name="LatexBlockToDOM"></div><p>The worst outcome is when you don&#8217;t litter, but everyone else does (bottom left). You have the inconvenience of carrying your litter home, but still end up with a dirty park. A slightly better outcome, but still bad, is when you litter and everyone else does (bottom right). There&#8217;s no more inconvenience, but your beloved park is still ruined. A good outcome is when you don&#8217;t litter, and neither does anyone else (top left). You have a beautiful park, at a small cost to yourself. The <em>best </em>outcome is when you litter, and no one else does (top right). You still have a clean park and the convenience of not carrying your trash home.</p><p>The problem is that no matter what everyone else does, it&#8217;s better for you to litter. If everyone else doesn&#8217;t litter (that is, if we&#8217;re in the top row), then you should litter, since you get a clean park either way and a bit of convenience if you litter. If everyone else does litter (that is, if we&#8217;re in the bottom row), then you should also litter, since you get a dirty park either way, and a bit of convenience if you litter. Littering <a href="https://en.wikipedia.org/wiki/Strategic_dominance">strictly dominates</a> not littering.</p><p>But since <em>everyone else </em>is in the same situation you are, they will also litter! And you will end up in the bottom right cell, with a dirty park and only a bit of convenience, instead of a much better outcome, with a clean park and only a small amount of inconvenience. Yet intuitively, the best outcome for everyone is the top left cell, where everyone carries their litter home and enjoys a beautiful park.</p><p>This is a collective action problem. Each individual must pay a small cost to preserve a public good, the good would be preserved even if any individual refused to pay, but if everyone refused to pay, the public good would be destroyed. A lot of real-life cases are like this. Cases of companies polluting, citizens voting, and even <a href="https://www.latimes.com/entertainment-arts/music/story/2022-03-10/mitski-bruno-mars-silk-sonic-cell-phones-concerts">concert attendees putting up their phones</a> to record the show can be thought of as collective action problems.</p><div><hr></div><p>Given that you care about other people&#8217;s lives, <em>Blue Button or Red Button</em> is a lot like <em>Littering</em>. You might reason that it would be better to push red since the probability of your vote making a difference is vanishingly small.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> But remember, everyone else is in the same situation as you are! And if a majority pushes red, then those unlucky people who pushed blue, whom you (probably) care about, will die.</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{array}{c|c|c} &amp; \\text{You: Blue Button} &amp; \\text{You: Red Button} \\\\ \\hline \\text{> 50%: Blue Button} &amp; {\\color{olive}\\begin{array}{c}\\text{Everyone lives +} \\\\ \\text{You risk your life}\\end{array}} &amp; {\\color{darkgreen}\\begin{array}{c}\\text{Everyone lives +} \\\\ \\text{You don't risk your life}\\end{array}} \\\\ \\hline \\text{$\\leq$ 50%: Blue Button} &amp; {\\color{red}\\begin{array}{c}\\text{Blue-pushers die +} \\\\ \\text{You risk your life (and die)}\\end{array}} &amp; {\\color{darkorange}\\begin{array}{c}\\text{Blue-pushers die +} \\\\ \\text{You don't risk your life}\\end{array}} \\\\ \\end{array}&quot;,&quot;id&quot;:&quot;BOPXPLJMJN&quot;}" data-component-name="LatexBlockToDOM"></div><p>Here again, the worst outcome is when you push blue but the majority doesn&#8217;t (bottom left). You end up risking your life and dying, and lots of people end up dying as well. Since we are assuming you care about other people&#8217;s lives too (especially your loved ones&#8217; lives), this is the worst outcome. A better outcome, but still bad, is when you push red, but the majority pushes blue (bottom right). You do survive, but lots of people end up dying. Again, since we are assuming you care about other people&#8217;s lives too, this is still a bad outcome. A good outcome is when you push blue and the majority does too (top left). Everyone survives, but you had to risk your life. The very best outcome is when you push red but the majority pushes blue (top right). You didn&#8217;t have to risk your life, and everyone still survives.</p><p>The problem is that, just as before, no matter what everyone else does, it&#8217;s almost always better for you to push red (since the probability of your vote being decisive is vanishingly small). If you are confident the majority will push blue, there&#8217;s no need to risk your life and push blue - just push red! If you&#8217;re confident the majority won&#8217;t push blue, then you should push red to save yourself - there&#8217;s no need to throw your life away for nothing! Just as in <em>Literring </em>(if we ignore the vanishingly unlikely situation in which your vote is the deciding vote), pushing red strictly dominates pushing blue.</p><p>But lots of people are in the same situation as you are. So, we will all end up in the bottom right cell, with a majority pushing red, and a few poor, confused people pushing blue and dying. Intuitively, the best outcome is again the top left cell, where most people push blue, and no one dies.</p><p><strong>Takeaway: </strong>If you care about other people&#8217;s lives, <em>Blue Button or Red Button</em> has a similar structure to a collective action problem, like the problem of littering, voting, or polluting.</p><div><hr></div><p>What should you do if you are in a collective action problem? Based on the reasoning I gave above, it seems that you should refuse to pay the cost, and so end up in a less-than-ideal situation.</p><p>But a hidden assumption crept into the discussion. An assumption about the right theory you should use to make decisions. To see what that assumption is, consider this last case.</p><blockquote><p><em>Twin Littering</em>: Just like <em>Littering </em>above, except with a twist. This time, you and all the other people who in the park are <em>psychological twins</em>. Because of your genetic makeup, personality, and background, there is a 99% probability that you will all make the same decisions when placed in the same situation.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p></blockquote><p>What should you do now?</p><p>Suppose you think, &#8220;I should litter, since no matter what anyone else does, I&#8217;m better off if I litter.&#8221; But this time, remember that everyone else is your psychological twin, so you can be confident they are thinking the exact same thing. So, if you choose to litter, then you have a 99% confidence that everyone else will make that same decision to litter. On the other hand, if you choose <em>not </em>to litter, you can be very confident that everyone else will also not litter. So, you might reason, it would be better <em>not </em>to litter, since your very decision not to litter will make you confident that no one else will litter either.</p><p>If you agree with this reasoning, then you might like <a href="https://www.lesswrong.com/w/evidential-decision-theory">Evidential Decision Theor</a>y (EDT). Roughly, EDTists think that you should choose the action that you <em>expect</em> will maximize value, conditional on your choosing that action. More formally, EDTists calculate an action&#8217;s value by:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;EU(A) = \\sum_i P(S_i \\mid A) \\, U(A \\,\\&amp;\\, S_i)&quot;,&quot;id&quot;:&quot;UTJTCSGYHN&quot;}" data-component-name="LatexBlockToDOM"></div><p>where </p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;P(S_i \\mid A)&quot;,&quot;id&quot;:&quot;ZOCHRXAIRT&quot;}" data-component-name="LatexBlockToDOM"></div><p>refers to the probability that you&#8217;ll end up in some state <em>S,</em> conditional on taking some action <em>A</em>, and </p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;U(A \\,\\&amp;\\, S_i)&quot;,&quot;id&quot;:&quot;CQCRDVKHNN&quot;}" data-component-name="LatexBlockToDOM"></div><p>refers to your utility function <em>U</em>, which spits out a number that represents how much you value a certain action <em>A </em>and a state <em>S</em>.</p><p>Essentially, you should take the fact that you chose to litter <em>as evidence </em>for the fact that other people will litter, even though your choosing to litter doesn&#8217;t cause them to do so. EDTists will therefore choose <em>not </em>to litter in <em>Twin Littering</em>, because they know that their psychological twins will act just as they do, with a high probability.</p><p>However, you might <em>not </em>like this kind of reasoning. You might reason, your choice to litter won&#8217;t <em>cause </em>anyone to litter! It just so happens that your actions and everyone else&#8217;s are <em>correlated</em>, because you and everyone else in <em>Twin Littering</em> are psychological twins. You should still litter, you might think, because again, no matter what anyone else does, you are better off littering. This is what we learned, you might insist, from looking at those payoff matrices, and finding out that littering strictly dominates not littering.</p><p>If you like <em>this</em> kind of reasoning, you might like <a href="https://plato.stanford.edu/entries/decision-causal/">Causal Decision Theory</a> (CDT). Roughly, CDTists think that you should choose the action that you expect will <em>cause </em>the state with the highest value. More formally, CDTists calculate an action&#8217;s value by:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\quad EU(A) = \\sum_i P(A > S_i) \\, U(A \\,\\&amp;\\, S_i)&quot;,&quot;id&quot;:&quot;TQOGANNNOZ&quot;}" data-component-name="LatexBlockToDOM"></div><p>where </p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;P(A > S_i)&quot;,&quot;id&quot;:&quot;IUQCDJKRMP&quot;}" data-component-name="LatexBlockToDOM"></div><p>refers to the probability that a certain action <em>A</em> will <em>cause </em>a certain state <em>Si</em>. </p><p>Essentially, since your choice to litter is not causally connected to anyone else&#8217;s choice, CDTists think you should litter in <em>Twin Littering</em>. By payoff matrix-style reasoning, no matter what anyone else does, you are better off littering.</p><p><strong>Takeaway: </strong>An EDTist will not litter in <em>Twin Littering,</em> and a CDTist will.</p><div><hr></div><p>So, EDTists will choose not to litter in <em>Twin Littering</em>, while CDTists will choose to litter. What does this have to do with real-life collective action problems, like <em>Littering</em>?</p><p>The answer is that although in real-life cases, you do not have a bunch of psychological twins running around, you can still reasonably assume you are <em>very similar</em> to many other people out there. In real life, there are lots of people who have similar personalities, preferences, and backgrounds to you, who reason in the same way that you do, and therefore, who make similar decisions to you when placed in the same situations.</p><p>Take <em>Littering</em>. You cannot be 99% confident that everyone will make the same choice that you do, but you can be <em>reasonably</em> confident that lots of people, with similar personalities, preferences, and backgrounds to you, will make the same decision as you do. In other words, if you choose to litter, you can reasonably expect all the other people who are similar enough to you to litter as well. Maybe it&#8217;s not 99%. Maybe it&#8217;s closer to 50%. But whatever it is, you can be reasonably confident that lots of people will act like you.</p><p>So, if you are an EDTist, you will reason that, since many other people will make the same choice as you do, you should choose <em>not </em>to litter, since your choice not to litter will be evidence that lots of other people will not litter as well. (On the other hand, a CDTist will still litter in <em>Littering</em>.)</p><p><strong>Takeaway</strong>: If you are an EDTist, and there are enough people out there who think as you do, you will (probably) not litter in <em>Littering</em>.</p><div><hr></div><p>That, <em>finally</em>, brings us back to what you should do in <em>Red Button or Blue Button</em>. First, note that although you don&#8217;t have a bunch of psychological twins out there, you can be reasonably confident that there are lots of people with similar personalities, preferences, and backgrounds to you. When given the choice between pushing red and pushing blue, they will deliberate in the same way that you do, reach the same conclusion that you do, and then make the same <em>choice</em> that you do. </p><p>So, <em>if</em> you like EDT, you should (probably) push the blue button, because you will take your choice to push blue as <em>evidence </em>for what a chunk of similar people will choose to do. If there are enough people who will deliberate like you and make the same choice as you, then this will result in a blue victory, and no one will die. </p><p>However, if you like CDT, then you should just push the red button. You will reason that pushing the blue button won&#8217;t <em>cause </em>anyone else to push the blue button too (at best, it will just be correlated) and pushing the red button, as we figured out above, strictly dominates pushing the blue button.</p><p>But how <em>many</em> people in the world will deliberate as you do? If, say, only 10% of the population deliberates as you do, this may not be enough evidential lift for it to be worth pushing blue. In other words, it may turn out that pushing blue increases your expectation of a blue victory, but not by much. So, maybe even if you are an EDTist, you should be wary of pushing blue, since you still have a high expectation of all the blue-pushers dying, including yourself.</p><p><strong>Takeaway</strong>: If you are an EDTist, and there are enough people out there who think as you do, you will (probably) push blue.</p><div><hr></div><p>So far, we have been thinking about what the right decision is, assuming that you are only trying to maximize what you value. (We assumed that you <em>wanted </em>to save the lives of other people.)</p><p>But you might think that this leaves something out. You might think that in a collective action problem like <em>Littering</em>, you shouldn&#8217;t just do what is best for yourself or what you value. You might think that decision theory can tell you what to do if you want to get the most value for yourself, but this ignores the <em>ethics </em>of your decision. You might think you have an <em>ethical obligation </em>to not litter, to vote, to not pollute, and in general, to cooperate in collective action problems. If you are sympathetic to this idea, you might think we need to go beyond decision theory&#8230; to ethics.</p><p>There are many ethical theories that can make sense of collective action problems. One is Kantian Ethics, according to which there are certain ethical constraints on action that go beyond maximizing what you value. <em>Roughly</em>, Kantian Ethics says, before you take an action, think about what rule (<em>maxim</em>) you are acting on. Then, imagine a world in which this is a rule that everyone follows (<em>universal law</em>). Finally, ask yourself, in this world, are you still able to pursue the goals you are committed to pursuing (<em>will</em>)?</p><p>For example, in <em>Littering</em>, if you choose to litter, the rule you would be acting on is something like, &#8220;It is OK for people to litter to avoid the inconvenience of carrying litter home.&#8221; Now, you imagine that everyone else could act on this rule. In such a world, the park would be ruined. And finally, a goal you are committed to pursuing is having a clean park, because you love the park. So, you shouldn&#8217;t litter.</p><p>So, similarly, in <em>Red Button or Blue Button</em>, if we assume that some poor, confused individuals will press blue, Kantian Ethics would recommend pushing blue as well. If you push red, then the rule you would be acting on is something like, &#8220;It is OK not to take on small risks to your life, despite the fact (because of our assumption) that this will result in some people dying.&#8221; Now, you imagine everyone else could act on this rule. In such a world, you might find yourself in a situation in which other people need to take on small risks to protect <em>your life</em>, but don&#8217;t. And obviously, a goal you are committed to pursuing is not dying. So, you shouldn&#8217;t push red.</p><p><strong>Takeaway</strong>: If you think that you have an ethical obligation to cooperate in collective action problems, you will also think you have an ethical obligation to press blue.</p><div><hr></div><p>That&#8217;s my modest case for blue. Here is the final takeaway.</p><p><strong>Final takeaway</strong>: <em>If </em>you can be confident at least one other person will press blue, and <em>if</em> you care about other people&#8217;s lives, including the life of this one person who might press blue, not just your own, and <em>if </em>you are an EDTist, and <em>if </em>there are enough people in the world like you, who will deliberate just as you do, <em>then </em>you should push blue. If you <em>aren&#8217;t </em>an EDTist or if you think there <em>aren&#8217;t </em>enough people in the world like you, then you might still think you have an ethical obligation to cooperate in collective action problems, and so you will think you have an ethical obligation to press blue.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Out of a population of 8 billion, if each individual has <em>exactly </em>a .50 probability of pressing blue or red, then the probability that the population will be <em>exactly </em>evenly split is approximately </p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\frac{1}{224{,}000}&quot;,&quot;id&quot;:&quot;ERVPMMKBME&quot;}" data-component-name="LatexBlockToDOM"></div><p></p><p>If the probability is even <em>slightly </em>off, if say, it&#8217;s 0.501 or .499, then the probability that the population will be <em>exactly </em>evenly split is approximately </p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\frac{1}{10^{6948}}&quot;,&quot;id&quot;:&quot;WIYXVWUFXU&quot;}" data-component-name="LatexBlockToDOM"></div><p>which is virtually zero.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>This case is just a version of <a href="https://www.lesswrong.com/w/newcomb-s-problem">Newcomb&#8217;s Problem</a> and the <a href="https://www.lesswrong.com/w/psychological-twin-prisoner-s-dilemma">Psychological Twin Prisoner&#8217;s Dilemma</a>.</p></div></div>]]></content:encoded></item><item><title><![CDATA[The Impossible Quest for Fair Algorithms]]></title><description><![CDATA[Why bail and sentencing algorithms face an unavoidable tradeoff]]></description><link>https://firdausgupte.substack.com/p/the-impossible-quest-for-fair-algorithms</link><guid isPermaLink="false">https://firdausgupte.substack.com/p/the-impossible-quest-for-fair-algorithms</guid><dc:creator><![CDATA[Firdaus Gupte]]></dc:creator><pubDate>Fri, 17 Apr 2026 19:40:54 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!auck!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!auck!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!auck!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png 424w, https://substackcdn.com/image/fetch/$s_!auck!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png 848w, https://substackcdn.com/image/fetch/$s_!auck!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!auck!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!auck!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png" width="1456" height="794" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:794,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3703263,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!auck!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png 424w, https://substackcdn.com/image/fetch/$s_!auck!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png 848w, https://substackcdn.com/image/fetch/$s_!auck!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!auck!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7dd51305-feea-4981-89a0-05179656e42f_2816x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In Shawshank Redemption (1994), one of the film&#8217;s main characters, Red, played by Morgan Freeman, repeatedly appears before a parole board. Each time, the parole board asks him the same question:</p><p><em>&#8220;Do you feel you&#8217;ve been rehabilitated?&#8221;</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://firdausgupte.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>He gives careful, rehearsed answers to try to convince them that he has been rehabilitated, but each time, they deny him parole. Finally, at the end of the film, he bluntly responds that he doesn&#8217;t think he understands what the word &#8220;rehabilitated&#8221; means, and that he thinks that the concept is something the board has made up. After that hearing, the board grants him parole. The audience is left with the feeling that the parole board has no clear, rigorous criteria for making parole decisions; they depend on the whims and fancies of individuals who use their intuition and personal judgment to make decisions about whether or not to grant an individual their freedom.</p><p>The film is set between the 1940s and the 1960s, before more sophisticated techniques were put in place to make parole decisions. Nowadays, prediction tools use sophisticated algorithms, taking into account a wide variety of factors, including an individual&#8217;s criminal history, age at first arrest, substance abuse, employment history, family background, and even attitudes toward crime.</p><p>One such tool is COMPAS, developed by a company called &#8220;Northpointe&#8221; (now Equivant). It uses statistical models trained on historical data to make predictions about things like an individual&#8217;s likelihood of violating parole or failing to show up for a sentencing hearing. In theory, it helps parole boards make <em>scientific, data-based</em> decisions, unlike what was depicted in Shawshank Redemption.</p><p>COMPAS has been used to evaluate more than 1 million defendants across the country. But in 2016, ProPublica published an article written by Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner, called &#8220;<a href="https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing">Machine Bias</a>&#8221;. They looked at over 10,000 defendants in Broward County, Florida. They compared what COMPAS predicted about their likelihood to recidivate or commit another crime to what actually happened. And they made a startling accusation.</p><p>They charged that COMPAS was biased against Black defendants. It was much more likely to incorrectly flag Black defendants as &#8220;high-risk&#8221; and incorrectly flag White defendants as &#8220;low-risk&#8221;.</p><p>In their words,</p><blockquote><p>In forecasting who would re-offend, the algorithm made mistakes with Black and White defendants at roughly the same rate but in very different ways. The formula was particularly likely to falsely flag Black defendants as future criminals, wrongly labeling them this way at almost twice the rate as White defendants. White defendants were mislabeled as low risk more often than Black defendants.</p></blockquote><p>The creators of COMPAS at Northpointe published a <a href="https://go.volarisgroup.com/rs/430-MBX-989/images/ProPublica_Commentary_Final_070616.pdf">response</a>, written by William Dieterich, Christina Mendoza, and Tim Brennan, to the ProPublica article. The research report rejected ProPublica&#8217;s claims. They argued that for any given risk score that COMPAS assigned, Black and White defendants with that risk score re-offended at the same rates. In their words,</p><blockquote><p>When the correct classification statistics are used, the data do not substantiate the ProPublica claim of racial bias towards blacks. The proper interpretation of the results... demonstrates that [the risk scores] are equally accurate for blacks and whites.</p></blockquote><p>Here&#8217;s the twist &#8211; <em>they were both right.</em></p><p>How can that be?</p><div><hr></div><h3><strong>A Toy Model</strong></h3><p>The problem, it turned out, didn&#8217;t have anything to do with parole, sentencing, or racial bias. It was a problem that went deeper, to the mathematics of prediction itself.</p><p>To illustrate the problem, let&#8217;s use a toy model that abstracts away from the messy details in the data that Julia Angwin and her colleagues used.</p><p>It&#8217;s a model of grapes. Let&#8217;s take 2000 grapes, 1000 green and 1000 purple, and mix them all together. And suppose that some of them are <em>poisoned.</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iYyL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iYyL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png 424w, https://substackcdn.com/image/fetch/$s_!iYyL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png 848w, https://substackcdn.com/image/fetch/$s_!iYyL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png 1272w, https://substackcdn.com/image/fetch/$s_!iYyL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iYyL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png" width="1456" height="1429" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1429,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1355935,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iYyL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png 424w, https://substackcdn.com/image/fetch/$s_!iYyL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png 848w, https://substackcdn.com/image/fetch/$s_!iYyL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png 1272w, https://substackcdn.com/image/fetch/$s_!iYyL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faca0866a-e360-4f8a-840e-87f7f5de9fec_1632x1602.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There are many reasons why this model isn&#8217;t like the real-life case, besides the fact that the real-life case is about actual human beings and the model is about grapes. Here are a few worth highlighting. First, COMPAS gave individuals risk scores between 1 and 10, while our tool is more coarse-grained. It only sorts individuals into &#8220;high-risk&#8221; and &#8220;low-risk&#8221; categories. Second, in our model, grapes are either poisoned or not poisoned, and this is fixed before our tool even looks at them. In the real-life case, individuals in prison who were given higher risk scores by COMPAS were often denied parole and detained for longer. This might affect the outcomes - maybe someone who was given a higher risk score and detained will, <em>because of that,</em> become more or less likely to re-offend in the future. So, we might worry that in the real-life case, predictions had (at least some) effect on the outcomes. Third, since we made up data about the grapes, we know exactly how many grapes are poisoned or safe. In the real-life case, however, the data had to be collected. But, naturally, the outcomes we are able to collect data on are not about whether individuals <em>re-offend</em>, but about whether individuals are <em>re-arrested</em>. If, for example, Black neighborhoods are more likely to be policed, Black individuals are more likely to be stopped, and Black individuals are more likely to be searched, then we might worry that the data COMPAS uses doesn&#8217;t completely accurately represent the underlying rate of offending, but rather reflects patterns of policing and arrest.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>Our model avoids some of these tricky issues, and for our purposes, these differences won&#8217;t matter, since they won&#8217;t affect the points in this article.</p><h3><strong>Base Rates</strong></h3><p>Back to the model. Here is a crucial fact about the grapes in our model. The green grapes and purple grapes are not poisoned at the same rates. In other words, they have what&#8217;s called different &#8220;base rates&#8221;.</p><p>If we count up all the poisoned grapes in the original pile, we will find that 50 percent of purple grapes are poisoned (500 out of 1000) whereas 25 percent of green grapes are poisoned (250 out of 1000).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lnbU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lnbU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png 424w, https://substackcdn.com/image/fetch/$s_!lnbU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png 848w, https://substackcdn.com/image/fetch/$s_!lnbU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png 1272w, https://substackcdn.com/image/fetch/$s_!lnbU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lnbU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png" width="1456" height="662" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:662,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:85528,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lnbU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png 424w, https://substackcdn.com/image/fetch/$s_!lnbU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png 848w, https://substackcdn.com/image/fetch/$s_!lnbU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png 1272w, https://substackcdn.com/image/fetch/$s_!lnbU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5b2f8e4-45dc-4e92-ae77-65a9cfdceba0_1632x742.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3><strong>A Prediction Tool Sorts Grapes into Risk Categories</strong></h3><p>Suppose we use an algorithmic tool, just like COMPAS, that makes <em>predictions</em> about whether or not any given grape is poisoned. It doesn&#8217;t know what we know about these grapes; it doesn&#8217;t know which ones are poisoned and which ones are safe. It sorts grapes into two categories, &#8220;high-risk&#8221; and &#8220;low-risk&#8221;, based on some historical data.</p><p>Suppose that it isn&#8217;t perfect: it puts some grapes that are actually poisoned in the &#8220;low-risk&#8221; category and some that are actually safe in the &#8220;high-risk&#8221; category. But it is fairly accurate.</p><p>Say that it splits the purple grapes, 750 and 250, into the &#8220;high-risk&#8221; and &#8220;low-risk&#8221; categories, and it splits the green grapes, 125 and 875, into the &#8220;high-risk and &#8220;low-risk&#8221; categories, respectively. (I picked these specific numbers somewhat arbitrarily, but with certain constraints that will be clear in a moment.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!q-KF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!q-KF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png 424w, https://substackcdn.com/image/fetch/$s_!q-KF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png 848w, https://substackcdn.com/image/fetch/$s_!q-KF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png 1272w, https://substackcdn.com/image/fetch/$s_!q-KF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!q-KF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png" width="1456" height="780" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:780,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:653530,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!q-KF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png 424w, https://substackcdn.com/image/fetch/$s_!q-KF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png 848w, https://substackcdn.com/image/fetch/$s_!q-KF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png 1272w, https://substackcdn.com/image/fetch/$s_!q-KF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e509b94-a9e9-4794-a338-ca890ec2c575_1632x874.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There seems to be a problem. There are <em>way</em> more purple grapes in the &#8220;high-risk&#8221; category, and <em>way</em> more green grapes in the &#8220;low-risk&#8221; category. In fact, if we try to visualize this data, we can see this more clearly.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vsmv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vsmv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png 424w, https://substackcdn.com/image/fetch/$s_!vsmv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png 848w, https://substackcdn.com/image/fetch/$s_!vsmv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png 1272w, https://substackcdn.com/image/fetch/$s_!vsmv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vsmv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png" width="1456" height="719" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:719,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:92724,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vsmv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png 424w, https://substackcdn.com/image/fetch/$s_!vsmv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png 848w, https://substackcdn.com/image/fetch/$s_!vsmv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png 1272w, https://substackcdn.com/image/fetch/$s_!vsmv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F285740dc-bcde-454d-9479-c8b700a29bcc_1632x806.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>What we can see clearly is that the number of purple grapes is much larger than the number of green grapes in the &#8220;<em>high</em>-risk&#8221; category, and the number of green grapes is much larger than the number of purple grapes in the &#8220;<em>low</em>-risk&#8221; category.</p><h3><strong>Percentages of Risk Labels Across Groups</strong></h3><p>But this, on its own, doesn&#8217;t necessarily mean that the tool is unfairly biased. Since the groups have different <em>base rates</em> - the groups are poisoned at different rates - what we need to look at are the <em>percentages</em>. For example, what <em>percentage</em> of purple grapes in the &#8220;high-risk&#8221; category are poisoned? How does this compare to the percentage of <em>green</em> grapes in the &#8220;high-risk&#8221; category? (We would then ask the same questions about the &#8220;low-risk&#8221; category.)</p><p>It turns out that the tool we used to sort grapes into different categories ends up with <em>exactly</em> the same percentages across groups. More specifically, exactly 60 percent of both purple and green grapes are poisoned in the &#8220;high-risk&#8221; category, and exactly 20 percent of both purple and green grapes are poisoned in the &#8220;low-risk&#8221; category. (This is no accident - I picked specific numbers about how the tool makes predictions to ensure this. In reality, the creators of COMPAS tweaked the algorithm it used to ensure this result.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WXg7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WXg7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png 424w, https://substackcdn.com/image/fetch/$s_!WXg7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png 848w, https://substackcdn.com/image/fetch/$s_!WXg7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png 1272w, https://substackcdn.com/image/fetch/$s_!WXg7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WXg7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png" width="1456" height="721" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:721,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:104841,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WXg7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png 424w, https://substackcdn.com/image/fetch/$s_!WXg7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png 848w, https://substackcdn.com/image/fetch/$s_!WXg7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png 1272w, https://substackcdn.com/image/fetch/$s_!WXg7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92ea6dcb-c08b-484b-b0d3-5fab64a6f230_1632x808.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In machine learning contexts, this is called &#8220;calibration&#8221;. Say a tool makes predictions about certain <em>outcomes</em>, like whether an individual re-offends or whether a grape is poisoned. A risk score, like the risk scores our hypothetical tools assigns to grapes, is calibrated within groups just in case: among the individuals who have the same risk score, the actual probability of the outcome is the same, regardless of which group they belong to.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><p>For us, if the same percentage of purple grapes and green grapes are poisoned in the &#8220;high-risk&#8221; category (and similarly for the &#8220;low-risk&#8221; category), then our risk scoring is calibrated. Essentially, a purple grape and a green grape with the same risk score (high-risk or low-risk) must have the same probability of being poisoned.</p><p>If a prediction tool is calibrated, it seems as if the tool is <em>fair</em>. In contrast, if it is not calibrated - if say, a green and purple grape in the &#8220;high-risk&#8221; category have a different probability of being poisoned - it seems as if it is <em>unfair</em>.</p><p>This is exactly what Northpointe claimed. COMPAS was calibrated across groups; a Black individual and a White individual with the same risk score had the same probability of re-offending.</p><p>But calibration doesn&#8217;t tell us the whole story, and this is exactly what <em>ProPublica</em> claimed. ProPublica conceded that COMPAS was calibrated but <em>still</em> claimed it was unfair.</p><h3><strong>False Positive Rates Differ</strong></h3><p>To see why that might be, let&#8217;s turn back to our grape model, and this time, look at a different graphic.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!K_JP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!K_JP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png 424w, https://substackcdn.com/image/fetch/$s_!K_JP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png 848w, https://substackcdn.com/image/fetch/$s_!K_JP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png 1272w, https://substackcdn.com/image/fetch/$s_!K_JP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!K_JP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png" width="1456" height="689" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:689,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:71872,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!K_JP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png 424w, https://substackcdn.com/image/fetch/$s_!K_JP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png 848w, https://substackcdn.com/image/fetch/$s_!K_JP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png 1272w, https://substackcdn.com/image/fetch/$s_!K_JP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76e61052-dfcb-447e-a648-75950a46c83e_1632x772.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We are still looking at high-risk and low-risk grapes, but this time, they are grouped by grape color. Now, notice that there are grapes, of both colors, that were put in the &#8220;high-risk&#8221; category but are completely safe. These are <em>false positives</em>, grapes that are actually safe but incorrectly labeled &#8220;high-risk&#8221;. For an imperfect prediction tool, it&#8217;s inevitable that there will be <em>some</em> false positives; we could only completely avoid false positives if the tool were 100 percent accurate at predicting which grapes were poisoned and which ones were safe.</p><p>The important thing to notice now is that our prediction tool resulted in false positives at <em>different rates</em>, depending on the grape color. Look at the &#8220;Purple Grapes&#8221; section (on the left) and notice how many purple grapes were put in the &#8220;high-risk&#8221; category, but are completely safe. (For purple grapes, there are 300 safe grapes in the &#8220;high-risk&#8221; category, and only 200 safe grapes put in the &#8220;low-risk&#8221; category.) Now, look at the &#8220;Green Grapes&#8221; section (on the right) and notice that there are very few safe grapes in the &#8220;high-risk&#8221; category. (For green grapes, there are just 50 safe grapes in the &#8220;high-risk&#8221; category, and as many as <em>700</em> in the &#8220;low-risk&#8221; category.)</p><p>What we can see is that the <em>false positive rate</em> is different across different color groups. The false positive rate for a given group is the percentage of grapes that were incorrectly marked &#8220;high-risk&#8221;.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> We can see this clearly below. (The false <em>negative</em> rate, the rate at which poisoned grapes were put into the &#8220;low-risk&#8221; category, is also <em>higher</em> for green grapes, but let&#8217;s focus on the false positive rate here.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GNhm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GNhm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png 424w, https://substackcdn.com/image/fetch/$s_!GNhm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png 848w, https://substackcdn.com/image/fetch/$s_!GNhm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png 1272w, https://substackcdn.com/image/fetch/$s_!GNhm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GNhm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png" width="1456" height="683" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:683,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:106277,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GNhm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png 424w, https://substackcdn.com/image/fetch/$s_!GNhm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png 848w, https://substackcdn.com/image/fetch/$s_!GNhm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png 1272w, https://substackcdn.com/image/fetch/$s_!GNhm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca474e6e-1585-46f6-b471-b7fc9343d8bb_1632x766.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There are 300 purple grapes marked &#8220;high-risk&#8221; but that are actually safe. This is out of a total of 500 safe purple grapes. So, the false positive rate for purple grapes is 60 percent. On the other hand, there are 50 green grapes that are marked &#8220;high-risk&#8221; but are actually safe. This is out of a total of 750 green grapes. So, the false positive rate for green grapes is only 6.7 percent.</p><p>In short, the false positive rate for purple grapes is 60 percent, but only 6.7 percent for green grapes. Think about it like this. Suppose we threw out every grape marked &#8220;high-risk&#8221;. Now, if you&#8217;re a <em>safe, purple</em> grape, you have a <em>60 percent</em> chance of wrongly being thrown out. There you are, minding your own business with no ability to poison anyone, but 60 percent of the time, you&#8217;ll be thrown out anyway. On the other hand, if you&#8217;re a <em>safe, green</em> grape, you only have a <em>6.7 percent</em> chance of wrongly being thrown out. This seems <em>unfair</em>.</p><p>This is precisely what ProPublica argued. Even though COMPAS was <em>calibrated</em> &#8211; Black and White individuals with the same risk score had the same probability of re-offending &#8211; it had different <em>error rates</em>. For example, the <em>false positive rate</em> was much higher for Black individuals than for White individuals. As a result, Black individuals who did not re-offend were much more likely to be <em>predicted</em> to be high-risk than their White counterparts. These individuals were denied parole, forced to pay higher bail, denied release, and even given longer sentences. The <em>burdens</em> of the errors that the tool made were disproportionately falling on Black individuals. (The false negative rate was also higher for White individuals, but again, let&#8217;s just focus on the false positive rate.)</p><h3><strong>Impossibility</strong></h3><p>Given this issue with COMPAS, it seems natural to think that we should adjust COMPAS so that it no longer has different false positive rates for Black and White individuals. Ideally, our tool would be calibrated (same risk scores, same probability of re-offense) <em>and</em> have equal error rates.</p><p>But it turns out, in any realistic situation, this is <em>mathematically impossible</em>. In a <a href="https://arxiv.org/abs/1609.05807">result</a> achieved by Jon Kleinberg, Sendhil Mullainathan, and Manish Raghavan, and then in another <a href="https://arxiv.org/abs/1610.07524">result</a> achieved independently by Alexandra Chouldechova, it was proven that in any realistic situation, you cannot simultaneously have calibration <em>and</em> equal error rates. They proved that in any realistic situation, if we have a calibrated tool, then it must have different error rates, and if it has equal error rates, it must violate calibration.</p><p>What do I mean here by &#8220;realistic situation&#8221;? There are two conditions that need to be met for the impossibility result to hold. First, the tool isn&#8217;t <em>perfectly</em> accurate. A calibrated tool that is perfectly accurate will, by definition, make no errors, so the error rate will be 0. So, unless some tool is able to predict with 100 percent accuracy whether an individual will re-offend or not, this condition is met.</p><p>Second, and more importantly, there are different <em>base rates</em> between groups. In our grape model, this meant that green grapes and purple grapes were not poisoned at <em>exactly</em> the same rates. In the real-life situation, this means that Black individuals and White individuals don&#8217;t re-offend at exactly the same rate.</p><p>Now, there may be many reasons for this. Note that, as I mentioned above, the difference in &#8220;base rates&#8221; isn&#8217;t a difference in <em>re-offense</em> rates, but a difference in <em>re-arrest</em> rates. Perhaps, as I mentioned above, this difference is partly due to an excess of policing and surveillance of Black neighborhoods. But even setting this aside, there are many factors that correlate with recidivism that are unevenly distributed between Black and White people. Poverty and income instability, unemployment, housing instability, and less access to education and training can increase the probability of re-offense. If there are disparities in these factors between Black and White individuals, this might also partly explain differences in base rates. Further, returning to a neighborhood with higher crime rates and weaker social services may <em>cause</em> a higher likelihood of re-offending. If there already exist higher crime rates with weaker social services in Black neighborhoods, these differences can be exacerbated.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p><p>Whatever the possible explanations, the important point here is that the base rates differed. If they differ, a calibrated tool that is not 100 percent accurate will make errors at different rates, by mathematical necessity.</p><h3><strong>Understanding Impossibility: Calibration but Unequal Error Rates</strong></h3><p>The proofs I mentioned above were for <em>any</em> possible (imperfect) model, with <em>any</em> given population that had differing base rates. It was a general proof. But it might be worth seeing the main idea at work in an example that gives a sense of how (in a realistic situation) calibration and equal error rates are mutually incompatible. Let&#8217;s turn back to our grape model one last time.</p><p>First, why does satisfying calibration imply that there will be different error rates?</p><p>Let&#8217;s look at our calibrated prediction tool. For both purple and green grapes, a grape in the &#8220;high-risk&#8221; category has a 60 percent chance of being poisoned, and a grape in the &#8220;low-risk&#8221; category has a 20 percent chance of being poisoned. (This is the same graphic as above.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4j3N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4j3N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png 424w, https://substackcdn.com/image/fetch/$s_!4j3N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png 848w, https://substackcdn.com/image/fetch/$s_!4j3N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png 1272w, https://substackcdn.com/image/fetch/$s_!4j3N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4j3N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png" width="1456" height="721" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:721,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:104841,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4j3N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png 424w, https://substackcdn.com/image/fetch/$s_!4j3N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png 848w, https://substackcdn.com/image/fetch/$s_!4j3N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png 1272w, https://substackcdn.com/image/fetch/$s_!4j3N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c8c123-d6ae-4c4f-83b5-4e0f14d6e694_1632x808.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Now - and this is the key - purple and green grapes have different <em>base rates</em>. Overall, 50 percent of purple grapes are poisoned, and only 25 percent of green grapes are poisoned.</p><p>Because of that, two things happen. First, to get calibration, it has to be that in the &#8220;high-risk&#8221; category, the percentage of poisoned purple grapes equals the percent of poisoned green grapes. So, the prediction tool puts a <em>higher percent</em> of the purple grapes into the &#8220;high-risk&#8221; bin. (If it didn&#8217;t - if say, it put exactly 50 percent of the purple grapes and exactly 50 percent of the green grapes, then because of the different base rates, the probability of a high-risk grape being poisoned would differ across groups, violating calibration.) Now, because the tool isn&#8217;t perfect - high-risk grapes are still 40 percent safe - this will inevitably mean putting a higher percentage of <em>safe</em> purple grapes into the &#8220;high-risk&#8221; bin than green grapes.</p><p>Second, because the base rates differ, a higher percent of green grapes are safe compared to purple grapes.</p><p>Now, in our model, the false positive rate is</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\frac{\\text{# of safe grapes labeled high-risk}}{\\text{# of safe grapes}}&quot;,&quot;id&quot;:&quot;WDCPJJPPRU&quot;}" data-component-name="LatexBlockToDOM"></div><p>We have seen that (1) the percent of safe <em>purple</em> grapes in the &#8220;high-risk&#8221; bin is pushed higher, and at the same time, (2) the percent of safe <em>green</em> grapes is higher than the percent of safe <em>purple</em> grapes. In other words, the numerator is pushed up for purple grapes and the denominator is pushed up for green grapes. So, the false positive rate goes up for purple grapes and down for green grapes. That&#8217;s why calibration forces the false positive rate to differ when base rates differ.</p><h3><strong>Understanding Impossibility: Equalized Error Rates but No Calibration</strong></h3><p>Now, let&#8217;s try to understand the other direction - why does equalizing the error rates violate calibration?</p><p>Let&#8217;s look at our grape model again. Suppose we noticed that the error rates differed based on the grape color: for purple grapes, the false positive rate was 60 percent, but for green grapes, the false positive rate was only 6.7 percent.</p><p>Suppose we tried to tinker with the algorithm to fix this, say, by adjusting the predictions so that the false positive rate equalized. We would have to explicitly tell the algorithm, &#8220;If you see a purple grape, use the old procedure, but if you see a green grape, use this <em>new</em> procedure so that it is more likely to end up in the &#8216;high-risk&#8217; category.&#8217;</p><p>Let&#8217;s say we adjust the algorithm so that it puts 500 more green grapes in the &#8220;high-risk&#8221; category instead of the &#8220;low-risk&#8221; one. (I picked &#8220;500&#8221; in order to equalize the base rates, as will be clear in a moment.) Now, there are 1375 grapes in the &#8220;high-risk&#8221; category, and many more of them are green grapes.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7FxL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7FxL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png 424w, https://substackcdn.com/image/fetch/$s_!7FxL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png 848w, https://substackcdn.com/image/fetch/$s_!7FxL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png 1272w, https://substackcdn.com/image/fetch/$s_!7FxL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7FxL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png" width="1456" height="721" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:721,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:96507,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7FxL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png 424w, https://substackcdn.com/image/fetch/$s_!7FxL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png 848w, https://substackcdn.com/image/fetch/$s_!7FxL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png 1272w, https://substackcdn.com/image/fetch/$s_!7FxL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5c42a6c-8230-4ef9-a93a-5a303c16a395_1632x808.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>By moving green grapes into the &#8220;high-risk&#8221; category, we have made it so that more green grapes are incorrectly marked as &#8220;high-risk&#8221;. This, based on the numbers of grapes we have, equalizes the false positive rates across groups.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2T9A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2T9A!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png 424w, https://substackcdn.com/image/fetch/$s_!2T9A!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png 848w, https://substackcdn.com/image/fetch/$s_!2T9A!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png 1272w, https://substackcdn.com/image/fetch/$s_!2T9A!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2T9A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png" width="1456" height="760" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:760,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:101564,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2T9A!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png 424w, https://substackcdn.com/image/fetch/$s_!2T9A!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png 848w, https://substackcdn.com/image/fetch/$s_!2T9A!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png 1272w, https://substackcdn.com/image/fetch/$s_!2T9A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffac07e7e-83d2-4e7b-a783-46f3d8c974d1_1632x852.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>But to do this, remember, we had to change the algorithm in our prediction tool! We had to adjust it, so that a green grape with a low predicted probability of being poisoned would still end up in the &#8220;high-risk&#8221; category. We had to do this while still using the same algorithm for purple grapes. Because of this, we end up with different percentages of poisoned grapes across risk categories. (This is unlike <a href="https://firdaus-gupte.github.io/blog/impossible-fairness/#composition">before</a>, in which the percentages of poisoned grapes were identical across risk categories.) It&#8217;s no longer the case that exactly the same percent of poisoned grapes end up in &#8220;high-risk&#8221; and &#8220;low-risk&#8221; categories, respectively.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RkvO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RkvO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png 424w, https://substackcdn.com/image/fetch/$s_!RkvO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png 848w, https://substackcdn.com/image/fetch/$s_!RkvO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png 1272w, https://substackcdn.com/image/fetch/$s_!RkvO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RkvO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png" width="1456" height="799" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:799,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:116018,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://firdausgupte.substack.com/i/194550430?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RkvO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png 424w, https://substackcdn.com/image/fetch/$s_!RkvO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png 848w, https://substackcdn.com/image/fetch/$s_!RkvO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png 1272w, https://substackcdn.com/image/fetch/$s_!RkvO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff307d466-ec10-489d-ba46-2ba4383378e2_1632x896.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The tool, in other words, is no longer calibrated.</p><h3><strong>Conclusion</strong></h3><p>What is the takeaway of all this? We can put the problem like this. There are two views we might have on fairness in this machine learning context. On one view of fairness, the <em>calibration</em> view, a fair prediction algorithm is one that is <em>calibrated</em>. Any two individuals from two different groups with the same risk score should have the same probability of re-offending. Defenders of this view would argue that violating this would mean using a different procedure for individuals of different groups. On another view of fairness, the <em>error rate</em> view, a fair prediction algorithm is one that has equalized error rates across groups. Defenders of this view would argue that violating this would mean that one group disproportionately bears the burden of errors made by the tool.</p><p>And the takeaway is that in any realistic situation, it&#8217;s mathematically impossible for us to design a tool that satisfies both.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Here is some evidence to support this. A very well-cited study, <a href="https://openpolicing.stanford.edu/findings/">Stanford Open Policing Project</a>, analyzed 100 million traffic stops across the U.S. and found that Black drivers are stopped more often than White drivers, relative to their share in the population. Another <a href="https://sc.edu/uofsc/posts/2020/06/racial_disparities_traffic_stops.php">study</a> found that Black drivers are also <em>searched</em> more often than White drivers. Finally, here is a <a href="https://arxiv.org/abs/2109.12491">study</a> that found that police presence is higher in Black neighborhoods.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Here is a formal definition. For a given outcome <em><strong>Y</strong></em>, and groups <em><strong>a </strong></em>and <em><strong>b</strong></em>, and a risk score <em>s</em>, calibration requires that</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;        P(Y=+ \\mid S=s, A=a) = P(Y=+ \\mid S=s, A=b), \\quad \\forall s \\in S, \\; \\forall a,b \\in A.&quot;,&quot;id&quot;:&quot;JCGRVLHEAX&quot;}" data-component-name="LatexBlockToDOM"></div><p>This just says that the probability that a given outcome occurs (<em><strong>Y = +</strong></em>) is the same, for any two individuals from two different groups <em><strong>a</strong></em> and <em><strong>b</strong></em>, <em>given that</em> the model&#8217;s risk score <em><strong>S </strong></em>for both of those individuals is the same, <em><strong>s</strong></em>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Here is a formal definition.</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\mathrm{FPR} = P(\\hat{Y}=+ \\mid Y=-)&quot;,&quot;id&quot;:&quot;OAWNAQKQDR&quot;}" data-component-name="LatexBlockToDOM"></div><p>This says that the false positive rate is equal to the probability that the model predicts a certain outcome, but the outcome does not occur.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Here is a large criminology <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC6387636/">study</a> that found that unemployed individuals had much higher likelihood of being reincarcerated than employed individuals. Here is a <a href="https://www.tandfonline.com/doi/full/10.1080/10509674.2024.2406755">study</a> that reviewed recidivism research and found low education as a major prediction of re-offending after prison. Here is a <a href="https://www.sciencedirect.com/science/article/pii/S0049089X15301228">study</a> that found that many formerly incarcerated people return to neighborhoods characterized by poverty, unemployment, and high crime, and that returning to these neighborhoods increases the risk of re-offending after prison. Finally, here is a longitudinal <a href="https://www.tandfonline.com/doi/full/10.1080/07352166.2018.1495041">study</a> that found that Black and Hispanic individuals tend to return to more disadvantaged neighborhoods than White individuals, even accounting for differences in neighborhoods before prison.</p></div></div>]]></content:encoded></item></channel></rss>