Disclaimer
Although I do have a lot of experience in math, this article has not yet been thoroughly peer-reviewed. Check with someone with expertise on the topic before using this formula.
(So far, it has been peer-reviewed by a math teacher with wide-ranging mathematical expertise.)
Notes
For this article, I will be using "Expected value" and "impact" interchangeably.
TL;DR
It's actually the average increase in effectiveness per increase in cost, divided by cost, and it seems that the two calculations just so happen to be different.
But why?
To show why this is, we'll use the help of our friend, pigeon, and statistician in training, Todd.
Todd wants to estimate the expected happiness per rock (the currency the pigeons use) of donating to Jeremy's Flavored Crumb Stand. Todd decides to calculate the expected impact of one extra rock donated with the following formula.
Here's the data.
[1] This doesn't seem very sensible. This estimate is way too high! The projected change in impact (What we're interested in) is much higher than all of the actual changes in impact!
Todd explains his problem to his statistics professor, Taylor. Taylor explains to Todd, "Since we're trying to find [2], and [3][4], the cost-effectiveness of donating rocks to Jeremy's Flavored Crumb Shop is [5][6]. This leads to a much more sensible output, and the new estimate looks like this.
After making some assumptions, he calculated his new estimate like this.
Calculations (Feel free to skip this part)
+[7]
.
Findings
This is a much better result. He did make some assumptions, but the assumptions he made don't seem too unreasonable, so he's happy with his results.
Note that you can intuitively think of the change in the value of a (where a is the expected impact of Jeremy's Flavored Crumb Shop) when we shift the probabilities to the left by one.
What about if the expected value of [donating twice as much] is twice as big?
First of all, note how we can express the impact of donating one rock to Jeremy's Hotdog Shop as . Similarly, we can express rocks donated as Now, we just need to know if But, it turns out that this is not the case. To show why this is true, consider the following example.
and Therefore,
- , since N can only be 1, .
- , since N can only be 1, .
- .
Despite this, on average, , since, on average, data is linear (i.e. , since and (i.e., data is neither skewed to the more negative side nor the more positive side)(i.e., graphs of data are, on average, in the shape of a line) and when is linear, by definition, for all , , with some and .
Furthermore, for all , where , for all x,
Therefore, for all linear functions , since, , and .
It's important to note that, while on average, holds, when V_h(x) isn't linear, isn't a perfect estimate for
Sidenote
Similarly, is a decent approximation for cost-effectiveness, as, given the way most charities operate, the impact of each "rock" donated is roughly independent of the charity's total money that would have been donated if said "rock" was not donated. (i.e., if you were to graph impact as the y-axis and the number of dollars donated, the shape that would form would look like a line).
All functions that make this line shape can, by definition, be written in the form , where:
- is the total amount of donated "rocks" is .
- The impact is .
- can be thought of as "Impact per rock."
- can be thought of as "impact that the charity did regardless of donations (e.g. if the co-founders were nice to their parents regardless of that year's donations). We want to know "impact per rock."
Therefore, , and what we want to know is , and, as gets larger and larger, trends towards , since trends towards .
Congrats! You made it to the end of this article! 🥳!
Now, Todd can finally sit, having finished all his winter break homework. [8]
(also, I wanted to make a picture of Todd eating a crumb, but this picture of Todd eating a hotdog turned out way cuter. 🐦⬛🌭 )
If you have any questions, comments, suggestions, corrections, or feedback, please feel free to put them in the comments!
- ^
The Avg. is short for the average (Arithmetic mean) yearly impact.
- ^
where E(x) is the expected value of x, N is the number of rocks Jeremy's Hotdog Stand gets in any given year, and is the value generated by Jeremy's Hotdog Stand when Jeremy's Hotdog Stand gets N rocks.
- ^
is the probability that some random variable X is equal to .
- ^
- ^
We can see why this is true if we input .
- ^
When is known, but isn't known, use some estimation for . (e.g., . The same goes for when is unknown. (That is, we estimate )
- ^
Todd assumed that
- ^
I imagine that pigeons get only one piece of homework for winter break.
I'm finding it a little hard to parse this example. Are you highlighting the difference between average impact and marginal impact?
Very close! It's more the difference between average impact and a weighted average of the marginal impact, weighted by probability. Did that answer your question?
It's unintuitive to me since I would normally just think about marginal impact,but it makes sense. What is the uncertainty over? Is it uncertainty about V(n)?
hello?
It's over E(Vh(N+1)−(Vh(N)). How should I make this clearer in my article?