vault backup: 2025-08-06 16:28:38

2025-08-06 16:28:38 -04:00
parent 89b1ac60b9
commit 8cc5b7ca70
4 changed files with 150 additions and 23 deletions
@@ -31,6 +31,8 @@ except for risk-neutral parties.
 People and organizations are risk-averse
 %% TODO: ... %%

+For risk-neutral parties, expected value = CME
+
 ### Expert Opinion Must Be ~~Adjusted~~

 Expert opinion is valuable despite its flaws.
@@ -46,9 +48,133 @@ Experts tend to be good at creating heuristics,
 but do not apply them consistently in practice.

 > [!example]
-> Chapter 7 describes a study where experts
+> Chapter 7 describes a study where individual experts
 > were shown to estimate risk differently for identical cases.

+Chapter 13 introduces the [Brier Score](https://en.wikipedia.org/wiki/Brier_score)
+as a method of evaluating the performance of an estimator,
+evaluated as the mean squared error of their forecasts.
+
+#### The Difficulty of Calibration
+
+##### Boolean Examples
+
+> The melting point of tin is higher than the melting point of aluminum.
+
+> In English, the word “quality” is more frequently used that the word “speed”.
+
+reductive (used more frequently where?)
+
+> Any male pig is referred to as a hog.
+
+reductive (referred to by whom?)
+
+> California’s giant sequoia trees are named for an early 19th century leader of the Cherokee Indians.
+
+reductive
+
+> The Model T was the first car produced by Henry Ford.
+
+reductive (Henry Ford didn't produce cars)
+
+> When rolling 2 dice, a roll of 7 is more likely than a 3.
+
+facile
+
+> No one has ever been reported to have been hit by any object that fell from space.
+
+reductive (reported by whom?)
+
+> Sir Christopher Wren was a British anthropologist.
+
+> Pakistan does not border Russia.
+
+unnecessary negative form, otherwise good.
+
+> The Navy won the first Army-Navy football game.
+
+should specify the official event name, otherwise good.
+
+> The paperback version of the book “The Da Vinci Code”, as of July 2007, still ranks in the top 500 bestselling books on Amazon.
+
+obtuse phrasing, dated topic, otherwise good
+
+> Italian has more words than any other language.
+
+reductive (what is a word? what dialect?)
+
+> The month of August is named after a Greek god.
+
+borderline facile, reductive
+
+> The deepest ocean trench is deeper than the Grand Canyon.
+
+facile
+
+> Abraham Lincoln was the first president born in a log cabin.
+
+deceptive phrasing
+
+> As of July of 2007, more people search Google for  “Harry Potter” than “Hillary Clinton” (according to GoogleTrends).
+
+obtuse phrasing, dated topic, otherwise good
+
+> The population of Alabama is higher than the population of Arizona.
+
+borderline facile, deceptive phrasing
+
+> No category 5 hurricane hit the US in 2004.
+
+> The UK is among the top 10 largest economies in the world (by GDP).
+
+> The movie Forest Gump has grossed more to date than E.T. The Extra Terrestrial.
+
+obtuse phrasing, dated topic, otherwise good
+
+##### Interval Examples
+
+> What percentage of bronze is typically made of copper?
+
+reductive
+
+> How many countries have at least one McDonald’s?
+
+As of when?
+
+> How many employees did eBay have in the first quarter of 2006
+> What was the population of Miami (within the city limits, not the entire metropolitan area) in 1990?
+> How many casualties did the French suffer in the Battle of Waterloo?
+> What is the range in miles of a Minuteman Missile?
+> What is the percentage of IT jobs in the US were unfilled in 1997?
+> The Supremes’ (with Diana Ross) song “Stop! In the Name of Love” was how long? (minutes, seconds)
+> How many undergraduates attended Cambridge in 1990?
+> If you could jump 50 feet straight up into the air, how many seconds would you be airborne before you landed?
+> How many gallons are in a bushel (they are both measures of volume)?
+> How many sovereign rulers has England had in the last thousand years?
+> If the air temperature was 5 degrees below zero (Fahrenheit) and the wind speed was 15 mph, what would the temperature adjusted for wind-chill be?
+> Average cost of testing in software development is what percentage of total project costs?
+> On average, if a software development project was projected to take 17 months, it actually takes how many months?
+> How many meters tall is the Sears Tower?
+> How many gold medals did Jesse Owens win at the 1936 Berlin Olympics?
+> In 2005, the average combined MPG for all US cars and light trucks on the road was how much?
+> The average house in the United States uses how many gallons of water per day?
+> What was the average price in the United States of a house sold in 2001?
+
+##### Writing Good Calibration Questions
+
+A good calibration question should not feel like it could be a "trick" question.
+
+Definitions/terminology are *always* contentious,
+questions based on them always feel deceptive.
+
+Interval "questions" should describe the quantity
+rather than phrase it as a question.
+
+##### Strategy for Answering Calibration Questions
+
+Confidence should never be less than probability of picking randomly
+(50% for true)
+
 ### Luck Looks Like Skill

 > Chapter 7 p.154
@@ -66,7 +192,9 @@ as the "Red Baron effect".
 ### There's Always Enough Data

 Hubbard challenges the popular rebuttal
-that X industry lacks the data to use quantitative models.
+that any industry is so niche that
+data sufficient for quantitative models
+does not exist.

 > [!quote] Fallacy of Close Analogy - p.236
 > ...the belief that unless two things are identical in every way,
@@ -83,6 +211,8 @@ that X industry lacks the data to use quantitative models.
 * Value at Risk (VaR)
 * Loss-Exceedance Curve (LEC)
 * Risk Tolerance
+* Certain Monetary Equivalent (CME)
+    * also called Certainty Equivalent

 ## Critiques

@@ -92,21 +222,23 @@ Hubbard uses _exsupero ursus_ to describe the tendency of his detractors
 to dismiss quantitative methods as inappropriate for their industry-specific risks.

 > [!quote] Chapter 9 p.195
-> Suppose a car buyer had a choice between buying two nearly identical automobiles, Car A and Car B.
+> Suppose a car buyer had a choice between buying two nearly identical automobiles,
+> Car A and Car B.
 > The only difference is that Car B is $1,000 more expensive,
 > has fifty thousand more miles, and was once driven into a lake.
 > But buyer chooses Car B because Car A doesn't fly.
 > Neither does Car B, of course,
 > but for some reason the buyer believes that Car A should fly
 > and therefore chooses Car B.
-> The buyer is committing the exsupero ursus fallacy.
+> The buyer is committing the _exsupero ursus_ fallacy.

 Based on this strawman it is clear Hubbard believes his detractors are _correct_
 that qualitative methods can not capture the entire nuance of risk probability,
 but that they are failing to acknowledge that their preferred alternatives
 are not demonstrably more effective at doing so.

-The nuance Hubbard dismisses without addressing is the possibility of model _improvement_.
+The nuance Hubbard dismisses without addressing
+is the possibility of model _improvement_.
 A most competent detractor would be aware of the apparent contradiction
 but argue that their methods will eventually surpass quantitative methods
 if they are further developed.
@@ -116,5 +248,6 @@ To them, Hubbard's methods represent an attractive short-term gain
 that would exclude a long-term payoff.

 Hubbard's dismissal rubs me wrong
-because it reads exactly as he describes the "at least we're doing _something_" argument
+because it reads exactly as he describes the
+"at least we're doing _something_" argument
 throughout the book and just pages earlier.