During a faculty meeting, a group of 9th grade teachers decided they needed to further understand what the optimal duration of study is for students to achieve satisfactory results. So, they decided to gather the approximate number of hours students were studying, and then compare to the student’s test scores.
Mr. Simpson convinced the faculty that more data means better results, and so all of the teachers integrated their cross-course data for the analysis.
The results were astounding. To everyone’s confusion, the less a student studied, the higher they tend to score on tests.
In fact, the coeff
During a faculty meeting, a group of 9th grade teachers decided they needed to further understand what the optimal duration of study is for students to achieve satisfactory results. So, they decided to gather the approximate number of hours students were studying, and then compare to the student’s test scores.
Mr. Simpson convinced the faculty that more data means better results, and so all of the teachers integrated their cross-course data for the analysis.
The results were astounding. To everyone’s confusion, the less a student studied, the higher they tend to score on tests.
In fact, the coefficient associated with this correlation was -0.7981, a strongly negative relationship.
Should they be encouraging their students to study less? How in the world could data be backing up such a claim? Surely something was missing.
After discussing the results, the teachers agreed they should consult the school’s statistician, Mrs. Paradox. After Mr. Simpson explained to Mrs. Paradox what they had found in their results, Mrs. Paradox suggested they analyze each course’s data individually.
So, they went ahead and analyzed Phys. Ed. and proceeded to have their minds blown.
A correlation of 0.6353! How in the statistical universe was this even possible?
Mrs. Paradox then explained this as Simpson’s Paradox, a statistical phenomenon where a seemingly strong relationship reverses or disappears when introduced to a third confounding variable.
She convinced Mr. Simpson to plot all of the data once again, but then color-code each course separately to distinguish them from one another.
After doing so, Mr. Simpson and the 9th grade faculty concluded that the relationship was indeed positive, and that the more hours a student studied, the higher the grade tends to be.
Including the course of study in the analysis completely reversed the relationship.
R Code for this example:
- # Load the tidyverse
- library(tidyverse)
- # Generating correlated data with mvrnorm() from the MASS library
- library(MASS)
- # Sample Means
- mu <- c(20,4)
- # Define our covariance matrix, and specify the covariance relationship (i.e. 0.7 in this case)
- Sigma <- matrix(.7, nrow=2, ncol=2) + diag(2)*.3
- # create both variables with 100 samples
- vars <- mvrnorm(n=100, mu=mu, Sigma=Sigma)
- # Examine the data and the correlation
- head(vars)
- cor(vars)
- # Plot the variables
- plot(vars[,1],vars[,2])
- # Create a function for generating 2 correlated variables given variable means
- corVars<-function(m1,m2,confVar){
- mu <- c(m1,m2)
- Sigma <- matrix(.7, nrow=2, ncol=2) + diag(2)*.5
- vars <- mvrnorm(n=100, mu=mu, Sigma=Sigma)
- Var1<-vars[,1]
- Var2<-vars[,2]
- df<-as.data.frame(cbind(Var1 = Var1,Var2 = Var2,Var3 = confVar))
- df$Var1<-as.numeric(as.character(df$Var1))
- df$Var2<-as.numeric(as.character(df$Var2))
- df
- }
- # Re-running for multiple sets and combining into a single dataframe df
- d1 <- corVars(m1 = 20, m2 = 82, confVar = "Algebra")
- d2 <- corVars(m1 = 18, m2 = 84, confVar = "English")
- d3 <- corVars(m1 = 16, m2 = 86, confVar = "Social Studies")
- d4 <- corVars(m1 = 14, m2 = 88, confVar = "Art")
- d5 <- corVars(m1 = 12, m2 = 90, confVar = "Physical Education")
- # Create the aggregate data
- df<-rbind(d1,d2,d3,d4,d5)
- # Grade & Study Time Plot
- df %>%
- ggplot(aes(x = Var1, y = Var2/100)) +
- geom_jitter(aes(size = 13), alpha = 0.55, shape = 21, fill = "darkgray", color = "black") +
- scale_y_continuous(name = "Final Percentage", labels = percent)+
- scale_x_continuous(name = "Approximate Hours for Preparation")+
- guides(size = FALSE) +
- ggtitle("Impact of Studying on Final Grades")+
- theme(plot.title = element_text(hjust = 0.5))+
- theme_bw()
- # Grade & Study Time Correlation
- cor(df$Var1, df$Var2)
- # PhysEd Plot
- df %>%
- filter(Var3 == 'Physical Education') %>%
- ggplot(aes(x = Var1, y = Var2/100)) +
- geom_jitter(aes(size = 13), alpha = 0.55, shape = 21, fill = "darkgray", color = "black") +
- scale_y_continuous(name = "Final Percentage", labels = percent)+
- scale_x_continuous(name = "Approximate Hours for Preparation")+
- guides(size = FALSE) +
- ggtitle("Impact of Studying on Final Grades (Physical Education Only)")+
- theme(plot.title = element_text(hjust = 0.5))+
- theme_bw()
- # PhysEd Correlation
- cor(df$Var1[df$Var3 == 'Physical Education'], df$Var2[df$Var3 == 'Physical Education'])
- # Confounding plot
- df %>%
- ggplot(aes(x = Var1, y = Var2/100)) +
- geom_jitter(aes(size = 1, fill = Var3), alpha = 0.25, shape = 21) +
- guides(fill = guide_legend(title = "Course Class", override.aes = list(size = 5)),
- size = FALSE) +
- scale_y_continuous(name = "Testing Results", labels = percent)+
- scale_x_continuous(name = "Approximate Hours for Preparation")+
- ggtitle("Impact of Studying on Final Grades")+
- theme(plot.title = element_text(hjust = 0.5))+
- theme_bw()
Simpson’s Paradox says that a broad trend, when broken down to individual components, may be manifest to be the reverse of the apparent.
Bob and Sue are weight-training coaches. Who’s a more successful coach?
Bob’s lifters bench an average of 191 lbs.
Sue’s lifters bench an average 159.1 lbs.
Is Bob more successful? No. Bob coaches 10 guys who bench 200 lbs each and he coaches one girl who benches 100 lbs.
To calculate the average, we take (10*200 lbs + 1* 100 lbs)/11 people.
Sue coaches 10 girls who each bench 150 lbs and one guy who benches 250.
To calculate the average, we take (10*150 lbs + 1* 25
Simpson’s Paradox says that a broad trend, when broken down to individual components, may be manifest to be the reverse of the apparent.
Bob and Sue are weight-training coaches. Who’s a more successful coach?
Bob’s lifters bench an average of 191 lbs.
Sue’s lifters bench an average 159.1 lbs.
Is Bob more successful? No. Bob coaches 10 guys who bench 200 lbs each and he coaches one girl who benches 100 lbs.
To calculate the average, we take (10*200 lbs + 1* 100 lbs)/11 people.
Sue coaches 10 girls who each bench 150 lbs and one guy who benches 250.
To calculate the average, we take (10*150 lbs + 1* 250 lbs)/11 people.
So if just average everything, it looks like Bob is a more successful coach. If you look at the individual breakdowns, it’s clear that Sue is the better coach. Or at least it’s clear that her women can outlift Bob’s woman, and her one man can outlift Bob’s men.
The broader lesson is that when you start averaging things across dissimilar groups, you run into errors.
It’s not a paradox. It’s a common situation in data analysis that researchers should be alert for. It’s common for a trend in a combined data set to have the opposite sign when the data are broken into subpopulations, or results are adjusted for another variable.
For example, I was working on a sex discrimination case against United Airlines in the 1970s. At the time, all the pilots were men, and all the cabin attendant's were women. The pilots generally went to the military after high school to get their training. The stewardesses usually had some college or were graduates. The pilots made fou
It’s not a paradox. It’s a common situation in data analysis that researchers should be alert for. It’s common for a trend in a combined data set to have the opposite sign when the data are broken into subpopulations, or results are adjusted for another variable.
For example, I was working on a sex discrimination case against United Airlines in the 1970s. At the time, all the pilots were men, and all the cabin attendant's were women. The pilots generally went to the military after high school to get their training. The stewardesses usually had some college or were graduates. The pilots made four times the money that the stewardesses did.
If you graphed income versus education, you saw a downward slope—high-school-graduate pilots made a lot, college-educated stewardesses made little.
But if you looked at pilots separately, the ones with more education made more money. The same was true of stewardesses.
There’s no paradox, no contradiction. The issue is that putting the two populations together without adjusting for sex (or for job type, which meant the same thing at the time) produced a misleading, non-causal trend.

Simpson's paradox is a phenomenon in statistics where a trend that appears in several different groups of data disappears or reverses when these groups are combined. This paradox highlights how aggregated data can lead to misleading conclusions, as the relationship between variables may differ when examined at a more granular level.
Example:
Imagine two treatments for a disease, A and B. In separate groups of patients (e.g., men and women), treatment A appears to be more effective than treatment B. However, when the data from both groups is combined, treatment B shows better overall effectivenes
Simpson's paradox is a phenomenon in statistics where a trend that appears in several different groups of data disappears or reverses when these groups are combined. This paradox highlights how aggregated data can lead to misleading conclusions, as the relationship between variables may differ when examined at a more granular level.
Example:
Imagine two treatments for a disease, A and B. In separate groups of patients (e.g., men and women), treatment A appears to be more effective than treatment B. However, when the data from both groups is combined, treatment B shows better overall effectiveness. This can happen if one group has a significantly larger number of patients or if the groups differ in ways that affect the outcomes, such as age or severity of illness.
Key Points:
- Aggregation Issues: Aggregating data can obscure important differences between groups.
- Confounding Variables: A confounding variable can influence the results, leading to different conclusions at different levels of analysis.
- Importance of Context: It emphasizes the need for careful analysis and consideration of how data is grouped in statistical studies.
Simpson's paradox serves as a cautionary tale in data analysis, reminding researchers and analysts to carefully consider how they present and interpret their findings.
There have been some good examples given, here’s another real-world one.
There’ve been some years when, if we look at the SAT scores of all students who took the test, the mean score fell (compared to the previous year).
But when we look at students by ethnicity, the mean score of white students rose. And the mean score of black students rose. And Asians, and everybody else (including “unknown/declined to state their ethnicity”).
How can every single group have had higher scores, but the overall total population had lower scores?
Answer: Simpson’s Paradox. The mix of students changed, with more st
There have been some good examples given, here’s another real-world one.
There’ve been some years when, if we look at the SAT scores of all students who took the test, the mean score fell (compared to the previous year).
But when we look at students by ethnicity, the mean score of white students rose. And the mean score of black students rose. And Asians, and everybody else (including “unknown/declined to state their ethnicity”).
How can every single group have had higher scores, but the overall total population had lower scores?
Answer: Simpson’s Paradox. The mix of students changed, with more students from lower-scoring groups taking the test. Each group had higher scores — but there were fewer students from the high-scoring groups and more from the low-scoring ones.
Simpson’s Paradox tells us that what’s true for an overall pattern might be reversed when we look at sub-groups. Or another way to put it, if you account for ethnicity (or add it as an explanatory variable to your model), the pattern can change.
Where do I start?
I’m a huge financial nerd, and have spent an embarrassing amount of time talking to people about their money habits.
Here are the biggest mistakes people are making and how to fix them:
Not having a separate high interest savings account
Having a separate account allows you to see the results of all your hard work and keep your money separate so you're less tempted to spend it.
Plus with rates above 5.00%, the interest you can earn compared to most banks really adds up.
Here is a list of the top savings accounts available today. Deposit $5 before moving on because this is one of th
Where do I start?
I’m a huge financial nerd, and have spent an embarrassing amount of time talking to people about their money habits.
Here are the biggest mistakes people are making and how to fix them:
Not having a separate high interest savings account
Having a separate account allows you to see the results of all your hard work and keep your money separate so you're less tempted to spend it.
Plus with rates above 5.00%, the interest you can earn compared to most banks really adds up.
Here is a list of the top savings accounts available today. Deposit $5 before moving on because this is one of the biggest mistakes and easiest ones to fix.
Overpaying on car insurance
You’ve heard it a million times before, but the average American family still overspends by $417/year on car insurance.
If you’ve been with the same insurer for years, chances are you are one of them.
Pull up Coverage.com, a free site that will compare prices for you, answer the questions on the page, and it will show you how much you could be saving.
That’s it. You’ll likely be saving a bunch of money. Here’s a link to give it a try.
Consistently being in debt
If you’ve got $10K+ in debt (credit cards…medical bills…anything really) you could use a debt relief program and potentially reduce by over 20%.
Here’s how to see if you qualify:
Head over to this Debt Relief comparison website here, then simply answer the questions to see if you qualify.
It’s as simple as that. You’ll likely end up paying less than you owed before and you could be debt free in as little as 2 years.
Missing out on free money to invest
It’s no secret that millionaires love investing, but for the rest of us, it can seem out of reach.
Times have changed. There are a number of investing platforms that will give you a bonus to open an account and get started. All you have to do is open the account and invest at least $25, and you could get up to $1000 in bonus.
Pretty sweet deal right? Here is a link to some of the best options.
Having bad credit
A low credit score can come back to bite you in so many ways in the future.
From that next rental application to getting approved for any type of loan or credit card, if you have a bad history with credit, the good news is you can fix it.
Head over to BankRate.com and answer a few questions to see if you qualify. It only takes a few minutes and could save you from a major upset down the line.
How to get started
Hope this helps! Here are the links to get started:
Have a separate savings account
Stop overpaying for car insurance
Finally get out of debt
Start investing with a free bonus
Fix your credit
Simpson’s paradox is a statistical phenomenon where the same trend occurs in different groups of data points but when all the groups are combined, the trend gets reversed.
For example,
Let’s say that we need to compare the performance of 2 international cricket batsmen across a period of time.
Consider that the following table shows the batting average of 2 batsmen – Sachin Tendulkar and James from 2
Simpson’s paradox is a statistical phenomenon where the same trend occurs in different groups of data points but when all the groups are combined, the trend gets reversed.
For example,
Let’s say that we need to compare the performance of 2 international cricket batsmen across a period of time.
Consider that the following table shows the batting average of 2 batsmen – Sachin Tendulkar and James from 2010 – 2015.
Clearly, James has a higher batting average than Sachin in all the years. Does it mean James is a better batsman than Sachin?
Hence, Sachin < James
Let’s look at the following table also-
Sachin has scored 26708 runs in 438 matches from 2010 -2015 that makes a total batting average of around 60.97
James has scored 28387 runs in 471 matches from 2010 -2015 that makes a total batting average of around 60.26
So, Sachin has a higher batting average than James in total.
Hence, Sachin > James
Now, we cannot consider James as a better batsman than Sachin. This contradicts the inference from the first table. This is nothing but Simpson's paradox.
According to standard definition- “Simpson's paradox is a phenomenon in statistics, in which a trend appears in several different groups of data but disappears or reverses when these groups are combined.”
Here, when we examined the batting average of Sachin and James year by year, James appeared to be a better batsman but when we aggregated their overall performance together then Sachin appeared as a better batsman than James.
Usually, this scenario happens in many use cases due to the presence of a hidden bias in the data. In the above-mentioned example, there is a drastic difference in the number of matches played by Sachin and James in the years 2010 and 2013 which made a way to contradictory inferences.
Similarly, in any problem, there is a possibility of an unnoticed bias that can lead to Simpson's paradox.
Arithmetically, when (a1/A1) < (a2/A2) and (b1/B1) < (b2/B2) we tend to think (a1+a2)/(A1+A2) < (b1+b2)/(B1+B2)
But Simpson's paradox proves that this may not be true in all the cases and that’s what happened in the above example.
Let us understand the below scenario for better intuition-
This image shows the relationship between the happiness and age of a person according to a survey.
The below diagram clearly shows that the happiness of a person increases when he/she gets older ie age and happiness are positively correlated.
But if we split the age into multiple groups then we can see that -
This may be because of the reasons that-
* Happiness decreases from the age of 10 to age 20 (This may be because children become very happy when they join the high schools and gets new friends and teach...
This is one of the most interesting paradox. Almost every person must have encountered it once in his life. It adversely affects our decision making as you will see, so beware of it!
Let me explain it with a simple example:
Suppose you and I are captains of 2 football teams A and B.
Each of the team played 10 matches in 2 months.
In the first month,
Team A won 0 match out of the 3 it played. So your win % is 00.00%
Team B won 1 match out of the 7 it played. My win % is 14.20%
In the second month,
Team A won 5 matches out of 7 it played. So your win % is 71.40%
Team B won 3 match out of the 3 it p
This is one of the most interesting paradox. Almost every person must have encountered it once in his life. It adversely affects our decision making as you will see, so beware of it!
Let me explain it with a simple example:
Suppose you and I are captains of 2 football teams A and B.
Each of the team played 10 matches in 2 months.
In the first month,
Team A won 0 match out of the 3 it played. So your win % is 00.00%
Team B won 1 match out of the 7 it played. My win % is 14.20%
In the second month,
Team A won 5 matches out of 7 it played. So your win % is 71.40%
Team B won 3 match out of the 3 it played. My win % is 100.00%
So when the statisticians put out the numbers, here is what the report card looked like:
Team Captain Month 1(win %) Month 2(win %)
A You 00.00 71.40
B Me 14.20 100.00
And experts unanimously declare me as a much better captain than you(oh yeah!) and Team B much better than Team A.
But are they correct?
NO
Here is what the the report card of 2 months looks like
Team Total matches played Total matches won Win(%)
A 10 5 50
B 10 4 40
You won more matches than I did and both the teams played equal number of matches. Surprising isn't it!? This paradox plays on our mind since the result is so counter-intuitive.
One of the best-known real-life examples of Simpson's paradox occurred when the University of California, Berkeley was sued for bias against women who had applied for admission to graduate schools there. (Page on berkeley.edu)
The admission figures for the fall of 1973 showed that men applying were more likely than women to be admitted, and the difference was so large that it was unlikely to be due to chance.But when examining the individual departments, it appeared that no department was significantly biased against women. In fact, most departments had a "small but statistically significant bias in favor of women."!!
So beware the next time you judge someone on the basis of statistics!
Here’s the thing: I wish I had known these money secrets sooner. They’ve helped so many people save hundreds, secure their family’s future, and grow their bank accounts—myself included.
And honestly? Putting them to use was way easier than I expected. I bet you can knock out at least three or four of these right now—yes, even from your phone.
Don’t wait like I did. Go ahead and start using these money secrets today!
1. Cancel Your Car Insurance
You might not even realize it, but your car insurance company is probably overcharging you. In fact, they’re kind of counting on you not noticing. Luckily,
Here’s the thing: I wish I had known these money secrets sooner. They’ve helped so many people save hundreds, secure their family’s future, and grow their bank accounts—myself included.
And honestly? Putting them to use was way easier than I expected. I bet you can knock out at least three or four of these right now—yes, even from your phone.
Don’t wait like I did. Go ahead and start using these money secrets today!
1. Cancel Your Car Insurance
You might not even realize it, but your car insurance company is probably overcharging you. In fact, they’re kind of counting on you not noticing. Luckily, this problem is easy to fix.
Don’t waste your time browsing insurance sites for a better deal. A company called Insurify shows you all your options at once — people who do this save up to $996 per year.
If you tell them a bit about yourself and your vehicle, they’ll send you personalized quotes so you can compare them and find the best one for you.
Tired of overpaying for car insurance? It takes just five minutes to compare your options with Insurify and see how much you could save on car insurance.
2. Ask This Company to Get a Big Chunk of Your Debt Forgiven
A company called National Debt Relief could convince your lenders to simply get rid of a big chunk of what you owe. No bankruptcy, no loans — you don’t even need to have good credit.
If you owe at least $10,000 in unsecured debt (credit card debt, personal loans, medical bills, etc.), National Debt Relief’s experts will build you a monthly payment plan. As your payments add up, they negotiate with your creditors to reduce the amount you owe. You then pay off the rest in a lump sum.
On average, you could become debt-free within 24 to 48 months. It takes less than a minute to sign up and see how much debt you could get rid of.
3. You Can Become a Real Estate Investor for as Little as $10
Take a look at some of the world’s wealthiest people. What do they have in common? Many invest in large private real estate deals. And here’s the thing: There’s no reason you can’t, too — for as little as $10.
An investment called the Fundrise Flagship Fund lets you get started in the world of real estate by giving you access to a low-cost, diversified portfolio of private real estate. The best part? You don’t have to be the landlord. The Flagship Fund does all the heavy lifting.
With an initial investment as low as $10, your money will be invested in the Fund, which already owns more than $1 billion worth of real estate around the country, from apartment complexes to the thriving housing rental market to larger last-mile e-commerce logistics centers.
Want to invest more? Many investors choose to invest $1,000 or more. This is a Fund that can fit any type of investor’s needs. Once invested, you can track your performance from your phone and watch as properties are acquired, improved, and operated. As properties generate cash flow, you could earn money through quarterly dividend payments. And over time, you could earn money off the potential appreciation of the properties.
So if you want to get started in the world of real-estate investing, it takes just a few minutes to sign up and create an account with the Fundrise Flagship Fund.
This is a paid advertisement. Carefully consider the investment objectives, risks, charges and expenses of the Fundrise Real Estate Fund before investing. This and other information can be found in the Fund’s prospectus. Read them carefully before investing.
4. Earn Up to $50 this Month By Answering Survey Questions About the News — It’s Anonymous
The news is a heated subject these days. It’s hard not to have an opinion on it.
Good news: A website called YouGov will pay you up to $50 or more this month just to answer survey questions about politics, the economy, and other hot news topics.
Plus, it’s totally anonymous, so no one will judge you for that hot take.
When you take a quick survey (some are less than three minutes), you’ll earn points you can exchange for up to $50 in cash or gift cards to places like Walmart and Amazon. Plus, Penny Hoarder readers will get an extra 500 points for registering and another 1,000 points after completing their first survey.
It takes just a few minutes to sign up and take your first survey, and you’ll receive your points immediately.
5. Get Up to $300 Just for Setting Up Direct Deposit With This Account
If you bank at a traditional brick-and-mortar bank, your money probably isn’t growing much (c’mon, 0.40% is basically nothing).
But there’s good news: With SoFi Checking and Savings (member FDIC), you stand to gain up to a hefty 3.80% APY on savings when you set up a direct deposit or have $5,000 or more in Qualifying Deposits and 0.50% APY on checking balances — savings APY is 10 times more than the national average.
Right now, a direct deposit of at least $1K not only sets you up for higher returns but also brings you closer to earning up to a $300 welcome bonus (terms apply).
You can easily deposit checks via your phone’s camera, transfer funds, and get customer service via chat or phone call. There are no account fees, no monthly fees and no overdraft fees. And your money is FDIC insured (up to $3M of additional FDIC insurance through the SoFi Insured Deposit Program).
It’s quick and easy to open an account with SoFi Checking and Savings (member FDIC) and watch your money grow faster than ever.
Read Disclaimer
5. Stop Paying Your Credit Card Company
If you have credit card debt, you know. The anxiety, the interest rates, the fear you’re never going to escape… but a website called AmONE wants to help.
If you owe your credit card companies $100,000 or less, AmONE will match you with a low-interest loan you can use to pay off every single one of your balances.
The benefit? You’ll be left with one bill to pay each month. And because personal loans have lower interest rates (AmONE rates start at 6.40% APR), you’ll get out of debt that much faster.
It takes less than a minute and just 10 questions to see what loans you qualify for.
6. Lock In Affordable Term Life Insurance in Minutes.
Let’s be honest—life insurance probably isn’t on your list of fun things to research. But locking in a policy now could mean huge peace of mind for your family down the road. And getting covered is actually a lot easier than you might think.
With Best Money’s term life insurance marketplace, you can compare top-rated policies in minutes and find coverage that works for you. No long phone calls. No confusing paperwork. Just straightforward quotes, starting at just $7 a month, from trusted providers so you can make an informed decision.
The best part? You’re in control. Answer a few quick questions, see your options, get coverage up to $3 million, and choose the coverage that fits your life and budget—on your terms.
You already protect your car, your home, even your phone. Why not make sure your family’s financial future is covered, too? Compare term life insurance rates with Best Money today and find a policy that fits.
A real life example.
In Spain, global mortality rate by cardiovascular diseases is higher in women than in men.
But each and every age-specific mortality rate by these diseases are higher in men than in women. So, a woman, at a certain age, has less risk of dying because of these deseases than a same age man, other things equal.
But , as there are many more women from 75–80 years on, where the majority of these deaths occur, the global mortality rate by sex indicates just the opposite.
Simpson’s paradox occurs when data subsets show trends that disappear or reverse when viewing the full data set.
Here is a simple example, borrowed from Wikipedia:
Justice had a higher batting average than Jeter in both 1995 and 1996. But when you combine the data from both years, Jeter had the higher average. How is this possible?
The key is the size of the data sets. Both of them had a better year in 1996 than 1995, but Jeter had a lot of at-bats in his good year and few in his bad year. Similarly, Justice had more at-bats his bad year, and relatively few his good year. Jeter’s good year gets w
Simpson’s paradox occurs when data subsets show trends that disappear or reverse when viewing the full data set.
Here is a simple example, borrowed from Wikipedia:
Justice had a higher batting average than Jeter in both 1995 and 1996. But when you combine the data from both years, Jeter had the higher average. How is this possible?
The key is the size of the data sets. Both of them had a better year in 1996 than 1995, but Jeter had a lot of at-bats in his good year and few in his bad year. Similarly, Justice had more at-bats his bad year, and relatively few his good year. Jeter’s good year gets weighed more heavily when the data is combined, as does Justices bad year, and Jeter winds up with the higher average.
This is just one way Simpson’s paradox can occur, it doesn’t have to be about sample sizes. It will usually some overlooked variable that isn’t being accounted for, either in the divided sets or the full data set.
AI effectiveness depends on relevant, responsible and robust data to prevent costly errors, inefficiencies, and compliance issues. A solid data foundation allows AI models to deliver precise insights and ensures systems comply with regulations and protect brand reputation.
Gartner® finds that "At least 30% of generative AI (GenAI) projects will be abandoned after proof of concept by the end of 2025, due to poor data quality, inadequate risk controls, escalating costs, or unclear business value." High-quality, AI-ready data is the fuel for AI-driven advancements now and in the future.
AI effectiveness depends on relevant, responsible and robust data to prevent costly errors, inefficiencies, and compliance issues. A solid data foundation allows AI models to deliver precise insights and ensures systems comply with regulations and protect brand reputation.
Gartner® finds that "At least 30% of generative AI (GenAI) projects will be abandoned after proof of concept by the end of 2025, due to poor data quality, inadequate risk controls, escalating costs, or unclear business value." High-quality, AI-ready data is the fuel for AI-driven advancements now and in the future.
As a cartographer and spatial analyst, this speaks to the problems of generalization. The characteristics of a group as a whole may be significantly different from the characteristics of included groups.
For example, a green area on a map shows a “forest,” but it omits significant information about the structure and composition of that forest area. One area might be mostly Douglas Fir and Hemlock, for example, while another area is Big Leaf Maple and Alder. For a more glaring example, the arid and relatively sparse forests of Nevada are shown with the same symbol as the dense forests of coastal
As a cartographer and spatial analyst, this speaks to the problems of generalization. The characteristics of a group as a whole may be significantly different from the characteristics of included groups.
For example, a green area on a map shows a “forest,” but it omits significant information about the structure and composition of that forest area. One area might be mostly Douglas Fir and Hemlock, for example, while another area is Big Leaf Maple and Alder. For a more glaring example, the arid and relatively sparse forests of Nevada are shown with the same symbol as the dense forests of coastal Oregon.
And I’ve just barely started. There are some other good answers here, too. So, remember well the phrase mistakenly attributed to Mark Twain and others: lies, damned lies, and statistics.
OK, I see some qualified folks have already given quality answers. So I’ll throw a wrench into the works and give a different “Simpson’s Paradox.” This is “Bart Simpson’s Paradox.”
You’re damned if you do and damned if you don’t.
The Low Birth Weight paradox
Today everybody knows that smoking causes lung cancer. But it wasn’t so obvious in the 1950s and took nearly a decade to resolve this debate. Even after the smoking and cancer debate died down, a major puzzling paradox remained. In the mid 1960s, it was pointed out that a mother’s smoking seemed to have benefits for her newborn baby if the baby was born underweight. This so-called birth-weight paradox was a major challenge to the emerging medical consensus about the harmful effects of smoking and took more than 40 years to resolve.
In 1959, Jacob Yerushalmy, a biosta
The Low Birth Weight paradox
Today everybody knows that smoking causes lung cancer. But it wasn’t so obvious in the 1950s and took nearly a decade to resolve this debate. Even after the smoking and cancer debate died down, a major puzzling paradox remained. In the mid 1960s, it was pointed out that a mother’s smoking seemed to have benefits for her newborn baby if the baby was born underweight. This so-called birth-weight paradox was a major challenge to the emerging medical consensus about the harmful effects of smoking and took more than 40 years to resolve.
In 1959, Jacob Yerushalmy, a biostatistician at the University of California at Berkeley and also an opponent of the smoking-causes-cancer hypothesis, launched a long-term public health study that collected data on more than 15,000 children in the San Francisco Bay Area. The data included information about mothers’ smoking habits, the birth weights and mortality rates of their babies in the first month of life.
It was well-known that low birth-weight (LBW) infants, defined as less than 2500g at birth, have a mortality rate more than twenty times higher than that of normal-birth-weight infants. Babies of smoking mothers weigh less at birth than babies of non-smokers. It would seem natural to assume this lower birth weight (LBW) would lead to poorer survival.
Yet what Yerushalmy found was surprising. The LBW babies of smoking mothers had a better survival rate than those of nonsmokers. It was as if the mother’s smoking had a protective effect!
Yerushalmy was not silly enough to make this absurd claim but he stated that there is no causal link between smoking and mortality. But this seems odd and counter-intuitive.
And it is just plain wrong. Modern epidemiologists believe that smoking does increase infant mortality. But how can the data be explained?
To make it more concrete, we can look at a more recent dataset of over three million children born in the US in 1991 presented in this paper Birth Weight “Paradox” Uncovered? which observed the same thing that Yerushalmy did 50 years ago (btw, if you find this paper full of jargons and hard to read, I did too but bear with me I’ll make it clearer). Overall, infants born to smokers had higher risks of both LBW and mortality than infants born to nonsmokers.
But among LBW infants, the mortality rate was lower for infants born to smokers. You can see in this plot below that the switch happens around 2000g.
I can’t find the raw data for this paper so I try to simulate in this table:
While the overall mortality rate of the smoker-mother group (3+3)/(55+445) = 1.2% is greater than that of the non-smoker-mother group (10+11)/(150+2350) = 0.8%, you can see in the LBW group, infants of smoker-mothers have lower mortality rate (5% vs 7%).
Using causal diagrams, the authors finally uncovered the reason behind this paradox. Smoking may be harmful and contributes to LBW, but there are other more serious and harmful causes of LBW such as genetic defects or malnutrition. If the mother is a smoker, this ‘explains away’ the LBW and reduces the likelihood of a serious birth defect. If the mother is not a smoker, it is likely that the cause of the LBW is a serious birth defect which leads to higher mortality. The presence of these unmeasured risk factors (U) can induce a spurious association between smoking mortality when the analysis is split by birth weight.
Only the three arrows marked by the red circles in the diagram above are enough to create this paradox. This bias is introduced when the data is split on a variable (weight) that is affected by the exposure (smoking) and shares common causes (U) with the outcome (mortality).
It was lucky that the apparent explanation was so ridiculous (smoking is beneficial) that the bias was detected. There might be many other cases where this bias is undetected because it doesn’t conflict with theory.
Refs:
Birth Weight “Paradox” Uncovered?
Simpson's paradox is a statistical phenomenon where a trend or association that holds for different groups reverses when the groups are combined.
This occurs due to the differing rates of aggregation, which can lead to confusing conclusions.
One surprising aspect of Simpson's paradox is that it highlights the importance of considering the context in which data is analyzed.
For instance, a study on the effectiveness of a new medication might show a positive correlation between the medication and patient recovery rates when analyzed separately for different age groups, but a negative correlation wh
Simpson's paradox is a statistical phenomenon where a trend or association that holds for different groups reverses when the groups are combined.
This occurs due to the differing rates of aggregation, which can lead to confusing conclusions.
One surprising aspect of Simpson's paradox is that it highlights the importance of considering the context in which data is analyzed.
For instance, a study on the effectiveness of a new medication might show a positive correlation between the medication and patient recovery rates when analyzed separately for different age groups, but a negative correlation when the data is combined.
Another interesting aspect is that Simpson's paradox can arise from the way we aggregate data.
Aggregation can introduce biases that are not present in the original data, and these biases can lead to wrong conclusions.
For example, in a study on the relationship between university tuition fees and graduation rates, the data might show a positive correlation when analyzed separately for different ethnic groups, but a negative correlation when the data is combined.
Simpson's paradox also has real-world implications, such as in healthcare, where it can lead to misdiagnosis or ineffective treatment.
In medical research, the paradox can arise when combining data from different studies, leading to conflicting conclusions.
Furthermore, Simpson's paradox is not unique to statistics; it can occur in any field where data is aggregated, including economics, social sciences, and business.
It highlights the need for a deeper understanding of the underlying mechanisms and the importance of considering the context in which data is analyzed.
In addition, Simpson's paradox can be mitigated by using more advanced statistical techniques, such as regression analysis, which can help account for the underlying structures in the data.
This emphasizes the importance of careful consideration of the statistical methods used in data analysis.
Paradox:
Simpson's paradox occurs when your sample is composed of separate classes with different mean values of a statistical value. In this case, if the class distribution within sample changes between two measures, the trend observed on average might be opposed to the trend observed in each of the two classes.
Example:
Let's say that a teacher has a class with 100 students, 10 among them come from a disadvantaged background and their average in year 1 is 80/100. All the other students are normal and have an average of 90/100. Thus the average of her class was at 89/100. Since disadvantaged st
Paradox:
Simpson's paradox occurs when your sample is composed of separate classes with different mean values of a statistical value. In this case, if the class distribution within sample changes between two measures, the trend observed on average might be opposed to the trend observed in each of the two classes.
Example:
Let's say that a teacher has a class with 100 students, 10 among them come from a disadvantaged background and their average in year 1 is 80/100. All the other students are normal and have an average of 90/100. Thus the average of her class was at 89/100. Since disadvantaged student were feeling themselves well with that particular teacher they advised other disadvantaged students to sign in into the course and in the year 2 the same teacher had 50 students from a disadvantageous background and 50 normal students. Normal student average became 91 and disadvantaged ones - 81. However, the average of the class dropped to 86 and she gets a call from the dean asking her why did she become so bad at teaching.
Conclusion:
Always look for a meaningful split of your data into classes that might have different behavior. If not, you might obtain a correlation opposed to the real one. If your data don't look like a Gaussian, don't try to pretend it is a Gaussian and proceed anyway: instead try to split it into classes that look Gaussian.
The testing paradox (also known as the Prosecutor’s Fallacy):
You have a test that is 90% accurate (correctly identifying 90% of those with a condition, and 90% of those without the condition). You test someone, and the test result is positive. How likely are they to have the condition?
The correct answer is: It depends on how prevalent the condition is in the population. If the condition is only present in 10% of the population, then there’s only a 50% chance that the person actually has the condition.
The first time I encountered this problem was taking the Math GREs. I got the answer, and didn
The testing paradox (also known as the Prosecutor’s Fallacy):
You have a test that is 90% accurate (correctly identifying 90% of those with a condition, and 90% of those without the condition). You test someone, and the test result is positive. How likely are they to have the condition?
The correct answer is: It depends on how prevalent the condition is in the population. If the condition is only present in 10% of the population, then there’s only a 50% chance that the person actually has the condition.
The first time I encountered this problem was taking the Math GREs. I got the answer, and didn’t believe it, so I rechecked my work and got the same answer. I thought about it some more, and had to do something like the following to convince myself the answer was, in fact, correct:
That was in 19mumblemumble. Since then, testing mania has taken over, with drug tests being mandatory for a lot of things, from getting a job flipping burgers to receiving public assistance. Given the high likelihood of a false positive result, and the impossibility of proving you weren’t using drugs at the time of the test, the wisdom of using such tests is highly questionable.
When you compare a population with labeled subpopulations with another population (or "the same" at a different time), it's extremely likely that the two populations will have different proportions of their subpopulations. This is the heart of Simpson's paradox.
It's easiest to understand if you think about the change happening with one population over time.
Let's take a very simple example. You've got girls and boys. The girls have, on average, longer hair than the boys do, and there is an average for the school somewhere in between. Now, a boy shows up with longer than average hair (for a bo
When you compare a population with labeled subpopulations with another population (or "the same" at a different time), it's extremely likely that the two populations will have different proportions of their subpopulations. This is the heart of Simpson's paradox.
It's easiest to understand if you think about the change happening with one population over time.
Let's take a very simple example. You've got girls and boys. The girls have, on average, longer hair than the boys do, and there is an average for the school somewhere in between. Now, a boy shows up with longer than average hair (for a boy) but shorter than the school average. Presto: The girls have the same length hair as before. The boys have, on average, longer hair. Even though no subgroup has shorter hair, the school average has gone down!
This is the way it goes with Simpson's paradox -- the groups have averages that go in one direction while the overall average goes in the other. Sometimes it's because members of the population leave or join, sometimes it's due to shifts in counts within the sub-groups. But it's always because the counts in subgroups differ between the two populations.
Read more here: What are some recent examples of Simpson's Paradox in the media?
An example is False positive paradox : We would like to think that if we devise a Test, we should be able to measure its accuracy in a lab But the accuracy of a Test also depends on the characteristics of the population on which it is being applied. This is surely counter intuitive, but true nevertheless.
Suppose we invent a new Test which gives 95% correct diagnosis, and 5% false positives. We have tested it over large population samples. We apply it to a population of 100 people in which 40% people have the disease. The test works well.
- 40 People will be diagnosed correctly as having the di
An example is False positive paradox : We would like to think that if we devise a Test, we should be able to measure its accuracy in a lab But the accuracy of a Test also depends on the characteristics of the population on which it is being applied. This is surely counter intuitive, but true nevertheless.
Suppose we invent a new Test which gives 95% correct diagnosis, and 5% false positives. We have tested it over large population samples. We apply it to a population of 100 people in which 40% people have the disease. The test works well.
- 40 People will be diagnosed correctly as having the disease.
- 3 People will be diagnosed as having the disease but incorrectly - The False Positive.
- 57 People will be diagnosed correctly as not having the disease.
We can calculate the accuracy of the Test as the probability of correct result. As 40 of the 43 people detected positive are truly positive, we can say the accuracy is about 93%, not far from the initial 95%. So far, the test is good.
The Paradox comes when the same test is applied to another population which has a much lower prior probability of having the disease, lower than the false positive rate. Let's apply the above test to a new population of 100 people in which only 1% of people have the disease.
- 1 person will be diagnosed correctly as having the disease.
- 4.95 People ~ 5 People will be diagnosed incorrectly as having the disease - The False Positive
- The remaining 94 are correctly diagnosed as not having the disease.
If we look at the accuracy of the test now, only 1 person of 6 detected actually has the disease. Something that was 93% accurate is now only 16% accurate. Hence the Paradox.
Wikipedia has a nice segment applying this to counter terrorism measures:
Terrorists are really rare. In a city of twenty million like New York, there might be one or two terrorists, maybe up to ten. 10/20,000,000 = 0.00005 percent, one twenty-thousandth of a percent.
That's pretty rare. Now, say you have software that can sift through all the bank-records, or toll-pass records, or public transit records, or phone-call records in the city and catch terrorists 99 percent of the time.
In a pool of twenty million people, a 99 percent accurate test will identify two hundred thousand people as being terrorists. But only ten of them are terrorists. To catch ten bad guys, you have to investigate two hundred thousand innocent people.
Potato paradox
This is not a logical; but a mathematical paradox. Interesting nonetheless.
The paradox has been described as:
Let’s say you have 100 kg potatoes, which are 99% water by weight.
Now, you leave them outside overnight to dehydrate until they're 98% water.
How much do they weigh now?
The answer is 50 kg.
How so?
When you first had 100 kg of potatoes, water was 99% by weight, it means that there is 99 kg of water, and 1 kg of solids.
It's a 1:99 ratio.
Now, when potatoes dehydrate causing water to decrease to 98%, then the solids will account for 2% of the weight.Right?
So the ratio will becom
Potato paradox
This is not a logical; but a mathematical paradox. Interesting nonetheless.
The paradox has been described as:
Let’s say you have 100 kg potatoes, which are 99% water by weight.
Now, you leave them outside overnight to dehydrate until they're 98% water.
How much do they weigh now?
The answer is 50 kg.
How so?
When you first had 100 kg of potatoes, water was 99% by weight, it means that there is 99 kg of water, and 1 kg of solids.
It's a 1:99 ratio.
Now, when potatoes dehydrate causing water to decrease to 98%, then the solids will account for 2% of the weight.Right?
So the ratio will become 2:98 which will reduce to 1:49.Since the solids still weigh 1 kg, the water must weigh 49 kg.That makes total weight 50 kg.
Mind-boggling, isn’t it?
The ultimate paradox is the plot of the movie 'Predestination (2014)'...
or basically the short story - 'All you Zombies' written by Robert Heinlein.
SPOILERS AHEAD!
Its overall premise is as follows:
----------------------------------------------------------------
In an orphanage lived a small, little girl.
She grows up, turns 18, meets a man, has sex with him and delivers a daughter.
But the father abducts his daughter one night and disappears from the girl's (now a woman) life.
The woman then decides to have a sex change operation.
She turns into a man and discovers time travel.
The man then goes
The ultimate paradox is the plot of the movie 'Predestination (2014)'...
or basically the short story - 'All you Zombies' written by Robert Heinlein.
SPOILERS AHEAD!
Its overall premise is as follows:
----------------------------------------------------------------
In an orphanage lived a small, little girl.
She grows up, turns 18, meets a man, has sex with him and delivers a daughter.
But the father abducts his daughter one night and disappears from the girl's (now a woman) life.
The woman then decides to have a sex change operation.
She turns into a man and discovers time travel.
The man then goes back into the past, meets his younger self (who is still a woman for now) and has sex with her.
The woman delivers a daughter which he abducts and flees.
He then goes further 18 years back into the past and then leaves his baby girl on the doorstep of an orphanage.
No points for guessing who the little baby girl turns into.
--------------------------------------------------------------
This is the ultimate paradox that was formulated in the story.
A perfect and mind-tripping paradox where a human being has no ancestory at all, has no beginning in nature and has just come into existence out of absolutely nowhere!!
And the most boggling part:
If, as and when time travel is discovered in the future, this scenario will ACTUALLY, suddenly become plausible (assuming that the cause and effect relationship is still valid and true after the possibility of time travel is verified).
Sleep on this!
P.S: As an additional food for thought, just mull over the question that what will be his genetic code and who shall he look like?!
P.P.S: Bootstrap paradox ...link for the explanation of all these kinds of existential paradoxes.
An Infinity paradox:
A bouncing ball returns to 1/2 of its initial height; each bounce completes in 1/2 the time of the previous bounce. Eventually it comes to rest. Whenever it hits the ground its color alternates between blue and red.
If it starts out blue, what color is it at rest?
Ant on a rubber band
An ant crawls along a rubber band from one end toward the other at a constant rate of 1 mph (edit: with respect to the rubber band.) The other end is tied to a car that travels away from the ant at a constant rate of 10 mph. (The rubber band has the capacity of being stretched indefinitely withou
An Infinity paradox:
A bouncing ball returns to 1/2 of its initial height; each bounce completes in 1/2 the time of the previous bounce. Eventually it comes to rest. Whenever it hits the ground its color alternates between blue and red.
If it starts out blue, what color is it at rest?
Ant on a rubber band
An ant crawls along a rubber band from one end toward the other at a constant rate of 1 mph (edit: with respect to the rubber band.) The other end is tied to a car that travels away from the ant at a constant rate of 10 mph. (The rubber band has the capacity of being stretched indefinitely without breaking.)
Even though the car’s speed is 10 times that of the ant, the ant eventually reaches the car.
The unexpected hanging
On Saturday a condemned man was sentenced to be hanged. The hanging will take place at noon on one of the seven days of next week, said the judge. You will not know the day of your execution until it is announced to you on the morning of the hanging.
After some thought the prisoner concluded that his execution could not be on the following Saturday, because on Friday afternoon he would be alive with only Saturday remaining as an execution day. So he would know of his execution prior to the announcement on Saturday morning. With Saturday thus ruled out, the same argument can be applied to Friday. Similarly, Thursday, Wednesday, Tuesday, Monday, and finally Sunday can be ruled out. The prisoner reasoned, quite logically, that the hanging therefore could not be carried out at all, without contradicting the judge’s condition of surprise. The prisoner was overjoyed.
On Wednesday, quite to the prisoner’s surprise, the hangman announced his execution.
Non-transitive coin tosses
If a coin is tossed three times, eight equally probable patterns obtain: HHH, HHT, HTH, HTT and the four patterns obtained by exchanging H and T.
A game can be construed where one player picks a pattern and the other player picks a different one. A coin is flipped repeatedly until the pattern picked by one of the players appears as a run, and that player then wins the game. For example if Player 1 chooses THT and Player 2 chooses HHT and the flips are THHHT, Player 2 wins the game.
Even though the patterns have equal likelihood for three flips, Player 2, who chooses after Player 1, can secure a winning probability that is at worst 2/3 (2 to 1 odds) and at best 7/8 (7 to 1 odds).
Edit: For example if Player 1 selects HHH, Player 2 selects THH. Player 1 wins only if HHH are the first three flips (1 chance in 8). If HHH appears anywhere else in the sequence of flips it is preceded by a T and Player 2 wins.
Grand Father Paradox…….
Imagine someone(X) travelling back in time and killing his own Grand Father(Z) before his Marriage.
Firstly as he doesn’t have any kids(Y), X wouldn’t have been born. Thus No one killed Z.
Now Z gets married, has kids, therefore X is in existence, goes back in time ,kills Z before marriage, X is erased from existence , Z is married and so on(X ,no X , X…..).
Creating a PARADOX.
This can be resolved by concluding that when somebody goes back in time , he/her lands in a different reality (so called ALTERNATE TIMELINE or PARALLEL UNIVERSE).
God……! Time Travelling is such a Mess.
1. Card Paradox: Imagine you’re holding a postcard in your hand, on one side of which is written, “The statement on the other side of this card is true.” We’ll call that Statement A. Turn the card over, and the opposite side reads, “The statement on the other side of this card is false” (Statement B). Trying to assign any truth to either Statement A or B, however, leads to a paradox: if A is true then B must be as well, but for B to be true, A has to be false. Oppositely, if A is false then B must be false too, which must ultimately make A true.
2. Pinocchio Paradox: What will happen when pinn
1. Card Paradox: Imagine you’re holding a postcard in your hand, on one side of which is written, “The statement on the other side of this card is true.” We’ll call that Statement A. Turn the card over, and the opposite side reads, “The statement on the other side of this card is false” (Statement B). Trying to assign any truth to either Statement A or B, however, leads to a paradox: if A is true then B must be as well, but for B to be true, A has to be false. Oppositely, if A is false then B must be false too, which must ultimately make A true.
2. Pinocchio Paradox: What will happen when pinnochio says: "My nose will grow"?
If his nose is not growing, he is telling a lie and his nose will grow but then he is telling the truth and it can't happen. If his nose is growing, he is telling the truth, so it can't happen. If his nose will grow, he will be telling the truth, but his nose grows if he lies so it can't happen. If his nose will not grow, he is lying and it will grow but then he would be telling the truth so it can't happen.
3.Crocodile's Paradox: A crocodile snatches a young boy from a riverbank. His mother pleads with the crocodile to return him, to which the crocodile replies that he will only return the boy safely if the mother can guess correctly whether or not he will indeed return the boy. There is no problem if the mother guesses that the crocodile will return him—if she is right, he is returned; if she is wrong, the crocodile keeps him. If she answers that the crocodile will not return him, however, we end up with a paradox: if she is right and the crocodile never intended to return her child, then the crocodile has to return him, but in doing so breaks his word and contradicts the mother’s answer. On the other hand, if she is wrong and the crocodile actually did intend to return the boy, the crocodile must then keep him even though he intended not to, thereby also breaking his word.
4. Barber's Paradox: Suppose there is a town with just one barber; and that every man in the town keeps himself clean-shaven: some by shaving themselves, some by attending the barber. It seems reasonable to imagine that the barber obeys the following rule: He shaves all and only those men in town who do not shave themselves.
Under this scenario, we can ask the following question: Does the barber shave himself?
Asking this, however, we discover that the situation presented is in fact impossible:
– If the barber does not shave himself, he must abide by the rule and shave himself.
– If he does shave himself, according to the rule he will not shave himself
5. A catch-22 is a paradoxical situation from which an individual cannot escape because of contradictory rules. For example:To apply for this job, you would have to be insane; but if you are insane, you are unacceptable for the job
Here are three really nice paradoxes - [The last one is my favorite!]
Achilles & the tortoise paradox:
In the paradox of Achilles and the Tortoise, Achilles is in a footrace with the tortoise. Achilles allows the tortoise a head start of 100 feet. If we suppose that each racer starts running at some constant speed (one very fast and one very slow), then after some finite time, Achilles will have run 100 feet, bringing him to the tortoise’s starting point. During this time, the tortoise has run a much shorter distance, say, 10 feet. It will then take Achilles some further time to run that distanc
Here are three really nice paradoxes - [The last one is my favorite!]
Achilles & the tortoise paradox:
In the paradox of Achilles and the Tortoise, Achilles is in a footrace with the tortoise. Achilles allows the tortoise a head start of 100 feet. If we suppose that each racer starts running at some constant speed (one very fast and one very slow), then after some finite time, Achilles will have run 100 feet, bringing him to the tortoise’s starting point. During this time, the tortoise has run a much shorter distance, say, 10 feet. It will then take Achilles some further time to run that distance, by which time the tortoise will have advanced farther; and then more time still to reach this third point, while the tortoise moves ahead. Thus, whenever Achilles reaches somewhere the tortoise has been, he still has farther to go. Therefore, because there are an infinite number of points Achilles must reach where the tortoise has already been, he can never overtake the tortoise. Of course, simple experience tells us that Achilles will be able to overtake the tortoise, which is why this is a paradox.
The barber’s Paradox:
Suppose there is a town with just one male barber; and that every man in the town keeps himself clean-shaven: some by shaving themselves, some by attending the barber. It seems reasonable to imagine that the barber obeys the following rule: He shaves all and only those men in town who do not shave themselves.
Under this scenario, we can ask the following question: Does the barber shave himself?
Asking this, however, we discover that the situation presented is in fact impossible:
- If the barber does not shave himself, he must abide by the rule and shave himself.
- If he does shave himself, according to the rule he will not shave himself.
The unexpected hanging paradox:
A judge tells a condemned prisoner that he will be hanged at noon on one weekday in the following week, but that the execution will be a surprise to the prisoner. He will not know the day of the hanging until the executioner knocks on his cell door at noon that day. Having reflected on his sentence, the prisoner draws the conclusion that he will escape from the hanging. His reasoning is in several parts. He begins by concluding that the “surprise hanging” can’t be on a Friday, as if he hasn’t been hanged by Thursday, there is only one day left – and so it won’t be a surprise if he’s hanged on a Friday. Since the judge’s sentence stipulated that the hanging would be a surprise to him, he concludes it cannot occur on Friday. He then reasons that the surprise hanging cannot be on Thursday either, because Friday has already been eliminated and if he hasn’t been hanged by Wednesday night, the hanging must occur on Thursday, making a Thursday hanging not a surprise either. By similar reasoning he concludes that the hanging can also not occur on Wednesday, Tuesday or Monday. Joyfully he retires to his cell confident that the hanging will not occur at all. The next week, the executioner knocks on the prisoner’s door at noon on Wednesday — which, despite all the above, will still be an utter surprise to him. Everything the judge said has come true.
You’re damned if you do and damned if you don’t
You’re damned if you do and damned if you don’t
An economist is someone who, upon encountering a stadium of over 100,000 destitute, penniless people who have come to see a talk by Bill Gates, says, "The economy’s doing great! On average, I see a stadium full of millionaires!"
Of course, that doesn’t really illustrate Sympson’s Paradox that well, being just a single data point.
But that same economist recalls being at that same stadium about a decade ago, and there were only 33,000 destitute, penniless people who came to see a talk by Bill Gates, and yet, the economist was able to make the exact same pronouncement, a decade ago!
So now, given t
An economist is someone who, upon encountering a stadium of over 100,000 destitute, penniless people who have come to see a talk by Bill Gates, says, "The economy’s doing great! On average, I see a stadium full of millionaires!"
Of course, that doesn’t really illustrate Sympson’s Paradox that well, being just a single data point.
But that same economist recalls being at that same stadium about a decade ago, and there were only 33,000 destitute, penniless people who came to see a talk by Bill Gates, and yet, the economist was able to make the exact same pronouncement, a decade ago!
So now, given two such data points, the economist can make the following pronouncement: "The economy’s doing great! There are three times as many millionaires than there were a decade ago! The economy is growing at 30% per year!"
Doesn’t that make you feel richer?
I think I can explain it without any math at all but maybe a little arithmetic would help provide an example. Suppose you were drawing from two groups of people, maybe classrooms. Suppose you found out that in class one there was a positive correlation between being a smoker and being male. Then you also found out that in the second class there was a positive correlation between being a smoker and being male.
(Let us assume that all people in these groups fall into two types, smokers and non-smokers. Similarly, all are either male or female.)
If the samples were large enough, you would be ready
I think I can explain it without any math at all but maybe a little arithmetic would help provide an example. Suppose you were drawing from two groups of people, maybe classrooms. Suppose you found out that in class one there was a positive correlation between being a smoker and being male. Then you also found out that in the second class there was a positive correlation between being a smoker and being male.
(Let us assume that all people in these groups fall into two types, smokers and non-smokers. Similarly, all are either male or female.)
If the samples were large enough, you would be ready to conclude that there is a large likelihood that in general classes like these we would find a positive correlation between being male and being a smoker.
Now, instead of doing the statistics on them separately, throw the two classes together and find the correlation. Would you not expect there still to be a positive correlation between being a smoker and being male? Sure, but as it turns out, sometimes it is actually a negative correlation when viewed as one group instead of two. That is the paradox. You can come to two opposite conclusions depending upon how you view the data. And sometimes correlation is causation. It that were true here, what would we say causes what? Is there a best way to always view the data? Hard to see why that would be. But there are arguments that we won’t go into right now.
It would be helpful, though, to have an example to see why that might be. Let’s take a simple one, and take a look. But first we need to decide what a correlation would look like in this case. A simple way is to treat the correlation as the difference in percentages betwwn two groups. If we found that 80% of women smoked, would we then say that there was a positive correlation between being a woman and being a smoker? No! We would have to look at how many men smoked. For instance, if we found out that 90% of men smoked then that would mean that there is a positive correlation between being a smoker and being male. If this were a large enough sample we might say (simply) that there is a 10% positive correlation and being male. (Correlations have a slightly different way of being calculated but this way is good enough to decide if it is positive or negative.)
So, imagine these two groups
In Group A we have:
3 male smokers, 1 male non-smoker
4 female smokers, 2 female non-smokers
so of 4 males 3/4 or 75% smoke, of the females 4 out of 6 smoke, which means 66.7% smoke so there is a positive correlation between being male and being a smoker because 75% is greater than 66.7%
In Group B we have:
3 male smokers, 4 male non-smokers
1 female smoker and 2 female non-smokers
so here we have 3 out of 7 males smoke and 1 out of 3 females smoke. about 42.9% male smokers and about 33.3% of females smoke. Again, there is a positive correlation between being a smoker and being male.
But now let’s throw the two groups together. So we have:
6 male smokers and 5 male non-smokers
5 female smokers and 4 female non-smokers
Now it’s close, but when we do the division 54.5454% of men smoke and 55.5555% women smoke.
So, now the correlation between being male and being a smoker is negative, because 54.54 is less than 55.55.
So, it seems strange that the correlation changed when we just viewed the data differently, and thus the paradox. You might think that these differences aren’t enough to get statistically significant, but we can find a case with the same numbers but multiplying all the initial data by a thousand or a million or whatever number it takes to make it significant. Some number will do that. The correlations would be the same.
Pretty strange, eh?
In a small town in America, a person decided to open up his bar business, which was right opposite to a church. The church & its congregation started a campaign to block the bar from opening with petitions and prayed daily against his business.
Work progressed. However, when it was almost complete and was about to open a few days later, a strong lightning struck the bar and it was burnt to the ground. The church folk were rather smug in their outlook after that, till The bar owner sued the church authorities for $2million on the grounds that the church through its congregation & prayers was
In a small town in America, a person decided to open up his bar business, which was right opposite to a church. The church & its congregation started a campaign to block the bar from opening with petitions and prayed daily against his business.
Work progressed. However, when it was almost complete and was about to open a few days later, a strong lightning struck the bar and it was burnt to the ground. The church folk were rather smug in their outlook after that, till The bar owner sued the church authorities for $2million on the grounds that the church through its congregation & prayers was ultimately responsible For the demise of his bar shop, either through direct or indirect actions or means.
In its reply to the court, the church vehemently denied all responsibility or any connection that their prayers were reasons to the bar shop's demise. In support of their claim they referred to the Benson study at Harvard that inter-cessionary prayer had no impact !
As the case made its way into court, the judge looked over the paperwork and at the hearing and commented:
'I don't know how I am going to decide this case, but it appears from the paperwork, we have a bar owner who believes in the power of prayer and we have an entire church and its devotees that doesn't.'
Simpson’s paradox is the inexplicable ability of someone to be familiar with, and know how to use, Quora, but yet apparently never having heard of Google.
This is a fascinating apparent paradox where the individual groups that make up a population show a shift in one direction on some value while the average for the whole group moves in the other direction.
Here's an example from an excellent blog post by economist Russ Roberts:
Between 2000 and 2012, here are the changes in real median income for women 25 and older who were defined as working full-time, year-round by education level. The data I’m using are census data from here.
Less than 9th grade -3.7%
9th-12th but didn’t finish -6.7%
High school
This is a fascinating apparent paradox where the individual groups that make up a population show a shift in one direction on some value while the average for the whole group moves in the other direction.
Here's an example from an excellent blog post by economist Russ Roberts:
Between 2000 and 2012, here are the changes in real median income for women 25 and older who were defined as working full-time, year-round by education level. The data I’m using are census data from here.
Less than 9th grade -3.7%
9th-12th but didn’t finish -6.7%
High school graduate -3.3%
Some college but no degree -3.7%
Associate’s degree -10.0%
Bachelor’s degree or more -2.7%Looks like a pretty bleak 12 years, doesn’t it?
But wait!
The income of women over the age of 25 who worked full time actually increased between 2000 and 2012. It went up 2.8%. (…all these numbers are corrected for inflation.)
The mistaken assumption that leads to the apparent paradox is that the groups are fixed. In fact, people shift between groups and new people show up in one group or another. In this case, a startling change in the number of college-educated women (a 37% increase!) is largely responsible. One explanation: Many entry-level college-grad workers were added,removing some of the highest earners from the other categories. There's a lot more explanation and detail at the original link, a really good read: When facts aren't facts.
It's worth noting that one of the other answers to this Quora question references A great example of Simpson's Paradox: US median wage decline, which, alarmingly, gets the explanation not just wrong but completely backwards! It's the overall average that get closer to the truth in that case, the individual group averages fool as people shift among groups. (note that sometimes it goes the other way)
#4377
There is plenty of information about this on the world wide interweb. Simply google “Simpson’s paradox”, as I just did.
The most important part of statistics is that it requires an intelligent understanding of the data under study, not blind adherence to a set of rules or algorithms. It is important therefore to look at whether the statistician is a) intelligent, and b) honest. If you can trust the statistician, then you can trust the statistics. Not otherwise.
The US median wage decline mentioned in the other answers is a good example. Another, slightly older example, is the effect of race on death-penalty sentences in Florida.
A 1991 study revealed that 53/483 (11.0%) Caucasian murderers were sentenced to death in Florida, compared to just 15/191 (7.9%) African Americans. However, when the race of the victim was taken into account it turned out that 11/48 (22.9%) of African American killers of Caucasians were sentenced to death, compared to just 53/467 Caucasians (11.3%). And similarly, 4/143 (2.8%) African American killers of African Americans were
The US median wage decline mentioned in the other answers is a good example. Another, slightly older example, is the effect of race on death-penalty sentences in Florida.
A 1991 study revealed that 53/483 (11.0%) Caucasian murderers were sentenced to death in Florida, compared to just 15/191 (7.9%) African Americans. However, when the race of the victim was taken into account it turned out that 11/48 (22.9%) of African American killers of Caucasians were sentenced to death, compared to just 53/467 Caucasians (11.3%). And similarly, 4/143 (2.8%) African American killers of African Americans were sentenced to death compared to 0/16 Caucasians!
For a discussion of Simpson's paradox with a couple more examples not mentioned on the Wikipedia article, see this post on my Quora blog Paradoxicon.
How do I debunk Simpson's paradox?
You don’t. You explain it. But you wouldn’t be the first—it is already well understood.
It can happen when you ignore the effect of a variable, but accounting for the effect of that variable you find that the effect of another variable goes the opposite direction.
A classic example is when a higher proportion of female applicants for a university fail to get admitted. There is an apparent gender bias. However, when considering each faculty, the apparent bias may be in the other direction.
Here is an example:
Here are some paradoxes that I found to be interesting:
- The Fermi Paradox: The Fermi Paradox speaks to the contradiction between the likelihood of alien life and lack of evidence for it. This paradox is named after physicist Enrico Fermi who first proposed this in 1950. It ponders over the question that if the universe is so big and old, and there are countless opportunities for life, why have we not found aliens?
- The Paradox of the heap: Also known as the Sorites paradox, this paradox goes something like this: Imagine a heap of sand. Take away one grain of sand, and you'd probably still call it
Here are some paradoxes that I found to be interesting:
- The Fermi Paradox: The Fermi Paradox speaks to the contradiction between the likelihood of alien life and lack of evidence for it. This paradox is named after physicist Enrico Fermi who first proposed this in 1950. It ponders over the question that if the universe is so big and old, and there are countless opportunities for life, why have we not found aliens?
- The Paradox of the heap: Also known as the Sorites paradox, this paradox goes something like this: Imagine a heap of sand. Take away one grain of sand, and you'd probably still call it a heap. Keep taking away one grain of sand at a time. At what point is it no longer a heap? What is a heap anyway? :|
- The Twin Paradox: It is possible to be older than your identical twin. Einstein's theory of special relativity says this is possible because time can tick at different speeds on how you are moving.
Source: Curosity
I don’t think it does. Breakdowns of income by group are quite skewed to a very small group. Depending on how you look at it, either the top 10%, or top 5%, or top 1% or top 0.1% are seeing relative income gains while those in the remaining groups see little or no gains.
So, while incomes have risen, the top 1% and the next 19% have risen faster than the remainder. If we just focus on the top 1%, other groupings are unlikely to capture this. It can’t just be steadily married couples, single men with no dependents, etc. because it is too small a group. What distinguishes this group is their high
I don’t think it does. Breakdowns of income by group are quite skewed to a very small group. Depending on how you look at it, either the top 10%, or top 5%, or top 1% or top 0.1% are seeing relative income gains while those in the remaining groups see little or no gains.
So, while incomes have risen, the top 1% and the next 19% have risen faster than the remainder. If we just focus on the top 1%, other groupings are unlikely to capture this. It can’t just be steadily married couples, single men with no dependents, etc. because it is too small a group. What distinguishes this group is their high income.
Simpson’s Paradox states that a trend changes when groups are combined. If you combine the groups, you see an increase. If you break the groups out, you see an overall greater increase in the subgroups. The trend is the same, however.
An interesting paradox is Xeno’s Dichotomy paradox.
Essentially if you are walking 100 meters from one spot to another, you must first reach the 50 meter point (1/2 way). Then you will keep walking and eventually hit 75 meters (1/4) left. Then take more steps and you’ll have 1/8 of the way left. Take some more steps and you’ll have 1/16 left, then 1/32, then 1/64 and on and on. To get from one spot to another takes an infinite number of steps, which should be plain impossible. Even the smallest tasks can be halved infinitely!!
How can one ever get from A to B, if an infinite number of (non-insta
An interesting paradox is Xeno’s Dichotomy paradox.
Essentially if you are walking 100 meters from one spot to another, you must first reach the 50 meter point (1/2 way). Then you will keep walking and eventually hit 75 meters (1/4) left. Then take more steps and you’ll have 1/8 of the way left. Take some more steps and you’ll have 1/16 left, then 1/32, then 1/64 and on and on. To get from one spot to another takes an infinite number of steps, which should be plain impossible. Even the smallest tasks can be halved infinitely!!
How can one ever get from A to B, if an infinite number of (non-instantaneous) events can be identified that need to precede the arrival at B, and one cannot reach even the beginning of a "last event"?
Might be why people are always late..
Lets say we have an experiment. After making some change, both group 1 and group 2 conversion is increased however overall conversion decreased. This is the result of not proper pooling. group1/group2 total count changed significantly before and after change.
Here is a great explanation with visualisations. You can change parameters and observe effects.
Lets say we have an experiment. After making some change, both group 1 and group 2 conversion is increased however overall conversion decreased. This is the result of not proper pooling. group1/group2 total count changed significantly before and after change.
Here is a great explanation with visualisations. You can change parameters and observe effects.
Paradoxes of this sort do not have solutions. The point is to make you think about your ideas of causality, free will and randomness.
If someone can predict your actions with certainty, then you have no free will, at least as that term is commonly understood. Your actions were predetermined outside yourself.
If someone can predict your actions with high accuracy, but you have free will, then their prediction does not force your decision. Your optimal choice is to choose to be the kind of person who picks only the opaque box—so the predictor will put $1 million in it—but to actually pick both box
Paradoxes of this sort do not have solutions. The point is to make you think about your ideas of causality, free will and randomness.
If someone can predict your actions with certainty, then you have no free will, at least as that term is commonly understood. Your actions were predetermined outside yourself.
If someone can predict your actions with high accuracy, but you have free will, then their prediction does not force your decision. Your optimal choice is to choose to be the kind of person who picks only the opaque box—so the predictor will put $1 million in it—but to actually pick both boxes.
The question is whether it’s possible to make those two choices simultaneously. Are there layers of free will such that you can choose what kind of person to be, then also choose to behave in different ways?
Psychologically the answer is clearly “yes.” I can choose to be the kind of person who never eats unhealthful food, and then gorge myself on junk food. I can choose to be a ruthless person, but then help an injured child.
Philosophically, it’s not so clear. Looking only at a person’s actions, the things they do tell you what kind of persons they are.
So the paradox forces you to try to reconcile your psychological image of yourself with a consistent external philosophical one. If you can do that, great, you’ve resolved the paradox for yourself. If you can’t, then you know there are inconsistencies between your internal psychology and your views about external reality.
Newcomb’s problem is very often misunderstood, and it seems that the other answerers so far have also fallen to the misconceptions.
I want to make it clear that Newcomb’s problem does not require some mystical, future-gazing, guaranteed-true prophecy. All it takes is someone that is good at predicting people’s actions. Making predictions isn’t rocket science - I predict that the sun will rise tomorrow morning, I predict that if I tell a joke, people will laugh (either with me or at me), and I predict that a rocket traveling at 11.2 km/sec will escape Earth (ok, maybe that last prediction is roc
Newcomb’s problem is very often misunderstood, and it seems that the other answerers so far have also fallen to the misconceptions.
I want to make it clear that Newcomb’s problem does not require some mystical, future-gazing, guaranteed-true prophecy. All it takes is someone that is good at predicting people’s actions. Making predictions isn’t rocket science - I predict that the sun will rise tomorrow morning, I predict that if I tell a joke, people will laugh (either with me or at me), and I predict that a rocket traveling at 11.2 km/sec will escape Earth (ok, maybe that last prediction is rocket science).
Let’s say the setup is thus: The predictor interviews you for 10 minutes. Based on the interview he predicts whether you are likely to one-box or two-box. He puts money in the box accordingly and then allows you to make the choice. His past record indicates that he has guessed correctly 95% of the times.
In the interview he tries to figure out what kind of person you are, what are your thought processes, whether you subscribe to CDT, EDT or UDT, whether you are arrogant enough to believe you are in the rare 5% for whom the predictor will be mistaken, whether you are familiar with precommitments, whether you are the kind of person who will back down in the last minute and two-box even after you resolved to one-box, etc. He uses all this information to establish his prediction.
So there’s no time-traveling causal paradox; the predictor doesn’t see the future, he predicts the future based on the information available to him now, just like any good decision-maker does.
When it comes down to it, you have the choice of what to do. But if you are the kind of person who believes two-boxing arguments, the predictor will correctly predict that you will two-box. If you are the kind of person who believes one-boxing arguments, the predictor will correctly predict that you will one-box.
So it’s not so much that you should one-box. It’s that you should be the kind of person who one-boxes. And of course, the way to be the kind of person who one-boxes, is to one-box.
So, long story short - you should open just box A, and you will very likely get $1M.
Here's a great recent example:
A great example of Simpson's Paradox: US median wage decline
from the blog:
"Since 2000, the median US wage has risen about 1%, adjusted for inflation.
But over the same period, the median wage for:
high school dropouts,
high school graduates with no college education,
people with some college education, and
people with Bachelor’s or higher degrees
have all decreased. In other words, within every educational subgroup, the median wage is lower now than it was in 2000.
How can both things be true: overall wages have risen, but wages within every subgroup have fallen? T
Here's a great recent example:
A great example of Simpson's Paradox: US median wage decline
from the blog:
"Since 2000, the median US wage has risen about 1%, adjusted for inflation.
But over the same period, the median wage for:
high school dropouts,
high school graduates with no college education,
people with some college education, and
people with Bachelor’s or higher degrees
have all decreased. In other words, within every educational subgroup, the median wage is lower now than it was in 2000.
How can both things be true: overall wages have risen, but wages within every subgroup have fallen? This is a great example of Simpson's Paradox. In this particular case, the explanation lies in the changing educational profile of the workforce over the past 13 years: there are now many more college graduates (who get higher-paying jobs) than there were in 2000, but wages for college graduates collectively have fallen at a much slower rate (down 1.2%) than for those of lower educational attainment (whose wages have fallen precipitously, down 7.9% for high school dropouts). The growth in the proportion of college graduates swamps the wage decline for specific groups."
Here are two additional examples of Simpson's Paradox that have popped up recently:
In it’s simplest terms, the Fermi Paradox asks:
“Given that the chances for life to arise
seem abundant, why have we never de-
tected it apart from our own planet?”
Asked another way: “Is there anyone out there?”
Scientists plug their best guesses into the Drake Equation (Photo: Carl Sagan explains the equation to my Astronomy 101 class)—and most come to the conclusion: Yes! It seems very likely that the universe is abundant with life.
If this is the case, then, why have we not detected evidence of radio waves, space probes, industrial constituents in the atmosphere of other planets—or even byprod
In it’s simplest terms, the Fermi Paradox asks:
“Given that the chances for life to arise
seem abundant, why have we never de-
tected it apart from our own planet?”
Asked another way: “Is there anyone out there?”
Scientists plug their best guesses into the Drake Equation (Photo: Carl Sagan explains the equation to my Astronomy 101 class)—and most come to the conclusion: Yes! It seems very likely that the universe is abundant with life.
If this is the case, then, why have we not detected evidence of radio waves, space probes, industrial constituents in the atmosphere of other planets—or even byproducts of plant life or bacteria, etc? Specifically:
- If intelligent extraterrestrial life is abundant, than why has no one tried to contact us, colonize or attack us, or trade with us?
- If simpler life exists, then why have our existing telescopes and space probes found nothing but rocks, minerals and (yes) water—but no life. [continue below Prof Sagan]…
There are several good reasons as to why we have not yet detected life, even if it is very abundant. Here are just a few:
- The galaxy and the universe is vast. It covers far greater distances than we can grasp. Even if there are trillions of populated planets, it would amount to a tiny fraction of star systems. The chances are slim that we would have encountered or identify one of them yet, given our nascent stage of sky searching and space exploration.
- It is possible that intelligent species exist, but have not invented radio or radio telescopes. This removes our key means of detecting them.
- It is possible that intelligent species exist, but have no desire to ponder their place in the universe or communicate with anyone far from home.
- It is possible that intelligent species have existed in the past, but their timeline did not overlap with ours observation distance. That is, even after accounting for the distance and time involved, they may have extinguished themselves with pollution, war, global warming (as we are doing)—or they may have been killed off by a celestial event such as a large asteroid or solar discharge.
- It is possible that intelligent species exist and have already reached out to us (or we have already detected the evidence), but that we simply didn’t understand what we were observing. That is, we may not have any way to communicate with nor even recognize life that is very different than our own. They may use things other than speech, radio, sign language, etc. They may not even have eyes and ears.
In short, there really is no paradox. It is possible that we will detect simple life (plants and bacteria) in the next thousand years. But with the current state of our technology, we can only improve our estimate of the chances for life to exist in the universe. We may find evidence of plant life, but it is very, very unlikely that we will encounter any intelligent life in the next century—even if an extraterrestrial were to also be searching for us.
When I studied under Carl Sagan in 1980, he estimated that the number of inhabited planets (those with ANY life) was somewhere between ONE (that’s us) and 10^23. (That a hundred billion trillion). Of course, this is a terribly imprecise range. In fact, it is meaningless.
But over the past 3 decades, we are gathering evidence that it is not likely to be ONE, and may be toward the higher end of this range.
Dr. Hempel made an important category mistake. Let [math]r[/math] denote a collection of things under the condition of raven-ness, & [math]b[/math], a collection of things under the condition of blackness.
[math]\text{Approach }\boxed{1}\tag*{}[/math]
Dr. Hempel’s hypothesis (all ravens are black) doesn’t represent the situation he considered because it isn’t an equivalence claim & it compares the wrong things. Equivalence claims are bidirectional, & in if - - - then form, they have two clauses. If we convert Dr. Hempel’s hypothesis into an equivalence claim by adding in the second half, then we get:
- If a thing is a raven, then it’s b
Dr. Hempel made an important category mistake. Let [math]r[/math] denote a collection of things under the condition of raven-ness, & [math]b[/math], a collection of things under the condition of blackness.
[math]\text{Approach }\boxed{1}\tag*{}[/math]
Dr. Hempel’s hypothesis (all ravens are black) doesn’t represent the situation he considered because it isn’t an equivalence claim & it compares the wrong things. Equivalence claims are bidirectional, & in if - - - then form, they have two clauses. If we convert Dr. Hempel’s hypothesis into an equivalence claim by adding in the second half, then we get:
- If a thing is a raven, then it’s black ([math]r\subseteq b[/math]); &
- If a thing is black, then it’s a raven ([math]b\subseteq r[/math]).
That makes it clear that Hempel gave the wrong hypothesis outright. In the situation he described, he wanted to know whether the collection of all things under the condition of raven-ness is the same as the collection of all things under the condition of both raven-ness & blackness (that is, whether [math]r = (r\cap b)[/math] rather than whether [math]r = b[/math]). Therefore, he ought to have compared ravens with black ravens rather than ravens with black things in general. The correct equivalence claim is:
- If a thing is a raven, then it’s a black raven ([math]r\subseteq(r\cap b)[/math]); &
- If a thing is a black raven, then it’s a raven ([math](r\cap b)\subseteq r[/math]).
The second part of the equivalence relation & its contrapositive are simply true, so the final form of the corrected hypothesis is, simply, if a thing is a raven, then it’s a black raven; & the corrected contrapositive is, if a thing is not a black raven, then it isn’t a raven. No longer may we search for non-black things in general in order to support the hypothesis; rather, we must seek black ravens. We may seek ravens in general in order to find black ravens, & we may seek black things in general in order to find black ravens, but the existence of non-raven black things tells us about whether the collection of all black things is the collection of all ravens rather than whether the collection of all black ravens is the collection of all ravens. Even if we searched the world & found nothing black at all, then it wouldn’t support the hypothesis that ravens are not black.
[math]\text{Approach }\boxed{2}\tag*{}[/math]
There are four groups:
- Black ravens ([math]r\cap b[/math]),
- Non-black ravens ([math]r\cap\bar b[/math]),
- Non-raven black things ([math]\bar r\cap b[/math]), &
- Non-raven, non-black things ([math]\bar r\cap\bar b[/math]).
The claim that all ravens are black is like the claim that there are no non-black ravens, which is like the claim that the [math]r\cap\bar b[/math] group has no members. As a proportion of the total number of observed things, that claim is written, [math]r\cap\bar b=0[/math]. If we search for non-black things & find that all so far are not ravens, then everything we have encountered in our search belongs in the [math]\bar r\cap\bar b[/math] group. This is written as [math]\bar r\cap\bar b=1[/math]. This supports the claim that [math]r\cap\bar b[/math] has no members, since the total proportion sum for all four groups is [math]1.[/math] However, the claim that all ravens are black presupposes that there are ravens, unlike the claim that there are no non-black ravens. Claiming that there are no non-black ravens has the same form as the claim that there are no non-glittery unicorns; neither is the claim interesting for practical purposes, nor is it assailable.
Rather, we must consider only a world in which we have observed ravens. Thus, the [math]r[/math] group, or the collection of all things that have raven-ness, must have a nonzero proportion of the observations ([math]r>0[/math]). By the law of excluded middle, therefore, the sum of the proportions of observations which belong in the [math]r\cap b[/math] & the [math]r\cap\bar b[/math] groups is nonzero ([math]r\cap b+r\cap\bar b >0[/math]). Suppose that so far, we’ve seen only black ravens ([math]r\cap b>0, r\cap\bar b=0[/math]). Say, e.g., that [math]r\cap b = 0.5[/math] & [math]\bar r\cap\bar b = 0.5[/math]. That is, say that we’ve observed two things: a black raven, & a non-raven, non-black thing. Hempel’s claim is that if we now observe another non-black, non-raven thing, then the hypothesis that all ravens are black is better supported. This is where the definition of support begins to get mathematical. Say we make the proposed observation, so that the proportion of observed things which are black ravens is now ~ [math]0.33[/math], & the proportion of observed things which are non-black non-ravens is now ~ [math]0.67[/math]. I make at least two points: ① the proportions of ravens which are non-black ravens has not changed; ② the proportions for the group of black ravens and non-black ravens have come closer together. If the original hypothesis was rather that there are more black ravens than non-black ravens, then point ② would imply that the observation of the new non-raven, non-black thing refutes the hypothesis because it decreases the difference between the proportion of black ravens and non-black ravens, which means that the number of ravens which are black is more similar to the number of ravens that are not black with respect to the total number of observations. Rather, though, our hypothesis was a direct statement about the proportion of observations which belong only to the non-black raven group. Since the observation of a new non-black, non-raven did not change the absolute number of members in the non-black raven ([math]r\cap\bar b[/math]) group, the new observation neither supported nor refuted our actual hypothesis.