Superforecasting: The Art and Science of Prediction

by Philip E. Tetlock, Dan Gardner

340 pages

English language

Published Feb. 20, 2015 by Crown, Crown Publishers.

ISBN:: 978-0-8041-3669-3
OCLC Number:: 898909721

(16 reviews)

Superforecasting: The Art and Science of Prediction is a book by Philip E. Tetlock and Dan Gardner released in 2015. It details findings from The Good Judgment Project.

4 editions

phonner reviewed Superforecasting: The Art and Science of Prediction by Philip E. Tetlock

Review of Superforecasting

5 stars

Superforecasting tells the story of psychologist Philip Tetlock’s project to systematically evaluate the predictions of experts. What exactly does an advisor mean when they tell the President that a military operation has a “good chance” of being successful? It wasn’t so long ago that no one thought to even ask such a question, and as Tetlock shows, the consequences couldn’t be more real. Tetlock’s work led to the development of the Good Judgement project, a forecasting competition designed to identify the characteristics of “superforecasters”, individuals with a quantifiable talent for predicting how world events will unfold. It’s a great book, and one I was partly inspired to read because of my involvement in a student forecasting tournament based on the Good Judgement project (where our students took first and second place overall!)

franksbooks reviewed Superforecasting : The Art and Science of Prediction by Philip E. Tetlock

Review of 'Superforecasting : The Art and Science of Prediction' on 'Goodreads'

No rating

I don’t know how to rate this book.
On the one hand so much of it is informative and Tetlock is clearly a strong believer in his approach – and I’m both swayed and intrigued by much of his evidence.
But on the other hand, it sways so often into the usual Business book “could’ve been a blog” territory, that my bias against those over-simplified, large print, double spaced, tree destroying tomes often overwhelmed the positives.
I do think it’s a good read, but I really wish books like this would be substantially less repetitive and over-confident.

22 reviewed Superforecasting: The Art and Science of Prediction by Philip E. Tetlock

Review of 'Superforecasting: The Art and Science of Prediction' on 'Goodreads'

5 stars

You have to understand that this book is eminently and deeply rooted in the research program Tetlock pioneered and ran, the Good Judgement Project, and about the real people whom he found through the program to be great are forecasting geopolitical events in the three to six month range.

My own personal goal before reading the book was to learn rigorous prediction—not necessarily answering geopolitical questions that were handed from somewhere else, but more like how many editors will show up on global Wikipedia today, or possibly more profitably, how much the stock market will lose today. I was prepared for the book to be an overly-academic treatise on the minutiae of the research program, or for a number of ways it could be operationally useless.

But at each chapter, Tetlock had something very meaningful and useful to say to me about the business of prediction. At each stage he surprised me with the foxlike connections he drew between what he saw his forecasters doing and with other research programs and discoveries.

(I was delighted to see my boy Duncan Watts cited on the emphatically post hoc nature of describing “significant” events. Aaron Brown, my poker-playing risk-managing guru, weighs in on the value of small but consistent wins. The once-cool-but-now-fascist-apologist Nassim Taleb makes appearances as the book struggles with whether its brand of forecasting is useful in the face of an extremistan world (answer: it is, because you can make money in an extremistan world). Of course Danny Kahneman is intertwined with the narrative, as Tetlock’s sounding board and colleague for many decades. Not mentioned in the book but in Tetlock’s five-part master-class on forecasting on Edge.org, my main man Anders Ericsson is cited on the trainability of forecasters.)

Chapter one opens with trying to convince us that predictability exists and forecasting can help us (make or save us money, at the simplest). He invites us to contrast the unpredictability of cloud shapes with the predictability of a clock, and exposes this as the first of many false dichotomies throughout the book.

“We live in a world of clocks and clouds and a vast jumble of other metaphors. Unpredictability and predictability coexist uneasily in the intricately interlocking systems that make up our bodies, our societies, and the cosmos. How predictable something is depends on what we are trying to predict, how far into the future, and under what circumstances.”

He then sets up the core argument of the research program that IARPA paid for: how well can we do? A question like this should leave you speechless and befuddled—the US Government asked Tetlock and other university and industry programs to find out how well we can predict geopolitical outcomes over a 3–6 month time horizon, because nobody knew how well we did. Think of all the pundits, all the terrible books Tom Friedman pooped out, all the bloggers and tooters, and the realization that all that heat gave us no light should leave you breathless.

In chapter two, Tetlock dives into the ugly story of medicine.

Tetlock revisits this over and over again: medicine only very, very recently became a devotee of the randomized controlled trial. He has a wonderful set of vignettes of the vanguard of physicians who dragged their colleagues kicking and screaming into the evidence-based age after World War II. Its been a few weeks since I read this section but I think I will forever remember this story:

‘When hospitals created cardiac care units to treat patients recovering from heart attacks, Cochrane proposed a randomized trial to determine whether the new units delivered better results than the old treatment, which was to send the patient home for monitoring and bed rest. Physicians balked. It was obvious the cardiac care units were superior, they said, and denying patients the best care would be unethical. … [but] Cochrane got his trial: some patients, randomly selected, were sent to the cardiac care units while others were sent home for monitoring and bed rest. Partway through the trial, Cochrane met with a group of the cardiologists who had tried to stop his experiment. He told them that he had preliminary results. The difference in outcomes between the two treatments was not statistically significant, he emphasized, but it appeared that patients might do slightly better in the cardiac care units. “They were vociferous in their abuse: ‘Archie,’ they said, ‘we always thought you were unethical. You must stop the trial at once.’ ” But then Cochrane revealed he had played a little trick. He had reversed the results: home care had done slightly better than the cardiac units. “There was dead silence and I felt rather sick because they were, after all, my medical colleagues.”’

Yes, a lot of medicine is terribly “intuition-based” today—“it’s obvious this treatment is better”. But medicine has made great strides, over the last few decades, in acknowledging the risks and flaws of “intuition” and committing itself to the exacting requirements of randomized controlled trials.

Programmers are slowly learning this, thankfully with less loss of life. Andrei Alexandrescu, in his “Writing Quick Code in C++, Quickly” talk in 2013, discussed this at length:

‘You must measure everything. We all have intuition. And the intuition of programmers is always wrong. Outdated. Intuition ignores a lot of aspects of a complex reality. Today’s machine architectures are so complicated, there’re so many variables in flight at any point in time that it’s essentially impossible to consider them deterministic machines any more. They are not deterministic any more. So we make very often big mistakes when assuming things about what’s going to make fast code. [E.g.,] fewer instructions do not equal faster code. Data [access] is not always faster than computation. The only good intuition is “I should measure this stuff and see what happens.” To quote a classic, who is still alive, Walter Bight: “Measuring gives you a leg up on experts who are so good they don’t need to measure.” Walter and I have been working on optimizing bits and pieces of a project we work on and … whenever we think we know what we’re doing, we measure, and it’s just the other way around.’

Here are two of the world’s leading experts in programming language design and implementation, openly saying “whenever we think we know what we’re doing, we measure, and it’s just the other way around.” That is probably worth tattooing on one’s forehead.

This is relevant to the Good Judgement Project because it is the first time randomized controlled trials have been applied to geopolitical prediction, but also because dealing with evidence and weighing it is a key component of forecasting. Putting a collar on intuition helps to prevent it from ruining your predictions.

The next chapter (chapter three) discusses the intricacies of keeping score. A project like this, and the task of improving one’s own forecasting, lives and dies by the pesky pernicious questions of exactly how you measure performance, and other experimental details. Tetlock shows how everything we might consider as “forecasts” (intelligence agencies, the revolting Thomas Friedman) is trash, is intellectual weasel-worded garbage. He details the kinds of questions amenable to his experiment, how to elicit probability estimates, how to enforce time horizons, how to factor in update frequencies, how to fuse predictions from groups, various research tools, etc.

This chapter details how the Brier score works: forecasters answer a yes/no question with a probability. The score rewards emphatically correct answer, and punishes incorrect confidence.

Chapter four gives a detailed overview of the project’s findings: how well Tetlock’s superforecasters did, and analyses of how and why they did so well. Tetlock really surprised me here by offering a very humble and honestly rigorous analysis of regression to the mean. He explains how in games of chance, regression to the mean crushes the winners after repeated rounds, whereas exercises of skill sees the winners only improve after succeeding rounds.

“Each year, roughly 30% of the individual superforecasters fall from the ranks of the top 2% next year. But that also implies a good deal of consistency over time: 70% of superforecasters remain superforecasters.”

Tetlock could have gone all business-book “Good to Great” (trash) on me. No. This is a well-reasoned and thoughtful argument that honestly explored the role of luck in forecasting. 30% annual replacement suggests some luck, but a lot of skill. That is a very powerful finding.

Chapter five breaks down the intelligence of superforecasters, and six their math savvy. Findings: superforecasters are intelligent and also generically math-savvy, but intelligence and math skills are neither necessary nor sufficient for superforecasting performance.

Chapter five has a really interesting discussion of Fermi analyses—you know, “how many piano tuners are in Chicago”. I did this piano tuner exercise for the first time while reading this book (despite having read about it here and there in the past), and that was a very insightful experience. Fermi analysis shows that, rather than making a big prediction that might have a lot of error, you can break the problem up into smaller problems whose errors are smaller, and whose errors stay small after combining them. Fermi analysis is cool, and I finally appreciate them.

Chapter six also deals, almost spiritually, with the misguided “quest for meaning” and the herculean discipline needed to maintain a probabilistic outlook on life:

‘Even in the face of tragedy, the probabilistic thinker will say, “Yes, there was an almost infinite number of paths that events could have taken, and it was incredibly unlikely that events would take the path that ended in my child’s death. But they had to take a path and that’s the one they took. That’s all there is to it.” In Kahneman’s terms, probabilistic thinkers take the outside view toward even profoundly identity-defining events, seeing them as quasi-random draws from distributions of once-possible worlds.’

Forget living Biblically for a year. Try living like this for a day.

Chapter seven examines whether superforecasters are plugged into the global news streams. Answer: yes to some degree, but it doesn’t really explain their performance versus regular non-super forecasters.

Chapter eight examines the twin vexing problems of updating beliefs in light of new evidence, and getting better at making predictions in light of your past predictions. Because Black Lives Matter, consider police officers:

“police officers spend a lot of time figuring out who is telling the truth and who is lying, but research has found they aren’t nearly as good at it as they think they are and they tend not to get better with experience. That’s because experience isn’t enough. It must be accompanied by clear feedback. … Psychologists who test police officers’ ability to spot lies in a controlled setting find a big gap between their confidence and their skill. And that gap grows as officers become more experienced and they assume, not unreasonably, that their experience has made them better lie detectors. As a result, officers grow confident faster than they grow accurate, meaning they grow increasingly overconfident.”

This chapter also talks about the discipline to keep hindsight bias in the kennel, and the difficulty in acknowledging the role of luck:

“People often assume that when a decision is followed by a good outcome, the decision was good, which isn’t always true, and can be dangerous if it blinds us to the flaws in our thinking.”

In a book full of actionably valuable insights (to the aspiring forecaster), this discussion might be the most helpful.

Chapter nine deals with teams, team dynamics, fusing algorithms for merging individuals’ predictions, and lots of interesting related things. Chapter ten discusses how leaders might respond to a team of forecasters, and the changes leaders have to make to best utilize them.

Chapter eleven talks about the problems with Tetlock’s research platform. (As he himself states earlier in the book, a scientist will always specify the conditions under which they would change their minds.)

“I see Kahneman’s and Taleb’s critiques as the strongest challenges to the notion of superforecasting.”

Kahneman’s critique is, can forecasters permanently tame cognitive biases, and keep churning out winning forecasts year after year (or at least long enough to be useful)? Taleb’s critique is, can forecasters say anything about black swan events that dominate history?

Both of these critiques, in my personal opinion, are surmountable, making Tetlock’s research agenda and this book well worth reading.

Chapter twelve closes with Tetlock’s hopes for a future world where we keep score about forecast. It could be awesome. But we’d get used to it fast and start worrying about the next problem.

Lavinia reviewed Superforecasting : The Art and Science of Prediction by Philip E. Tetlock

Review of 'Superforecasting : The Art and Science of Prediction' on 'Goodreads'

4 stars

Forecasting might appear to be a game but in fact it is real. We are making forecasts every day, when we are buying a product at the supermarket, or when we decide to date someone or live with him. We are making forecasts when we make financial investments.

Forecasting is important. On a personal level because the ability to forecast may be the difference between success and failure. And despite the unwillingness of some decision-makers to examine and accept scientific evidence – think the case of parents who opt out of vaccinations for their children, or the lack of action to lower the level of greenhouse gases that are heating up the planet – as societies, we have started embracing evidence-based policies in order to deal better with contemporary challenges.

Superforecasting – The Art and Science of Prediction is a fascinated book, about lots of things I didn’t know about. Philip Tetlock and Dan Gardner tell us why forecasting is so important and crucial in our daily lives, look into what make people good forecasters, and what elevates forecasting to superforecasting.

Because of people like Tom Friedman and the rise of big data, people have started to have an interest in forecasting. But despite the interest, forecasters’ accuracy is not measured and forecasting itself is not very well analysed at all.

There is an inverse correlation between fame and accuracy, says Philip Tetlock, a psychologist who teaches at Berkeley. The more famous an expert is, the less accurate he is. Tetlock’s conclusions are based on a long-term study, the Good Judgment Project which won a massive four‑year US government‑sponsored forecasting tournament.

There is a story in the book, the fox vs hedgehog metaphor that is very interesting. The story is based on a fragment of a poem written by the Greek poet Archilochus, 2,500 years ago. It actually says: “The fox knows many things, but the hedgehog knows one big thing.” ‘Πολλ’ οίδ’ αλώπηξ, εχίνος δε εν, μέγα’, for those who know ancient Greek.

The meaning of this epigramma, is that the “hedgehogs” devote their whole life to one big issue, insist on their views, and they are reluctant to change them even when they fall out. They are so committed to their ideology that they expect solutions to everyday problems to come through some great theory, their favourite theory, preferably.

On the contrary, the “foxes” tend to be more eclectic. They use accumulated interdisciplinary knowledge and adapt their approaches according to real circumstances, while doing their own self-criticism whenever necessary. Above all, they recognize the complexity of the world in which we live in, and rely more on observation and less on theory.

Of course, if you are a producer for a television show, you tend to go with the hedgehog. You don’t really care if she is a good or bad forecaster. What you really need is a media pundit, someone who is bold and decisive, one that can tell an interesting story …. the eurozone is going to melt down in the next two years, for example.

‘Foresight isn’t ‘a mysterious gift bestowed at birth,’ says Philip Tetlock. It is the product of particular ways of thinking, of gathering information, and of updating beliefs. Broadly speaking superforecasting demands focus, thinking that is open-minded, careful, curious, less ideological, and above all, self-critical. The most systematically and thoughtfully we go about forecast the better we do.

I really liked this book and I enjoyed learning about these things. I had a look on the website and I even made my first attempt on forecasting. I am now looking forward to see the results. Am I a fox or a hedgehog?