nikos

If a forecaster is consistently off by like 10 percentage points - I think that is a difference that matters. But even in that extreme scenario where the (simulated) difference between two forecasters is in fact quite large, we have a hard time picking that up using standard significance tests.

Wisdom of the Crowd vs. "the Best of the Best of the Best"

nikos2y3

Interesting, thanks for sharing the paper. Yeah agree that using the Brier score / log score might change results and it would definitely be good to check that as well.

Wisdom of the Crowd vs. "the Best of the Best of the Best"

nikos2y1

In principle yes. In practice also usually yes, but the specifics depend on whether the average user who predicted on a question gets a positive amount of points. So if you predicted very late and your points are close to zero, but the mean number of points forecasters on that question received is positive, then you will end up with a negative update to your reputation score.
Completely agree that a lot hinges on that reputation score. It seems to work decent for the Metaculus Prediction, but it would be good to see what results look like for a different metric of past performance.

Wisdom of the Crowd vs. "the Best of the Best of the Best"

nikos2y3

Not sure how to quantify that (open for ideas). But intuitively I agree with you and would suspect it's at least a sizable part

Wisdom of the Crowd vs. "the Best of the Best of the Best"

nikos2y2

Yeah, definitely. The title was a bit tongue-in-cheek (it's a movie quote)

Predictive Performance on Metaculus vs. Manifold Markets

nikos2y2

And is the code to the MetaculusBot public somewhere? :)

Predictive Performance on Metaculus vs. Manifold Markets

nikos2y3

It should be possible to fully automate the bot and just run a CRON job that regularly checks the Metaculus API for new questions, right?

Predictive Performance on Metaculus vs. Manifold Markets

nikos2y3

I slightly tend towards yes, but that's mere intuition. As someone on Twitter put it, "Metaculus has a more hardcore user base, because it's less fun" - I find it plausible that the Metaculus user base and the Manifold user base differs. But higher trading volume I think would have helped.

For this particular analysis I'm not sure correcting for the number of forecasters would really be possible in a sound way. It would be great to get the MetaculusBot more active again to collect more data.

Predictive Performance on Metaculus vs. Manifold Markets

nikos2y5

Is it possible to get rid of the question mode for this post?

nikos

Posts 15

Comments24

Posts
15

Comments
24