ELO is a stochastic gradient descent approximation of logistic regression. You c...

gowld · on Jan 29, 2018

https://en.wikipedia.org/wiki/Glicko_rating_system

moultano · on Jan 30, 2018

Even that is still assuming you can only update parameters once per game, and only for the players in the game. If I've played a large number of games against someone, and the win-rate is 50/50, and then that player plays in a tournament, my skill should move up or down in accordance with their performance in that tournament.

dmoy · on Jan 30, 2018

Not necessarily. At least I don't know how this works in smash, but in competitive fencing I'd see people go 50-50 consistently locally, but one would always do drastically better at nationals, year after year after year.

Right like there are A rank fencers, and then there are A rank fencers who actually have a shot at placing on the points table.

I'm not sure why.

YokoZar · on Jan 30, 2018

If you told me these facts about a random video game I'd guess the following:

- A high rank player can consistently execute a strategy that wins against the majority of players most of the time ("beats the meta")

- The above has a counter strategy, but this strategy often fails against the majority of the players ("loses to the meta")

When these two players meet, they go 50-50, but have very different results in tournaments. Alternatively, one player is generally bad but exploits a particularly hard to observe weakness in the first.

I know nothing about fencing, but I suspect something similar is going on here.

dmoy · on Feb 2, 2018

Yea I suspect you may be right. The ones I saw who did better in tournaments tended to have more controlled, standard style. Nothing too fancy.

jsnell · on Jan 30, 2018

I agree in principle, and having new data affect the interpretation of old results was one of the goals for the rating system for a game I run [0]. But while I believe it's the right thing to do if the goal is to predict results more accurately, there are downsides.

Basically players want rating systems to be reward loops; they hate systems where their rating can change randomly, and they want the system to be very volatile in response to their own results. If they go on a statistically insignificant winning streak, they want their ratings to shoot up. Not a rating system to go "meh, it's probably just random chance".

[0] https://www.snellman.net/blog/archive/2015-11-18-rating-syst...

moultano · on Jan 30, 2018

I think if the system provides reliable results, people will come around. There are a lot of preferences that players have, but I think they ultimately come to respect systems that work.