What's quality_1vs1 really shows?

I won't speak for that function now but quickly:

The source code is easy to browse: https://github.com/sublee/trueskill/blob/master/trueskill/__init__.py#L516
Match quality in Trueskill is a measure of the probability of a draw given the skills of the two players (within what they call the draw margin - a configuration for the game). It is intended to describe the quality of the match in the sense that it's much more fun if players are equally skilled and much less fun (a lower quality match) if one player is way better than the other and you can predict up front they'll win. Match quality is a function only of the current ratings of the competing players and has nothing to with their history (or the 1000 matches). What you presumably want to look at after a 1000 match test, is the ratings of the two players and assess the probability of victory for the two players. I don't think this library implemented win_probability but I did, and if you're keen I can take a closer look later.

sublee / trueskill