CCRL Update 21st September

Designed for posting all types of tournaments and Games (e.g. Man vs. Machine, Computer vs. Computer and basement matches.)

Moderators: Harvey Williamson, Watchman

Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
Post Reply
Ray

CCRL Update 21st September

Post by Ray »

The September 21st update of the CCRL Rating Lists and Statistics is now available for viewing at:
http://www.computerchess.org.uk/ccrl/4040/

The links to the various rating lists can be found just beneath the default Best Versions list.
For example there is a 32-bit Single CPU list.

Our standard testing is at 40 moves in 40 minutes repeating while our current blitz testing is at both 40 moves in 4 minutes repeating and 40 moves in 12 minutes repeating, all adjusted to the AMD64 X2 4600+ (2.4GHz).

Currently active testers in our team are:
Graham Banks, Ray Banks, Shaun Brewer, Kirill Kryukov, Dom Leste, Tom Logan, Andreas Schwartmann, Charles Smith, George Speight, Chris Taylor, Chuck Wilson, Gabor Szots and Martin Thoresen.

A big thanks to all testers as usual for their efforts this week.


40/40 Notes

There currently 76,274 games in our 40/40 database.

Many engines on our list have few games and in many cases their ratings are likely to fluctuate (markedly for some) until a lot more games are played. Therefore no conclusions should be drawn about their strength yet.
To illustrate this point, when an engine has 200 games played, the error margin is still approximately +-40 ELO, after 500 games +-25 ELO, after 1000 games +-17 ELO and even after 2000 games there is a +-13 ELO error margin!
This of course highlights the importance of looking at other rating lists that are also available in order to draw comparisons and get a more accurate overall picture.


Multi CPU Engines

Although we've only just started testing Zappa Mexico 64-bit 4CPU, we are not expecting it to challenge the hold that Rybka 2.3.2a 64-bit 4CPU has on top spot, based on what we've seen and read. However, still early days of course to be making such an assumption!

Naum 2.2 64-bit 4CPU and Hiarcs 11.1 4CPU are numbers three and four in this list, a good distance back.
Hiarcs 11.2 4CPU still requires more games.

Loop M1-T 64-bit 4CPU is next in the pecking order, ahead of the oldies - Deep Shredder 10 64-bit 4CPU, Deep Fritz 10 4CPU and Deep Junior 10 4CPU.
On our growing list of things to do, we would like to give more games to Deep Fritz 10 2CPU to see whether performs better than Deep Fritz 10 4CPU as has been reported elsewhere.

Glaurung 2 epsilon/5 64-bit 2CPU is the strongest free engine on this list.


Single CPU Engines

Rybka 2.3.2a leads the ratings here as well, although by a slightly larger margin.
The 64-bit version has a good edge over the 32-bit version.

We would expect Zappa Mexico to slot into second spot on this list also, once we've tested it in earnest.

Loop M1-T, Toga II 1.3.1, Hiarcs 11.1 and Naum 2.2, Fruit 051103, Fritz 10, Shredder 10 and Strelka 1.8 are all within a 25 ELO range of each other.
Hiarcs 11.2 now has 300+ games and remains slightly lower in rating than Hiarcs 11.1.

Deep Sjeng 2.7 has disappointingly slipped back to around the level of Spike 1.2 Turin and Junior 10.

Ktulu 8.0, SmarThink 1.00, Glaurung 2 epsilon/5 and Chess Tiger 2007.1 are 40-50 ELO further back.


Free Single CPU Engines

Rybka 1.0 narrowly retains its crown as the top free engine ahead of Toga II 1.3.1.

Fruit 051103 seems to the strongest Fruit ahead of 2.3.1, but both need many more games before the picture becomes clearer.

Strelka 1.8 and Spike 1.2 Turin come in next on this list.

Glaurung 2 epsilon/5 and Naum 2.0 are well ahead of Scorpio 1.91.

Alaric 707, Movei 0.08.438, SlowChess Blitz WV2.1, Delfi 5.1, Zappa 1.1, WildCat 7, Pro Deo 1.2 and List 512 are within a 30 ELO point range of each other.
With 500+ games now under its belt, Movei 0.08.438 is a massive 100+ ELO improvement over previous versions. Well done Uri!

Other recent releases to watch out for are Petir 4.9999 and the rapidly improving Hamsters 0.5.
The latest versions of Booot, BugChess2 and DanaSah also seem to have made good gains over previous versions.

We test a very extensive range of amateur engines through our Amateur Championship divisions (32-bit 1CPU) plus other tournaments, all of which can be followed in our public forum.

Our aim is of course to ensure that all engines lower on our lists get 200+ games.


Blitz Notes

There are currently 169,074 games in our 40/4 database.

The 40/4 update is usually done separately to our 40/40 update.
The latest ratings can be found here:
http://computerchess.org.uk/ccrl/404/

An updated list will be available during the next few days with many Zappa Mexico games included.


FRC Notes

There are currently 20,600 games in the FRC 40/4 database.

Ray tests only those engines that can play FRC through the Shredder Classic GUI.
If engine authors have a new and stable version of their engine that will run under this GUI, they should contact Ray if they wish to see it tested.

Although Rybka 2.3.2 FRC tops the list, it is a private engine, therefore Hiarcs 11.1 is still the best available FRC engine.

Ray has recently tested Rybka 2.3.2, Hiarcs 11.2, Naum 2.2, Fruit 2.3, Fruit 051103, Hamsters 0.4, Hermann 2.0, Movei 0.08.438 and Deep Sjeng 2.7.
All are now included in the ratings.

For FRC the best list to look at is the pure list.
http://www.computerchess.org.uk/ccrl/404FRC/


Stats/Presentation Notes

The LOS stats to the right hand side of each rating list are "likelihood of superiority" stats. They tell you the likelihood in percentage terms of each engine being superior to the engine directly below them.

A list of games played this week per engine can be found in the update thread in the CCRL public forum, accessible through the link given at the top of this post.

All games are available for download through the link given at the top of this post. They can be downloaded by engine or by month.
ELO ratings are now saved in all game databases for those engines that have 200 games or more.

Clicking on an engine name will give details as to opponents played plus homepage links where applicable.
Clicking on an engine name now also gives you a ratings history graph for that engine over time (a bit further down the page). The green line is the actual rating. The red lines are the upper and lower error bars, and the blue line represents the number of games.

Custom lists of engines can be selected for comparison.

An openings report page (link at bottom of index page) lists the number of games played by ECO codes with draw percentage and White win percentage. Clicking on a column heading will sort the list by that column.
Games can now be downloaded by ECO code.
Post Reply