Your Country Needs You!

This forum is for general discussions and questions, including Collectors Corner and anything to do with Computer chess.

Moderators: Harvey Williamson, Steve B, Watchman

Forum rules
This textbox is used to restore diagrams posted with the fen tag before the upgrade.
Post Reply
User avatar
spacious_mind
Senior Member
Posts: 4014
Joined: Wed Aug 01, 2007 10:20 pm
Location: Alabama
Contact:

Post by spacious_mind »

I have completed playing Excalibur Ivan and Igor on Level 53 30 seconds average time. Therefore below is a closer comparison at this level between Ivan, Igor, Grandmaster, Mirage and Avenger. I have also dropped in CXG Legend so that it gives you an opportunity to compare them at this level.

TEST GAME 1 - 15TH CENTURY MASTERS

Image

As you can see from above everyone is a keeper, not one is exactly the same playing at this level. But you still clearly see that Mirage 10 MHz and Igor 12 MHz are still almost identical at this level. The additional 2 MHz did not help Igor in this first test.

TEST GAME 2 - THE TURK

Image

In Test Game 2 CXG Legend scored best with Igor being the best compared Excalibur. As is Test 1 every computer plays a little differently.


RATING COMPARISON CHART AFTER 2 TESTS


Image

The above results I think clearly also show the continuing debate. The moves played are different indicating a possible different programmer or a Tune up carried out by somebody. The continuous similarity in playing strength which is consistent if you play them against each other or test them as above always repeats itself. The tuned up Horvath or different programmer debate continues because of this.

Best regards
Nick
Reinfeld
Member
Posts: 486
Joined: Thu Feb 17, 2011 3:54 am
Location: Tacoma, WA

Post by Reinfeld »

Regarding not being able to use the spreadsheet, which version of Excel are you using. Unfortunately the pop downs etc might not work with later versions but it should work just fine with Version 2010 onwards.
Nick, I'm using Excel for Mac 2011. It's possible that I botched your instructions.

I will add Game 2 for Super Connie later this week, and then add the Saitek Turbo King.

If I get time, I'll throw in the Radio Shack 2150L, to add yet more fuel to the Kaplan debate. I don't think we've quite settled the question on 2150L, which may or may not contain Kaplan's C program. An earlier debate on the forum led to a general (but not unanimous) agreement that TK contains Kaplan's D or D+ program (stronger than C). No one said much about RS 2150 - I don't own it, but it has a much larger book than 2150L, and I wonder if it mirrors/clones the Turbo King (which has the same size book).

- R.
"You have, let us say, a promising politician, a rising artist that you wish to destroy. Dagger or bomb are archaic and unreliable - but teach him, inoculate him with chess."
– H.G. Wells
Reinfeld
Member
Posts: 486
Joined: Thu Feb 17, 2011 3:54 am
Location: Tacoma, WA

Post by Reinfeld »

More results. Here's Super Constellation tackling Test Game 2 at 30s/move:

As White:
7.0-0
8.Nc3
9.Nb5
10.Nd4
11.Re1
12.Nxe6
13.Nxd5
14.c4
15.Nb5
16.Be3
17.Nbxc7
18.Rxe3
19.Nxd5
20.c4
21.Rxe8
22.Qf1
23.Qf1
24.Qb3
25.Rxb7+
26.Rxa7+
27.Qb6+

As Black:
6. ... d5
7. ... Nf6
8. ... Qf5
9. ... Bc5
10. ... Qg6
11. ... Bg4
12. ... Bd7
13. ... 0-0-0
14. ... Nxc6
15. ... Bd6
16. ... Bxe3+
17. ... Bxe3+
18. ... Bf7
19. ... Qd6
20. ... g5
21. ... Kb8
22. ... g5
23. ... b5
24. ... b6
25. ... Kc8
26. ... Kc8

And here's the Saitek/SciSys Turbo King at 30s/move, both games:

Game 1 (White)
5.d4
6.Bxf7+
7.Qxf3
8.Qxb7
9.Nb5
10.Nxa7
11.Bb5+
12.Qa8
13.Qc6+
14.Qxb5+
15.Bf4
16.Qxd5
17.Qxd5
18.c3
19.Qb7
20.Qxd7+
21.Qd8+

Game 1 (as Black):

5. ... Bf5
6. ... Bxf3
7. ... c6
8. ... Nbd7
9. ... Bd6
10. ... Rb8
11. ... Nxc4
12. ... Bd6
13. ... Ke7
14. ... Nd7
15. ... exd5
16. ... c5
17. ... Qf6
18. ... c6
19. ... Qe4+
20. ... Kf8

Game 2 (as White):
7.0-0
8.Nc3
9.Bd3
10.Nd4
11.Bd3
12.Nxe6
13.Nxd5
14.Nxe7+
15.Be3
16.Be3
17.Nbxc7
18.Rxe3
19.Nxd5
20.c4
21.Rxe8
22.Rh3
23.Qd3
24.Qb3
25.Rxb7+
26.Rxa7+
27.Qb6+

Game 2 (as Black):

6. ... Nb4
7. ... Nf6
8. ... Qg6
9. ... Nge7
10. ... Qf6
11. ... 0-0-0
12. ... Bd7
13. ... 0-0-0
14. ... Bxc6
15. ... Bd6
16. ... Bb4
17. ... Bxe3+
18. ... Bf7
19. ... Kb8
20. ... Kb8
21. ... Kb8
22. ... Re4
23. ... Nxb4
24. ... Nc6
25. ... Kc8
26. ... Kc8


- R.
"You have, let us say, a promising politician, a rising artist that you wish to destroy. Dagger or bomb are archaic and unreliable - but teach him, inoculate him with chess."
– H.G. Wells
Reinfeld
Member
Posts: 486
Joined: Thu Feb 17, 2011 3:54 am
Location: Tacoma, WA

Post by Reinfeld »

Here's the Radio Shack 2150L, tackling games 1 and 2:

Game 1 (as White):

5.Nf3
6.Bxf7+
7.Qxf3
8.Qxb7
9.Nb5
10.Nxa7
11.Bb5+
12.0-0
13.Qc6+
14.Qxb5+
15.Bf4
16.Qxd5
17.Qxd5
18.c3
19.Qb7
20.Qxd7+
21.Qd8+

Game 1 (as Black):

4. ... Nf6
5. ... Nc6
6. ... Bxf3
7. ... Nc6
8. ... Nbd7
9. ... Bd6
10. ... Rb8
11. ... Nxc4
12. ... Bd6
13. ... Ke7
14. ... Nd7
15. ... exd5
16. ... c5
17. ... 0-0
18. ... c6
19. ... Qe6+
20. ... Kf8

Game 2 (as White):

7.0-0
8.Nc3
9.Ng5
10.Nd4
11.Be3
12.Nxe6
13.Nxd5
14.Nxc6
15.Be3
16.Be3
17.Nbxc7
18.Rxe3
19.Nxd5
20.Rd3
21.Rh3
22.Rh3
23.Kh1
24.Rbb3
25.Qb2
26.Rxa7+
27.Qb6+

Game 2 (as Black):

6. ... Nb4
7. ... Bf5
8. ... Qf5
9. ... Nge7
10. ... Qg6
11. ... 0-0-0
12. ... Bd7
13. ... 0-0-0
14. ... Bxc6
15. ... Bd6
16. ... Bd6
17. ... Bxd5
18. ... Bf7
19. ... Kb8
20. ... Kb8
21. ... Re4
22. ... Qf8
23. ... Nxb4
24. ... Nc6
25. ... Kxb7
26. ... Kc8

- R.
"You have, let us say, a promising politician, a rising artist that you wish to destroy. Dagger or bomb are archaic and unreliable - but teach him, inoculate him with chess."
– H.G. Wells
Reinfeld
Member
Posts: 486
Joined: Thu Feb 17, 2011 3:54 am
Location: Tacoma, WA

Post by Reinfeld »

And just for grins, let's see how Mephisto Atlanta stacks up at 30s/move. Here's goes:

Atlanta Test game 1 (as White):

5.Nf3
6.Bxf7+
7.Qxf3
8.Qxb7
9.Nb5
10.Nxa7
11.Bb5+
12.Bb5+
13.Qc6+
14.Qxb5+
15.0-0
16.Qxd5
17.Qxd5
18.c3
19.0-0
20.Qxd7+
21.Qd8+

Atlanta Test game 1 (as Black):

5. ... e6
6. ... Bh5
7. ... c6
8. ... Nbd7
9. ... Bd6
10. ... Rb8
11. ... Nxc4
12. ... Nd6
13. ... Ke7
14. ... Nd7
15. ... exd5
16. ... c5
17. ... 0-0
18. ... c6
19. ... Qe4+
20. ... Kf8

Atlanta Test game 2 (as White):

7.Nc3
8.Nbd2
9.Bb5
10.Bd3
11.Bxc6+
12.Nxe6
13.Nxd5
14.Bc4
15.Nb5
16.Be3
17.Nbxc7
18.Rxe3
19.Nxd5
20.Rd3
21.Rxe8
22.Rd3
23.b4
24.Qd4
25.Rxb7+
26.Rxa7+
27.Qb6+

Atlanta Test game 2 (as Black):

6. ... Nb4
7. ... Be6
8. ... Qf5
9. ... Nge7
10. ... Qg6
11. ... Bh3
12. ... Bd7
13. ... 0-0-0
14. ... Bxc6
15. ... Nb4
16. ... Bd6
17. ... Bb4
18. ... Bf7
19. ... Rd7
20. ... g5
21. ... h6
22. ... h6
23. ... Re4
24. ... b6
25. ... Kc8
26. ... Kc8

- R.
"You have, let us say, a promising politician, a rising artist that you wish to destroy. Dagger or bomb are archaic and unreliable - but teach him, inoculate him with chess."
– H.G. Wells
User avatar
spacious_mind
Senior Member
Posts: 4014
Joined: Wed Aug 01, 2007 10:20 pm
Location: Alabama
Contact:

Post by spacious_mind »

Hi Reinfeld,

Wow you are on a roll! I will get them all added this evening and post their scores. Thanks a lot for doing all these.

Best regards
Nick
User avatar
spacious_mind
Senior Member
Posts: 4014
Joined: Wed Aug 01, 2007 10:20 pm
Location: Alabama
Contact:

Post by spacious_mind »

Hi Reinfeld,

The Turbo King for Game 1 has the first Black move missing 4. ... ? Can you please provide.

Thanks
Nick
Nick
User avatar
spacious_mind
Senior Member
Posts: 4014
Joined: Wed Aug 01, 2007 10:20 pm
Location: Alabama
Contact:

Post by spacious_mind »

spacious_mind wrote:Hi Reinfeld,

The Turbo King for Game 1 has the first Black move missing 4. ... ? Can you please provide.

Thanks
Nick
Also Atlanta's first Black move in Game 1 is missing.

Thanks
Nick
Reinfeld
Member
Posts: 486
Joined: Thu Feb 17, 2011 3:54 am
Location: Tacoma, WA

Post by Reinfeld »

Hi Nick - so sorry. The first Game 1 move for Turbo King and Atlanta (as Black) is ...Nf6. My apologies.

Cutting and pasting too fast regards,

- R.
"You have, let us say, a promising politician, a rising artist that you wish to destroy. Dagger or bomb are archaic and unreliable - but teach him, inoculate him with chess."
– H.G. Wells
Reinfeld
Member
Posts: 486
Joined: Thu Feb 17, 2011 3:54 am
Location: Tacoma, WA

Post by Reinfeld »

So far, these tests lead to three observations, one of which may force me to eat crow while Steve watches:

1. Igor, Mirage and Excalbur GM are closely related, and I'm betting standard Excalibur GM is closer than Platinum.

2. Krypton Legend/Systema Challenge (an accepted Horvath program) is not, NOT a clone of Igor. The deviations are too great. Igor and Legend deviate 42 percent of the time in Nick's tests. That's well beyond the generally accepted clone horizon of 90 percent, and it suggests more than tuning.

3. Avenger is an unsolved mystery. It deviates from Legend and Ivan in significant ways. I can't agree with mclane - Avenger deviates too much to qualify as a Legend clone - moves differed 77 percent of the time in the two test games.

- R.
"You have, let us say, a promising politician, a rising artist that you wish to destroy. Dagger or bomb are archaic and unreliable - but teach him, inoculate him with chess."
– H.G. Wells
User avatar
spacious_mind
Senior Member
Posts: 4014
Joined: Wed Aug 01, 2007 10:20 pm
Location: Alabama
Contact:

Post by spacious_mind »

Here is the updated chart with Reinfeld's tests. I have also included Mephisto Milano to see Schroeder in action.

Image

Atlanta did well in game 1 but quite poorly in game 2. Turbo King and Super Constellation did quite well. Milano also under performs in game 2.

Best regards
Nick
User avatar
Steve B
Site Admin
Posts: 10144
Joined: Sun Jul 29, 2007 10:02 am
Location: New York City USofA
Contact:

Post by Steve B »

Reinfeld wrote:So far, these tests lead to three observations, one of which may force me to eat crow while Steve watches:

1. Igor, Mirage and Excalbur GM are closely related, and I'm betting standard Excalibur GM is closer than Platinum.

2. Krypton Legend/Systema Challenge (an accepted Horvath program) is not, NOT a clone of Igor. The deviations are too great. Igor and Legend deviate 42 percent of the time in Nick's tests. That's well beyond the generally accepted clone horizon of 90 percent, and it suggests more than tuning.

3. Avenger is an unsolved mystery. It deviates from Legend and Ivan in significant ways. I can't agree with mclane - Avenger deviates too much to qualify as a Legend clone - moves differed 77 percent of the time in the two test games.

- R.
No Need For You To Eat Crow..R
The Question as to who programmed the later Excaliburs could very well be a question never fully answered although I suspect it will not stop those from trying(which can only be a good thing)

for me the question was settled years ago ..but I wont repeat all of that again because i have nothing new to add to the debate other then what I have repeated so may times now that it borders on

Ad Nauseam Regards
Steve
Reinfeld
Member
Posts: 486
Joined: Thu Feb 17, 2011 3:54 am
Location: Tacoma, WA

Post by Reinfeld »

Correction - bad math on my part (see bolded/fixed number below):
3. Avenger is an unsolved mystery. It deviates from Legend and Ivan in significant ways. I can't agree with mclane - Avenger deviates too much to qualify as a Legend clone - moves differed 23 percent of the time in the two test games.
The underlying point still feels right to me, however. A dedicated that matches moves less than 80 percent of the time with a dedicated of roughly equal strength doesn't feel like a clone.

- R.
"You have, let us say, a promising politician, a rising artist that you wish to destroy. Dagger or bomb are archaic and unreliable - but teach him, inoculate him with chess."
– H.G. Wells
User avatar
spacious_mind
Senior Member
Posts: 4014
Joined: Wed Aug 01, 2007 10:20 pm
Location: Alabama
Contact:

Post by spacious_mind »

Reinfeld wrote:Correction - bad math on my part (see bolded/fixed number below):
3. Avenger is an unsolved mystery. It deviates from Legend and Ivan in significant ways. I can't agree with mclane - Avenger deviates too much to qualify as a Legend clone - moves differed 23 percent of the time in the two test games.
The underlying point still feels right to me, however. A dedicated that matches moves less than 80 percent of the time with a dedicated of roughly equal strength doesn't feel like a clone.

- R.
Your statement would be acceptable for clones but for identifying same programmers it is not that simple and not really correct in all cases. Take the Horvath Test post for example that I posted and compare Test Game 1 Style 9 with Legend to the standard out of the box style 5. There are 17 move deviations out of 34 moves. That would make it a similarity of 50% coming out of the exact same computer by just changing the style. Based on your comments you would discount it as an absolute certainty that style 9 is a different programmer to style 5.

I had posted these tests so that people can get a complete picture of the difficulty of these discussions.

The Avenger plays very similar to a Horvath so therefore I agree with Thorsten on this one. Besides Igor and Mirage had more variations at level 53 because it is an average time and a more random setting than the test I posted at Level 22 (Igor) and Level 23 (Mirage). You also have to factor in the slight speed difference. The similarity of Avenger would probably be seen even clearer if someone were to play it at level 23 (assuming it has 10 MHz) or Level 22 (if 12 MHz).

Best regards
Nick
User avatar
spacious_mind
Senior Member
Posts: 4014
Joined: Wed Aug 01, 2007 10:20 pm
Location: Alabama
Contact:

Post by spacious_mind »

PS... Since I don't have an Avenger, here is an obvious question. Does it have the built-in play styles and style parameter settings to create your own out of the hundred thousand billion possible settings?
Nick
Post Reply