|
Main
Date: 19 Sep 2008 22:18:24
From: Guest
Subject: Random number mobility scores
|
Some time back it was mentioned that Hyatt had mentioned a test by Don Beal on random mobility scores for a material only program. I finally got curious enough to dig out an old chess program and try it. The program is crap. It is a mailbox program that was going through a lot of modification when I packed it away. But it seems to be working, and it has enough Winboard ability that works for me to run the Win-At-Chess test through it. The program used only material and a random number generator as a mobility score. No piece tables, no pawn scoring, etc. Just material and a random number generator from Numerical Recipes (I didn't want to use the one that comes with the compiler, of course.) I did a random number of 128 and put half negative and half positive. So about a half a pawn worth of mobility balance. The move ordering was done MVV/LVA with History Heuristic. No SEE move ordering. The capture search does only captures, and all of them. No checks or promotions, etc. are included. There are no search extensions like Null Move, check extensions, etc. There are no pruning methods beyond alpha-beta. No futility moves, no LMR, etc. etc. No transposition tables. Pretty much a basic, niave alpha-beta chess program with an eval of only material and random numbers. I didn't even bother to compile it with optimization enabled. I left it in debug mode. I used epd2wb and ran the Win-At-Chess test through my program, giving it 10 seconds to think. I used a 2ghz laptop. Only one core was used. (Note: This is the original WAC test, not the revised / fixed version.) I got +203 / -77 = 67%. The later ones were where most of the failures occured. As another test, I ran WAC through the same program without the random number generator. Scoring only material. I got +197 / -103 = 65% As a lark, I also tried it with the original evaluator I had in the program. This eval is junk. It was stuff I had cut & pasted from some other simple public domain programs. No effort was made to tune it. I ran very few tests with it. I just added stuff so I had an eval that did more than material. It got +197 / -103 = 65%... (grin) Pretty bad.... I then decided to try the WAC tests again, this time without the mates. The idea being the search itself is finding the mates, not the evaluator. Random: +148 / -80=64%. Material: +148 / -80=64% Odd. Maybe the mobility helped it find a few mates? Or at least picked the right move for the wrong reason? Or maybe it provided enough extra guidence in the search that it was able to go to the next ply and see the mate? I also tried the Bratko-Kopec test. With the random one. It got a total of 11 right. That gives a BK rating of 2000-2199. (And no, Sanny, that's not ELO, even though it's supposed to be.) For the Material only version, it got only 8 right. That's a 1800-1999 BK Rating. The one with the old evaluator, it got a score of 16.5, which is around 2400. (The score table doesn't handle fractional points. For 16, it'd be 2000-2399 and for 17 it'd be 2400+. So around 2400.) Finally, note that these results are just one test. Due to the nature of random numbers, a different see could conceivably cause all of them to give the right answer or wrong answer. But it does appear that a random number generator mobility score is at least as good as plain material and in some cases better. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
|
|
Date: 23 Sep 2008 15:28:09
From: David Richerby
Subject: Re: Random number mobility scores
|
Guest <[email protected] > wrote: > [By using an evaluator that returned material score plus a random > number in the range +/- 50 centipawns,] I got +203 / -77 = 67%. The > later ones were where most of the failures occured. > > As another test, I ran WAC through the same program without the random > number generator. Scoring only material. I got +197 / -103 = 65% This doesn't really surprise me. As has been mentioned, minimax on the randomly perturbed scores approximates mobility. But mobility itself approximates material, since the very reason that a queen is more powerful than a knight is that it has more mobility in typical positions. So, your two tests are, on the one hand, material and, on the other hand, material plus something that approximates material. Now, you'd expect material plus mobility to do slightly better than just material (which it does) because mobility factors in how well the available material can be deployed. So, I'm not surprised by your results. But it's really good that somebody actually sat down and did the experiment. Thank you. Dave. -- David Richerby Psychotic Voodoo Cheese (TM): www.chiark.greenend.org.uk/~davidr/ it's like a lump of cheese that has mystical powers but it wants to kill you!
|
|
Date: 22 Sep 2008 13:18:31
From: Kenneth Sloan
Subject: Re: Random number mobility scores
|
The more interesting test is: actual mobility - with no other term, not even material. -- Kenneth Sloan [email protected] Computer and Information Sciences +1-205-932-2213 University of Alabama at Birmingham FAX +1-205-934-5473 Birmingham, AL 35294-1170 http://KennethRSloan.com/
|
| |
Date: 23 Sep 2008 13:21:11
From: Simon Krahnke
Subject: Re: Random number mobility scores
|
* Guest <[email protected] > (06:02) schrieb: > "Simon Krahnke" <[email protected]> wrote in message > news:[email protected]... >>* Guest <[email protected]> (03:48) schrieb: >> >>>> Compare >>>> >>>> rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - KkQq 0 1 >>>> >>>> with >>>> >>>> 1n2k1n1/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - - 0 1 >>> >>> Compare them how? >> >> Well, using the pure mobility scoring Kenneth supposed. >> >> In the second position white has two more possible moves than black, >> giving it a better score than the first position, where in reality the >> first position is far better in white's perspective. > > I don't think it would show anything. > > It may have a different score, but that would be based on the search from > that point. > > So whatever score or 'best move' would be based on the starting position, > rather than comparing the two. > > And comparing the scores or best moves among the programs for the two > positions wouldn't really show a lot since there's nothing to judge their > result by. No 'best move to make' or 'move to avoid', etc. That might not be the best example positions, but they are easy to come up with. Surely there are lots of real positions where one or two rooks have little or no mobility and changing that is no immediate concern. Your tests also show that taking out material from the equation does no good. mfg, simon .... l
|
| |
Date: 23 Sep 2008 04:25:55
From: Simon Krahnke
Subject: Re: Random number mobility scores
|
* Guest <[email protected] > (03:48) schrieb: > "Simon Krahnke" <[email protected]> wrote in message > news:[email protected]... >>* Kenneth Sloan <[email protected]> (20:18) schrieb: >> >>> The more interesting test is: actual mobility - with no other term, not >>> even material. >> >> Compare >> >> rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - KkQq 0 1 >> >> with >> >> 1n2k1n1/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - - 0 1 > > Compare them how? Well, using the pure mobility scoring Kenneth supposed. In the second position white has two more possible moves than black, giving it a better score than the first position, where in reality the first position is far better in white's perspective. mfg, simon .... beware of errors in the handcrafted fen
|
| | |
Date: 22 Sep 2008 23:02:50
From: Guest
Subject: Re: Random number mobility scores
|
"Simon Krahnke" <[email protected] > wrote in message news:[email protected]... >* Guest <[email protected]> (03:48) schrieb: > >>> Compare >>> >>> rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - KkQq 0 1 >>> >>> with >>> >>> 1n2k1n1/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - - 0 1 >> >> Compare them how? > > Well, using the pure mobility scoring Kenneth supposed. > > In the second position white has two more possible moves than black, > giving it a better score than the first position, where in reality the > first position is far better in white's perspective. I don't think it would show anything. It may have a different score, but that would be based on the search from that point. So whatever score or 'best move' would be based on the starting position, rather than comparing the two. And comparing the scores or best moves among the programs for the two positions wouldn't really show a lot since there's nothing to judge their result by. No 'best move to make' or 'move to avoid', etc. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
| |
Date: 22 Sep 2008 21:25:26
From: Simon Krahnke
Subject: Re: Random number mobility scores
|
* Kenneth Sloan <[email protected] > (20:18) schrieb: > The more interesting test is: actual mobility - with no other term, not > even material. Compare rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - KkQq 0 1 with 1n2k1n1/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - - 0 1 mfg, simon .... l
|
| | |
Date: 22 Sep 2008 20:48:53
From: Guest
Subject: Re: Random number mobility scores
|
"Simon Krahnke" <[email protected] > wrote in message news:[email protected]... >* Kenneth Sloan <[email protected]> (20:18) schrieb: > >> The more interesting test is: actual mobility - with no other term, not >> even material. > > Compare > > rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - KkQq 0 1 > > with > > 1n2k1n1/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w - - 0 1 Compare them how? The score returned? Do a tournament of the different mobility versions using thos positions as a starter? Too much trouble. > > mfg, simon .... l ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
| |
Date: 22 Sep 2008 14:13:30
From: Guest
Subject: Re: Random number mobility scores
|
"Kenneth Sloan" <[email protected] > wrote in message news:[email protected]... > The more interesting test is: actual mobility - with no other term, not > even material. I said I wasn't going to run any more tests.... But alright. That's somewhat interesting. I'll rerun some of the tests with the material removed, leaving just the various forms of mobility. It'll take a day or so to get around to it. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
| | |
Date: 22 Sep 2008 15:16:00
From: Kenneth Sloan
Subject: Re: Random number mobility scores
|
Guest wrote: > "Kenneth Sloan" <[email protected]> wrote in message > news:[email protected]... >> The more interesting test is: actual mobility - with no other term, not >> even material. > > I said I wasn't going to run any more tests.... > > But alright. That's somewhat interesting. > > I'll rerun some of the tests with the material removed, leaving just the > various forms of mobility. > > It'll take a day or so to get around to it. > Also - it's worth noting that a random term (don't know why you want to *call* it "mobility") is, in practice, a good *indirect* measure of actual mobility (back one ply). -- Kenneth Sloan [email protected] Computer and Information Sciences +1-205-932-2213 University of Alabama at Birmingham FAX +1-205-934-5473 Birmingham, AL 35294-1170 http://KennethRSloan.com/
|
| | | |
Date: 22 Sep 2008 20:49:33
From: Guest
Subject: Re: Random number mobility scores
|
"Kenneth Sloan" <[email protected] > wrote in message news:[email protected]... > Guest wrote: >> "Kenneth Sloan" <[email protected]> wrote in message >> news:[email protected]... >>> The more interesting test is: actual mobility - with no other term, not >>> even material. >> >> I said I wasn't going to run any more tests.... >> >> But alright. That's somewhat interesting. >> >> I'll rerun some of the tests with the material removed, leaving just the >> various forms of mobility. >> >> It'll take a day or so to get around to it. >> > > Also - it's worth noting that a random term (don't know why you want to > *call* it "mobility") is, in practice, a good *indirect* measure of actual > mobility (back one ply). Right. That's why I simplify and call it 'mobility'. The random number is taking the place of the mobility score. As far as the sarch is concerned, it really is the mobility score. Hence, random mobility score. For this discussion, we all know what's being tested. Although it's only 'accurate' at the ply above, rather than the current ply, the closer to root, the greater the chance of it being closer to being accurate. I don't know what the math would be to calculate how many plies would be needed for it to be close enough as to make no statistical difference. Figuring 35 moves per position.... (frown) I'd say at least 8 plies. So I'm nto sure I'd call it "a good *indirect* measure of actual mobility". Better than expected, tolerable, etc. But not 'good'. What it does have going for it is that it's faster than a regular mobility evaluation, and therefor you can search a little deeper. My 10 second tests aren't super deep, but a longer 3 minute search would get better results. Speaking of which, I wonder how much of the BK test is influenced by the deeper search, over the 10 second search I'm doing for the WAC tests. These tests aren't meant to be super accurate, either. I'm not doing nearly enough tests, and for the random mobility, I'd have to try various ranges of random numbers, etc. (Doing the -64..+63 is likely too large. A smaller number would be more realistic, but I didn't think about that when I ran that first test. I was thinking more about making sure I was getting enough randomness in it to have a noticable effect.) I'm not interested enough to redo the tests. It's just a curiosity. I'm not actually planning on using a random number generator in my program. This was just a novelty. > > -- > Kenneth Sloan [email protected] > Computer and Information Sciences +1-205-932-2213 > University of Alabama at Birmingham FAX +1-205-934-5473 > Birmingham, AL 35294-1170 http://KennethRSloan.com/ > ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
|
Date: 20 Sep 2008 12:21:43
From: Guest
Subject: Re: Random number mobility scores
|
I have some more results for the random mobility score. I was originally interested in how it compared to just material alone, but then I wondered how it compared to real mobility scoring. Here are the program versions. Same junk program as before. Mtl is material scoring only. Rand: Addition of -64..+63 random value to simulate mobility. I just lopped off some bits and made half negative. The asymetry might cause some issues, but I don't care enough to repeat it with balanced results or a reduced range. Eval: The original eval in the program. Primative. A little pawn knowledge, a little mobility, queen/king closeness, a tiny bit of king info. Just stuff hacked together, mostly Mob: Material and two calls to GenAllMoves(Side). Mob2: Same as "Mob" but I don't bother to store the moves, pawn promotions are countd as 1 instead of 4, and I don't do castling. Faster. PMob: Material and pawn based mobility. The mobility the sliders would have if there were only pawns on the board. Only the sliders are scored. (This was alread in the full eval, so I just cut out the unrelated stuff.) Bratko-Kopec test. (2=1/2 4=1/4) Mtl : 100000100001011001001100 = 8 = 1800-1999 Rand: 100100100101011100101001 = 11 = 2000-2199 Mob : 100100110001011001101000 = 10 = 2000-2199 Mob2: 100110140001011001001000 = 9.25= 2000-2199 PMob: 101110110001111101101001 = 15 = 2200-2399 Eval: 101111110201011111101100 = 16.5 = 2400'ish Tact: T---T-T--T-T-TTT-TT-TT-- Pos : -PPP-P-PP-P-P---P--P--PP (The BK test are 12 tactical & 12 positional / Levers, as indicated above.) Tactical Bias = 12*(T-L)/S Tact: T---T-T--T-T-TTT-TT-TT-- Mtl : 100000100001011001001100 = 8 Bias = 12 Rand: 100100100101011100101001 = 9 Bias = 7.6 Mob : 100100110001011001101000 = 8 Bias = 7.2 Mob2: 100110140001011001001000 = 8 Bias = 8.75 PMob: 101110110001111101101001 = 11 Bias = 5.6 Eval: 101111110501011111101100 = 11.5 Bias = 4.7 Tact: T---T-T--T-T-TTT-TT-TT-- So we can see that just material is entirely matierially biased. No surprise. Adding any form of mobility improves the bias. It's still very materially biased, but it helps. It is a little odd that Mob2 did worse than Mob. It could be sheer statistical randomness, though. More tests might have different results. Classic Win-At-Chess test. Mtl +197 / -103 = 65% Rand +203 / -77 = 67% Mob +179 / -121 = 59% Mob2 +189 / -111 = 63% PMob +194 / -106 = 64% Eval +197 / -103 = 65% The WAC tests are very tactically oriented. Very simple tests, too. In this case, the random mobility provides just a little guidence with minimal impact on the speed of the search. A better quiscient search would probably help these tests, regardless of the eval. So how do I interpret the results... Well, the mobility ones and the full eval do run slower. WAC is very tactical so a slow prgram is at a little disadvantage. Mine is particularly slow (compared to modern programs), and it's dumb (by nearly any standard), so anything that slows down the program is going to be at a disadvantage for a tactically niave / slow program. Hence, the material only does pretty well. And the random one does pretty well too because it provides a tiny bit of smarts to the program at very little cost. The BK positions are a combination of tactics and positional. In this case, the full evaluator makes up for its slowness by being smart enough to find the right move anyway. Even mobility, in any form, helps in the BK tests. Although they do slow down the program, it helps over all. So, my final conclusion is this.... Random mobility is a good thing if you are doing just material. It's quick, it's easy to add, and it actually helps some. But it's no substitute for real mobility scoring, especially if you can do that fast, rather than just doing calls to GenAllMoves(). It looks like pawn based mobility is a good compromise. Quick to generate but decent results. If you don't have time to program that.... then go for random mobility. And of course, your results may vary depending on the random generator you use, the speed of the program, the number of tests you do, etc. etc. I have no plans to run more tests to see if piece square tables would benefit from random moblity, etc. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
| |
Date: 22 Sep 2008 20:48:16
From: Guest
Subject: Re: Random number mobility scores
|
At Kenneth's suggestion I've run some more tests. He wondered how things would perform without material balance. Just mobility. The program is the same as the one used before. Primitive alpha-beta, MVV/LVA & History Heuristic, capture search only, no trans table, simple eval. Possibly buggy, even. Same 2ghz computer. The WAC (old version, not new version) is done at 10 seconds per position. The Bratko-Kopec test is done for 2 minutes, as prescribed. Here are the program versions. Mtl is material scoring only. Rand: Addition of -64..+63 random value to simulate mobility. I just lopped off some bits and made half negative. The asymetry might cause some issues, but I don't care enough to repeat it with balanced results or a reduced range. Eval: The original eval in the program. Primative. A little pawn knowledge, a little mobility, queen/king closeness, a tiny bit of king info. Just stuff hacked together, mostly Mob: Material and two calls to GenAllMoves(Side). Mob2: Same as "Mob" but I don't bother to store the moves, pawn promotions are countd as 1 instead of 4, and I don't do castling. Faster. PMob: Material and pawn based mobility. The mobility the sliders would have if there were only pawns on the board. Only the sliders are scored. (This was alread in the full eval, so I just cut out the unrelated stuff.) Kenneth suggested the following tests, without the material score. 0eval: The original evaluator but with the material removed. For reference. 0mob: Two calls to GenAllMoves(Side). No material score. 0mob2: faster, slightly modified GenAllMoves, but no material score. 0pmob: Pawn based mobility. No material score. 0rand: random number mobility score. No material score. I also ran: eval-NoMob: The original evaluator, but with the mobility (pawn based mobility for the bishop & rook) removed. Just to test how much the P-Mob helps the evaluator. Bratko-Kopec test. (2=1/2 3=1/3 4=1/4) Mtl : 100000100001011001001100 = 8 = 1800-1999 Rand : 100100100101011100101001 = 11 = 2000-2199 Mob : 100100110001011001101000 = 10 = 2000-2199 Mob2 : 100110140001011001001000 = 9.25= 2000-2199 PMob : 101110110001111101101001 = 15 = 2200-2399 Eval : 101111110201011111101100 = 16.5 = 2400'ish 0rand: 100110000001000000001000 = 5 = 1600-1799 0mob : 100001010011000100000000 = 6 = 1600-1799 0mob2: 100004000011000000004000 = 3.5 = 1300-1599 0pmob: 101101010000000000001001 = 7 = 1800-1999 0eval: 101011010301000000000000 = 6.3 = 1700-1800 EvNoM: 101111110101011100101100 = 15 = 2200-2399 111111111122222 123456789012345678901234 Tact : T---T-T--T-T-TTT-TT-TT-- Pos : -PPP-P-PP-P-P---P--P--PP (The BK test are 12 tactical & 12 positional / Levers, as indicated above.) Tactical Bias = 12*(T-L)/S Tact : T---T-T--T-T-TTT-TT-TT-- Mtl : 100000100001011001001100 = 8 Bias = 12 Eval : 101111110501011111101100 = 11.5 Bias = 4.7 EvNoM: 101111110101011100101100 = 11 Bias = 5.6 0eval: 101011010301000000000000 = 3.3 Bias = 0.5 Rand : 100100100101011100101001 = 9 Bias = 7.6 Mob : 100100110001011001101000 = 8 Bias = 7.2 Mob2 : 100110140001011001001000 = 8 Bias = 8.75 PMob : 101110110001111101101001 = 11 Bias = 5.6 Tact : T---T-T--T-T-TTT-TT-TT-- 0rand: 100110000001000000001000 = 4 Bias = 7.2 0mob : 100001010011000100000000 = 3 Bias = 0.0 0mob2: 100004000011000000004000 = 2.25 Bias = 3.4 0pmob: 101101010000000000001001 = 2 Bias = -5.1 Tact : T---T-T--T-T-TTT-TT-TT-- The 0pmob (pawn based slider mobility only, no material) gets a bias of -5.1 which says it's pretty positionally oriented, rather than tactially.) Classic Win-At-Chess test. Mtl +197 / -103 = 65% Rand +203 / -77 = 67% Mob +179 / -121 = 59% Mob2 +189 / -111 = 63% PMob +194 / -106 = 64% Eval +197 / -103 = 65% 0rand +133 / -167 = 44% 0mob +132 / -168 = 44% 0mob2 +141 / -159 = 47% 0pmob +155 / -145 = 51% 0eval +131 / -169 = 43% evNoMob +205 / -95 = 68% For the BK tests, it's obvious that a lack of material makes a significant difference. In spite of that, the full eval manages to do pretty well. And pawn based mobility still does better than common pseudo-legal move mobility. For the WAC, all of them are also effected by lack of material. Not too surprising considering WAC is mostly tactics. Again, no surprises there. Unless you consider that they did as well as they did without any material balance.... The eval with no mobility got a higher WAC score than with mobility. Could be the eval is smart enough it doesn't matter much, or it could be the mobility scoring is taking enough extra time that it's slowing the search. Without the extra cost, it can do better tacticall. If that supposition is true, a better, smarter Q-Search would likely be an advantage Considering the performance in the BK test, I suspect a better Q-search is what's needed. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
|
Date: 20 Sep 2008 17:28:15
From: Simon Krahnke
Subject: Re: Random number mobility scores
|
* Guest <[email protected] > (05:18) schrieb: > Some time back it was mentioned that Hyatt had mentioned a test by Don Beal > on random mobility scores for a material only program. > > I finally got curious enough to dig out an old chess program and try it. Cool! > The program used only material and a random number generator as a mobility > score. No piece tables, no pawn scoring, etc. Just material and a random > number generator from Numerical Recipes (I didn't want to use the one that > comes with the compiler, of course.) [...] Of what kind is that random generator? A Mersenne Twister? > Pretty much a basic, niave alpha-beta chess program with an eval of only > material and random numbers. I didn't even bother to compile it with > optimization enabled. I left it in debug mode. > > > I used epd2wb and ran the Win-At-Chess test through my program, giving it 10 > seconds to think. I used a 2ghz laptop. Only one core was used. > > (Note: This is the original WAC test, not the revised / fixed version.) Which one did you post here? > I got +203 / -77 = 67%. The later ones were where most of the failures > occured. > > As another test, I ran WAC through the same program without the random > number generator. Scoring only material. I got +197 / -103 = 65% > > > As a lark, I also tried it with the original evaluator I had in the program. > This eval is junk. It was stuff I had cut & pasted from some other simple > public domain programs. No effort was made to tune it. I ran very few > tests with it. I just added stuff so I had an eval that did more than > material. > > It got +197 / -103 = 65%... (grin) Pretty bad.... It would be interesting to know if all they solved the same 197 positions. Do you have detailed logs? > I then decided to try the WAC tests again, this time without the mates. The > idea being the search itself is finding the mates, not the evaluator. The evaluator should help. :-) > Random: +148 / -80=64%. > > Material: +148 / -80=64% > > Odd. Maybe the mobility helped it find a few mates? Or at least picked the > right move for the wrong reason? Or maybe it provided enough extra guidence > in the search that it was able to go to the next ply and see the mate? Answers a log could give. Finalfun scored 188 in 5s each on a Pentium 4 at 3.4GHz. It must be doing something right. Transposition table? > But it does appear that a random number generator mobility score is at least > as good as plain material and in some cases better. Thanks for testing. mfg, simon .... l
|
| |
Date: 21 Sep 2008 17:54:06
From: Simon Krahnke
Subject: Re: Random number mobility scores
|
* Guest <[email protected] > (19:17) schrieb: > "Simon Krahnke" <[email protected]> wrote in message > news:[email protected]... >>* Guest <[email protected]> (05:18) schrieb: >> >> Of what kind is that random generator? A Mersenne Twister? > > Numerical Recipes 3rd ed., RanLim32. It's the only 32 bit random generator > they recommend and provide code for. Well, that's an investment. 50.95 Euros at Amazon. >> Finalfun scored 188 in 5s each on a Pentium 4 at 3.4GHz. It must be >> doing something right. Transposition table? > > That could certainly be it. > > This program has no trans table at all, and poor move ordering. The best move from the TT is all the move ordering Finalfun has. I never got to implement MVV/LVA. > It's entirely possible that if you ran these tests, you'd get very different > results than what I do. Let's see if I can get my new engine to actually generate moves. Right now, it knows what a board is, and that there are piece on it. That's not really much. I'm kind of working on preparation for chess960 compatible castling now. mfg, simon .... slow pace
|
| | |
Date: 21 Sep 2008 11:40:08
From: Guest
Subject: Re: Random number mobility scores
|
"Simon Krahnke" <[email protected] > wrote in message news:[email protected]... >* Guest <[email protected]> (19:17) schrieb: > >> Numerical Recipes 3rd ed., RanLim32. It's the only 32 bit random >> generator >> they recommend and provide code for. > > Well, that's an investment. 50.95 Euros at Amazon. You can read it online for free on their website. They make it available as PDFs. They don't provide seperate code to download, but it's only a few lines to type in. Plus, if you look hard enough, you can find places online where people have already typed it in for you. I did some random number testing some time back, and it's actually pretty hard to find a good random number generator even for things like the hash keys. A lot of the simplier ones on the web have very poor hamming distances for the hash keys, even if they are okay for normal random numbers. On the other hand, even some of the classic simple ones can do okay for hash keys if you do things right. They aren't random in the mathematical sense, but the hamming distances can be good enough for a hash table. I think many people just end up using Hyatt's method. You make a few throw-away calls until you get to the point where you get the best hamming distances for your hash table. Hyatt says it's based on one in NumRec, which was based on one in Knuth, but I haven't seen the sources. The only thing I don't like about it is it uses precomputed data. >>> Finalfun scored 188 in 5s each on a Pentium 4 at 3.4GHz. It must be >>> doing something right. Transposition table? >> >> That could certainly be it. >> >> This program has no trans table at all, and poor move ordering. > > The best move from the TT is all the move ordering Finalfun has. I never > got to implement MVV/LVA. Wow. Most people do MVV/LVA & history heuristic long before they do a trans table. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
| |
Date: 20 Sep 2008 12:17:27
From: Guest
Subject: Re: Random number mobility scores
|
"Simon Krahnke" <[email protected] > wrote in message news:[email protected]... >* Guest <[email protected]> (05:18) schrieb: > > Of what kind is that random generator? A Mersenne Twister? Numerical Recipes 3rd ed., RanLim32. It's the only 32 bit random generator they recommend and provide code for. I could have done a 64 bit using long long, but I wanted to keep the program in 32 bit clean. >> (Note: This is the original WAC test, not the revised / fixed version.) > > Which one did you post here? I posted the original one. When I ran my tests, that was handy and I wasn't expecting to run so many tests. Plus, some of those 'corrected' moves actually take quite a bit of effort. Not much chance my program would get them right anyway. >> As a lark, I also tried it with the original evaluator I had in the >> program. >> This eval is junk. It was stuff I had cut & pasted from some other >> simple >> >> It got +197 / -103 = 65%... (grin) Pretty bad.... > > It would be interesting to know if all they solved the same 197 > positions. Do you have detailed logs? Yeah. I dumped the results of each test to seperate files. But I don't really have the urge to look into them. I was just curious about the end results, not in tuning my evaluator. My guess is the poor results are due to a slow down in the program and a poor Q-search. It's doing just a capture search. And there could even be bugs in the program. It was a program 'in transition'. It had never really been finished and had been in a constant state of change. > Finalfun scored 188 in 5s each on a Pentium 4 at 3.4GHz. It must be > doing something right. Transposition table? That could certainly be it. This program has no trans table at all, and poor move ordering. It's entirely possible that if you ran these tests, you'd get very different results than what I do. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! >100,000 Newsgroups ---= - Total Privacy via Encryption =---
|
|