Jump to content

Bad simulation or bad rule?


thorvald

Recommended Posts

[hv=url=https://www.bridgebase.com/tools/handviewer.html?bbo=y&lin=pn%7CHuman%2CRobot%2CRobot%2CRobot%7Cst%7C%7Cmd%7C3SQ5HAK63DAKQ74CA7%2CSK732H852DJ63CJT6%2CSAT8HQJ74D5CQ8543%2CSJ964HT9DT982CK92%7Csv%7CN%7Cah%7CBoard%205%7Cmb%7CP%7Cmb%7CP%7Cmb%7C2C%7Can%7CStrong%20two%20club%20--%2019%2B%20HCP%3B%2023%2B%20total%20points%3B%20forcing%20to%202N%7Cmb%7CP%7Cmb%7C2D%7Can%7C2D%20bid%20waiting%20--%2011-%20HCP%3B%2012-%20total%20points%3B%20forcing%20to%202N%7Cmb%7CP%7Cmb%7C2N%7Can%7CBalanced%20minimum%20--%202-5%20%21C%3B%202-5%20%21D%3B%202-5%20%21H%3B%202-5%20%21S%3B%2022-24%20HCP%3B%2023%2B%20total%20points%7Cmb%7CP%7Cmb%7C3C%7Can%7CStayman%20--%2011-%20HCP%3B%203-12%20total%20points%7Cmb%7CP%7Cmb%7C3H%7Can%7C2-5%20%21C%3B%202-5%20%21D%3B%204-5%20%21H%3B%202-4%20%21S%3B%2022-24%20HCP%3B%2023%2B%20total%20points%7Cmb%7CP%7Cmb%7C3S%21%7Can%7Cforcing%20H%20raise%20--%204%2B%20%21H%3B%2011-%20HCP%3B%208-12%20total%20points%7Cmb%7CP%7Cmb%7C4C%7Can%7CCue%20bid%20--%202-5%20%21C%3B%202-5%20%21D%3B%204-5%20%21H%3B%202-4%20%21S%3B%2022-24%20HCP%3B%20%21CA%3B%2025%2B%20total%20points%3B%20forcing%7Cmb%7CP%7Cmb%7C7H%7Can%7C4%2B%20%21H%3B%2011-%20HCP%3B%2011-12%20total%20points%7Cmb%7CP%7Cmb%7CP%7Cmb%7CP%7Cpc%7CCJ%7Cpc%7CCQ%7Cpc%7CCK%7Cpc%7CCA%7Cpc%7CDA%7Cpc%7CD3%7Cpc%7CD5%7Cpc%7CDT%7Cpc%7CD4%7Cpc%7CD6%7Cpc%7CH7%7Cpc%7CD2%7Cpc%7CHQ%7Cpc%7CHT%7Cpc%7CH3%7Cpc%7CH2%7Cpc%7CHJ%7Cpc%7CH9%7Cpc%7CH6%7Cpc%7CH8%7Cpc%7CH4%7Cpc%7CD9%7Cpc%7CHA%7Cpc%7CH5%7Cpc%7CDK%7Cpc%7CDJ%7Cpc%7CC3%7Cpc%7CD8%7Cpc%7CDQ%7Cpc%7CS3%7Cpc%7CC4%7Cpc%7CS9%7Cpc%7CD7%7Cpc%7CS2%7Cpc%7CS8%7Cpc%7CS4%7Cpc%7CC7%7Cpc%7CC6%7Cpc%7CC5%7Cpc%7CC9%7Cpc%7CC2%7Cpc%7CHK%7Cpc%7CCT%7Cpc%7CC8%7Cpc%7CS5%7Cpc%7CS7%7Cpc%7CSA%7Cpc%7CS6%7Cpc%7CST%7Cpc%7CSJ%7Cpc%7CSQ%7Cpc%7CSK%7C]399|300[/hv]

 

Trying this on the 2012-version North bids 5 over 4. Bidding 7, simply just seems wrong

 

This was from a challenge, and I gained 3 IMP by accepting I would go down

Link to comment
Share on other sites

Trying this on the 2012-version North bids 5 over 4.

Not too relevant here, since it also plays 4 as a help suit slam try rather than a cue bid.

 

But it's the exactly the same as discussed in previous threads; the GIB database will never, ever, bid a grand slam unless holding the appropriate combined number of points. North can tell that's impossible in this situation, so every single simulated bid will lead to an identical result of 6.

 

(And indeed, over the help suit slam try, 5 ties 4N and 6, it's only picking 5 based on an arbitrary ordering. It didn't consider 7 then because it wasn't allowed to).

 

If enough hands where you can make grand turn up, then it will leap to 7, since that will work better on average. It is not capable of realising that if you go slow, you can simulate to find 7 if it's best later.

Link to comment
Share on other sites

Not too relevant here, since it also plays 4 as a help suit slam try rather than a cue bid.

 

But it's the exactly the same as discussed in previous threads; the GIB database will never, ever, bid a grand slam unless holding the appropriate combined number of points. North can tell that's impossible in this situation, so every single simulated bid will lead to an identical result of 6.

 

(And indeed, over the help suit slam try, 5 ties 4N and 6, it's only picking 5 based on an arbitrary ordering. It didn't consider 7 then because it wasn't allowed to).

 

If enough hands where you can make grand turn up, then it will leap to 7, since that will work better on average. It is not capable of realising that if you go slow, you can simulate to find 7 if it's best later.

 

When simulating and calculating that on average 7 will give net positive larger than 6, it probably lacks the the percentage used to bid grands, by pure odds it is 2-1, but in reality most want better odds - and it is probably not calculating the odds for getting doubled.

Link to comment
Share on other sites

When simulating and calculating that on average 7 will give net positive larger than 6, it probably lacks the the percentage used to bid grands, by pure odds it is 2-1, but in reality most want better odds - and it is probably not calculating the odds for getting doubled.

No, all of that part works perfectly fine. It doesn't need to use human methods for working out what percentage it needs to make to be profitable; all that is computed from the simulated hands and theoretically should work better than the human methods.

 

(But of course, as usual, a low number of simulations is not always 100% accurate).

Link to comment
Share on other sites

No, all of that part works perfectly fine. It doesn't need to use human methods for working out what percentage it needs to make to be profitable; all that is computed from the simulated hands and theoretically should work better than the human methods.

 

(But of course, as usual, a low number of simulations is not always 100% accurate).

 

The simulation does not take into factor that the other table might stop in Game. I just had such a board tonight, where I calculated the odds in favor of a grand. It went down and my opponent played 4

 

[hv=pc=n&s=s8532hakqj64da6ca&d=n&v=b&b=13&a=1d3c3h4c4hp4np6cp?]133|200[/hv]

 

7 looks to be with odds.

 

I also tried to let the bot bid my hand, and noticed, that after the response 6C, there was no simulation and the bot just bid 6H

Link to comment
Share on other sites

The simulation does not take into factor that the other table might stop in Game.

Right. But there is no chance a robot could ever figure out what humans at other tables would do; trying to add that in would more likely cause disasters than solve anything. Assuming the same contract at the other table is very reasonable given the robots' consistency.

Link to comment
Share on other sites

Hardcoding a completely arbitrary figure like that in doesn't really make any sense at all to me, and wouldn't fit into the entire concept of the way it bids, so I can't agree with you there.

 

Using Milton Points, distribution points etc are al arbitrary values :-)

 

Learning about risk - reward in gaming is very interesting

 

If I offer you a bet on toss of a coin giving you odds 6:5 on tails you would (and should) take it, but a factor might be how much is at stake.

 

There has been a lot of studies in that area - what is your limit on these odds (a car, a house or something even more valuable) ?

 

But as we both know the chances of getting any changes in GIB looks to be non-existing

Link to comment
Share on other sites

How many hands does it simulate to have any real chance of bidding 7 (1 or 2 perhaps)

I appreciate there are thousands of tables being played simultaneously. Trying to do some sums on CPUs and threads and processing time and how much is actually done on servers rather than on client machines?

One would think the requirements to simulate all likely contracts once beforehand should be enough - with a decent number of sims - not every time

Maybe BBO use a small cluster of Pentiums

 

As a small test I reduced my number of simulated hands to around 10 and was starting to get a few >60% chances and only used a few percent of a second of an 8-thread processor - bot sure exactly how much resource it used. It takes a lot of hands to get 10 though

 

Does it have any rules at all (common sense) or is everything just DD sims

 

Surely South could be thinking along which slam to go for but not North :)

Link to comment
Share on other sites

Using Milton Points, distribution points etc are al arbitrary values :-)

Precisely, that's the point - simulations are all about trying to avoid using these arbitrary values / rules of thumb that humans have come up with as much as possible.

 

There was a classic example on BridgeWinners a while back when, defending 6NT at IMPs, a robot defender won a trick, had the setting trick in hand, but failed to cash it, despite the fact it was clear that doing so would allow the contract to make on some layouts (and make it did).

 

Humans would always cash the winner, because your only aim is beat the contract.

 

The robot saw that cashing the winner would guarantee down 1, and its return would sometimes allow the slam to make.. but it would also take the contract down 2 or more tricks a significant proportion of the time; enough to average out to a better IMP score, and was actually theoretically better.

 

Introducing hardcoded values would result in so many more bugs - the most obvious would be how are you going to define sacrificing anyway, or a two-way shot for sacrificing when it might make? Or situations when it's completely clearcut both tables will be in slam, and all you're doing is programming in a losing strategy.

 

Agree there is definitely a lot of interesting aspects of game theory here, but while there are lots of things I'd change about GIB, but I wouldn't want to go near this one. Especially when the hand in question is easily resolvable via other means.

Link to comment
Share on other sites

Precisely, that's the point - simulations are all about trying to avoid using these arbitrary values / rules of thumb that humans have come up with as much as possible.

 

There was a classic example on BridgeWinners a while back when, defending 6NT at IMPs, a robot defender won a trick, had the setting trick in hand, but failed to cash it, despite the fact it was clear that doing so would allow the contract to make on some layouts (and make it did).

 

Humans would always cash the winner, because your only aim is beat the contract.

 

The robot saw that cashing the winner would guarantee down 1, and its return would sometimes allow the slam to make.. but it would also take the contract down 2 or more tricks a significant proportion of the time; enough to average out to a better IMP score, and was actually theoretically better.

 

Introducing hardcoded values would result in so many more bugs - the most obvious would be how are you going to define sacrificing anyway, or a two-way shot for sacrificing when it might make? Or situations when it's completely clearcut both tables will be in slam, and all you're doing is programming in a losing strategy.

 

Agree there is definitely a lot of interesting aspects of game theory here, but while there are lots of things I'd change about GIB, but I wouldn't want to go near this one. Especially when the hand in question is easily resolvable via other means.

 

Yes, I remember that board, and Bots have one advantage over humans. A bad board will never affect the next boards :-)

 

And yes, game theory is very interesting.

 

But I don't think you can just base the play purely on statistics.

 

As an example:

 

You are playing against a vulnerable 4Ma and make a simulation (with 20 boards), where 1 card will defeat the contract in one simulation, but giving away an overtrick in 15 other.

 

Statistically you will lose in the long run playing that card, but when you look in the greater perspective, it is rare that the Imp will cost the match, but 13 Imps might well be decisive :-)

 

This is in fact the other side of playing safe, where we pay a small insurance to make sure the contract will win against a bad layout.

 

And I agree, that it is not where I would start if I ever got the chance to improve GIB

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...