So i'm trying to find the probability of how often can you win the "Impossible bet".

Question

1 vote

The general way you solve the bet is on YouTube already (Minute Physics) but now I'm trying to find the probability that you win. I made code that follows the logic explained in the video but it doesn't seem to work and will give me a 0% succes rate at the end.

Here is the YouTube video:

https://www.youtube.com/watch?v=C5-I0bAuEUE

And here is my code:

clear;
clc;
close all;
%Impossible bet
%100 people have 1 dollar with a number on it which get put on boxes with numbers on it 
%100 boxes with the numbers were shuffled randomly with different numbers inside
Per = 1:100;
Box = randperm(100);
success = 1;
nb_success = 0;
[~,nb_box] = size(Box);
nb_max_open = nb_box/2;
for bet = 1:10^4
for index_person = 1:nb_box
index_box_to_open = index_person;
nb_opened_box = 0;
found = 0;
while (~ found) && nb_opened_box < nb_max_open
    found = (index_person == Box(index_box_to_open));
    index_bo_to_open = Box(index_box_to_open);
    nb_opened_box = nb_opened_box + 1;
end
success = found && success;
ifnot success;
    break
end
 if success == 1
    nb_success = nb_success + 1;
   end
end
prob = (nb_success/bet).*100;
disp(prob);

4 Comments
Show 2 older comments Hide 2 older comments

Fisnik Reci on 21 Dec 2019

The way i see it is that if you don't use the way that is said on the video it is basically impossible but if you do he says the chances go to about 30%

Now this is an excercise that was given to me as practice to see what precentage i get but i still can't do it.

the cyclist on 21 Dec 2019

I would recommend thoroughly commenting your code, explaining what it does. This will help us decipher what is the intended behavior of each part of the code, and possibly spot any errors.

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

John D'Errico on 22 Dec 2019

Edited: John D'Errico on 22 Dec 2019

Open in MATLAB Online

3 votes

First, some logical thought. (That has absolutely nothing to do with MATLAB.) Does the argument made hold water?

We can think of any permutation of the numbers 1:100 (or 1:n) in general as representing a graph, in the sense that box n maps to some other box, based on the number the box contains. (A box may even rarely map to itself for some boxes.)

But because this is a permutation of the numbers 1:n, then any box can only have one other box that maps to it, and there will always be exactly one box that maps into box k. This is true because we are working with a permutation of the numbers 1:n.

Next, if we start out in box #k, MUST we always return to box #k? The only way this could fail is if it was possible for us to have a sequence that would go something like

A --> B --> C --> D --> B --> C --> D --> B ...

Can you see why this is impossible, under the conditions we have set up the boxes? For a sequence like the above to happen, we MUST have TWO boxes that each point to box B. Thus in the postulated search sequence, box A and box D must both point to box B. Since we set up the numbers in the boxes as a permutation of the numbers 1:n, that can never happen. Therefore, if we start in box #k, then we will always returns to box number k. A cycle must always exist that contains box k, so if we start at box k, we will eventually return to box k, although the number of steps required for that event may be the maximum possible.

The only way we can fail then is if the cycle containing box k is if the associated cycle happens to pass throguh more boxes than 50.

In fact, any permutation of 1:n can result in multiple cycles, some of a short length, some longer. For example, consider the numbers 1:20. I'll just pick one random sample.

[1:20;randperm(20)]
ans =
  1  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20
 12 20  2 14 15 10 13 18  8  9  1  5 17 11  7  6 16  3  4 19

What cycles can I find there? Only 1 cycle in this permutation.

1--> 12 --> 5 --> 15 --> 7 --> 13 --> 17 --> 16 --> 6 --> 10 --> 9 --> 8 --> 18 --> 3 --> 2 --> 20 --> 19 --> 4 --> 14 --> 11 --> 1

Hmm, this case would fail. Any box I start in will take exactly 21 steps before I return to that box.

[1:20;randperm(20)]
ans =
  1  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20
  8  9 17 14 18 12  2 10 16 20  3  7 19  4  1 13 11  5 15  6

Here, we find 4 distinct cycles.

1 --> 8 --> 10 --> 20 --> 6 --> 12 --> 7 --> 2 --> 9 --> 16 --> 13 --> 19 --> 15 --> 1

3 --> 17 --> 11 --> 3

4 --> 14 --> 4

5 --> 18 --> 5

As you can see, each cycle is disjoint from the other cycles. At the same time, one of the 4 disjoint cycles was of length greater than 50% of the total number of elements. So this permutation is also a failed permutation, in that if we started in any of the 13 boxes in that first cycle, it will take exactly 13 steps to return to that box. And returning to the start box means we have found the box containing our number.

One thing you do need to recognize if that IF we start in box #k, then we must ALWAYS return to box #k by following the chain thus induced. The only question is the expectation of how many steps it will take at most. The question really reduces to determining the probability that a permutation of 1:100 will contain no cycle of length greater than 50. (The claim made is the frequency is a wee bit over 30% for permutations of the number 1:100.)

Can we use the tools in MATLAb to do this for us? Well, yes, we can do a lot.

First, I'll create a random permutation graph. I'll do it for the numbers 1:20. The function conncomp really does it all.

s = 1:20;
t = randperm(20);
G = graph(s,t);
conncomp(G)
ans =
     1     1     1     1     1     1     1     1     1     1     2     1     1     1     1     1     1     2     3     2

So this time we had 3 disjoint cycles, but one of them was a long one.

Do I really need to write code to estimate that probability using simulation? After all, that was your assignment. I'll look back in tomorrow morning. In fact, the functions graph and conncomp do almost all the work for you. accumarray would do the rest, IF you understand what it tells you. (If you are not allowed to use the graph tools to solve the problem, too bad. I won't re-write them.) Consider what this little code fragment tells you.

t = randperm(20);
G = graph(s,t);
conncomp(G)
ans =
     1     1     2     2     2     3     2     2     1     3     2     2     2     3     1     1     2     1     2     3
     
accumarray(conncomp(G)',1)
ans =
     6
    10
     4
     
max(accumarray(conncomp(G)',1))
ans =
    10

Edit: Since I know the answer to that silly question above, what does this tell you?

n = 100;
s = 1:n;
N = 100000;
success = 0;
for i = 1:N
  t = randperm(n);
  G = graph(s,t);
  if max(accumarray(conncomp(G)',1)) <= n/2
    success = success + 1;
  end
end
disp(success/N)
    
      0.31181

Of course, if you turn in this terse little code fragment as the solution to your homework assignment, my guess is your teacher will know you did not do it yourself. Que sera, sera...

Post edit:

There are a couple of important points to understand here. I think some of the commenters missed the point, not fully thinking through the algorithm as described.

Can you randomly have fallen onto the wrong cycle? NO! If you have number k, then you start by looking at box #k. That induces some sequence of numbers that starts with the number k. Eventually, if you end up searching through every box, this tells us only that there is only one cycle induced by the particular permutation, and that is a really large cycle. In that case, everybody fails to find their original number in a sufficiently low time by this algorithm.
Suppose a really large cycle exists that spanns ALL of the boxes. Will this algorithm start you at some random point in that cycle, so that even though there is a cycle of length 100, you might find your number before 100 boxes have been searched? NO! That is like the joke about why are your missing car keys always in the last place you look - you stop looking because there is no reason to look anymore. Effectively, if a long cycle exists, and you start loooking for #k in box number k, then if a box ever points you back into box #k befoe you have looked in every box, then the cycle is not one that is the fullest possible length.
Can some people happen to have numbers that allow them to suceed in fewer than 100/2=50 steps? Yes. But the gist of the algorithm posed is that EVERYBODY needs to succeed, or the case is deemed to have failed. For example...

I ran off a couple of sample graphs, looking for one that proves my point, but here is the one I chose:

n = 100;
s = 1:n;
t = randperm(n);
G = graph(s,t);
accumarray(conncomp(G)',1)
ans =
    77
    21
     1
     1

Here is a permutation where out of 100 numbers, there is one cycle of length 77. So if you start from ANY of those boxes, say box #k, while looking for the number k? In every case, it will require you to search through EXACTLY 77 boxes before you find the box that directs you back to the box you started from. That final box contains the number you were searching for.

However, there are also two unit cycles, where a box contains its own number. So this permutation is not a "derangement". A derangement is a permutation that has no numbers that appear in their original position. (Yes, there is even a name for such a result.)

https://en.wikipedia.org/wiki/Derangement

So if you happen to be one of those two people for this permutation, if you start with that number, then the box you will open first contains your own number. So this particular permutation is NOT a derangement.

There is also a short cycle here, of length 21. So for this particular permutation, there are actually 23 people who will succeed in finding their own number. Two will find it immediately. And 21 of them will find their number after 21 steps. But if you happen to lie in the cycle of length 21, no matter what number you hold, you will always run through all 21 boxes before you are done. The important point is that for any given permutation, unless ALL 100 numbers lead to a successful conclusion, then the algorithm is deemed to have failed on that permutation. And that explains why the simulation I ran was sufficient to determine if a solution exists for a given permutation.

Finally, why did the code that Fisnik wrote fail? I can best only say that it was not written to solve the problem. There are literally too many problems with that code for me to list. I'm sorry, but that is true. You need to fully understand the algorithm described to search through the boxes, and what that entails in terms of permutations. I think you also need to understand things like derangements to fully appreciate things here. Could code have been written by me to solve this problem without using the graph tools in MATLAB? Of course, but I won't write that code. As well, I have not written what I would consider a complete proof that the probability is around 31% that a permutation allows success using this algorithm. That would get deeply into the theory of things like derangements, and I won't go there. This answer is already far too long.

I think this now covers all possible things that need be discussed, at least in my eyes. If you are confused, then you should start by a careful re-reading of my answer, but I can try to entertain further thoughtfully posed questions, as long as you have thoughtfully read my full answer here and the analysis contained therein.

3 Comments
Show 1 older comment Hide 1 older comment

John D'Errico on 22 Dec 2019

Edited: John D'Errico on 22 Dec 2019

Open in MATLAB Online

Use MATLAB. You cannot learn to use a tool unless you use it. While that may seem a bit silly, it is true. The point is, you can use MATLAB to solve problems of all sorts. Many good ones can be found in Cody, as a good place to start. These are problems posed to be answered in MATLAB. They are of all levels, some are trivial, but some can be difficult, especially as you are starting out. A nice thing about Cody is you get reinforcement, by earning site rep. After you solve a problem though, you can also learn about how others may have solved it. (The flaw with Cody is it is too easy to game the site, with solutions that employ what are essentially poor programming skills. But if your goal is to learn MATLAB, Cody is a great place to do so.)

https://www.mathworks.com/matlabcentral/cody/?s_tid=gn_mlc_cody

There are other problem sites that can be found online, with varying degrees of difficulty. If Cody and others all seem too trivial, then go on to Project Euler.

https://projecteuler.net/archives

These form a large set of problems, all of which require a numerical solution of some sort. I've done something like 300-400 of them over some years, all but one of the problems I solved using MATLAB. (The odd one out I did essentially on paper. But for a while my goal was to solve one PE problem per day.) If you start on the first page of problems, there should be at least a few that are doable with only basic programming skills.

A great thing about the PE problems is they can also teach you a great deal about mathematics. They will stretch your abilities though, as they probably will for almost anyone.

Another way to learn MATLAB is to scan through the tools. For example, maybe the find function seems interesting to you, and you have found it to be useful recently. So, use the which utility, to see where find lives:

which find
built-in (/Applications/MATLAB_R2019b.app/toolbox/matlab/elmat/find)

find is grouped in the elmat tools in MATLAB. So, now use MATLAb to get help about the tools in elmat.

help elmat
  Elementary matrices and matrix manipulation.
 
  Elementary matrices.
    zeros       - Zeros array.
    ones        - Ones array.
    eye         - Identity matrix.
    repmat      - Replicate and tile array.
    repelem     - Replicate elements of an array.
    linspace    - Linearly spaced vector.
    logspace    - Logarithmically spaced vector.
    freqspace   - Frequency spacing for frequency response.
    meshgrid    - X and Y arrays for 3-D plots.
    accumarray  - Construct an array with accumulation.
    :           - Regularly spaced vector and index into matrix.
 
  Basic array information.
    size        - Size of array.
    length      - Length of vector.
    ndims       - Number of dimensions.
    numel       - Number of elements.
    
 ...

There is much more to be found there, but I've just shown the first few functions listed in elmat. Do any of those functions seem useful to you? If you know what they all do, then great. But odds are, there are a few utilities in there that will do something you might find useful. So then go on to read the help, perhaps for accumarray or some other tool. Look at the examples. At the end of the help fro accumarray, you will find this line:

See also full, sparse, sum, function_handle.

That suggests some functions the authors thought might be related to accumarray. Follow down the trail. Read the help for anything that strikes you as interesting.

What else can you do? You can find friends in class who have the same problems as you. I'm NOT suggesting that you do the work together. That is a bad way to collaborate, and should get you in a great deal of trouble, at least in my opinion. However, a good way to work is as I did on at least one difficult class when I was in grad school. There were only three students in one of my classes. After EVERY class, we would all sit down and discuss the homework as assigned, making sure we understood all of the deliverables for that assignment, and sometimes how to attack a difficult problem.

The best way to learn a tool is to actively need to use it. No matter what, this will take some effort, certainly so if you lack general programming experience. But MATLAB skills can be of value to you in the future, so it might be worth the investment of time and energy.

Fisnik Reci on 28 Dec 2019

Thanks again John I will make sure that i start to use MATLAB more and more.

Sign in to comment.

So i'm trying to find the probability of how often can you win the "Impossible bet".

4 Comments
Show 2 older comments Hide 2 older comments

Accepted Answer

3 Comments
Show 1 older comment Hide 1 older comment

More Answers (0)

Categories

Products

Tags

Community Treasure Hunt

So i'm trying to find the probability of how often can you win the "Impossible bet".

4 Comments Show 2 older comments Hide 2 older comments

Accepted Answer

3 Comments Show 1 older comment Hide 1 older comment

More Answers (0)

Categories

Products

Tags

See Also

Community Treasure Hunt

4 Comments
Show 2 older comments Hide 2 older comments

3 Comments
Show 1 older comment Hide 1 older comment