running your code in SWI-Prolog, I get
?- solve(X),label(X). X = [3, 5, 2, 1, 4].
Without label :
?- solve(X). X = [3, _G3351, _G3354, 1, _G3360], _G3351 in 4..5, all_different([_G3351, _G3386, _G3389, 2, _G3395]), all_different([3, _G3351, _G3354, 1, _G3360]), _G3386 in 3..5, all_different([_G3386, _G3444, 1, _G3450, _G3360]), _G3389 in 1\/3..5, _G3389+1#=_G3478, _G3492+1#=_G3389, _G3395 in 1\/3..5, _G3478 in 2..6, _G3444#=_G3478#<==>_G3529, _G3444 in 2..5, _G3444#=_G3556#<==>_G3553, _G3444#=_G3568#<==>_G3565, _G3444#=_G3492#<==>_G3577, _G3450 in 2\/5, all_different([_G3354, 4, 3, _G3450, _G3614]), _G3360 in 2\/4..5, _G3354 in 2\/5, _G3614 in 1..2\/5, _G3614+1#=_G3556, _G3568+1#=_G3614, _G3556 in 2..3\/6, _G3553 in 0..1, _G3565#\/_G3553#<==>1, _G3565 in 0..1, _G3568 in 0..1\/4, _G3492 in 0..4, _G3577 in 0..1, _G3577#\/_G3529#<==>1, _G3529 in 0..1.
If I change all_different to all_distinct I get a solution without a label:
.... all_distinct(Nationalities), all_distinct(Colors), all_distinct(Beverages), all_distinct(Cigarettes), all_distinct(Pets), .... ?- solve(X). X = [3, 5, 2, 1, 4].
As you can see, the documents show stronger distribution for all_distinct vs all_different . Running the proposed sample will help to understand the difference between the two:
?- maplist(in, Vs, [1\/3..4, 1..2\/4, 1..2\/4, 1..3, 1..3, 1..6]), all_distinct(Vs). false. ?- maplist(in, Vs, [1\/3..4, 1..2\/4, 1..2\/4, 1..3, 1..3, 1..6]), all_different(Vs). Vs = [_G419, _G422, _G425, _G428, _G431, _G434], _G419 in 1\/3..4, all_different([_G419, _G422, _G425, _G428, _G431, _G434]), _G422 in 1..2\/4, _G425 in 1..2\/4, _G428 in 1..3, _G431 in 1..3, _G434 in 1..6.