Contrast is what focuses attention. In this case you are drawn to the face due to the background image being largely made up of highlights. The catchlights are the key here along with the white fur framing the face.
I would add that there's no detail in the white to hold your attention, so your eyes slide over to where they can see detail.











However, there is something about the colors in the first image that is bothering me but I can't figure out what (or don't know how to make it better). Any thoughts? 




