Decision trees don't have to be representations of decision making ... Information gain is itself calculated using a measure called entropy, which we first define for the case of a binary decision ...