probability theory in machine learning