Reinforcement-Learning-Advertisements Upper Confidence Bound and Thompson Sampling Two text based user behavior learning programs.