MIT researchers have developed a reinforcement learning method, RLCR, that trains AI models to provide calibrated confidence estimates alongside answers, reducing overconfidence by up to 90% without ...