Reduced Pavlovian Value Updating Alters Decision-Making in Sign-Trackers.
Luigi A E Degni, Lorenzo Mattioni, Claudio Danti, Valentina Bernardi, Gianluca Finotti, Marco Badioli, Francesca Starita, Alireza Soltani, Giuseppe di Pellegrino, Sara Garofalo
Abstract
Open AccessSuccessful reward-guided behavior relies not only on learning actions to obtain rewards but also on learning cues that predict the reward, which motivate and prepare the animal to pursue and consume it. Although these two types of learning-instrumental learning and pavlovian conditioning-have been extensively studied, it remains unclear how the brain updates and arbitrates between these systems, especially when pavlovian signals are irrelevant to decision-making. To address this, we used eye-tracking, pupillometry, and computational modeling in a pavlovian-to-instrumental transfer task with 60 humans (30 females), consisting of three phases: pavlovian phase (learning stimulus-outcome associations), instrumental phase (learning response-outcome associations), and transfer phase (testing pavlovian bias on instrumental responses). Using this approach, we aimed to identify different types of learners and their strategies, especially how individual differences in sign-trackers versus goal-trackers influence pavlovian bias. To that end, we used eye gaze data to categorize participants as sign- or goal-trackers and found that although both groups learned the task, sign-trackers' performance was lower when exposed to pavlovian cues, as they favored options based on their cue-outcome associations. Fitting data with multiple computational models revealed that participants dynamically arbitrated between values estimated through pavlovian and instrumental systems. Importantly, lower performance in sign-trackers was due to slower updating of pavlovian cue values during the transfer phase, not overweighting of pavlovian cue values relative to instrumental action values. Overall, our study offers a computational framework for understanding inflexible decision-making and potential interventions for disorders marked by maladaptive cue reactivity.