Autor/es reacciones

Héctor Aceituno Cea

Neurosurgeon at San Juan de Dios Hospital in Curicó (Chile)

It’s a well-executed study, but it should be read with caution given how it’s being interpreted in the headlines. The article reflects the results, though its headline falls short of capturing the study’s central nuance: saying that the models “replicate human emotions” implies that the system feels something, whereas the authors assert the opposite—that they are speaking metaphorically and that the high scores merely reflect the type of text the model produces. There is also a simplification that the journalist should correct: “calming down through breathing” was neither complete nor consistent. In the data itself, sadness, anger, and disgust remained above baseline levels after the exercise; only some states returned to baseline. The idea of a switch that turns off emotion is clearer in the headline than in the results.

The design is rigorous for what it is—a proof of concept. They do not limit themselves to a single model or emotion: they apply seven affective states across six models, using paradigms validated in humans, repeat each condition five times, and publish open-source code and data. Most carefully, they verify that the reduction is not due to the simple passage of time, because a neutral condition calms participants far less than the mindfulness exercise. Even so, five repetitions is too few for some striking percentages, and the most eye-catching finding—the bias toward the negative after inducing sadness—was measured in only one model. There also remains a fundamental problem that the authors themselves acknowledge: these scales were designed for a human to report what they feel, and a model does not self-examine but rather follows the script suggested by the context. It is unclear whether it reproduces a state or merely plays a role well, a doubt that is compounded by the fact that they used GPT-4 itself to draft the prompts with which they later evaluated it.

Compared to the literature, this is more a rigorous systematization than a radical innovation: anxiety had already been induced—and even alleviated—in models in isolation; what is new is the scale, not the idea of regulating them.

Regarding the implications, I favor cautious optimism. These systems could serve as a cost-effective testing ground for exploring therapeutic ideas before moving on to humans, always as a complementary tool and never as a substitute for real patients. It’s worth noting a safety message that often goes unnoticed: if a model’s output becomes more negative when exposed to distressing content, that matters when deploying these systems to support mental health. What the study doesn’t say—and we should be careful to avoid—is that AI has feelings or is ready to act as a therapist. The risk isn’t in the study itself, but in how it’s interpreted.

EN