German China

Empathy by Algorithm AI Outperforms Humans in Emotional Intelligence Tests, Study Finds

Source: Université de Genève 2 min Reading Time

Related Vendor

A joint study by the Universities of Geneva and Bern reveals that generative AIs outperform humans in emotional intelligence tests — and can even design their own. The results challenge long-held assumptions about what AI can (and should) do in emotionally complex situations.

Generative AI models like ChatGPT are now outperforming humans in emotional intelligence tests, according to new research from Unige and Unibe.(Source:  free licensed /  Pixabay)
Generative AI models like ChatGPT are now outperforming humans in emotional intelligence tests, according to new research from Unige and Unibe.
(Source: free licensed / Pixabay)

Is artificial intelligence (AI) capable of suggesting appropriate behaviour in emotionally charged situations? A team from the University of Geneva (Unige) and the University of Bern (Unibe) put six generative AIs — including ChatGPT — to the test using emotional intelligence (EI) assessments typically designed for humans. The outcome: these AIs outperformed average human performance and were even able to generate new tests in record time. These findings open up new possibilities for AI in education, coaching, and conflict management. The study is published in Communications Psychology.

Large Language Models (LLMs) are artificial intelligence (AI) systems capable of processing, interpreting and generating human language. The ChatGPT generative AI, for example, is based on this type of model. LLMs can answer questions and solve complex problems. But can they also suggest emotionally intelligent behaviour? These results pave the way for AI to be used in contexts thought to be reserved for humans.

Emotionally Charged Scenarios

To find out, a team from Unibe, Institute of Psychology, and Unige’s Swiss Center for Affective Sciences (CISA) subjected six LLMs (ChatGPT-4, ChatGPT-o1, Gemini 1.5 Flash, Copilot 365, Claude 3.5 Haiku and DeepSeek V3) to emotional intelligence tests. ‘‘We chose five tests commonly used in both research and corporate settings. They involved emotionally charged scenarios designed to assess the ability to understand, regulate, and manage emotions,’’ says Katja Schlegel, lecturer and principal investigator at the Division of Personality Psychology, Differential Psychology, and Assessment at the Institute of Psychology at Unibe, and lead author of the study.

For example: One of Michael’s colleagues has stolen his idea and is being unfairly congratulated. What would be Michael’s most effective reaction?

  • a) Argue with the colleague involved
  • b) Talk to his superior about the situation
  • c) Silently resent his colleague
  • d) Steal an idea back

Here, option b) was considered the most appropriate.

In parallel, the same five tests were administered to human participants. “In the end, the LLMs achieved significantly higher scores — 82 % correct answers versus 56 % for humans. This suggests that these AIs not only understand emotions, but also grasp what it means to behave with emotional intelligence,” explains Marcello Mortillaro, senior scientist at the Unige’s Swiss Center for Affective Sciences (CISA), who was involved in the research.

New Tests in Record Time

In a second stage, the scientists asked ChatGPT-4 to create new emotional intelligence tests, with new scenarios. These automatically generated tests were then taken by over 400 participants. ‘‘They proved to be as reliable, clear and realistic as the original tests, which had taken years to develop,’’ explains Katja Schlegel. ‘‘LLMs are therefore not only capable of finding the best answer among the various available options, but also of generating new scenarios adapted to a desired context. This reinforces the idea that LLMs, such as ChatGPT, have emotional knowledge and can reason about emotions,’’ adds Marcello Mortillaro.

These results pave the way for AI to be used in contexts thought to be reserved for humans, such as education, coaching or conflict management, provided it is used and supervised by experts.

Original Article: Large language models are proficient in solving and creating emotional intelligence tests; Communications Psychology; DOI:10.1038/s44271-025-00258-x

(ID:50438981)

Subscribe to the newsletter now

Don't Miss out on Our Best Content

By clicking on „Subscribe to Newsletter“ I agree to the processing and use of my data according to the consent form (please expand for details) and accept the Terms of Use. For more information, please see our Privacy Policy. The consent declaration relates, among other things, to the sending of editorial newsletters by email and to data matching for marketing purposes with selected advertising partners (e.g., LinkedIn, Google, Meta)

Unfold for details of your consent