
Multimodal sentiment analysis combining language, audio, visual, and physiological signals.
The multimodal neural network is used to predict user sentiment from multimodal features such as text, audio, and visual data. In a new study, researchers from Japan account for physiological signals in sentiment estimation while talking with the system, greatly improving the system’s performance.
Image courtesy: Shogo Okada from JAIST
Researchers integrate biological signals with gold-standard machine learning methods to enable emotionally intelligent speech dialog systems
Artificial intelligence (AI) is at the forefront of modern technology. Making AI “emotionally intelligent” could open doors to more natural human-machine interactions. To do this, it needs to pick up on the user’s sentiment during a dialog. Physiological signals could provide a direct route to such sentiments. Now, researchers from Japan take things to the next level with an AI with sentiment-sensing capabilities comparable to that of humans.
Speech and language recognition technology is a rapidly developing field, which has led to the emergence of novel speech dialog systems, such as Amazon Alexa and Siri. A significant milestone in the development of dialog artificial intelligence (AI) systems is the addition of emotional intelligence. A system able to recognize the emotional states of the user, in addition to understanding language, would generate a more empathetic response, leading to a more immersive experience for the user.
“Multimodal sentiment analysis” is a group of methods that constitute the gold standard for an AI dialog system with sentiment detection. These methods can automatically analyze a person’s psychological state from their speech, voice color, facial expression, and posture and are crucial for human-centered AI systems. The technique could potentially realize an emotionally intelligent AI with beyond-human capabilities, which understands the user’s sentiment and generates a response accordingly.
However, current emotion estimation methods focus only on observable information and do not account for the information contained in unobservable signals, such as physiological signals. Such signals are a potential gold mine of emotions that could improve the sentiment estimation performance tremendously.
In a new study published in the journal IEEE Transactions on Affective Computing, physiological signals were added to multimodal sentiment analysis for the first time by researchers from Japan, a collaborative team comprising Associate Professor Shogo Okada from Japan Advanced Institute of Science and Technology (JAIST) and Prof. Kazunori Komatani from the Institute of Scientific and Industrial Research at Osaka University. “Humans are very good at concealing their feelings. The internal emotional state of a user is not always accurately reflected by the content of the dialog, but since it is difficult for a person to consciously control their biological signals, such as heart rate, it may be useful to use these for estimating their emotional state. This could make for an AI with sentiment estimation capabilities that are beyond human,” explains Dr. Okada.
The team analyzed 2468 exchanges with a dialog AI obtained from 26 participants to estimate the level of enjoyment experienced by the user during the conversation. The user was then asked to assess how enjoyable or boring they found the conversation to be. The team used the multimodal dialogue data set named “Hazumi1911,” which uniquely combined speech recognition, voice color sensors, facial expression and posture detection with skin potential, a form of physiological response sensing.
“On comparing all the separate sources of information, the biological signal information proved to be more effective than voice and facial expression. When we combined the language information with biological signal information to estimate the self-assessed internal state while talking with the system, the AI’s performance became comparable to that of a human,” comments an excited Dr. Okada.
These findings suggest that the detection of physiological signals in humans, which typically remain hidden from our view, could pave the way for highly emotionally intelligent AI-based dialog systems, making for more natural and satisfying human-machine interactions. Moreover, emotionally intelligent AI systems could help identify and monitor mental illness by sensing a change in daily emotional states. They could also come handy in education where the AI could gauge whether the learner is interested and excited over a topic of discussion, or bored, leading to changes in teaching strategy and more efficient educational services.
Original Article: Physiological Signals Could be the Key to “Emotionally Intelligent” AI, Scientists Say
More from: Japan Advanced Institute of Science and Technology | Osaka University
The Latest on: Emotionally intelligent artificial intelligence
- Cover story: How digital technologies have upped the game for healthcare across GCCon May 27, 2022 at 10:00 pm
As technologies drive the future of most industries, digital solutions are also reshaping the healthcare sector.
- Artificial Intelligence Has Rising Impact on Financial Marketson May 24, 2022 at 4:59 pm
Automation and artificial intelligence are profoundly transforming ... databases provide the analytical capability of the new intelligent market. Like all new inventions, the new markets will ...
- KangoGift CEO to Talk About Emotionally Intelligent Recognition at WorldatWork Conferenceon May 19, 2022 at 2:49 pm
During this session, KangoGift chief executive officer Todd Horton and Healthesystems vice president for human resources, Laura Wood will be addressing how emotionally intelligent ...
- Wysa Receives FDA Breakthrough Device Designation for AI-led Mental Health Conversational Agenton May 12, 2022 at 5:50 am
Wysa, a leading artificial intelligence (AI ... Wysa is intended to support individuals with the help of an “emotionally intelligent” conversational agent. The bot uses evidence-based ...
- The Other AI: Augmented Intelligenceon April 25, 2022 at 5:00 pm
Most people would say, "artificial intelligence ... One type of augmented intelligence, augmented reality (AR), is a rapidly emerging technology driving a new form of intelligent service engagement.
- Artificial intelligence that can read your emotionson April 25, 2022 at 5:00 pm
Developers and researchers have been advancing artificial intelligence to ... that enable developers to incorporate this type of emotional intelligence into their applications.
- Can AI Be Emotionally Intelligent?on April 22, 2022 at 1:54 pm
This pioneering study suggests that focusing on human physiological signals may be the key to creating artificial intelligence machine learning systems with high emotional intelligence.
- Artificial Intelligence and its significance in the growth of the gaming sectoron April 16, 2022 at 3:50 am
Better level progression: Game AI can detect a player’s skill level and emotional ... intelligent game characters who act as though they are controlled by human players. Artificial intelligence ...
- Robotics Startup Miko Raises $28 Mn In Series B Round led by IIFL AMCon August 12, 2021 at 6:50 am
Founded in 2015 by Sneh Vaswani, Prashant Iyengar and Chintan Raikar, Miko is an advanced robotics company that creates emotionally intelligent robots leveraging artificial intelligence and the ...
- Amazon Is Teaching Alexa to Analyze Your Emotionson May 21, 2019 at 1:55 pm
Amazon researchers think they’ve found a better way to make emotionally-savvy artificial intelligence ... could bring about a truly emotionally-intelligent — maybe even empathetic — smart ...
via Bing News
The Latest on: Emotionally intelligent artificial intelligence
via Google News
Add Comment