Research News

Jan 15, 2025

  • Informatics

Reading signs: New method improves AI translation of sign language

Additional data can help differentiate subtle gestures, hand positions, facial expressions

Improving AI accuracy



Adding data such as hand and facial expressions, as well as skeletal information on the position of the hands relative to the body, to the information on the general movements of the signer’s upper body improves word recognition.

Credit: Osaka Metropolitan University

 

Sign languages have been developed by nations around the world to fit the local communication style, and each language consists of thousands of signs. This has made sign languages difficult to learn and understand. Using artificial intelligence to automatically translate the signs into words, known as word-level sign language recognition, has now gained a boost in accuracy through the work of an Osaka Metropolitan University-led research group.

Previous research methods have been focused on capturing information about the signer’s general movements. The problems in accuracy have stemmed from the different meanings that could arise based on the subtle differences in hand shape and relationship in the position of the hands and the body.

Graduate School of Informatics Associate Professor Katsufumi Inoue and Associate Professor Masakazu Iwamura worked with colleagues including at the Indian Institute of Technology Roorkee to improve AI recognition accuracy. They added data such as hand and facial expressions, as well as skeletal information on the position of the hands relative to the body, to the information on the general movements of the signer’s upper body.

“We were able to improve the accuracy of word-level sign language recognition by 10-15% compared to conventional methods,” Professor Inoue declared. “In addition, we expect that the method we have proposed can be applied to any sign language, hopefully leading to improved communication with speaking- and hearing-impaired people in various countries.”

The findings were published in IEEE Access.

Funding

This work was supported by JSPS KAKENHI JP19K12023.

Paper information

Journal: IEEE Access
Title: Word-Level Sign Language Recognition With Multi-Stream Neural Networks Focusing on Local Regions and Skeletal Information
DOI: 10.1109/ACCESS.2024.3494878
Authors: Mizuki Maruyama, Shrey Singh, Katsufumi Inoue, Partha Pratim Roy, Masakazu Iwamura, and Michifumi Yoshioka
Published: 11 November 2024
URL: https://doi.org/10.1109/ACCESS.2024.3494878

Contact

Katsufumi Inoue

Graduate School of Informatics
Email: inoue[at]omu.ac.jp

*Please change [at] to @.

SDGs

  • SDGs04