Chanda, Sushovan and Fitwe, Kedar and Deshpande, Gauri and Schuller, Björn W. and Patel, Sachin (2021) A Deep Audiovisual Approach for Human Confidence Classification. Frontiers in Computer Science, 3. ISSN 2624-9898
pubmed-zip/versions/1/package-entries/fcomp-03-674533/fcomp-03-674533.pdf - Published Version
Download (1MB)
Abstract
Research on self-efficacy and confidence has spread across several subfields of psychology and neuroscience. The role of one’s confidence is very crucial in the formation of attitude and communication skills. The importance of differentiating the levels of confidence is quite visible in this domain. With the recent advances in extracting behavioral insight from a signal in multiple applications, detecting confidence is found to have great importance. One such prominent application is detecting confidence in interview conversations. We have collected an audiovisual data set of interview conversations with 34 candidates. Every response (from each of the candidate) of this data set is labeled with three levels of confidence: high, medium, and low. Furthermore, we have also developed algorithms to efficiently compute such behavioral confidence from speech and video. A deep learning architecture is proposed for detecting confidence levels (high, medium, and low) from an audiovisual clip recorded during an interview. The achieved unweighted average recall (UAR) reaches 85.9% on audio data and 73.6% on video data captured from an interview session.
Item Type: | Article |
---|---|
Subjects: | STM Library > Computer Science |
Depositing User: | Managing Editor |
Date Deposited: | 18 Nov 2022 04:39 |
Last Modified: | 24 Feb 2024 04:09 |
URI: | http://open.journal4submit.com/id/eprint/244 |