Follow
Kalin Stefanov
Kalin Stefanov
Verified email at monash.edu - Homepage
Title
Cited by
Cited by
Year
MARLIN: Masked Autoencoder for facial video Representation LearnINg
Z Cai, S Ghosh, K Stefanov, A Dhall, J Cai, H Rezatofighi, R Haffari, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
462023
Multimodal Analysis and Estimation of Intimate Self-Disclosure
M Soleymani, K Stefanov, SH Kang, J Ondras, J Gratch
2019 International Conference on Multimodal Interaction, 59-68, 2019
382019
Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Z Cai, K Stefanov, A Dhall, M Hayat
2022 International Conference on Digital Image Computing: Techniques and …, 2022
332022
Vision-based active speaker detection in multiparty interaction
K Stefanov, J Beskow, G Salvi
International Workshop on Grounding Language Understanding 2017, 5, 2017
282017
Multimodal Learning for Identifying Opportunities for Empathetic Responses
L Tavabi, K Stefanov, S Nasihati Gilani, D Traum, M Soleymani
2019 International Conference on Multimodal Interaction, 95-104, 2019
272019
Public Speaking Training with a Multimodal Interactive Virtual Audience Framework
M Chollet, K Stefanov, H Prendinger, S Scherer
Proceedings of the 2015 ACM on International Conference on Multimodal …, 2015
252015
Multimodal Automatic Coding of Client Behavior in Motivational Interviewing
L Tavabi, K Stefanov, L Zhang, B Borsari, JD Woolley, S Scherer, ...
Proceedings of the 2020 International Conference on Multimodal Interaction …, 2020
202020
A multi-party multi-modal dataset for focus of visual attention in human-human and human-robot interaction
K Stefanov, J Beskow
Proceedings of the Tenth International Conference on Language Resources and …, 2016
182016
Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially Aware Language Acquisition
K Stefanov, J Beskow, G Salvi
IEEE Transactions on Cognitive and Developmental Systems 12 (2), 250-259, 2019
172019
Look who's talking: visual identification of the active speaker in multi-party human-robot interaction
K Stefanov, A Sugimoto, J Beskow
Proceedings of the 2nd Workshop on Advancements in Social Signal Processing …, 2016
162016
A Kinect Corpus of Swedish Sign Language Signs
K Stefanov, J Beskow
Proceedings of the 2013 Workshop on Multimodal Corpora: Beyond Audio and Video, 2013
162013
Analysis of behavior classification in motivational interviewing
L Tavabi, T Tran, K Stefanov, B Borsari, JD Woolley, S Scherer, ...
Proceedings of the conference. Association for Computational Linguistics …, 2021
142021
OpenSense: A Platform for Multimodal Data Acquisition and Behavior Perception
K Stefanov, B Huang, Z Li, M Soleymani
Proceedings of the 2020 International Conference on Multimodal Interaction …, 2020
142020
Modeling of Human Visual Attention in Multiparty Open-World Dialogues
K Stefanov, G Salvi, D Kontogiorgos, H Kjellström, J Beskow
ACM Transactions on Human-Robot Interaction (THRI) 8 (2), 1-21, 2019
142019
Multimodal multiparty social interaction with the furhat head
S Al Moubayed, G Skantze, J Beskow, K Stefanov, J Gustafson
Proceedings of the 14th ACM international conference on Multimodal …, 2012
122012
Glitch in the matrix: A large scale benchmark for content driven audio–visual forgery detection and localization
Z Cai, S Ghosh, A Dhall, T Gedeon, K Stefanov, M Hayat
Computer Vision and Image Understanding 236, 103818, 2023
92023
Group-Level Focus of Visual Attention for Improved Next Speaker Prediction
C Birmingham, K Stefanov, MJ Mataric
Proceedings of the 29th ACM International Conference on Multimedia, 4838-4842, 2021
92021
Emotion or expressivity? an automated analysis of nonverbal perception in a social dilemma
S Lei, K Stefanov, J Gratch
2020 15th IEEE International Conference on Automatic Face and Gesture …, 2020
92020
Tutoring Robots: Multiparty multimodal social dialogue with an embodied tutor
S Al Moubayed, J Beskow, B Bollepalli, A Hussen-Abdelaziz, ...
Innovative and Creative Developments in Multimodal Interaction Systems: 9th …, 2014
82014
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Z Cai, S Ghosh, AP Adatia, M Hayat, A Dhall, K Stefanov
arXiv preprint arXiv:2311.15308, 2023
72023
The system can't perform the operation now. Try again later.
Articles 1–20