CoDesign Lab / Cognitive Vision

Cognitive Vision pursues an emerging line of research bringing together a novel and unique combination of methodologies from Artificial Intelligence, Vision and Machine Learning, Cognitive Science and Psychology, Visual Perception, and Spatial Cognition and Computation.

COMPUTATIONAL FOCUS:

General methods for the processing and semantic interpretation of dynamic visuo-spatial imagery with a particular emphasis on the ability to abstract, learn, and reason with cognitively rooted structured characterisations of commonsense knowledge pertaining to space and motion.

Semantic interpretation of dynamic visual imagery calls for general and systematic methods integrating techniques in knowledge representation and computer vision. Our research emphasises deep semantics, denoting the existence of declarative models -e.g., pertaining space and motion- and corresponding formalisation and methods supporting (domain-independent) reasoning capabilities such as semantic question-answering, relational visuospatial learning, and (non-monotonic) visuospatial abduction.

Presently, we emphasise applications of developed methods and tools in two settings:
(1). explainable visual abduction for active visual sensemaking and control (emphasising human-centred, ethical considerations in autonomous driving); and (2). semantic interpretation of multimodal human behavioural stimuli (emphasising AI foundations for empirically-driven research behavioural research in psychology, social sciences, visual art)

DISSEMINATION

COMING SOON!

We are presently putting together a new set of materials for wider dissemination. For collaboration purposes, we are happy to share material directly in the meantime that the new website will be released.

Either watch out this space, or get in touch if you would like to be updated about the materials to be released.

Contact

KEY PUBLICATIONS

Monsen, J., Suchan, J., Bhatt, M., Karlsson, L. (2026). Reasonable Motion: A General ASP Foundation for Environment Constrained Movement Trajectory Computation. In: LPNMR 2026: 18th International Conference on Logic Programming and Non-monotonic Reasoning (LPNMR), September 2026 - Klagenfurt, Austria. (to appear)

Suchan, J., Baloch, S., Bhatt, M. (2026). Towards a VLM-based Foundation for Generalised Neurosymbolic Visual Commonsense. In: FoIKS 2026: 14th International Symposium on Foundations of Information and Knowledge Systems, Hanover, Germany (position statement). (to appear )

Suchan, J., Bhatt, M., Monsen, J. (2025). ASP-Driven Visual Commonsense: A General Framework for Reasoning about Embodied Interaction in the Wild. In: KR 2025 - 22nd International Conference on Principles of Knowledge Representation and Reasoning., Melbourne, Australia.

Monsen, J., Suchan, J., Bhatt, M. (2025). Probabilistic Answer Set Programming Driven Ranking of Dynamic Space-Time Belief Models. In: RuleML+RR 2025 - Rules and Reasoning - Second International Joint Conference, part of: Declarative AI 2025 - Rules, Reasoning, Decisions, and Explanations(RuleML+RR 2025).

Bhatt, M. (2025). Towards Responsible AI Foundations for Neurocognitive Analytics of Vision., In: ECVP 2025: 47th European Conference on Visual Perception., Mainz, Germany.

Nair, V., Bhatt, M., Suchan, J., Billing, E., Hemeren, P. (2025). How do Naturalistic Visuo-Auditory Cues Guide Human Attention? - Insights from Systematic Explorations in Visual Perception of Embodied Multimodal Interaction. ACM Transactions on Applied Perception (ACM TAP). (To Appear)

Nair, V., Bhatt, M., Suchan, J., Billing, E., Hemeren, P. (2025). A Naturalistic Embodied Human Multimodal Interaction Dataset: Systematically Annotated Behavioral Visuo-Auditory Cue and Attention Data. PsyArXiv (April 2025), 41 pages. DOI (PDF@psArXiv), Dataset Info URL: www

Kondyli, V., Bhatt, M. (2024). Effects of Temporal Load on Attentional Engagement: Preliminary Outcomes with a Change Detection Task in a VR Setting. ACM SAP (Symposium on Applied Perception), DOI: https://doi.org/10.1145/3675231.3687149 - 2024. www

Bhatt, M., Suchan, J. (2023). Artificial Visual Intelligence: Perceptual Commonsense for Human-Centred Cognitive Technologies. In: Advanced Course on Artificial Intelligence (Human-Centred AI)., Ed: M. Chetouani et al. DOI: https://doi.org/10.1007/978-3-031-24349-3_12 - Springer. (2023). www

Kondyli, ,V., Bhatt, M., Levin, D., Suchan, J. (2023). How do drivers mitigate the effects of naturalistic visual complexity? - On attentional strategies and its implications under a change blindness protocol. Cognitive Research: Principles and Implications (CRPI). 8, 54 (2023). DOI / PDF (Open Access] Open Access): https://doi.org/10.1186/s41235-023-00501-1 . Springer Nature. (2023)

Kondyli, V. , Suchan, J., Bhatt, M. (2022). Grounding Embodied Multimodal Interaction: Towards Behaviourally Established Semantic Foundations for Human-Centered AI. In: The 1st International Workshop on Knowledge Representation for Hybrid Intelligence (KR4HI 2022)., part of International Conference on Hybrid Human-Artificial Intelligence (HHAI 2022), Amsterdam, The Netherlands, June 13-17, 2022.

The Economist (2021). Is a self-driving car smarter than a seven-month-old? How to improve the intelligence of self-driving cars. Independent Media Coverage of our Research in Cognitive Vision for Autonomous Driving. (The Economist - Sep 4th 2021 Edition - www ) (Media Coverage)

J. Suchan., M. Bhatt., S. Varadarajan, S. (2021). Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics, Artificial Intelligence Journal (AIJ), Volume 299, 2021. (Open Access / PDF )

M. Bhatt., J. Suchan. (2020). Cognitive Vision and Perception: Deep Semantics Integrating AI and Vision for (Declarative) Reasoning about Space, Action, and Motion. ECAI 2020: the 24th European Conference on Artificial Intelligence (ECAI)., June 2020, Santiago de Compostela, Spain.

J. Suchan., M. Bhatt. (2020). Driven by Commonsense: On the Role of Human-Centred Visual Explainability for Autonomous Vehicles. ECAI 2020: the 24th European Conference on Artificial Intelligence (ECAI)., June 2020, Santiago de Compostela, Spain.

V. Kondyli., M. Bhatt. (2020). Multimodality on the Road: Towards Evidence-Based Cognitive Modelling of Human Interactions in Everyday Roadside Situations. 6th International Digital Human Modeling Symposium 2020 (DHM2020), Skövde, Sweden

V. Kondyli., M. Bhatt., J. Suchan., (2020). Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving. STAIRS @ ECAI 2020: 9th European Starting AI Researchers Symposium (STAIRS), at ECAI 2020, the 24th European Conference on Artificial Intelligence (ECAI)., Santiago de Compostela, Spain.

Suchan, J., Bhatt, M., Vardarajan, S. (2019). Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving. IJCAI 2019: the 28th International Joint Conference on Artificial Intelligence (IJCAI) 2019, August 10 - 16, Macao. (Distinguished Paper Nomination, Honourable Mention)

Bhatt, M., Suchan, J., Vardarajan, S. (2019). Deep Semantics for Explainable Visuospatial Intelligence: Perspectives on Integrating Commonsense Spatial Abstractions and Low-Level Neural Features. NeSy 2019: In Proceedings of 14th International Workshop on Neural-Symbolic Learning and Reasoning (NeSy), at IJCAI 2019., August 12, 2019 (accepted for publication - to appear).

J. Suchan., M. Bhatt, Walega, P., Schultz, C. (2018). Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects. In AAAI 2018: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, February 2-7, 2018, New Orleans, USA.

Mehul Bhatt, Kristian Kersting (2018): Semantic Interpretation of Multi-Modal Human-Behaviour Data - Making Sense of Events, Activities, Processes. KI / Artificial Intelligence, 31(4): 317-320 (2017)

Suchan, J., Bhatt, M., Vardarajan, S., Amirshahi, S. A., and Yu, S. (2018). Semantic Analysis of (Reflectional) Visual Symmetry: A Human-Centred Computational Model for Declarative Explainability. Advances in Cognitive Systems, Vol 6: 65:84, 2018.

Jakob Suchan, Mehul Bhatt (2017): Deep Semantic Abstractions of Everyday Human Activities - On Commonsense Representations of Human Interactions. ROBOT (1) 2017: 477-488

J. Suchan., M. Bhatt. (2017). Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions. In ICCV 2017 Workshop: Vision in Practice on Autonomous Robots (ViPAR), International Conference on Computer Vision (ICCV), Venice, Italy.

Jakob Suchan. Declarative Reasoning about Space and Motion with Video. KI, 31(4):321–330, 2017

Suchan, J., Bhatt, M. (2016). Semantic Question-Answering with Video and Eye-Tracking Data: AI Foundations for Human Visual Perception Driven Cognitive Film Studies. IJCAI 2016: 25th International Joint Conference on Artificial Intelligence, New York City, USA.

Michael Spranger, Jakob Suchan, Mehul Bhatt (2016): Robust Natural Language Processing - Combining Reasoning, Cognitive Semantics, and Construction Grammar for Spatial Language. IJCAI 2016: 2908-2914

Suchan, J., Bhatt, M. (2016). The Geometry of a Scene: On Deep Semantics for Visual Perception Driven Cognitive Film Studies., in: WACV 2016: IEEE Winter Conference on Applications of Computer Vision (WACV 2016)., Lake Placid, NY, USA, IEEE.

Suchan, J., Bhatt, M., Santos, P. (2014). Perceptual Narratives of Space and Motion for Semantic Interpretation of Visual Data, in: Proceedings of International Workshop on Computer Vision + Ontology Applied Cross-Disciplinary Technologies (CONTACT). ECCV 2014 -- European Conference on Computer Vision, Zurich, Switzerland.

Suchan, J., Spranger, M., Bhatt, M., Eppe, M. (2014). Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction, in: 13th Pacific Rim International Conference on Artificial Intelligence (PRICAI-2014), Queensland, Australia, 2014.

Bhatt, M., Suchan, J., Schultz, C. (2013). Cognitive Interpretation of Everyday Activities - Toward Perceptual Narrative Based Visuo-Spatial Scene Interpretation. Computational Models of Narrative (CMN) 2013., a satellite workshop of CogSci 2013: The 35th meeting of the Cognitive Science Society., Editors: M. Finlayson., B. Fisseni., Benedikt Löwe., J. C. Meister. OASIcs proceedings volume. OpenAccess Series in Informatics (OASIcs). Dagstuhl, Germany

Bhatt, M., Schultz, C., Freksa, C. (2013). The `Space' in Spatial Assistance Systems: Conception, Formalisation, and Computation. in Thora Tenbrink, Jan Wiener, Christophe Claramunt (editors). Representing space in cognition: Interrelations of behavior, language, and formal models. Series: Explorations in Language and Space. Oxford University Press, 2012. 978-0-19-967991-1.

Bhatt, M. (2012). Reasoning about Space, Actions and Change: A Paradigm for Applications of Spatial Reasoning. in: Hazarika, S. (editor). Qualitative Spatio-Temporal Representation and Reasoning: Trends and Future Directions. IGI Global (PA, USA). DOI: 10.4018/978-1-61692-868-1. ISBN13: 978161692868.

Bhatt, M., Guesgen, H., Woelfl, S., Hazarika, S. (2011). Qualitative Spatial and Temporal Reasoning: Emerging Applications, Trends and Future Directions. Journal of Spatial Cognition and Computation. Issue: Emerging Applications of Spatial and Temporal Reasoning. 11(1). ISSN: 1387-5868 print/1542-7633 online, Taylor & Francis Group 2011.

Bhatt, M., Lee, J. H., Schultz, C. (2009). CLP(QS): A Declarative Spatial Reasoning Framework. Proceedings of the 10th International Conference on Spatial Information Theory (COSIT 11). Belfast, Maine.

Bhatt, M., and Loke, S. (2008). Modelling Dynamic Spatial Systems in the Situation Calculus. Spatial Cognition and Computation, 8(1-2):86–130, 2008.

COGNITIVE VISION AND PERCEPTION

DISSEMINATION

KEY PUBLICATIONS

COGNITIVE VISION
AND PERCEPTION