- Tapu, R., Mocanu, B., & Chiva, I. C. Multimodal Visual Speech Recognition for Under-Resource Languages via Cross-Modal Learning and Large Language Models. Romanian Journal of Information Science and Technology (ROMJIST), Q1 journal, IF 3.9.
- Minulescu, D.-E., & Toma, Ș.-A. (2025). Whisper Based Speech Recognition for Emergency Services. In 2025 17th International Conference on Electronics, Computers and Artificial Intelligence (ECAI) (pp. 1–6). https://doi.org/10.1109/ECAI65401.2025.11095458
- Mitrut, V. D., Stefanescu, S., & Toma, Ș.-A. (2025). Transcription and Identification of Compound and Special Numeral Entities Using Artificial Intelligence and Rule-Based Methods. In 13th Conference on Speech Technology and Human-Computer Dialogue.
- Tapu, R., & Mocanu, B. (2025). Automatic Audio Description: A Training-Free Approach Using Foundation Models. In Proceedings of the 21st International Conference on Computer Analysis of Images and Patterns (CAIP 2025) (Vol. 15622, pp. 173–183).
- Tapu, R., & Mocanu, B. (2025). Lip Reading Across Languages: A Cross-Modal Framework Leveraging Foundation Models. In Proceedings of the 2025 IEEE International Conference on Content-Based Multimedia Indexing (CBMI 2025).
- Mocanu, B., & Tapu, R. (2025). A Lightweight Audio-Visual Speaker Detection System for Assistive Video Captioning. In Proceedings of the 2025 13th European Workshop on Visual Information Processing (EUVIP 2025).
- Mocanu, B., & Tapu, R. (2025). Seeing Through Words: A Zero-Shot Multimodal Audio Description System with Foundation Models. In Proceedings of the 20th International Symposium on Visual Computing (ISVC 2025). Springer.
- Grosu, M., Mocanu, B., Tapu, R., & Datcu, O. (2025). Evaluating Speech Emotion Recognition Systems: From Traditional Low-Level Features to Transformer-Based Models. In Proceedings of the International Conference on E-Health and Bioengineering (EHB 2025).
- Constantin, O., Tapu, R., Mocanu, B., & Grosu, M. (2025). Food Image Recognition: From CNNs to Transformers and Multimodal Learning. In Proceedings of the International Conference on E-Health and Bioengineering (EHB 2025).
- Ionescu, B., Müller, H., Stanciu, D.-C., Radzhabov, A., de Herrera, A. G. S., Andrei, A.-G., … Xie, Z. (2026). ImageCLEF 2026: Multimodal Challenges in Medicine, Science, Agritech, and Security. In Proceedings of the 48th European Conference on Information Retrieval (ECIR 2026).