Latest research projects:

project image

ATDD: Multi-lingual Dataset for Auto-Tune Detection in Music Recordings


Mahyar Gohari, Paolo Bestagini, Sergio Benini, Nicola Adami
Accepted in Data in Brief, 2025

This research presents a new multilingual dataset aimed at differentiating between auto-tuned music and genuine performances, filling an important void in current datasets. It includes tracks in English, Mandarin, and Japanese to capture a wide range of linguistic settings.

project image

Audio Features Investigation for Singing Voice Deepfake Detection


Mahyar Gohari, Davide Salvi, Paolo Bestagini, Nicola Adami
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Paper / Code /

This research investigates the effectiveness of various audio representations and features for discriminating real and synthetically generated singing voice signals. This work achieved the highest performance at the Singing Voice Deepfake Detection Challenge at ISMIR 2024.

project image

Spectrogram-Based Detection of Auto-Tuned Vocals in Music Recordings


Mahyar Gohari, Paolo Bestagini, Sergio Benini, Nicola Adami
16th IEEE International Workshop on Information Forensics and Security (WIFS), 2024
Paper / Code / Website /

This study introduces a data-driven approach leveraging triplet networks for the detection of Auto-Tuned songs.

project image

PGNN-based Approach for Robust 3D Light Direction Estimation in Outdoor Images


Marcello Zanardelli, Mahyar Gohari, Riccardo Leonardi, Sergio Benini, Nicola Adami
2024 International Conference on Content-based Multimedia Indexing (CBMI), 2024
Paper / Code /

This research introduces a novel methodology utilizing a physics-guided neural network (PGNN) to achieve precise global 3D light direction estimation. The proposed architecture integrates an illumination model, allowing the network to indirectly acquire geometric information, thereby enhancing the accuracy of the estimated light direction.

project image

SynthOutdoor: a synthetic dataset for 3D outdoor light estimation


Marcello Zanardelli, Mahyar Gohari, Riccardo Leonardi, Sergio Benini, Nicola Adami
Data in Brief, 2024
Paper / Code / Website /

In this work, we present SynthOutdoor dataset, comprising 39086 high-resolution images, aimed at addressing the data scarcity in the field of 3D light direction estimation.

project image

Detection and Localization of Ripe Tomatoes Using Machine Vision


Mahyar Gohari
Turkish Journal of Computer and Mathematics Education (TURCOMAT), 2021
Paper /

The study aimed to design a model for a tomato harvester robot using machine vision to be able to detect and locate tomatoes automatically and in real-time.





Source code from Leonid Keselman.