Research Article

, 04 Dec 2025 | 10.6234610.62346/ijcn_q4_v13_no4_25_07
Year : 2025 | Volume: 13 | Issue: 4 | Pages : 1-7

DESIGN AND IMPLEMENTATION OF A SPOOF-RESISTANT VOICE-BASED SMART LOCKER SYSTEM USING EMBEDDED MFCC AND CHALLENGE-RESPONSE AUTHENTICATION

Muralikrishnan P1 *, Annushka V, Harini CS, Kiruthika A, Manushri A
  • 1Anna University Chennai, Faculty, Department of ECE, K. Ramakrishnan College of Engineering, IN
Traditional bank locker security systems that depend on mechanical keys, static passwords, or single-modality biometrics, like fingerprints or fixed voice phrases, have several weaknesses. These include risks of spoofing, replay attacks, and reliance on external infrastructure. Current voice-based solutions often do not include liveness detection and are not fully integrated, which limits their practical use. This study introduces a standalone, spoof-resistant voice-based smart locker system that is entirely implemented on an ESP32-S3 microcontroller. The system uses a dynamic challenge-response method, where users receive randomized digit sequences (for example, β€œ3-8-1-5”) to guard against replay attacks. Voice features are extracted using 39- dimensional Mel-Frequency Cepstral Coefficients (MFCCs), and speaker verification is conducted using lightweight Gaussian Mixture Models (GMMs) tailored for each user. To combat spoofing, liveness cues such as spectral flux at the start of speech, variance in zero- crossing rate, and response latency (under 3 seconds) are incorporated. The system was tested with 20 users (10 male and 10 female) in various environments: quiet, office (60 dB), and cafΓ© (65 dB), and it was evaluated against replay attacks using a smartphone speaker from different distances. In quiet conditions, the system achieved an Equal Error Rate (EER) of 2.1%, while under 60 dB noise, the EER was 4.8%. The False Acceptance Rate (FAR) against replay attacks was less than 1%, which is a significant improvement over fixed-phrase systems that had FARs greater than 30%. The average time to unlock was 2.4 seconds, with all processing done offline on the device. The solution requires no more than 100 KB of flash storage per user and functions without needing cloud or PC support. This work showcases a practical, embedded voice authentication system that effectively addresses major issues in current locker security, such as the absence of liveness detection, reliance on static credentials, and lack of embedded designs. Tested under realistic conditions, the proposed system provides a strong, cost- effective, and deployable solution for secure access control in banking and institutional environments.

References

[1]     B. Matrouf, J. Bonastre, and C. Fredouille, β€œA comparative study of various speaker verification approaches for embedded systems,” Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), pp. 233–236.

[2]     Z. Wu, N. Evans, T. Kinnunen, J. Yamagishi, F. Alegre, and H. Li, β€œSpoofing and countermeasures for speaker verification: A survey,” Speech Communication, vol. 66, pp. 130–153.

[3]     Ali and Q. Al-Azze, β€œDesign of voice recognition module for secure access system,” International Journal of Advanced Computer Science Applications, vol. 10, no. 4, pp. 112– 118.

[4]     R. Todisco, H. Delgado, and N. Evans, β€œA new feature for automatic speaker verification anti-spoofing: Constant-Q cepstral coefficients,” Proc. Odyssey Speaker and Language Recognition Workshop, pp. 283–290.

[5]     P. Jadhav and A. Agrawal, β€œA secure locker access system using RFID, fingerprint and password authentication,” International Journal of Engineering Research and Technology, vol. 8, no. 6.

[6]     V. Veeramallu, S. Reddy, and A. Rao, β€œDual-factor smart locker authentication using real-time face and voice recognition,” International Journal of Scientific Research in Engineering and Management, vol. 5, no. 3, pp. 1–7.

[7]     K. Reddy, S. Kumar, and D. Sekhar, β€œSmart locker using ESP32-CAM for face recognition and GSM-based OTP verification,” International Journal of Emerging Technology and Computer Science, vol. 12, no. 2.

[8]     Ali and Q. Al-Azze, β€œLow-cost voice-based security system using speaker-dependent VRM module,” International Journal of Electronics and Communication Engineering.

[9]     Nagrani, J. S. Chung, and A. Zisserman, β€œVoxCeleb: A large-scale speaker identification dataset,” Proc. INTERSPEECH, 2017.

[10] INMP441 Digital MEMS Microphone Datasheet, InvenSense. Espressif Systems, β€œESP32-S3 Technical Reference Manual.”

[11] H. Sakoe and S. Chiba, β€œDynamic programming algorithm optimization for spoken word recognition,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 26, no. 1,pp. 43–49, 1978.


Keywords: Voice-based authentication, embedded biometrics, anti-spoofing, challenge- response, ESP32-S3

Citation: Muralikrishnan P*,Muralikrishnan P ( 2025), DESIGN AND IMPLEMENTATION OF A SPOOF-RESISTANT VOICE-BASED SMART LOCKER SYSTEM USING EMBEDDED MFCC AND CHALLENGE-RESPONSE AUTHENTICATION. , 13(4): 1-7

Received: 15/11/2025; Accepted: 30/11/2025;
Published: 04/12/2025

Edited by:

Mr.ERES JOURNALS

Reviewed by:

Copyright: @eres journals.

*Correspondence: Muralikrishnan P, pmuralikrishnanece@krce.ac.in


Copyright Β© 2013-2026 ERES Publications