How to automate Audio in Robotframework

alice88 · 30 June 2025 11:05

Hi everyone, can someone help me?
I am currently automating a mobile banking application..When I successfully make a transfer, the app plays an audio message saying “You have successfully transferred 2 dollars.”
I want to verify whether the audio message is exactly what I expect.
Is there a way to automate the verification of this audio?
Thanks for support.

damies13 · 30 June 2025 16:18

Hi Alice,

I’ve not seen an audio library for robot framework, but it should be possible to create one using python audio modules.

My guess what you’ll need to do is use the python audio module to create a loopback device and then redirect the audio from the os into this device and capture the audio to a wav file, then compare that wav file to a known source.

How are your python programming skills? While I wouldn’t call this a trivial python project it should be achievable as a few python functions that you can make callable from RF as keywords.

Dave.

alice88 · 1 July 2025 07:09

Thanks so much for your suggestion – it’s really helpful! I’ve actually been exploring this idea and found a similar approach mentioned on ChatGPT as well: using a Python audio module to capture and save the system audio, then comparing it to a known file.
I think I’ll give it a try and see how it goes.
Thanks again for your support !!!

damies13 · 1 July 2025 08:14

Hi Alice,

There’s quite good documentation on writing python library files here: Robot Framework User Guide

A few of us on this forum have done it (just not for audio) so just ask if something’s not clear.

Dave.

Many · 1 July 2025 20:15

I would also look into visualizing your recorded audio as a waveform and doing a visual comparison against a reference waveform.
I’m sure there are python modules out there to record audio and convert it.

Many · 1 July 2025 20:18

If the importance is to only verify the text content - there are ways to transcribe your audio to text (there are a lot of AI services offering that, but I’m sure there are also models and tools that you can run locally) .
Then you would just compare the transcription against a reference text

rasjani · 2 July 2025 10:49

I’m sure there are also models and tools that you can run locally

Whisper runs locally and its pretty much standard tooling to translate speech to text. There’s free mac tool in app store but binaries can be downloaded with brew (openai-whisper). Other platforms most likely have binaries available too..

rasjani · 2 July 2025 11:01

Recorded a small audio clip and ran whisper against the wav file:

rasjani@Mac ~/tmo/bounssit$ time whisper 02.wav  --output_format txt --language en
/opt/homebrew/Cellar/openai-whisper/20250625/libexec/lib/python3.13/site-packages/whisper/transcribe.py:132: UserWarning: FP16 is not supported on CPU; using FP32 instead
  warnings.warn("FP16 is not supported on CPU; using FP32 instead")
[00:00.000 --> 00:06.600]  you have successfully transferred two dollars

real	0m13.261s
user	1m10.391s
sys	0m5.182s
rasjani@Mac ~/tmo/bounssit$ cat 02.txt
you have successfully transferred two dollars
rasjani@Mac ~/tmo/bounssit$

Works

Topic		Replies	Views
Verify text in html towards predefined text Robot Framework	4	61	2 August 2024
Passing a Class Instance as a Library Robot Framework	2	712	11 June 2024
Emmbedded QA Automation Robot Framework	7	1796	5 March 2025
Robotframework and Autoit Library Libraries	9	2843	8 March 2024
Automate windows application using robot framework Robot Framework	15	2635	20 May 2025

How to automate Audio in Robotframework

Related topics