ExecutionResult timeout (Killed) for large output.xml input

marobot · 30 August 2024 18:30

For a personalized view of our test case result we use process our output.xml files.
We now have some stability test cases that are running over 48 hours resulting in very large output.xml files, about 8.5GB.

When we try to process them with:
result = ExecutionResult(output_file)
The code exits after about 900 seconds, with the only output “Killed”.
There are no exception throne, the python code just exits.

Are the any timeout that can be tweeted?
Any other ways to get around this?

damies13 · 30 August 2024 22:52

Hi Magnus,

Are these files on a local disk or some type of network drive?, though I’m not familiar with the code behind ExecutionResult, the 900 sec is leading me to think it might be a slow disk io issue?, if that’s is the case i’d try copying the file locally first (into a temp dir?)

here’s the API doc for ExecutionResult (if you didn’t have it), the only options that might help are the include_keywords and flattened_keywords that get passed though to ExecutionResultBuilder, maybe as a test you could try adding the option include_keywords=False just to see if that makes a difference?

Hope that’s helpful,

Dave.

CharlieScene · 31 August 2024 07:27

Hi,

There also –splitlog and removekeywords with some options (for example PASSED) that can lighten the log.

Regards
Charlie

marobot · 9 September 2024 15:07

Thanks for the ideas of how to solve this.

After testing different ways of reprocessing using rebot, I found that it is not an timing it’s a size issue:

rebot --output small.xml --removekeywords ALL output.xml
Killed

rebot --flattenkeywords ITERATION --output small.xml output.xml
Resulted in successful processing and an 9MB small.xml file from an input of 8.5GB!

I also notice that we had output.xml files up to 2GB that has been successfully processed to that seems to be the limit.

I don’t have direct control of how the output.xml is created so, my way forward here is that I’ll check if my input output.xml file is larger the 2GB. In that case I’ll pre-process it with “rebot --flattenkeywords ITERATION --output small.xml output.xml”.
Now I can process it with result = ExecutionResult(output_file)

Thanks
/Magnus

pekkaklarck · 19 October 2024 22:17

The main problem with processing huge output.xml files is typically the memory usage. If more memory is needed than the machine has real RAM, the operating system starts swapping and execution slows down. There’s also a limit for swapping and if that’s exceeded then the process is killed.

The reason --flatten-keywords help so much is that it affects XML parsing in very low level (see the code here). It basically throws away unnecessary XML nodes (which saves memory) and that avoids creating Robot’s keyword objects (which saves even more memory). Because flattening is so powerful, we added a dedicated robot:flatten keyword tag to support that during execution. It doesn’t at least yet support flattening loop iterations only similarly as --flatten-keywords iteration does (and yes, --flatten-keywords is badly named as it also affects control structures).

If you use --remove-keywords, the end result can be as good as with --flatten-keywords, but that option requires first parsing the whole output.xml and thus you may not ever see those end results.

Topic		Replies	Views
Rebot not working on large XML file size Robot Framework	3	857	12 June 2023
Log.html won't generate, and rebot just hangs Robot Framework	0	566	3 February 2023
Processing output xml midrun Robot Framework	2	939	6 August 2020
Robot xml report(output.xml) incomplete/missing closing tags Robot Framework	10	340	28 November 2024
Output.xml doesn't get updated after every test case run during the test suite execution Robot Framework	2	126	19 October 2024

ExecutionResult timeout (Killed) for large output.xml input

Related topics