Assert a Get Text with <strong> in the HTML markup

northernHemisphere · 30 January 2023 20:21

I am trying to assert contents of a

tag in a test case.

HTML fragment contains

<div id="result"><strong>42</strong> results</div>

In my test case, this assertion fails because there are some invisible characters before and after “42”.

Get Text	id=result	==	42 results

What is the way to do this assertion?

northernHemisphere · 30 January 2023 20:33

The markdown engine is eating my HTML fragment.

<div id="result"><strong>42</strong> results</div>

a-mamlouk · 30 January 2023 21:24

i remember i wrote this code a while ago, hope it helps

 Count_Test

    ${alllinkscount}=   get element count    xpath://a
    log to console    ${alllinkscount}
#    ${alllinkscount}=    Evaluate    ${alllinkscount} + 1
#    log to console  \r${alllinkscount}
    @{linkItems}    create list
    FOR    ${i}     IN RANGE    1       ${alllinkscount}+1
        ${linktext}=    get text    xpath:(//a)[${i}]
        log to console  \r[${i}], ${linktext}
    END

René · 30 January 2023 22:02

Hi @northernHemisphere

I tried your html snipped and the <strong> does not influence the Get Text .

What sometimes is an issue is a   (no-break space) which is a different character than a normal space.
Often Web developers use these to ensure that the spaces does not lead to a line break.

Maybe you can post the error message you get?

Ps: Three “Back Ticks” ``` before and after the preformatted code does the trick here in the forum.

René · 30 January 2023 22:21

Now maybe as a last resort:
you can log the “non printable” characters like this:


    ${text} =    Get Text    id=result
    ${escaped} =    Evaluate     [c if c in string.printable else r'\x{0:02x}'.format(ord(c)) for c in $text]   modules=string
    Log To Console    ${escaped}

That weird python expression escapes non printable characters.
then you can figure out what there is.

northernHemisphere · 31 January 2023 01:48

I thought I’d jump to last resort to get an idea of what is in the string…

['x2068', '1', '7', 'x2069', ' ', 'r', 'e', 's', 'u', 'l', 't', 's']

René · 31 January 2023 08:01

it looks like that \u2068 is a start of strong.

See ⁨ - First Strong Isolate: U+2068 - Unicode Character Table

But i doubt that this has anything to do with the html <strong> .
I would guess that the content has additionally these two start and end characters.

René · 31 January 2023 08:27

So i tested a bit more and you can in JavaScript send these unicode characters to the element like this.

Evaluate JavaScript    id=result   e => e.innerHTML = "<strong>\u206842\u2069</strong> result"

without the <strong> it is not rendered as strong on the page.

You can however replace these special characters when reading it with

Get Text    id=result    validate    re.sub('[\u2068\u2069]', '', value) == '42 result'

Or just return it with

Get Text    id=result    evaluate    re.sub('[\u2068\u2069]', '', value)

northernHemisphere · 31 January 2023 13:36

Thanks Rene.

Well you learn something everyday. Our internationalization library is putting these Unicode characters in by default to support display of bi-directional text. The HTML fragment I am testing is generated through that library (in our case to make sure pluralization of the word “result” matches the number of results) so it is putting in these Unicode characters. Turns out this is great as we will be working in right to left languages in the future.

The FSI 0x2068 and PDI 0x2069 are Unicode characters to help with the correct display of bi-directional text. For the benefit of future readers of this chat, see Unicode Isolation · projectfluent/fluent.js Wiki · GitHub.

Now that I know what is going on, I can design my test cases.

Regards.

northernHemisphere · 31 January 2023 14:46

Because the FSI and PDI characters are there, I think I’ll write the test to explicitly check for them.
Get Text id=result \u206842\u2069 results

The alternative would be to strip them out, which would also get the test passing.

Topic		Replies	Views
How to get a hidden text Browser	1	1445	9 January 2023
'Element text should be' in the Browser library Browser	4	5840	27 March 2022
How to assert on page titles when crawling Browser	2	534	10 January 2024
How to get value from UI? Robot Framework	1	2453	19 December 2022
How to get text from random tags Robot Framework	5	90	26 February 2025

Assert a Get Text with <strong> in the HTML markup

Related topics