It appears the inbound text is double-encoded UTF-8 - the problem character is the fancy-quote. I've seen this in IRIS pipelines where the UTF8 data is read into a character stream without setting the TranslateTable, then exported out through a UTF8 encoder (eg a REST call)
%SYS>s x="Can’t"
%SYS>zzdump x
0000: 0043 0061 006E 2019 0074 Can’t
%SYS>s y=$ZCVT(x,"O","UTF8")
%SYS>zw y
y="Canâ"_$c(128,153)_"t"
%SYS>w y
Can�t
%SYS>s z=$ZCVT(y,"O","UTF8")
%SYS>zw z
z="CanâÂ"_$c(128)_"Â"_$c(153)_"t"
%SYS>w z
Can�t
- Log in to post comments