To add a feature for Java Parser to generate target files with escaped \uxxxx for all non-ASCII characters and also encoded as UTF-8 with BOM

Under Community Review

The current behavior of Java parser based on Passolo 2018 (18.0.97.0) is that the target file is generated in the following formats:
1. UTF-8 with BOM but non-ASCII characters are not escaped
2. ANSI or UTF-8 without BOM and non-ASCII characters are escaped

I would like to add a feature for Java parser to generate target files with escaped \uxxxx for all non-ASCII characters and also encoded as UTF-8 with BOM.

Under Community Review
  • Hi, I was recently handling similar issues.

    I wanted to have UTF-8 without BOM header, escaped. I had to set the general Java parser setting as UTF-8, and then the individual source list parsers as "AutoDetect" ecnonding for the soruce file  & "same as source files" encoding for the target file, without BOM. (plus segmenter was not set-up).

    See this topic on the Community with a relevant answer provided by Achim Herrmann. See: community.sdl.com/.../13431. So this behaviour is intended in Passolo. Plus you need to keep in mind that you are setting parser in general and then in at source list.