Helloit is an Excel file with html codesI want to extract only text from each cell so one cell to many segmentswhat is the setting to use this regex<[^>]*>Any suggestion will be very appreciated
I found an online tool to extract text from html tagshtml to text converterfor segmentation:____________________________________________https://onlinetexttools.com/split-textsample:Sample-html-coding-20skus.xlsx
aazzoma khateeb
I'll eat my hat if your file is not an XML file that has been converted to Excel. You can use the normal Excel file type and check "Enable embedded content processing", which should do the job.
Remy Blattler published more granular HTML processing rules, if that is a help: https://remy.supertext.ch/2014/08/translate-excel-files-with-embedded-html-content-in-sdl-trados-studio/
You should not need to change any segmentation settings.
If you want more control, you can convert Excel->XML and define a custom XML file type, but I wonder whether it's worth the hassle.
Daniel
Thank You Dear @Daniel Hug there are about 100 Excel file, 90 rows eachit is worthy to segment cell using regex rule and reuse repetitionsI haven't enough experience Regex so I'm looking for a solution from experts would you please suggest a RegEx for segmentation rulesKind Regards
Hi aazzoma khateeb
Again, you don't need to toy around with segmentation rules here. This is a matter of processing embedded content:
Preview:
Preview works fine for me, but I can't prepare the file for translation. (This error: https://gateway.sdl.com/apex/communityknowledge?articleName=000004908) I can prepare the file for translation if I delete all rows except the first two (header and first row of content).
I notice the HTML is quite flaky, like a <li> element without a closing </li> and <ul> without closing </ul>. They are causing this error.
A workaround is to declare these tag pairs as mere placeholders, which is sloppy, but seems to solve the problem:
If you run into that error more often, you might have to do the same with the <p> and <div> elements.
Hope this helps. (settings file below.)
Excel_with_html_for_flaky_html_code.zip
Dear Daniel Hug I really appreciate your help
Thanks a million