SDL Trados Studio
SDL Trados GroupShare
SDL Trados Business Manager
SDL Trados Live
SDL MultiTerm
SDL Passolo
SDL Speech to Text
SDL Managed Translation - Enterprise
SDL MultiTrans
SDL TMS
SDL WorldServer
Translation Management Connectors
SDL LiveContent S1000D
SDL Contenta S1000D
SDL XPP
SDL Tridion Docs
SDL Tridion Sites
SDL Content Assistant
SDL Machine Translation Cloud
SDL Machine Translation Connectors
SDL Machine Translation Edge
Language Developers
Tridion Developers
Tridion Docs Developers
Xopus Developers
Community Help
SDL User Experience
Language Products - GCS Internal Community
SDL Community Internal Group
SDL Access Customer Portal
SDL Professional Services
SDL Training & Certification
Style Guides
Language Technology Partner Group
SDL Academic Partners
SDL Approved Trainers
SDL Enterprise Technology Partners
XyUser Group
ETUG (European Trados User Group) Public Information
Machine Translation User Group
Nordic SDL Tridion Docs User Group
SDL Tridion UK Meetup
SDL Tridion User Group New England
SDL Tridion West Coast User Group
SDL WorldServer User Group
Tridion Docs Europe & APAC User Group
Tridion User Group Benelux
Tridion User Group Ohio Valley
SDL MultiTerm Ideas
SDL Passolo Ideas
SDL Trados GroupShare Ideas
SDL Trados Studio Ideas
SDL Machine Translation Cloud Ideas
SDL Machine Translation Edge Ideas
SDL Language Cloud TMS Ideas
SDL Language Cloud Terminology Ideas
SDL Language Cloud Online Editor Ideas
SDL Managed Translation - Enterprise Ideas
SDL TMS Ideas
SDL WorldServer Ideas
SDL Tridion Docs Ideas
SDL Tridion Sites Ideas
SDL LiveContent S1000D Ideas
SDL XPP Ideas
Events & Webinars
To SDL Documentation
To SDL Support
What's New in SDL
Detecting language please wait for.......
Hi Community,
I have received a Wordpress XML file and creating a custom XML for it.
It looks good so far, the parsing works, but it has HTML embedded content, and it is hard to clean it up.
It has some of the below content:
************************************<strong> Mise en œuvre</strong>L'ensemble des notions théoriques seront illustrées par des cas concrets sur le logiciel elec calc.[/av_textblock]
[av_heading tag='h3' padding='10' heading='Dates' color='' style='blockquote modern-quote' custom_font='' size='' subheading_active='' subheading_size='15' custom_class='' admin_preview_bg='' av-desktop-hide='' av-medium-hide='' av-small-hide='' av-mini-hide='' av-medium-font-size-title='' av-small-font-size-title='' av-mini-font-size-title='' av-medium-font-size='' av-small-font-size='' av-mini-font-size='' margin=''][/av_heading]
[av_textblock size='16' font_color='' color='' av-medium-font-size='' av-small-font-size='' av-mini-font-size='' av_uid='av-50tak4l' admin_preview_bg='']<span style="color: #000000;"><strong>Les 24, 25 et 26 novembre de 9h00 à 12h30 et de 14h00 à 17h30</strong></span>[/av_textblock]
[av_heading tag='h3' padding='10' heading='Programme' color='' style='blockquote modern-quote' custom_font='' size='' subheading_active='' subheading_size='15' custom_class='' admin_preview_bg='' av-desktop-hide='' av-medium-hide='' av-small-hide='' av-mini-hide='' av-medium-font-size-title='' av-small-font-size-title='' av-mini-font-size-title='' av-medium-font-size='' av-small-font-size='' av-mini-font-size='' margin=''][/av_heading]
[av_textblock size='16' font_color='' color='' av-medium-font-size='' av-small-font-size='' av-mini-font-size='' av_uid='av-juva8v6m' admin_preview_bg='']
********************************
I am a newbie to regex, but with testing I came up with the below to filter out the [av...] bits (works in RegExr):
\[[a-z\s\S]+\]
However, I cannot seem to add this to Studio. Do I have to add this to the HTML embedded content processor?Studio seems to have a problem with it each time I add it here (something is missing, have to add an attribute, etc):
Can you please advise? Added the source file for reference.
Thank you!Greta
export-tsi-page-octobre-2020.xml
Greta Magyar
As usual Wordpress do a really poor job of exporting even XML for translation!
You may have to define more rules if you want to do this in the filetype by setting it up like this for example…
Thank you kindly Paul, it works like a charm!
You may have to define more rules if you want to do this in the filetype by setting it up like this for example:
Then use the text delimited ECP like this (I only added one more rule):
And now you have this sort of thing:
Alternatively, and this may be a better solution for you. Stick to your current approach but then use the SDL Data Protection Suite to handle the stuff in the square brackets... like this:
Now you have this:
Maybe a bit easier to handle like this. You just need to remember to remove the protection before you save the target file.