Detecting language please wait for.......
Does anyone know whether it is possible to export TMX files from Studio without also exporting the meta data (created by, created on, and other attribute fileds) in order to have clean training files for upload to Edge (ETS) or whether there is an easy way to remove these lines in the TMX files afterwards?
Many thanks in advance,
you might use the good old Olifant and its advanced Export options: untick everything under "TU-level data", and do not "Export notes" nor "Export empty segments":
Olifant does not always deal well with large files, so you better double-check that no other contents gets lost.
By the way, you might find some more tipps and usefuls links on TM cleanup in my workshop and article for the BDÜ Conference 2019: Localization Engineering für Übersetzer - mit Freeware- und OpenSource-Helferlein (https://cattmatters.de/.cm4all/uproc.php/0/Localization_Engineering_BDUE2019_ChristineBruckner.pdf?cdp=a&_=16e9ec1ac38)
I might add your use case in an adapted English version which I am currently preparing ;-)
Great, many thanks for those very helpful tips! The first hurdle will probably be getting approval to install freeware on my work PC, but it certainly looks like a much better solution than my current method of applying regular expressions to a substantial number of Studio Export files :-)
another hurdle with Olifant might be old .NET 2.0.50727 version which Olifant requires... (requires activation of the older .NET Framework in the Windows Features under Windows 10).
How about using Notepad++ (potentially the portable version, requires no installation) and the "Find in Files" command with Replace to batch-process your TMX export files with your regex?
ok, thanks, I'll bear that in mind!
I am currently using Notepad++, which I have installed on my computer, which is actually a great tool! It might actually be the case that Notepad++ is more reliable than Olifant at the end of the day if there is a risk of losing valuable data during the export process. I'll definitely try it out though!