Non-US ASCII characters corrupted in foreign language export

Non-US ASCII characters corrupted in foreign language export

I'm trying to export Notes document contents in XML format (using UTF-8). The doc contents can be in any of the 20 languages we deal with, ranging from German to Japanese, Russian or Greek. The problem is that when we export the documents all non-US ASCII characters are corrupted: They either break the XML flow (and Explorer stops rendering the XML file) or just appear as squares. Any hints?

    Requires Free Membership to View

    Register today to access targeted resources from our editorial writers and independent industry experts focused on Lotus Domino, Notes, Workplace and other related technologies.

    By submitting your registration information to SearchDomino.com you agree to receive email communications from TechTarget and TechTarget partners. We encourage you to read our Privacy Policy which contains important disclosures about how we collect and use your registration and other information. If you reside outside of the United States, by submitting this registration information you consent to having your personal data transferred to and processed in the United States. Your use of SearchDomino.com is governed by our Terms of Use. You may contact us at webmaster@TechTarget.com.

How are you doing the export? Are you using R6, and using the NotesDXLExporter functions?

If you're trying to roll your own, you might try that as an alternative. Also, remember Notes stores it's data internally in the Lotus Multi-Byte Character set, if you have it set up to do so. There might be a conversion problem between LMBCS and Unicode, so take a look at the settings on the machine and client you are exporting from to see how it's set.

If you are using NotesDXLExporter and Notes 6, then you might have uncovered a bug...er undocumented feature of Notes.


This was first published in April 2002