Uploaded image for project: 'Module Tools'
  1. Module Tools
  2. MODTOOLS-34

usfm2osis.py can fail on non-BMP text

    Details

    • Type: Bug
    • Status: Closed (View Workflow)
    • Priority: Minor
    • Resolution: Upstream Problem
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: usfm2osis.py
    • Labels:
      None

      Description

      UCS-2 is the default internal representation of Unicode on Python, so non-BMP characters in the input may cause problems. Print a warning.

      Use following to check for UCS-4 vs. UCS-2 in compiled interpreter:
      import sys
      sys.maxunicode > 65536 and 'UCS4' or 'UCS2'

        Attachments

          Activity

            People

            • Assignee:
              chrislit Chris Little
              Reporter:
              chrislit Chris Little
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: