Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix: added mostly-proper Unicode support. #7

Merged
merged 1 commit into from
Oct 26, 2023

Conversation

Marcus10110
Copy link
Contributor

Note, the single character decoding obviously won't work with multi code-unit characters. To fix that, the code-unit by code-unit decoding would need to be modified to detect code-units in the surrogate range, and continue decoding more code-units until a complete code-point is found. Fortunately, most languages are within the Basic Multilingual Plane.

Note, a USB capture with non-ascii characters can be found in support ticket 83241

Note, the single character decoding obviously won't work with multi code-unit characters.
To fix that, the code-unit by code-unit decoding would need to be modified to detect code-units in the surrogate range, and continue decoding more code-units until a complete code-point is found.
Fortunately, most languages are within the Basic Multilingual Plane.
@Marcus10110 Marcus10110 requested a review from huffman October 12, 2023 22:00
@Marcus10110 Marcus10110 merged commit ed2934a into master Oct 26, 2023
4 checks passed
@Marcus10110 Marcus10110 deleted the fix/utf-16-decoding branch October 26, 2023 21:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants