Skip to content

Multiply word_size by 4 in parse_npy_header for dtype 'U'#55

Open
sjmielke wants to merge 1 commit intorogersce:masterfrom
sjmielke:patch-1
Open

Multiply word_size by 4 in parse_npy_header for dtype 'U'#55
sjmielke wants to merge 1 commit intorogersce:masterfrom
sjmielke:patch-1

Conversation

@sjmielke
Copy link

For Unicode data, the NumPy "length" refers to characters, not bytes, specifically UCS-4/UTF-32 encoded characters, so we need 4 bytes of storage for each element.

For Unicode data, the NumPy "length" refers to characters, not bytes, specifically UCS-4/UTF-32 encoded characters, so we need 4 bytes of storage for each element.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant