Characters described as own simplified and traditional variant

The Unihan database currently contains 431 instances of characters described as their own variants. This is logically inconsistent. The correct traditional variants should of course remain, but the logically incorrect entries need to be removed.

I have previously reported four of these instances - U+575B 坛, U+5978 奸, U+6784 构 and U+9759 静 - through the official channel. Rather than doing this more than four hundred more times, I instead generated a complete list of all the instances, which is attached:

[simplified.txt](https://github.com/user-attachments/files/17264607/simplified.txt)

How would you like to proceed on this issue? Since kSimplifiedVariant and kTraditionalVariant are provisional fields, we could work through files already here in this repository, once updated to the current Unicode version. For a mass update like this, however, it might be simpler for you to update the official copy of the database directly, then regenerate files here for a final check.

Let me know.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Characters described as own simplified and traditional variant #408

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Characters described as own simplified and traditional variant #408

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions