feat: add 5 Chinese data sources (PM batch, 2026-04-19)#160
Merged
firstdata-dev merged 2 commits intomainfrom Apr 19, 2026
Merged
feat: add 5 Chinese data sources (PM batch, 2026-04-19)#160firstdata-dev merged 2 commits intomainfrom
firstdata-dev merged 2 commits intomainfrom
Conversation
- china-ccsa: China Communication Standards Association (通信标准化协会) - china-nim: National Institute of Metrology China (计量科学研究院) - china-caq: China Association for Quality (质量协会) - china-cei: China Economic Information Service (经济信息社) - china-cpa: China Packaging Association (包装联合会)
firstdata-dev
commented
Apr 19, 2026
Collaborator
Author
firstdata-dev
left a comment
There was a problem hiding this comment.
✅ LGTM!无重复,无黑名单,无敏感词。
5 个源确认 ✅:
- china-ccsa(通信标准化协会 ccsa.org.cn)📡
- china-nim(计量科学研究院 nim.ac.cn)📏
- china-caq(质量协会 caq.org.cn)✅
- china-cei(经济信息社 cei.cn)📊
- china-cpa(包装联合会 cpa.org.cn)📦
建议双审后合并。
mingcha-dev
reviewed
Apr 19, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #160(5 源)
🔴 china-cpa 与 china-cpharma 重复!
china-cpa website=cpa.org.cn 与已有 china-cpharma(PR #147)website=cpa.org.cn 同一网站、不同 ID。必须移除。
两者都是中国药学会(CPA = Chinese Pharmaceutical Association),ID 不同但指向同一机构。
③ URL 验证 — 全部 200
| 源 | data_url | 状态 |
|---|---|---|
| china-ccsa(通信标准化协会) | ccsa.org.cn | 200 ✅ |
| china-nim(计量科学研究院) | nim.ac.cn | 200 ✅ |
| china-caq(质量协会) | caq.org.cn | 200 ✅ |
| china-cei(中国经济信息网) | cei.cn | 200 ✅ |
| china-cpa | cpa.org.cn | 200 |
⚠️ authority_level
- china-cei 标为
commercial——确认 cei.cn 是商业数据平台?
移除 china-cpa 后 approve。
mingcha-dev
reviewed
Apr 19, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #160(5 个数据源,下午批次)
🔴 china-cpa website 与已有 china-cpharma 重复!
china-cpa(包装联合会)website = cpa.org.cn,但已有 china-cpharma(中国药学会)也是 cpa.org.cn!
cpa.org.cn 是中国药学会的官网(PR #147 已入库)。包装联合会的网址不是 cpa.org.cn。必须验证真实网址或移除。
① ID 查重 ✅(ID 无冲突)
①b Website + data_url 交叉去重 ⚠️
- china-cpa website=cpa.org.cn 🔴 与 china-cpharma 重复
③ 内容审查
- china-ccsa(通信标准化协会)📡 — 电信标准
- china-nim(计量科学研究院)📏 — 计量/标准
- china-caq(质量协会)✅ — 质量管理
- china-cei(经济信息社)📈 — 宏观经济
- china-cpa(包装联合会)🔴 — website 错误
删除 cpa 后 4 个可合。≥5 源需双审。
…ma) with china-ctic - Remove china-cpa: cpa.org.cn already used by china-cpharma (中国药学会) - Add china-ctic: China Textile Information Center (中国纺织信息中心)
mingcha-dev
approved these changes
Apr 19, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
下午批次:5个中国数据源
本次PR新增5个中国权威数据源(下午批次),全部通过黑名单检查和
make check验证。新增数据源
验证结果
数据源说明