|
Open Chinese Convert 1.3.2
A project for conversion between Traditional and Simplified Chinese
|

Open Chinese Convert (OpenCC, 開放中文轉換) is an open source project for conversions between Traditional Chinese, Simplified Chinese and Japanese Kanji (Shinjitai). It supports character-level and phrase-level conversion, character variant handling, and regional vocabulary variants across Mainland China, Taiwan and Hong Kong. This is not a translation tool between Mandarin and Cantonese, etc.
中文簡繁轉換開源項目,支持詞彙級別的轉換、異體字轉換和地區習慣用詞轉換(中國大陸、台灣、香港)及日本新字體轉換。不提供普通話與粵語之間的轉換。
Discussion (Telegram): https://t.me/open_chinese_convert
詳情參閱OpenCC 設計思想及地區詞收錄標準。
winget install opencc 命令可直接安裝 opencc.exe 應用程式,含 Jieba 分詞插件npm install -g opencc 命令可安裝 OpenCC Node.js CLInpm install -g opencc opencc-jieba 命令可同時安裝 OpenCC Node.js CLI 及 Jieba 分詞插件pip install opencc 命令可安裝 Python API 及 Python CLIhttps://opencc.js.org/converter?config=s2t
npm install opencc
The npm package supports Node.js >=20.17. It uses bundled Node-API prebuilds when available and falls back to a local node-gyp build when the current platform does not have a matching prebuild.
To install the npm CLI:
The npm CLI supports basic text conversion. Plugins, --inspect, and --segmentation require the native OpenCC CLI.
See demo.js and ts-demo.ts.
pip install opencc (Windows, Linux, macOS)
The Python package also installs a basic CLI:
The Python CLI supports basic text conversion, --include-tofu-risk-dictionaries, and --resource-zip. Diagnostic modes such as --inspect and --segmentation still require the native OpenCC CLI.
When OpenCC is embedded in a server binary or self-contained application, the JSON config can stay small while dictionary resources are loaded from explicit resource directories:
FilesystemResourceProvider searches directories in order. Existing SimpleConverter("s2t.json") and CLI behavior continue to use the config file location, current directory, explicit paths, and installed OpenCC data directory as before.
Unless otherwise noted, this section describes the native OpenCC CLI built from the C++ toolchain. The Python and npm CLIs support basic file/stdin conversion only, plus --include-tofu-risk-dictionaries; the Python CLI also supports --resource-zip.
opencc --helpopencc_dict --helpOpenCC CLI supports two diagnostic modes that output JSON instead of converted text:
**--segmentation** — Output segmentation result only (no conversion):
**--inspect** — Output full inspection result (segmentation + per-stage conversion + final output):
These modes are useful for diagnosing conversion issues:
--segmentation to verify that the input is segmented as expected.--inspect to see which conversion stage produces an unexpected result.Rules:
--segmentation and --inspect are mutually exclusive.The following ports are maintained within the OpenCC ecosystem and are generally up to date with current configuration and dictionary data.
These ports are community-maintained and may not always track upstream updates.
s2t.json Simplified Chinese to Traditional Chinese (OpenCC Standard) / 簡體 到 OpenCC 標準繁體t2s.json Traditional Chinese (OpenCC Standard) to Simplified Chinese / OpenCC 標準繁體 到 簡體s2tw.json Simplified Chinese to Traditional Chinese (Taiwan Standard) / 簡體 到 台灣正體tw2s.json Traditional Chinese (Taiwan Standard) to Simplified Chinese / 台灣正體 到 簡體s2hk.json Simplified Chinese to Traditional Chinese (Hong Kong variant) / 簡體 到 香港繁體hk2s.json Traditional Chinese (Hong Kong variant) to Simplified Chinese / 香港繁體 到 簡體s2twp.json Simplified Chinese to Traditional Chinese (Taiwan Standard, with Taiwan Phrases) / 簡體 到 台灣正體(含台灣常用詞彙)tw2sp.json Traditional Chinese (Taiwan Standard) to Simplified Chinese (Mainland China Phrases) / 台灣正體 到 簡體(含中國大陸常用詞彙)t2tw.json Traditional Chinese (OpenCC Standard) to Traditional Chinese (Taiwan Standard) / OpenCC 標準繁體 到 台灣正體tw2t.json Traditional Chinese (Taiwan Standard) to Traditional Chinese (OpenCC Standard) / 台灣正體 到 OpenCC 標準繁體t2hk.json Traditional Chinese (OpenCC Standard) to Traditional Chinese (Hong Kong variant) / OpenCC 標準繁體 到 香港繁體hk2t.json Traditional Chinese (Hong Kong variant) to Traditional Chinese (OpenCC Standard) / 香港繁體 到 OpenCC 標準繁體下列配置文件仍在開發中,歡迎貢獻新詞組:
s2hkp.json Simplified Chinese to Traditional Chinese (Hong Kong variant, with Hong Kong Phrases) / 簡體 到 香港繁體(香港常用詞彙)hk2sp.json Traditional Chinese (Hong Kong variant) to Simplified Chinese (Mainland China Phrases) / 香港繁體 到 簡體(含中國大陸常用詞彙)下列配置文件僅供探索性研究,不建議用於生產環境:
t2jp.json Old Japanese Kanji (Kyūjitai) to New Japanese Kanji (Shinjitai) / 日文舊字體 到 日文新字體jp2t.json New Japanese Kanji (Shinjitai) to Old Japanese Kanji (Kyūjitai) / 日文新字體 到 **日文舊字體**,並將少量日文詞組轉換爲對應中文通过环境变量OPENCC_DATA_DIR加载指定路径下的配置文件
配置檔中的字典可使用 type: "inline",直接在 JSON 裡定義小型自訂詞彙, 不必修改外部字典檔。例如在 group.dicts 最前面加入覆寫規則:
規則與限制:
entries 必須是 JSON 物件。entries 的 key/value 必須是非空字串。group.dicts 的順序決定。conversion_chain 步驟,不提供鎖定最終輸出。備註:OpenCC 1.3.2+ 解析器支援有限 JSONC 語法(//、/* */ 註解與尾逗號)。 若需跨實作相容,建議使用嚴格 JSON,不依賴 JSONC 擴充。
更多完整示例可見 examples/config/。該目錄僅供學習與自訂參考,不屬於官方內建 配置列表。
OpenCC 現已支援外部 C++ 分詞插件。當前第一個插件為 opencc-jieba, 可通過 s2t_jieba.json、s2tw_jieba.json、s2hk_jieba.json、 s2twp_jieba.json、tw2sp_jieba.json 等插件配置啓用。
OpenCC now supports external C++ segmentation plugins. The first plugin is opencc-jieba, which can be enabled through plugin-backed configs such as s2t_jieba.json, s2tw_jieba.json, s2hk_jieba.json, s2twp_jieba.json, and tw2sp_jieba.json.
注意:
jieba 插件是可選組件,預設 OpenCC 構建、Python 套件和 Node.js 套件都不要求它。opencc-jieba 額外依賴 cppjieba 及其配套詞典資源,這些依賴僅在構建或分發該插件時需要。Notes:
jieba plugin is optional and is not required for the default OpenCC build, Python package, or Node.js package.opencc-jieba additionally depends on cppjieba and its dictionary resources. These dependencies are only needed when building or distributing the plugin itself.g++ 4.6+ or clang 3.2+ is required.
詳情見 doc/benchmark.md 檔案。
Please update if your project is using OpenCC.
Apache License 2.0
opencc-jieba plugin.opencc-jieba 插件使用的可選依賴。opencc, opencc-js 与 opencc-wasm 三个 NPM packages 區別的說明 https://github.com/nk2028/opencc-js/blob/HEAD/README-zh-TW.md#%E8%88%87-opencc-npm-package-%E7%9A%84%E5%8D%80%E5%88%A5Please feel free to update this list if you have contributed OpenCC.