Chinese Buddhist Canon Digitization: A Review and Prospects
Abstract
1. Introduction
- (1)
- The first category comprises Image Databases, which provide high-quality digital reproductions of canonical collections through modern photographic and scanning technologies. By granting direct access to primary sources, these databases support essential philological tasks such as producing critical editions and analyzing textual variants. Representative examples include the National Library of China’s databases for the Zhaocheng Jin Canon and Sixi Canon, as well as the Three Editions of Buddhist Canons Database from Zōjōji Temple 增上寺.
- (2)
- Buddhist Canon Full-Text Databases, on the other hand, transcribe the Canon’s content into searchable text. While image databases offer proximity to original sources, full-text databases are favored for daily research due to their efficiency in retrieval and reference. Among these, CBETA stands out as a preeminent example, surpassing the printed Taishō edition in both accessibility and textual accuracy. Moreover, CBETA has established an academic standard with its concise citation format indicating volume, text number, page, column, and line.
- (3)
- The third category is Image–Text Parallel Databases, exemplified by Japan’s SAT Daizōkyō Text Database. These integrate the advantages of both image and full-text repositories by pairing searchable transcriptions with corresponding original images, thereby greatly facilitating textual verification and comparative analysis. A distinctive feature of the SAT database is its interoperability with other major resources, including the Digital Dictionary of Buddhism (DDB), INBUDS, and CiNii, signaling a forward-looking model for the integrated development of Buddhist digital scholarship (Tong 2017).
2. Digitization of Texts of the Chinese Buddhist Canon
- (1)
- The Zen-Knowledgebase Project, developed by the International Research Institute for Zen Buddhism (IRIZ) at Hanazono University in Japan. Directed by Dr. Urs App, then Deputy Director of the Institute, its primary goal was to digitize Zen Buddhist texts and build a comprehensive database. The content encompasses ancient Zen texts, modern research, reference materials, maps, and Zen art.
- (2)
- The Chinese Buddhist Electronic Text Association (CBETA) Project, led by Venerable Huimin 惠敏法師, Christian Wittern, and others. Its primary objective was the digitization of Chinese Buddhist Canons, beginning with the Taishō and later expanding to include other major editions.
- (3)
- The SAT Daizōkyō Text Database, founded by Professor Yasunori Ejima 江島惠教 of Japan. SAT is the abbreviation for the Sanskrit phrase “Saṃgaṇikīkṛtaṃ Taiśotripiṭakaṃ” (Digitized Taishō). Its primary goal was the digitization of the 85 volumes of the Taishō.
- (4)
- The Research Institute of the Tripitaka Koreana (RITK), established in 1993 to ensure the permanent preservation of the 81,340 wooden printing blocks of the Second Carved Edition (1236–1251) of the Goryeo Canon housed at Haeinsa Temple. The institute undertook the comprehensive digitization of both the physical printing blocks and the full-text images of the Goryeo Canon.
2.1. The Chinese Buddhist Electronic Text Association (CBETA): Historical Genesis and Methodological Paradigms
2.1.1. The Philology of Encoding: TEI and the “Gaiji” 缺字 Solution
2.1.2. Digitizing the Apparatus Criticus: The XML-Based Collation Structure
2.1.3. Infrastructure for the Future: Integration and Semantic Discovery
2.2. The SAT Daizōkyō Text Database: From Digital Text to Research Ecosystem
2.2.1. Evolution of Editorial Philosophy and Web Strategy
2.2.2. Interoperability: The API Ecosystem
2.2.3. The AI Turn: SATed and High-Precision Text–Image Alignment
3. Prospects for the Digitization of the Chinese Buddhist Canon
3.1. The “Taishō Bottleneck”: Lineage Bias and Collation Limitations
3.2. The Material Turn and the Dialectics of Systematic Collation
- (1)
- National Library of China’s Zhaocheng Jin Canon, China: In the winter of 1931, Zhu Qinglan 朱慶瀾, Ye Gongchuo 葉恭綽, and others initiated the “Photolithographic Song Canon Association” to photolithographically reproduce the Song-Yuan edition Qisha Canon held at the Kaiyuan Temple 開元寺 and Wolong Temple 臥龍寺 in Shaanxi Province for wider circulation. The monk Fan Cheng 范成 was tasked with investigating ancient Buddhist scriptures held in various temples that could supplement missing fascicles of the Qisha Canon. In the summer of 1932, following local leads, Fan Cheng discovered the Zhaocheng Jin Canon at the Guangsheng Temple 廣勝寺 in Zhaocheng 趙城, Shanxi. The majority of the texts in the Zhonghua Dazangjing, published by Zhonghua Book Company 中華書局, have been photolithographically reproduced based on the National Library’s Zhaocheng Jin Canon as the base text (Zhonghua Dazangjing Bianjiju 1983). In recent years, as part of the National Library’s ancient text digitization project, the Zhaocheng Jin Canon has been digitized and included in the “Chinese Ancient Books Resource Database.” (Appendix A).
- (2)
- National Library of China’s Sixi Canon, China: Originally acquired by Yang Shoujing from Japan in the late Qing dynasty, the Tian’an Temple edition of the Sixi Canon consisted of over 4600 fascicles but lacked the 600-fascicle Mahāprajñāpāramitā Sūtra. 6After passing through the Songpo Library 松坡圖書館 in Beijing, the collection was integrated into the National Library of China (NLC) in 1950. The NLC later completed the set by repurchasing the missing sutra, and the canon has since been digitized and included in the “Chinese Ancient Books Resource Database.” (Appendix A).
- (3)
- Fuzhou Canon Hybrid Edition held by the Imperial Household Agency 宮內廳, Japan: Produced in the Fuzhou region during the Northern and Southern Song dynasties, the Fuzhou Canon includes the Dongchan Temple edition 東禪寺版 (Chongning Canon 崇寧藏) and Kaiyuan Temple edition 開元寺版 (Pilu Canon 毗盧藏), with surviving sets predominantly hybrid in composition. While no complete versions exist in China today, Japan houses several hybrid sets, including one preserved by the Imperial Household Agency’s Shoryōbu. This edition, comprising 1454 texts across 5733 fascicles, was utilized as a collational reference for the Taishō, indicated by the abbreviation “宫” in its collation notes. Full-image access is provided through the “Imperial Household Agency Shoryōbu Collection of Chinese Classics Database.” (Appendix A) More recently, Cultural Relics Press issued a facsimile of the Fuzhou Canon held in the Imperial Household Agency’s Archives and Mausolea Department (Shoryōbu 書陵部) (Wang 2025).
- (4)
- First and Second Carved Editions of the Goryeo Canon held by Haeinsa Temple 海印寺, Korea: Initially digitized by the Korean Goryeo Canon Research Institute, both the First and Second Carved Editions were temporarily available online before being relocated to Dongguk University’s “Archives of Buddhist Culture” database, where they are currently accessible. (Appendix A).
- (5)
- Yuan Official Canon 元官藏 held by Yunnan Provincial Library, China: The Yuan Official Canon was identified relatively late. As early as the beginning of the last century, Ono Gemmyō 小野玄妙 noted the existence of a Yuan Dynasty Canon distinct from both the Qisha Canon and the Puning Canon, but its name remained uncertain. In 1984, Tong Wei 童瑋, Fang Guangchang 方廣錩, and Jin Zhiliang 金志良, in their article “The Discovery of the Officially Carved Yuan Dynasty Canon,” named the approximately 32-volume Canon found at the Yunnan Provincial Library the Yuan Official Canon (Tong et al. 1984). Subsequently, Li Jining 李際寧, Chi Limei 池麗梅, and others published articles researching the Yuan Official Canon (Li 2010; Chi 2023). The Yuan Official Canon held by the Yunnan Provincial Library is currently accessible via the library’s “Yunnan Ancient Books Digital Library” database. (Appendix A).
- (6)
- Jingshan Canon 徑山藏 held by the University of Tokyo, Japan: This Ming dynasty canon, also known as the Jiaxing Canon 嘉興藏, is notable for including Chan Buddhist discourse records absent in earlier canons. As extant versions vary significantly in content across collections, determining its definitive catalog remains a scholarly priority. The University of Tokyo’s digitized collection provides valuable primary sources for this research. (Appendix A).
- (7)
- Yongle Northern Canon 永樂南藏 and Yongle Southern Canon 永樂北藏 held by Shandong Provincial Library, China: The Yongle Southern Canon, initially carved under Emperor Chengzu, was expanded through three subsequent supplements, totaling 678 cases containing 1618 texts across 6325 fascicles. Shandong Provincial Library’s collection comprises 5200 fascicles in 5104 volumes. The Yongle Northern Canon, representing the third imperial Ming edition, was produced across the Yongle and Wanli reigns, containing 636 main cases and 41 supplementary cases. The library’s Wanli 20th-year (1592) imprint preserves 5386 fascicles. Both canons are accessible through the specialized “Ming Dynasty Buddhist Canon Database.” (Appendix A).
- (8)
- Goryeo, Sixi, and Puning Canon 普寧藏 at Zōjōji Temple, Japan: These three canons served as collation references for the Taishō, with the Second Carved Editions of Goryeo Canon as base text and the Sixi and Puning Canons as reference editions marked “Song” and “Yuan”, respectively. In November 2023, Zōjōji Temple launched digital access to its collections, comprising 5342 Sixi Canon fascicles, 5228 Puning Canon fascicles, and 1357 bound volumes of the Goryeo Canon. (Appendix A).
3.3. Case Study: Structural Divergence in the Mahāsaṃnipāta Sūtra
“The Newly Combined Mahāsaṃnipāta Sūtra: the work mentioned above, in 60 fascicles, was newly combined in the 6th year of the Kaihuang era(586) by the śramaṇa Sengjiu of the Zhaoti Monastery. Sengjiu entered the monastic life in his youth and devoted himself to the study of the scriptural treasury. According to the Sanskrit manuscript, this Mahāsaṃnipāta Sūtra contains a total of 100,000 verses. If translated in full, it would comprise about 300 fascicles. … However, as we are growing distant from the Sage [the Buddha], ordinary understanding is gradually dimming; people cannot retain the whole and thus copy and translate according to the portions they receive. Consequently, the Sanskrit manuscripts that arrived were incomplete in their divisions and bundles: abbreviated versions were translated briefly, expanded versions were translated extensively. For this reason, the translations produced by earlier masters—Zhi [Qian] and [Dharm]arakṣa, as well as those by Kumārajīva—resulted in versions of sometimes 27, sometimes 30, sometimes 31 fascicles; the scrolls were not fixed. Sengjiu, while propagating the scripture, often lamented this. When he later saw that [Narendra]yaśas, during the Northern Qi period, had translated the Candragarbha Sūtra in 12 fascicles, and again in the present Kaihuang era the same Yaśas translated the Sūryagarbha Sūtra in 15 fascicles—and since both were expanded old sections of the Mahāsaṃnipāta—he was inwardly delighted and promptly combined them to form 60 fascicles. Although Sengjiu incorporated them, the work was not yet refined. Later, the śramaṇa Hongqing of the Daxingshan Monastery, a man of profound insight and clarity, was commissioned by the Empress to collate and copy the scriptures of two canons. He then corrected the titles and headings in Sengjiu’s combined version, putting it into excellent order.”
《新合大集經》,右一部六十卷。招提寺沙門釋僧就開皇六年新合。就少出家,專寶坊學。依如梵本,此《大集經》凡十萬偈。若具足翻,可三百卷。……然去聖將遠,凡識漸惛,不能總持,隨分撮寫。致來梵本,部夾弗全,略至略翻,廣來廣譯。緣是前哲支、曇所翻及羅什出,或二十七,或復三十,或三十一,卷軸匪定。就既宣揚,每恒嗟歎。及覩耶舍高齊之世出《月藏經》一十二卷。至今開皇復屬耶舍譯《日藏經》一十五卷。既並《大集》廣本舊品,內誠欣躍,即依合之成六十軸。就雖附入,未善精。比有大興善寺沙門洪慶者,識度淵明,奉為皇后檢校抄寫眾經兩藏,遂更正就所合名題,甚為整頓。7
“For this sūtra, both the State Edition and the Song Edition comprise 60 fascicles in 17 sections. The Dan Canon contains 30 fascicles in 11 sections. Furthermore, at the beginning of the sūtra, both the State and Song editions include the section titled ‘Yingluo Pin’ (Jewel-Net Section), which is absent in the Dan Canon. The ‘Xukongzang Pin’ (Space-Storage Section) appears in the two editions after the ‘Bukeshuo [Pin]’ (Inexpressible Section), but in the Dan Canon it is placed before the ‘Wuyan Pin’ (Wordless Section). Also, after the ‘Baoji Pin’ (Jeweled Topknot Section), the two editions contain the ‘Wujinyi Pin’ (Inexhaustible Meaning Section) in four fascicles, which the Dan Canon lacks; instead, it has the ‘Rimi Fen’ (Sun-Secret Division) in three fascicles…”
此經《國本》《宋本》皆六十卷,凡十七品。《丹藏》中三十卷,十一品。又經初首《國》《宋》兩本則有《瓔珞品》名,丹藏所無。其《虛空藏品》兩本在《不可說》後,丹藏在《無言品》前。又於《寶髻品》後兩本有《無盡意品》四卷,《丹藏》即無,而有《日密分》三卷……8
3.4. Future Directions: AI-Driven “Hybrid Digital Canon”
4. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| API | Application Programming Interface |
| CBETA | Chinese Buddhist Electronic Text Association |
| CiNii | Citation Information by National institute of informatics. A database suite operated by the National Institute of Informatics (NII). |
| DDB | Digital Dictionary of Buddhism. |
| DOI | Digital Object Identifier |
| EBTI | Electronic Buddhist Text Initiative |
| IRIZ | International Research Institute for Zen Buddhism |
| INBUDS | Indian and Buddhist Studies Treatise Database |
| LLM | Large Language Models |
| NLC | National Library of China |
| NLP | Natural Language Processing |
| OCR | Optical Character Recognition |
| RITK | Research Institute of the Tripitaka Koreana |
| SAT | Saṃgaṇikīkṛtaṃ Taiśo Tripiṭakaṃ/The SAT Daizōkyō Text Database |
| TEI | Text Encoding Initiative |
Appendix A
| Database | Website | Canon |
|---|---|---|
| Chinese Ancient Books Resource Database | https://www.nlc.cn/pcab/zy/zhgj_zyk/ (Last accessed: 28 October 2025) | Zhaocheng Jin Canon; Sixi Canon |
| Imperial Household Agency Shoryōbu Collection of Chinese Classics Database | https://db2.sido.keio.ac.jp/kanseki (Last accessed: 28 October 2025) | Fuzhou Canon |
| Archives of Buddhist Culture Database | https://kabc.dongguk.edu/ (Last accessed: 28 October 2025) | First and Second Carved Editions of the Goryeo Canon |
| Yunnan Ancient Books Digital Library | http://msq.ynlib.cn/ (Last accessed: 28 October 2025) | Yuan Official Canon |
| “University of Tokyo General Library Wanli Edition Buddhist Canon (Jiaxing Canon) Electronic Edition” Database | https://dzkimgs.l.u-tokyo.ac.jp/kkz/ (Last accessed: 28 October 2025) | Jiaxing Canon |
| Ming Dynasty Buddhist Canon Database | http://58.59.15.37:9500/#/home (Last accessed: 28 October 2025) | Yongle Northern Canon and Yongle Southern Canon |
| “Three Editions of Buddhist Sacred Canons Stored at Zōjōji” Database | https://jodoshuzensho.jp/zojoji_sandaizo/ (Last accessed: 28 October 2025) | Goryeo Canon, Sixi Canon, and Puning Canon |
| 1 | For instance, Kanichi Ogawa 小川貫一 argued that the Buddhist canon is not merely a collection or series of Buddhist texts, but carries implications of “specific organization and intentional content” (Daizōkai 1964, pp. 5–6). Li Fuhua and He Mei described the Chinese Buddhist canon as “a comprehensive collection of Chinese Buddhist scriptures—including works composed by Chinese monks—that have been identified, systematized, and organized according to a certain sequence by Buddhist bibliographers since the Sui and Tang dynasties,” one that exhibits “rich content, rigorous sequence, and refined structure” (Li and He 2003, p. 14). K. R. Norman noted that a canon represents “it is a collection of scriptures (oral or written), which gives a certain authority to those texts included in it” (Norman 1997, p. 131). |
| 2 | Prior to CBETA’s official formation, a notable attempt was made in 1994 by Yang Guoping, convened under the same name (“Chinese Buddhist Electronic Text Association”), involving institutions like Fo Guang Shan and Academia Sinica. Additionally, lay practitioner Shen Jiazhen established the “Buddhist Computer Information Merit Association” in the US. However, these initiatives stalled due to difficulties in achieving consensus regarding copyright and editorial responsibility. See Fang (1996) and CBETA (2007). |
| 3 | Examining the inclusion catalogues 入藏錄 from the Sui and Tang dynasties reveals that manuscript canons of that period primarily contained the Tripitaka (Sūtra, Vinaya, Śāstra) translated from Sanskrit and Central Asian languages. (Zhang 2024) By the 18th year of the Kaiyuan era (730 CE), Zhi Sheng’s 智升 Kaiyuan Shijiao Lu 開元釋教錄 shows that works written by Chinese monks began to be gradually included in the canon, though their number remained very limited (approx. 40 works, 368 fascicles), mainly comprising Buddhist catalogues and historical/biographical texts. Notably, commentaries by eminent Tang dynasty masters were excluded, a situation incongruent with the flourishing state of Tang Buddhism. By the Song-Yuan printed canons and the Ming Jingshan Canon, works by local monks, Chan discourse records, and transmission of the lamp records 燈錄 were increasingly incorporated. In modern times, the Japanese-compiled Taishō and Zokuzōkyō significantly increased the inclusion of ancient Chinese exegetical literature. CBETA continues this trend of expanding inclusion criteria, incorporating stone inscription rubbings, monastic gazetteers, extra-canonical texts, and works by modern masters, marking a further transformation of the Buddhist canon from sacred scripture towards a comprehensive encyclopedia. |
| 4 | For a list of canons referenced by CBETA, see CBETA Canon Codes: https://cbetaonline.cn/doc/zh/02-02_id.php (accessed on 28 October 2025). |
| 5 | For CBETA citation examples, see CBETA Copy & Cite: https://cbetaonline.cn/doc/zh/04-aca_quote.php (accessed on 28 October 2025). |
| 6 | For details on the Sixi Canon held at the NLC, please see (Li 2007). |
| 7 | CBETA 2025.R3, T49, no. 2034, p. 103a9-b10 (CBETA 2025). |
| 8 | CBETA 2025.R3, K38, no. 1402, p. 513c15-21 (CBETA 2025). |
| 9 | Zhisheng described this 30-fascicle edition as follows: “Upon examining the scripture text, it differs from [Seng]You’s Record: First, the Tuoluoni Zizai Wang Pusa Pin (Dhāraṇī-Sovereign King Bodhisattva Section) (some scripture texts split this into the Yingluo Pin; this is incorrect. This is one continuous passage and should not be divided into two. The later Da’ai Jing corresponds to this section.); Second, the Baonü Pin (Jewel Maid Section); Third, the Buxuan Pusa Pin (Unwavering Bodhisattva Section); Fourth, the Haihui Pusa Pin (Oceanic Wisdom Bodhisattva Section); Fifth, the Xukongcang Pusa Pin (Space-Storage Bodhisattva Section); Sixth, the Wuyan Pusa Pin (Wordless Bodhisattva Section); Seventh, the Bukeshuo Pusa Pin (Inexpressible Bodhisattva Section); Eighth, the Baochuang Fen (Jewel-Banner Division); Ninth, the Xukongmu Fen (Space-Eye Division); Tenth, the Baoji Pusa Pin (Jeweled Topknot Bodhisattva Section); Eleventh, the Rimi Fen (Sun-Secret Division); (Upon checking various catalogues, the fascicle count for this Mahāsaṃnipāta Sūtra is not fixed: some say 29, 30, 31, 32, or 40 fascicles. Nowadays, the Mahāsaṃnipāta is mostly in 30 fascicles. The text of the Rimi Fen is incomplete, lacking about one fascicle. The 31-fascicle version should be complete, but it has not yet been obtained.); … From the Tuoluoni Zizai Wang Pin to the Rimi Fen, there are eleven divisions in total. The Rizang Jing and the Rimi Fen are different translations of the same original and also constitute the eleventh division. (Both the Rimi Fen and the Rizang Jing begin by stating: ‘Having explained the Xukongmu and the ānāpānasmṛti nectar-gate, this sūtra is then taught.’) Furthermore, since the Rimi Fen is taught after the Xukongmu Fen, by principle it should not be separated by the Baoji Pin. In the present scripture text, this section appears separated; the reason is unclear. Also, although the Rimi Fen and the Rizang Jing share the same original, the text of the Rimi Fen is extremely abbreviated, and its latter part is further missing, amounting to roughly less than a fascicle.)”; 今檢經本與祐《記》不同:第一《陀羅尼自在王菩薩品》,(亦有經本分為《瓔珞品》者,不然。此是一段,不合分二,後《大哀經》即是此品。)第二《寶女品》,第三《不眩菩薩品》,第四《海慧菩薩品》,第五《虛空藏菩薩品》,第六《無言菩薩品》,第七《不可說菩薩品》,第八《寶幢分》,第九《虛空目分》,第十《寶髻菩薩品》,第十一《日密分》(尋檢群錄,此《大集經》卷無定準,或云二十九,或云三十,或三十一,或三十二,或四十卷。今時《大集》,多分三十。其《日密分》文不具足,合少一卷。其三十一卷者,文應備具,今尋求未獲。)……今從《陀羅尼自在王品》至《日密分》,總十一分,其《日藏經》與《日密分》同本異譯,亦是第十一分,(《日密》《日藏》初俱云:”說《虛空目》安那般那甘露門已,次說此經。”)又《日密分》既於《虛空目》後說,准義不合,隔《寶髻品》。今經本中,有此品隔,未詳所以。又《日密》《日藏》雖是同本,其《日密分》又文極撮略,後文復闕,可少卷餘。(CBETA 2025.R3, T55, no. 2154, p. 588a14-b1 (CBETA 2025)). |
References
- Bhikkhu, Huimin 釋惠敏, Zhengmin Du 杜正民, Bangxin Zhou 周邦信, and Zhipan Wang 王志攀. 2005. Techniques for Collating Multiple Text Versions in the Digitization of Classical Texts: The CBETA Taishō Buddhist Canon as an Example. Chung-Hwa Buddhist Journal 中華佛學學報 18: 299–325. [Google Scholar]
- Bhikkhu, Huimin 釋惠敏, Zhenzhou Hung 洪振洲, and Jiayu Xu 許佳瑜. 2018. Hanji yuyi lianjie de tantao yu yingyong yanjiu: Yi fodian shuwei ziyuan wei li: CBETA shuwei yanjiu pingtai 漢籍語意鏈結的探討與應用研究—以佛典數位資源為例:CBETA數位研究平台 [Exploration and Applied Research on Semantic Linking of Chinese Classics: Using Buddhist Digital Resources as an Example: The CBETA Digital Research Platform]. Paper presented at Digital Archives and Digital Humanities International Conference (9th), Taiwan, China, December 18–21; Taipei: Taiwan Digital Humanities Association, p. 163. [Google Scholar]
- CBETA. 2007. Hui xiang Shen Jiazhen jushi 回向沈家楨居士 [Dedication to Layman C. T. Shen]. Zhonghua dianzi fodian xiehui xinwen dianzi bao 中華電子佛典協會新聞電子報. 103. Available online: https://archive.cbeta.org/data/news/200711/index.htm (accessed on 28 October 2025).
- CBETA. 2025. 2025.R3. Taishō Shinshū Daizōkyō 大正新脩大藏經, vols. T49, T55. Goryeo Canon 高麗大藏經, vols. K38. Available online: https://www.cbeta.org (accessed on 28 October 2025).
- Chen, Shu-Fen 陳淑芬. 2013. Jingang jing biaodian yanjiu: Yi Dazhengzang yu CBETA Jiumoluoshi yi ben wei li 《金剛經》標點研究:以《大正藏》與CBETA鳩摩羅什譯本為例 [A Study on the Punctuation of the Diamond Sūtra: Based on the Kumārajīva Translation in the Taishō Canon and CBETA]. Yuan Kuang Journal of Buddhist Studies 圓光佛學學報 22: 33–88. [Google Scholar]
- Chi, Limei 池麗梅. 2023. Gen Kanban Daizōkyō Kenkyū (1) Genzon Tekisuto no Gaikan 元官版大蔵経研究(一)現存テキストの概観 [A Study of the Official Buddhist Canon of the Yuan Dynasty (I): Existing Texts in China and Japan]. Kokusai Bukkyōgaku Daigakuin Daigaku Kenkyū Kiyō 國際仏教學大學院大學研究紀要 26: 63–81. [Google Scholar]
- Chikusa, Masaaki 竺沙雅章. 2000. Sō Gen ban Daizōkyō no keifu 宋元版大藏經の系譜 [A Genealogy of the Song–Yuan Editions of the Buddhist Canon]. In Sō Gen Bukkyō bunkashi kenkyū宋元佛教文化史研究 [Studies in the Cultural History of Song Yuan Buddhism]. Tokyo: Kyūko Shoin, pp. 271–362. [Google Scholar]
- Daizōkai 大蔵會, ed. 1964. Daizōkyō: Seiritsu to hensen 大蔵経: 成立と変遷 [The Buddhist Canon: Formation and Transformation]. Nagoya: Hyakkaen 百華苑. [Google Scholar]
- Deng, Sanhong 鄧三鴻, Haotian Hu 胡昊天, Hao Wang 王昊, and Dongbo Wang 王東波. 2021. Guwen zidong chuli yanjiu xianzhuang yu xinshidai fazhan qushi zhanwang 古文自動處理研究現狀與新時代發展趨勢展望 [The Current State and New Era Development Trends of Classical Chinese Automated Processing Research]. Keji Qingbao Yanjiu 科技情報研究 3: 1–20. [Google Scholar]
- Du, Zhengmin 杜正民. 1998. Dangdai guoji fodian dianzihua xiankuang: Dianzi fodian tuijin xieyihui (EBTI) jianjie 當代國際佛典電子化現況:電子佛典推進協議會(EBTI)簡介 [The Current State of International Buddhist Text Electronization: An Introduction to the Electronic Buddhist Text Initiative (EBTI)]. Fojiao Tushuguan Xun 佛教圖書館訊 15: 28–39. [Google Scholar]
- Du, Zhengmin 杜正民. 1999a. Hanwen dianzi Dazangjing de zhizuo yuanqi yu zuoye liucheng—yi “Zhonghua Dianzi Fodian Xiehui” wei li 漢文電子大藏經的製作緣起與作業流程——以”中華電子佛典協會”為例 [The Genesis and Workflow of the Chinese Electronic Buddhist Canon: Taking the Chinese Buddhist Electronic Text Association as an Example]. Foxue Zhongxin Xuebao 佛學中心學報 4: 347–69. [Google Scholar]
- Du, Zhengmin 杜正民. 1999b. Yi CBETA wei li: Tan daliang wenxian zhi jianli: Hanwen Zangjing dianzihua zuoye jianshuo 以CBETA為例—談大量文獻之建立—漢文藏經電子化作業簡說 [Using CBETA as an Example: On the Establishment of Large-Scale Texts—A Brief Introduction to the Electronization of the Chinese Buddhist Canon]. Computing Center Newsletter, Academia Sinica 中央研究院計算中心通訊 15: 117–22. [Google Scholar]
- Du, Zhengmin 杜正民. 2012. Foxue shuwei ziyuan de jianzhi yu kaizhan 佛學數位資源的建置與開展 [The Construction and Development of Buddhist Digital Resources]. Dharma Drum Journal of Buddhist Studies 法鼓佛學學報 10: 147–210. [Google Scholar]
- Du, Zhengmin 杜正民, Jiaming Li 李家名, Haiwen Zhou 周海文, Baolian Zheng 鄭寶蓮, and Xinyan Lin 林心雁. 2008. Zangjing yu Fojiao gongjushu de shuweihua bianzuan: Yi CBETA dianzi fodian yu shuwei jinglu jihua wei li 藏經與佛教工具書的數位化編纂:以CBETA電子佛典與數位經錄計畫為例 [The Digital Compilation of Buddhist Canons and Reference Works: Using the CBETA Electronic Buddhist Canon and Digital Catalog Project as Examples]. Fojiao Tushuguan Guankan 佛教圖書館館刊 47: 22–35. [Google Scholar]
- Fang, Guangchang 方廣錩. 1991. Fojiao Dazangjing shi: Ba–shi shiji 佛教大藏經史:八——十世紀 [History of the Buddhist Canon: 8th–10th Centuries]. Beijing: Zhongguo Shehui Kexue Chubanshe 中國社會科學出版社. [Google Scholar]
- Fang, Guangchang 方廣錩. 1996. Haiwai Dazangjing bianji ji dianziban Dazangjing de qingkuang 海外大藏經編輯及電子版大藏經的情況 [Overseas Buddhist Canon Compilation and the Status of Digitized Canons]. Fayin 法音 5: 3–11. [Google Scholar]
- Fang, Guangchang 方廣錩. 1997. Dazheng xinxiu Dazangjing pingshu 《大正新修大藏經》評述 [A Review of the Taishō Shinshū Daizōkyō]. In Wensi: Jinling kejingchu 130 zhounian jinian zhuanji 聞思:金陵刻經處130周年紀念專輯 [Wensi: Commemorative Collection for the 130th Anniversary of Jinling Scriptural Press]. Edited by Nanjing Jinling Kejingchu 南京金陵刻經處. Beijing: Hauwen chubanshe 华文出版社, pp. 230–53. [Google Scholar]
- Fang, Guangchang 方廣錩. 2006. Zhongguo xieben dazangjing yanjiu 中國寫本大藏經研究 [Studies on the Dazangjing in Chinese Manuscripts]. Shanghai: Shanghai Guji Chubanshe 上海古籍出版社, pp. 9–10. [Google Scholar]
- Fang, Guangchang 方廣錩. 2015. Guji shuzihua shiye zhong de Dazhengzang yu fodian zhengli 古籍數位化視野中的《大正藏》與佛典整理 [The Taishō Canon and Buddhist Text Collation from the Perspective of Ancient Text Digitization]. Shanghai Shifan Daxue Xuebao (Zhexue Shehui Kexue Ban) 上海師範大學學報 (哲學社會科學版) 4: 17–25. [Google Scholar]
- Hong, Tao 洪濤, Ruixue Cheng 程瑞雪, Sixi Liu 劉思汐, and Kaiqi Fang 方凱齊. 2021. Yizhong jiyu Transformer moxing de guji zidong biaodian jishu 一種基於 Transformer模型的古籍自動標點技術 [An Automatic Punctuation Technique for Classical Chinese Based on Transformer Model]. Shuzi Renwen 數字人文 2: 111–22. [Google Scholar]
- Hong, Zhenzhou 洪振洲. 2016. You ziliaoku dao shuwei yanjiu pingtai: Tan fodian wenxian shuwei yanjiu gongju zhi fazhan yu yanbian 由資料庫到數位研究平台-談佛典文獻數位研究工具之發展與演變 [From Database to Digital Research Platform: On the Development and Evolution of Digital Research Tools for Buddhist Literature]. Hanxue Yanjiu Tongxun 漢學研究通訊 35: 1–14. [Google Scholar]
- Hong, Zhenzhou 洪振洲. 2018. Shuweishidai Hanyi fodian zhi yanjiu liqi: CBETA shuwei yanjiu pingtai 數位時代漢譯佛典之研究利器-CBETA數位研究平臺 [A Powerful Tool for Studying Chinese Buddhist Scriptures in the Digital Age: The CBETA Digital Research Platform]. Journal of Digital Archives and Digital Humanities 數位典藏與數位人文 1: 149–74. [Google Scholar]
- Hou, Kunhong 侯坤宏. 2007. Xingtan nalu: Hengqing fashi fangtan lu 杏壇衲履:恒清法師訪談錄 [Cassock in the Scholarly Grove: An Interview with Ven. Hengching]. Taipei: Guoshiguan 國史館, pp. 203–8. [Google Scholar]
- Hou, Kunhong 侯坤宏, and Zunhong Zhuo 卓遵宏. 2015. Liushi gan’en ji: Huimin fashi fangtan lu (zengding ban) 六十感恩紀:惠敏法師訪談錄(增訂版) [Sixty Years of Gratitude: An Interview with Ven. Huimin (Expanded Edition)]. Taipei: Fagu Wenhua 法鼓文化, pp. 301–16. [Google Scholar]
- Huang, Qiangang 黃乾綱, Jiaming Li 李家名, and Fayuan Shi 釋法源. 2018. Yi shuwei fojing wei jichu, yanfa guwenxian yanjiu suoxu zixun jishu de jingyan yu chengguo 以數位佛經為基礎,研發古文獻研究所需資訊技術的經驗與成果 [Experience and Results of Developing Information Technologies for Classical Text Research Based on Digital Buddhist Sutras]. Renwen yu Shehui Kexue Jianxun 人文與社會科學簡訊 19: 124–31. [Google Scholar]
- Lancaster, Lewis R. 2003. Buddhism and the Digital Age. Hsi Lai Journal of Humanistic Buddhism 4: 79–86. [Google Scholar]
- Lancaster, Lewis R. 2008. Digital Buddhist Texts and Buddhist Universities. The Journal of the International Association of Buddhist Universities 1: 133–54. [Google Scholar]
- Li, Fuhua 李富華, and Mei He 何梅. 2003. Hanwen Fojiao Dazangjing yanjiu 漢文佛教大藏經研究 [Research on the Chinese Buddhist Canon]. Beijing: Zongjiao Wenhua Chubanshe 宗教文化出版社. [Google Scholar]
- Li, Jining 李際寧. 2007. Guotu xin shoucang Sixi ban Da bore boluomi jing de jingguo ji qi wenwu banben jiazhi 國圖新入藏思溪版《大般若波羅蜜經》的經過及其文物版本價值 [The Process and Value of the National Library’s Newly Acquired Sixi Canon of the Mahāprajñāpāramitā Sūtra]. In Fojiao Dazangjing yanjiu lungao 佛教大藏經研究論稿 [Studies on the Buddhist Canons]. Beijing: Zongjiao Wenhua Chubanshe 宗教文化出版社, pp. 171–84. [Google Scholar]
- Li, Jining 李際寧. 2010. Guanyu jinnian faxian de Yuan guanzang 關於近年發現的《元官藏》 [On the Recently Discovered Yuan Official Canon]. Zangwai Fojiao wenxian 藏外佛教文獻 13: 352–88. [Google Scholar]
- Muller, A. Charles, Masahiro Shimoda, and Kiyonori Nagasaki. 2017. The SAT Taishō Text Database: A Brief History. In Reinventing the Tripitaka: Transformation of the Buddhist Canon in Modern East Asia. Edited by Jiang Wu and Greg Wilkinson. Lanham: Lexington Books, pp. 175–85. [Google Scholar]
- Nagasaki, Kiyonori 永崎研宣. 2015a. Bukkyō bunken no tame no kōzō-teki na dejitaru tekisuto no kijutsu to katsuyō 仏教文献のための構造的なデジタルテクストの記述と活用 [Structured Digital Text Description and Utilization for Buddhist Literature]. Indogaku Bukkyōgaku Kenkyū 印度學佛教學研究 63: 1088–94. [Google Scholar]
- Nagasaki, Kiyonori 永崎研宣. 2015b. SAT Daizōkyō tekisuto dētabēsu: Jinbun-gaku ni okeru ōpundēta no katsuyō ni mukete SAT大蔵経テキストデータベース: 人文学におけるオープンデータの活用に向けて [The SAT Daizōkyō Text Database: Towards the Use of Open Data in the Humanities]. Jōhō Kanri 情報管理 58: 422–37. [Google Scholar]
- Nagasaki, Kiyonori 永崎研宣. 2024. Bukkyō kenkyū to tekisuto kōzōka 仏典研究とテキスト構造化 [Buddhist Text Research and Text Structuring]. Indogaku Bukkyōgaku Kenkyū 印度學佛教學研究 72: 725–30. [Google Scholar]
- Nagasaki, Kiyonori 永崎研宣, Ikki Ohmukai 大向一輝, and Masahiro Shimoda 下田正弘. 2023. OCR no kōseidoka o fumeta dejitaru gakujutsu henshūban no shintenkai OCRの高精度化を踏まえたデジタル学術編集版の新展開 [New Developments in Digital Scholarly Editions Based on Advanced OCR Accuracy]. Jinmon Kon 2023 Ronbunshū じんもんこん2023論文集 2023: 177–82. [Google Scholar]
- Nagasaki, Kiyonori 永崎研宣, Jun Homma 本間淳, and Masahiro Shimoda 下田正弘. 2024. TEI koten-seki byūwā no kaihatsu: Higashi Ajia koten-seki ōpun saiensisu no jitsugen ni mukete TEI 古典籍ビューワの開発 ―東アジア古典籍オープンサイエンスの実現に向けて—[Development of a TEI Classical Text Viewer: Towards Realizing Open Science for East Asian Classical Texts]. Jinmon Kon 2024 Ronbunshū じんもんこん2024論文集 2024: 59–66. [Google Scholar]
- Norman, Kenneth Roy. 1997. A Philological Approach to Buddhism: The Bukkyō Dendō Kyōkai Lectures 1994. London: School of Oriental and African Studies. [Google Scholar]
- Shanxi Sheng Wenwuju 山西省文物局, and Zhongguo Lishi Bowuguan 中國歷史博物館. 1991. Yingxian muta Liao dai mizang 應縣木塔遼代秘蔵 [Liao Dynasty Hidden Treasures from the Wooden Pagoda at Ying County]. Beijing: Wenwu chubanshe 文物出版社. [Google Scholar]
- Su, Qi 蘇祺, Renfen Hu 胡韌奮, Yuchen Zhu 諸雨辰, Chengxi Yan 嚴承希, and Jun Wang 王軍. 2021. Guji shuzihua guanjian jishu pingshu 古籍數字化關鍵技術評述 [A Review of Key Technologies in Ancient Book Digitization]. Shuzi Renwen Yanjiu 數字人文研究 1: 83–88. [Google Scholar]
- Tong, Ran 通然. 2017. Hanyu Fojiao wenxian dianzihua de xianzhuang yu zhanwang 漢語佛教文獻電子化的現狀與展望 [The Current Situation and Prospects of the Electronization of Chinese Buddhist Documents]. Foxue yanjiu 佛學研究 1: 267–75. [Google Scholar]
- Tong, Wei 童瑋, Guangchang Fang 方廣錩, and Zhiliang Jin 金志良. 1984. Yuandai guanke Dazangjing de faxian 元代官刻大藏經的發現 [The Discovery of the Officially Published Buddhist Canon in the Yuan Dynasty]. Wenwu 文物 12: 82–86. [Google Scholar]
- Tu, Aming. 2015. Appendix 2. The Creation of the CBETA Chinese Electronic Tripitaka Collection in Taiwan. In Spreading Buddha’s Word in East Asia. Edited by Jiang Wu and Lucille Chia. New York: Columbia University Press, pp. 321–36. [Google Scholar]
- Wang, Jun 王軍, Chenglin Liu 劉成林, Lianwen Jin 金連文, Yongge Liu 劉永革, Chixuan Zhang 張弛宜, Yinfei Wang 王胤斐, Hui Zhu 朱慧, Jingwen Han 韓靜雯, and Xuan Xu 徐璇. 2022. Xilie bitan zhi si: Zhineng shidai guji OCR jishu 系列筆談之四:智能時代古籍OCR技術 [The Fourth of a Series of Written Discussions: Ancient Book OCR Technology in the Age of AI]. Shuzi renwen 數字人文, 95–125. [Google Scholar]
- Wang, Song 王頌, ed. 2025. Songban Fuzhou zang: Dongchan Dengjue Chanyuan yu Kaiyuansi ban hebi 宋版福州藏:東禪等覺禪院與開元寺版合璧 [The Song Dynasty Fuzhou Canon: A Combination of the Dongchan Dengjue Chan Monastery and Kaiyuan Monastery Editions]. Beijing: Zhongguo shudian chubanshe 中國書店出版社. [Google Scholar]
- Wang, Zhipan 王志攀. 2000. CBETA dianzi fodian quezi shiwu CBETA電子佛典缺字實務 [Practical Handling of Missing Characters in CBETA Electronic Buddhist Texts]. Buddhist Library Newsletter 佛教圖書館館訊 24: 31–40. [Google Scholar]
- Wittern, Christian. 2017. The Digital Tripitaka and the Modern World. In Reinventing the Tripitaka: Transformation of the Buddhist Canon in Modern East Asia. Edited by Jiang Wu and Greg Wilkinson. Lanham: Lexington Books, pp. 155–73. [Google Scholar]
- Zhang, Xu 張旭. 2024. Zhongjing mulu yu Sui Tang shiqi de huangguan guancang 《眾經目錄》與隋唐時期的皇家官藏 [Zhongjing mulu and the Royal Official Collection of Buddhist Scriptures in the Sui and Tang Dynasties]. Shijie Zongjiao Wenhua 世界宗教文化 6: 172–79. [Google Scholar]
- Zhongguo Fojiao Xiehui 中國佛教協會, and Zhongguo Fojiao Tushu Wenwuguan 中國佛教圖書文物館, eds. 2000. Fangshan shijing 房山石經 [Fangshan Stone Scriptures]. Beijing: Huaxia chubanshe 華夏出版社. [Google Scholar]
- Zhonghua Dazangjing Bianjiju 中華大藏經編輯局. 1983. Zhonghua Dazangjing 中華大藏經 [Chinese Buddhist Canon]. Beijing: Zhonghua shuju 中華書局. [Google Scholar]
- Zhou, Bokan 周伯戡. 2002. Ping CBETA dianzi Dazhengzang 評 CBETA 電子大正藏 [Evaluating the CBETA Electronic Taishō Canon]. Foxue Zhongxin Xuebao 佛學研究中心學報 7: 379–90. [Google Scholar]
| Year Range | Buddhist Texts Incorporated |
|---|---|
| 1998–2002 | Taishō |
| 2003–2007 | Selections from the Manji Zokuzōkyō |
| 2008–2011 | Supplement to Historical Canons; Classified Buddhist Materials from Official Histories; Extra-Canonical Buddhist Texts (Collections 1–9); One Hundred Northern Dynasties Buddhist Stone Inscription Rubbings |
| 2012–2014 | Rare Buddhist Texts from the “National Central Library” (Taiwan, China); Chinese Translation of the Pāli Canon |
| 2015–2016 | Supplement to the Buddhist Canon; Chinese Buddhist Monastic Gazetteers (15 works) |
| 2017–2018 | Collection of Ven. Yinshun’s Buddhist Works; Chinese Buddhist Monastic Gazetteers (21 works) |
| 2019–2020 | Extra-Canonical Buddhist Texts (Collections 10–12); Collection of Lü Cheng’s Buddhist Works |
| 2021–2022 | Complete Works of Master Taixu |
| Feature | CBETA | SAT |
|---|---|---|
| Editorial Philosophy | Critical Edition Model: Aiming for a new, digitally evolved standard text. Heavily emended using >20 versions. | Diplomatic Edition Model: Strict adherence to the printed Taishō layout and text as a historical artifact. |
| Data Structure | TEI/XML: Semantic tagging (paragraphs, speakers). Line referencing is abstract (e.g., p0001a01). | Page-Line Grid: Text linked strictly to image coordinates. Line referencing mirrors the physical page. |
| Variant Handling | Dynamic/Computable: Variants are embedded in XML tags (<rdg>), allowing users to toggle versions textually. | Visual/Reference: Variants appear as footnotes or linked images via SATed; focus on visual verification. |
| Platform Scope | Expansive: Includes Taishō, Zokuzokkyō, rare manuscripts, and modern commentaries. | Focused & Interconnected: Primarily Taishō, but links extensively to external databases (DDB, INBUDS, CiNii). |
| AI Integration | NLP-Focused: Automatic punctuation, semantic search, and quotation detection (Huang et al. 2018). | Vision-Focused: AI-OCR and image–text alignment (SATed) for visual philology (Nagasaki 2024). |
| Section | Qidan Canon | Fangshan | Goryeo Canon | Fuzhou Canon | Sixi Canon |
|---|---|---|---|---|---|
| Fascicle | 30 | 30 | 60 | 30 | 30 |
| 1. Yingluo Pin | 1 | 1 | 1 | 1 | 1 |
| 2. Tuoluoni Zizai Wang Pin | 2 | ||||
| 3. Baonü Pin | 2 | 2 | 3 | 2 | 2 |
| 4. Buxuan Pusa Pin | 3 | 3 | 4 | 3 | 3 |
| 5. Haihui Pusa Pin | 4 | 4 | 5 | 4 | 4 |
| 6. Wuyan Pusa Pin | 6 | 6 | 6 | 6 | 6 |
| 7. Bukeshuo Pusa Pin | 7 | 7 | 7 | 7 | 7 |
| 8. Xukongzang Pusa Pin | 5 | 5 | 8 | 5 | 5 |
| 9. Baochuang Fen | 8 | 8 | 9 | 8 | 8 |
| 10. Xukongmu Fen | 9 | 9 | 10 | 9 | 9 |
| 11. Baoji Pusa Pin | 10 | 10 | 11 | 10 | 10 |
| 12. Wujinyi Pin | 12 | ||||
| 13. Rimi Fen | 11 | 11 | 13 | 11 | 11 |
| 14. Rizang Jing | 14 | ||||
| 15. Yuezang Jing | 15 | ||||
| 16. Xumicang Jing | 16 | ||||
| 17. Shifang Pusa Pin | 17 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhang, X. Chinese Buddhist Canon Digitization: A Review and Prospects. Religions 2026, 17, 52. https://doi.org/10.3390/rel17010052
Zhang X. Chinese Buddhist Canon Digitization: A Review and Prospects. Religions. 2026; 17(1):52. https://doi.org/10.3390/rel17010052
Chicago/Turabian StyleZhang, Xu. 2026. "Chinese Buddhist Canon Digitization: A Review and Prospects" Religions 17, no. 1: 52. https://doi.org/10.3390/rel17010052
APA StyleZhang, X. (2026). Chinese Buddhist Canon Digitization: A Review and Prospects. Religions, 17(1), 52. https://doi.org/10.3390/rel17010052
