Article published in:
Orthographic Databases and LexiconsEdited by Lynne Cahill and Terry Joyce
[Written Language & Literacy 20:1] 2017
► pp. 27–51
Constructing an ontology and database of Japanese lexical properties
Handling the orthographic complexity of the Japanese writing system
Terry Joyce | Tama University, Japan
Bor Hodošček | Osaka University, Japan
Hisashi Masuda | Hiroshima Shudo University, Japan
As a significant milestone within ongoing efforts to construct a comprehensive database in the form of a lexical resource (LR) of Japanese Lexical Properties (JLP-LR), this paper outlines the initial construction of an Ontology of Japanese Lexical Properties (JLP-O) (Joyce & Hodošček 2014), and, in particular, describes some of its key aspects specifically incorporated in order to satisfactorily handle the orthographic complexity of the Japanese writing system (Joyce 2013, 2016; Joyce, Hodošček & Nishina 2012). While motivated primarily by issues of orthographic representation for the Japanese lexicon, these key features potentially have wider implications for the effective construction of integrated orthographic databases and lexicons.
Keywords: ontology, database, Japanese lexical properties, Japanese writing system, orthography, lexical entry (LE), decomposition
Article outline
- 1.Introduction
- 2.Ontology of Japanese lexical properties (JLP-O)
- 3.Handling aspects of the Japanese writing system
- 3.1Character LEs and character module
- 3.2 canonicalForm and orthographicForm
- 3.3Forms of decomposition
- 3.3.1Orthographic decomposition
- 3.3.2Phonological decomposition
- 3.3.3Morphological decomposition
- 4.Conclusion
- Notes
-
References
Published online: 19 October 2017
https://doi.org/10.1075/wll.20.1.03joy
https://doi.org/10.1075/wll.20.1.03joy
References
Adelman, James S.
Backhouse, A. E.
Bunkachō [Agency for Cultural Affairs]
(2010) Jōyōkanjihyō [Jōyō kanji list]. Available at http://kokugo.bunka.go.jp/kokugo_nihongo/joho/kijun/naikaku/pdf/joyokanjihyo_20101130.pdf (13 November 2016).
Den, Yasuharu, Toshinobu Ogiso, Hideki Ogura, Atsushi Yamada, Nobuaki Minematsu, Kiyotaka Uchimoto & Hanae Koiso (2007) Kōpasu nihongogaku no tame no gengo shigen: Keitaisokaisekiyō denshijisho no kaihatsu to ōyō [The development of an electronic dictionary for morphological analysis and its application to Japanese corpus linguistics]. Nihongo Kagaku [Japanese Linguistics], 221: 101–122.
Guarino, Nicola
Guarino, Nicola, Daniel Oberle & Steffen Staab
Huang, Chu-Ren, Nicoletta Calzolari, Aldo Gangemi, Alessandro Lenci, Alessandro Oltramari & Laurent Prévot
Joyce, Terry
Joyce, Terry, & Bor Hodošček
(2014) Constructing an ontology of Japanese lexical properties: Specifying its property structures and lexical entries. In Michael Zock, Reinhard Rapp, & Chu-Ren Huang (Eds.), Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon (CogALex4) (pp. 174–185). 23 August, 2014. Dublin, Ireland. 

Joyce, Terry, Bor Hodošček & Hisashi Masuda
Joyce, Terry, Bor Hodošček & Kikuko Nishina
Joyce, Terry, Hisashi Masuda & Bor Hodošček
Joyce, Terry, Hisashi Masuda & Taeko Ogawa
Masuda, Hisashi (2014) Kanjinijihyōkigokan no imitekikankeisei ni kansuru dētabēsu no kōchiku [Constructing a database of the semantic relationships within two-kanji orthographic words]. Kagaku Kenkyū Hijo Seijigyō Kenkyū Seika Hōkokusho [Research Report for Grant-in-Aid for Scientific Research from the Japanese Society for the Promotion of Science].
Masuda, Hisashi, & Terry Joyce
Masuda, Hisashi, Terry Joyce, Taeko Ogawa, Masahiro Kawakami & Chikako Fujita
(2014) A database of semantic transparency ratings for two-kanji Japanese compound words. Poster presentation given at
‘Orthographic Databases and Lexicons’: 9th International Workshop on Writing Systems and Literacy
, 4–5 September, 2014. University of Sussex, Brighton, UK.
Maekawa, Kikuo, Makoto Yamazaki, Toshinobu Ogiso, Takeiko Maruyama, Hideki Ogura, Wakako Kashino, Hanae Koiso, Masaya Yamaguchi & Yasuharu Den
Morohashi, Tetsuji
Nation, I. S. P.
Ogura, Hideki, Toshinobu Ogiso, Hanae Koiso, Yutaka Hara, & Sayaka Miyauchi
(2010, March). Keitaiso kaiseki jisho UniDic ni okeru goiso midashi no rikkō hōshin [Criteria for the lemmatization of UniDic], Tokuteiryōiki kenkyū “nihongo kōpasu” heisei 21 nendo kōkai waakushoppu (Kenkyū seika hōkokukai) yokōshū [Priority-Area Research “Japanese Corpus”: Proceedings of the 2010 public workshop]. Tokyo: General Headquarters, Priority-Area Research “Japanese Corpus”.
Oltramari, Alessandro, Piek Vossen, Lu Qin & Hovy, Eduard
Prévot, Laurent, Chu-Ren Huang, Nicoletta Calzolari, Aldo Gangemi, Alessandro Lenci & Alessandro Oltramari
(2010) Ontology and the lexicon: a multidisciplinary perspective. In Chu-Ren Huang, Nicoletta Calzolari, Aldo Gangemi, Alessandro Lenci, Alessandro Oltramari & Laurent Prévot (Eds.), Ontology and the lexicon: a natural language processing perspective (Studies in Natural Language Processing) (pp. 3–24). Cambridge: Cambridge University Press. 

Spohr, Dennis
Cited by
Cited by 3 other publications
Joyce, Terry & Hisashi Masuda
Joyce, Terry & Hisashi Masuda
Santoso, Joan, Esther Irawati Setiawan, Christian Nathaniel Purwanto, Eko Mulyanto Yuniarno, Mochamad Hariadi & Mauridhi Hery Purnomo
This list is based on CrossRef data as of 11 november 2021. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.