Japanese Lexical Database
Covers approximately 290,000 entries
Optimized for NLP applications
Various grammatical and phonological attributes
Overview
CJKIโsย Japanese Lexical Databaseย (JLD) is a comprehensive monolingual lexical database that includes a rich set of grammatical attributes.ย JLDย contains about 290,000ย entries covering general vocabulary, both free forms and bound forms, and includes a significant number of affixes, particles, auxiliaries and conjugation patterns to account for all the inflectional, derivational and lexical morphology in Japanese. This enables NLP software to easily recognize inflected, conjugated and derived forms even though they are not explicitly listed in the lexicon.
Developed by CJKIโs team of experienced Japanese editors and linguists over more than a decade,ย JLDย is a significant contribution to the field of Japanese natural language processingย and information processing.
Main Features
Phonological information
Such as hiragana and romanized readings
Grammatical information
Such as part-of-speech codes
Morphological information
Such as derivational affixes and conjugation patterns
Japanese Lexical Database
Japanese | Kana | POS | Sub | Conj. | Type |
---|---|---|---|---|---|
่ฒทใใใใ | ใใใใใ | V5 | R | ||
่ฒทใ็ ฝใ | ใใใใใ | V5 | R | t | |
่ฒท็ ฝใ | ใใใใใ | V5 | R | ||
ๆนๆช | ใใใใ | VN | t | ||
่ฒทใใใ | ใใใใ | NC | |||
่ฒทใไธใ | ใใใใ | NC | |||
่ฒทไธ | ใใใใ | NC | |||
่ฒทไธใ | ใใใใ | NC | |||
่ฒทใใใใ | ใใใใใ | V1 | |||
่ฒทใไธใใ | ใใใใใ | V1 | S | t | |
่ฒทใไธใใ | ใใใใใ | V2 | |||
่ฒทไธใใ | ใใใใใ | V1 | |||
่ฒทใใใใ | ใใใใใ | V5 | R | ||
่ฒทใๆผใ | ใใใใใ | V5 | R | t | |
่ฒทใๆผใ | ใใใใใ | V4 | |||
่ฒทๆผใ | ใใใใใ | V5 | R | ||
่ฒๅ | ใใใใใ | NC | |||
่ฒๅใ | ใใใใใ | NC | |||
่ฒๅใใ | ใใใใใ | NC | |||
ไปๆ | ใใใ | VN | t | ||
ไผๆ | ใใใ | NC | |||
่งฃ้ ค | ใใใ | NC | |||
ๆช็ฐ | ใใใ | AN | 2 | ||
ๆช็ฐ | ใใใ | NC | |||
้ญๅ | ใใใ | AN | 0 | ||
้ญๅ | ใใใ | AN | 2 | ||
ๆตทๅฐ | ใใใ | NC | |||
็ใ | ใใใ | AJ | |||
ๆตทๅ | ใใใใ | NC | |||
ๆตทๅ็ค | ใใใใใใใ | NC | |||
ๆตทๅๅถๅพก | ใใใใใใใใ | NC | |||
ๆตทๅๅฉ็จ | ใใใใใใใ | NC | |||
้ญๅใ | ใใใใ | NC | |||
่ฒทใๆฅใ | ใใใใใ | V5 | G | t | |
้ฃผใ็ฌ | ใใใใฌ | NC | |||
้ฃผ็ฌ | ใใใใฌ | NC | |||
้ฃผใ็ฌใซๆใๅใพใใ | ใใใใฌใซใฆใใใพใใ | V1 | |||
้ฃผ็ฌใซๆใๅใพใใ | ใใใใฌใซใฆใใใพใใ | EJ | |||
่ฒทใใใใ | ใใใใใ | V1 | |||
่ฒทใๅ ฅใใ | ใใใใใ | V1 | S | t | |
่ฒทใๅ ฅใใ | ใใใใใ | V2 | |||
่ฒทๅ ฅใใ | ใใใใใ | V1 | |||
ไผๅก | ใใใใ | NC | |||
ๆๅผ | ใใใใ | VN | t | ||
ๆนๅฐ | ใใใใ | VN | i | ||
ๆตทๅก | ใใใใ | NC | |||
้้ข | ใใใใ | VN | r | ||
่ชจๆทซ | ใใใใ | NC | |||
ไผๅกไผ็คพ | ใใใใใใใใ | NC | |||
ๆตทๅก็ตๅ | ใใใใใใฟใใ | NC | |||
ไผๅกๅธ | ใใใใใใ | NC | |||
ไผๅกๆจฉ | ใใใใใใ | NC | |||
ไผๅกๆจฉๅๆณ | ใใใใใใใใใใปใ | NC | |||
ไผๅก่จผ | ใใใใใใใ | NC | |||
ไผๅกๆฐ | ใใใใใใ | NC | |||
ไผๅกๅถ | ใใใใใใ | NC | |||
ไผๅก็ต็น | ใใใใใใใ | NC | |||
ๆตท่ | ใใใ | NC | |||
่ฒทใใใใ | ใใใใใ | V1 | |||
่ฒทใๅใใ | ใใใใใ | V1 | S | t | |
่ฒทใๅใใ | ใใใใใ | V2 | |||
่ฒทๅใใ | ใใใใใ | V1 | |||
่ฒทใๅใใ | ใใใใใ | V1 | |||
่ฒทๅใใ | ใใใใใ | V1 | |||
ๆตท้ | ใใใใ | NC | |||
้้ | ใใใใ | NC | |||
ๆตท้ๅฑ | ใใใใใใใ | NC | |||
ๆตท้ๆฅญ | ใใใใใใใ | NC | |||
ๆตท้ๅ็ | ใใใใใฉใใใ | NC | |||
ใซใคใจ | ใใใ | NC | |||
ๅฟซๆณณ | ใใใใ | NC | |||
้ๆ | ใใใใ | VN | |||
ๆนๆ | ใใใใ | VN | t | ||
ๅฟซๆผ | ใใใใ | VN | |||
ๆตทๅกฉ | ใใใใ | NC | |||
ๆตทๆทต | ใใใใ | NC | |||
้ๅ | ใใใใ | VN | r | ||
้ๅฎด | ใใใใ | VN | |||
้ๆผ | ใใใใ | VN | i | ||
ๆตท็ๆ | ใใใใใใ | NC | |||
่ฒ่ฆ | ใใใใใ | NC | |||
่ฒ่ฆใ | ใใใใใ | NC | |||
่ฒทใ็ฝฎใ | ใใใใ | VN | |||
่ฒท็ฝฎ | ใใใใ | VN | |||
่ฒท็ฝฎใ | ใใใใ | VN | |||
ๅฃๅฑ | ใใใใ | NC | |||
่ฒทใ็ฝฎใ | ใใใใ | V5 | K | ||
่ฒท็ฝฎใ | ใใใใ | V5 | K | ||
้ฃผใๆกถ | ใใใใ | NC | |||
้ฃผๆกถ | ใใใใ | NC | |||
่ฒทใใชใ | ใใใใบ | NC | |||
่ฒทใชใ | ใใใใบ | NC | |||
่ฒทใใชใใฌใผใทใงใณ | ใใใใบใใผใใใ | NC | |||
่ฒทใชใใฌใผใทใงใณ | ใใใใบใใผใใใ | NC | |||
ไป้ณ | ใใใใ | NC | |||
ๅฟซ้ณ | ใใใใ | NC | |||
ๆช้ณ | ใใใใ | NC | |||
ๆตทๆธฉ | ใใใใ | NC | |||
้้ณ | ใใใใ | NC | |||
้้ณ็ฏ | ใใใใใใค | NC | |||
ไผๆญ | ใใใ | NC | |||
ๆช็ซ | ใใใ | NC | |||
้ๅ | ใใใ | VN | i | ||
้ๆถ | ใใใ | VN | |||
้่ฑ | ใใใ | VN | i | ||
้ไธ | ใใใ | NC | |||
่ซงๅ | ใใใ | VN | i | ||
ๆชใ | ใใใใ | AN | 0 | ||
ๆชๆช | ใใใใ | AN | 0 | ||
ๆขใ | ใใใใ | AN | 1 | ||
ๆขๆข | ใใใใ | AN | 1 | ||
ๆขๆข | ใใใใ | AN | 2 | ||
้ไผ | ใใใใ | VN | r | ||
้ไผๅผ | ใใใใใใ | NC | |||
่ฒทใใใใ | ใใใใใ | V5 | S | ||
่ฒทใ่ฟใ | ใใใใใ | V5 | S | t | |
่ฒทใ่ฟใ | ใใใใใ | V4 | |||
่ฒท่ฟใ | ใใใใใ | V5 | S | ||
่ฒทใๆใใ | ใใใใใ | V1 | |||
่ฒทใๆฟใใ | ใใใใใ | V1 | |||
่ฒทๆใใ | ใใใใใ | V1 | |||
่ฒทๆฟใใ | ใใใใใ | V1 | |||
่ฒทๆฟใ | ใใใใใ | V1 | |||
้่ฑๆ | ใใใใ | NC | |||
ๆน้ฉ | ใใใใ | VN | t | ||
ๆตท่ง | ใใใใ | NC | |||
่ฒทใๆใ | ใใใใ | NC | |||
่ฒทๆ | ใใใใ | NC | |||
่ฒทๆใ | ใใใใ | NC | |||
่ฒทใๆใ้ | ใใใใใใ | NC | |||
่ฒทๆใ้ | ใใใใใใ | NC | |||
่ฒทๆ้ | ใใใใใใ | NC | |||
้่ฑๅ็ท | ใใใใใใใ | NC | |||
่ฒทใๆน | ใใใใ | NC | |||
่ฒทๆน | ใใใใ | NC | |||
ๅฟซๆดป | ใใใใค | AN | 0 | ||
ๅฟซๆดป | ใใใใค | AN | 2 | ||
ๅฟซ่ฑ | ใใใใค | AN | 0 | ||
ๅฟซ่ฑ | ใใใใค | AN | 2 | ||
ๅฟซ้ | ใใใใค | AN | 0 | ||
ๅฟซ้ | ใใใใค | AN | 2 | ||
้่ฑ | ใใใใค | AN | 0 | ||
้่ฑ | ใใใใค | AN | 2 | ||
ๅฟซๆดปใ | ใใใใคใ | NC | |||
้ๅใฉใใถใ | ใใใใฉใใถใ | NC | |||
้ๅไธผ | ใใใใฉใใถใ | NC | |||
่ฒทใใใถใ | ใใใใถใ | VN | |||
่ฒทใ่ขซใ | ใใใใถใ | VN | |||
่ฒท่ขซ | ใใใใถใ | VN | |||
่ฒท่ขซใ | ใใใใถใ | VN | |||
่ฒทใใใถใ | ใใใใถใ | V5 | R | ||
่ฒทใ่ขซใ | ใใใใถใ | V5 | R | t | |
่ฒทใ่ขซใ | ใใใใถใ | V4 | |||
่ฒท่ขซใ | ใใใใถใ | V5 | R | ||
้่ฑใใซใขใณ | ใใใใปใใใ | NC | |||
่ฒทใ็บๆฟ | ใใใใใ | NC | |||
่ฒท็บๆฟ | ใใใใใ | NC | |||
ไผ้คจ | ใใใใ | NC | |||
ไผ่ | ใใใใ | NC | |||
ๅฟซๆ | ใใใใ | NC | |||
ๅฟซๆผข | ใใใใ | NC | |||
ๆชๆผข | ใใใใ | NC | |||
ๆตท้ข | ใใใใ | NC | |||
้ๅทป | ใใใใ | VN | |||
้้คจ | ใใใใ | VN | r |
Practical Applications
JLDย is being used by major IT companies to enhance their Japanese morphological analysis technology, and is especially suitable for natural language processing (NLP) applications for:
Segmentation and tokenization
Input method editors
Information retrieval
Morphological analysis
Part-of-speech tagging
Reference Documents
To makeย JLDย robust for information retrieval and morphological analysis, it is highly recommended to supplement it with ourย JODย (Japanese Orthographic Database), described in detail in the papers below.
The Challenges of Intelligent Japanese Searching
Linguistic issues that need to be addressed by advanced information retrieval technologies
Morphological Attributes in Japanese
Describes derivational affixes and binding valency
JLD Related Resources
Chinese Lexical Database
Monolingual general vocabulary for NLP applications
Korean Lexical Database
Monolingual general vocabulary for NLP applications
Japanese Wordlist
General vocabulary, proper nouns and technical terms