LSAJ

Japanese｜English

［Last update］January 10, 2024

2025年11月19日: I-JAS中納言のデータを更新しました。データバージョン2025.9

2025年01月10日: 第九回学習者コーパス・ワークショップ＆シンポジウム　【開催のお知らせ】

2024年01月10日: 第八回学習者コーパス・ワークショップ＆シンポジウム　【開催のお知らせ】

2022年08月10日: 第七回学習者コーパス・ワークショップ＆シンポジウム　【開催のお知らせ】

2022年05月23日: 「I-JAS 外国語母語話者コーパス（I-JAS FOLAS）」を公開しました

2021年04月20日: 第六回学習者コーパス・ワークショップ＆シンポジウム　【開催のお知らせ】

2021年03月15日: 【重要】　C-JAS検索システム　中納言に移行

2020年06月02日: 「I-JAS」完成記念シンポジウム／第5回コーパスワークショップ　【開催のお知らせ】

2020年04月07日: 『日本語学習者コーパスI-JAS入門研究・教育にどう使うか』が刊行されました

2020年03月25日: 【2020年3月25日】『多言語母語の日本語学習者横断コーパス』（I-JAS）が完成しました

2019年05月10日: 【2019年5月10日】『多言語母語の日本語学習者横断コーパス（I-JAS）』の第四次データを公開しました

2018年10月02日: 第四回学習者コーパス・ワークショップ&シンポジウム　開催案内（イベントページに一部予稿集を追加しました）

2018年05月21日: 【2018年5月21日】『多言語母語の日本語学習者横断コーパス（I-JAS: International Corpus of Japanese as a Second Language）』の第三次データを公開しました

C-JAS 中国語・韓国語母語の日本語学習者縦断発話コーパス

This is a spoken corpus, based on a series of longitudinal studies consisting of data collected from 3 Chinese and 3 Korean learners of Japanese. The title of this corpus is "Spoken Corpus of Longitudinal Research on Chinese and Korean Learners of Japanese", and the abbreviated title is "C-JAS (Corpus of Japanese as a Second Language)".
The data comprises 570,000 words and about 46.5 hours worth of speech (utterance). An online system can be used to search the data for examples by morpheme unit, character string, etc.

I-JAS 多言語母語の日本語学習者横断コーパス

This corpus is based on cross-sectional research and collects data on spoken and written words of 1000 Japanese learners of 12 different native languages. The title of this corpus is "International Corpus of Cross-sectional Research of Japanese learners", and the abbreviated title is "I-JAS (International Corpus of Japanese as a Second Language)". The targeted Japanese learners were requested to take the Japanese proficiency test to assess their langauge proficiency. Therefore, the data can be compared by level, native language, tasks, and learning environment. Examples can be searched online, along with audio data from speech research.