研究業績 / Achievements

2025年度

Papers

Hiroyuki Deguchi, Go Kamoda, Yusuke Matsushita, Chihiro Taguchi, Kohei Suenaga, Masaki Waga, Sho Yokoi (2025),
“A Soft and Fast Pattern Matcher for Billion-Scale Corpus Search”,
The Thirteenth International Conference on Learning Representations (ICLR 2025).
Makoto Shing, Kou Misaki, Han Bao, Sho Yokoi, Takuya Akiba (2025),
“TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models”,
The Thirteenth International Conference on Learning Representations (ICLR 2025).
Kanako Komiya, Soma Asada, Masayuki Asahara (2025),
“Automatic Tagging of Word Senses for a Large-Scale Historical Japanese Corpus”,
Digital Humanities Conference 2025 (DH 2025).
大村舞・川端良子・小西光・浅原正幸・竹内誉羽 (2025),
「地図を刺激に用いた位置情報・経路情報参照表現の収集」,
『自然言語処理』, 第32巻第2号.
Ryo Yoshida, Shinnosuke Isono, Kohei Kajikawa, Taiga Someya, Yushi Sugimoto, Yohei Oseki (2025),
“If Attention Serves as a Cognitive Model of Human Memory Retrieval, What is the Plausible Memory Representation?”,
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025).
Ryo Kishino, Hiroaki Yamagiwa, Ryo Nagata, Sho Yokoi, Hidetoshi Shimodaira (2025),
“Quantifying Lexical Semantic Shift via Unbalanced Optimal Transport”,
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025).
Masaki Sakata, Sho Yokoi, Benjamin Heinzerling, Takumi Ito, Kentaro Inui (2025),
“On Entity Identification in Language Models”,
Findings of the Association for Computational Linguistics: ACL 2025.
Hiroaki Ozaki, Mai Omura, Kanako Komiya, Masayuki Asahara, Toshinobu Ogiso (2025),
“Structure Modeling Approach for UD Parsing of Historical Modern Japanese”,
The 1st Joint Workshop on Large Language Models and Structure Modeling (ACL 2025).
Hiroshi Matsuda, Chunpeng Ma, Masayuki Asahara (2025),
“Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs”,
Universal Dependencies Workshop 2025 (UDW 2025).
Hiroshi Tamano, Hideitsu Hino, Daichi Mochihashi (2025),
“Misspecifying Non-compensatory as Compensatory IRT: Analysis of Estimated Skills and Variance”,
Behaviormetrika.
Yusuke Kubota, Robert Levine (2025),
“Against Successive Cyclicity: A Proof-theoretic Account of Extraction Pathway Marking”,
Journal of Language Modelling.
Koyo Akuzawa, Yusuke Kubota (2025),
“On Recent Debates on the Tense Alternation Generalization: A Reply to Fujii et al. (2023)”,
Gengo Kenkyu.
Yusuke Kubota, Robert Levine (2026),
Type-Logical Grammar: A Syntactic Introduction,
Cambridge University Press.

Awards

人工知能学会研究会優秀賞
- 鈴木あすみ（国立障害者リハビリテーションセンター研究所／東北大学大学院）, 小磯花絵（国語研究所）, 「日本語母語話者の自閉スペクトラム症者を対象とする公開コーパス活用の可能性―間投助詞・終助詞「ね」の使用率・用法の比較―」

Collaborative Researches

【共同利用型(B)】そり舌音などの調音に関する客観的観察 - 阿栄娜 (早稲田大学) 　(2025/05/01-2026/04/30)
【共同利用型(B)】発話のしにくさの感覚に関連する脳活動の抽出の試み - 北村達也 (甲南大学)　(2025/07/01-2026/06/30)

2024年度

Books

呉軍著・持橋大地監訳・井上朋也訳 (2024), 「数学の美」, 東京化学同人.
【URL】

Papers

浅原正幸 (2025), 「Bayesian Linear Mixed Model に基づく漢字親密度推定の試みーー言語資源としての統計モデルの検討」, 『日本語・日本学研究』, 15号, pp.1-20. 東京外国語大学国際日本研究センター.
大村舞・若狭絢・松田寛・浅原正幸 (2025),「日本語日常会話コーパスの Universal Dependencies: UD_Japanese-CEJC」,　『自然言語処理』, 32巻1号, pp.55-90. 【DOI】 | 【BIB】
Hajime Kiyama, Taichi Aida, Mamoru Komachi, Toshinobu Ogiso, Hiroya Takamura, Daichi Mochihashi (2025), “Analyzing Continuous Semantic Shifts with Diachronic Word Similarity Matrices”, Proceedings of the 31st International Conference on Computational Linguistics, pp. 1613-1631. 【URL】
Jeong Hyeonjeong, Jungho Kim, Masataka Yano, Haining Cui, Sachiko Kiayama, Masatoshi Koizumi (2025), “The crucial role of the left inferior frontal gyrus (BA44) in synergizing syntactic structure and information structure during sentence comprehension”, Brain and Language, Vol.262, Article 105533. 【DOI】
Sachi Kato, Masayuki Asahara (2024), “Assigning Impression Rating Information to the ‘Balanced Corpus of Contemporary Written Japanese’”, Proceedings of PACLIC 38.
Risa Murakami, Masataka Yano (2024), “The role of prosodic phrasing in silent reading”, Glossa Psycholinguistics, Vol.3 No.1, pp. 1-15. 【DOI】
Kohei Kajikawa, Yusuke Kubota, Yohei Oseki (2024), “Is Structure Dependence Shaped for Efficient Communication?: A Case Study on Coordination”, Proceedings of the 28th Conference on Computational Natural Language Learning, pp. 291-302. 【URL】
Natsumi Funasaki, Masataka Yano (2024), “Role of prosody and word order in identifying focus: evidence from pupillometry”, Language, Cognition, and Neuroscience, pp. 1-18.
【DOI】
Lis Pereira, Fei Cheng, Wan Jou She, Masayuki Asahara, Ichiro Kobayashi (2024), “Prior Knowledge-Guided Adversarial Training”, Proceedings of the 9th Workshop on Representation Learning for NLP (RepL4NLP-2024), pp. 51-57.
【URL】
Yoshiko Kawabata, Mai Omura, Hikari Konishi, Masayuki Asahara, Johane Takeuchi (2024), “Collection of Japanese Route Information Reference Expressions Using Maps as Stimuli”, Proceedings of the 4th Workshop on Spatial Language Understanding and Grounded Communication for Robotics (SpLU-RoboNLP 2024), pp. 30-35.
【URL】
Hiroaki Ozaki, Kanako Komiya, Masayuki Asahara, Toshinobu Ogiso (2024), “Long Unit Word Tokenization and Bunsetsu Segmentation of Historical Japanese”, Proceedings of the 1st Workshop on Machine Learning for Ancient Languages (ML4AL 2024), pp. 48-55.
【URL】
川端良子・大村舞・浅原正幸・竹内誉羽 (2024), 「Double Cross Model に基づく相対位置情報アノテーション」, 『自然言語処理』, 31巻3号, pp. 1356-1375.
【DOI】 | 【BIB】
Masataka Yano, Keiyu Niikuni, Ruri Shimura, Natsumi Funasaki, Masatoshi Oizumi (2024), “Producing non-basic word orders in (in)felicitous contexts: evidence from pupillometry and functional near-infrared spectroscopy (fNIRS)”, Language, Cognition, and Neuroscience, pp. 1-22.
【DOI】

Awards

言語処理学会30周年記念論文賞, (10年間で283編中2編）, 言語処理学会 (2024.10)
- 浅原正幸・金山博・宮尾祐介・田中貴秋・大村舞・村脇有吾・松本裕治
  「Universal Dependencies 日本語コーパス」【URL】
Outstanding Senior Area Chairs, （101名中10名）, The 2024 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics (2024.11)
- Masayuki Asahara 【X】
Best Paper Award, （42件中1件）, Proceedings of the 28th Conference on Computational Natural Language Learning (2024.12)
- Kohei Kajikawa, Yusuke Kubota, Yohei Oseki
  “Is Structure Dependence Shaped for Efficient Communication?: A Case Study on Coordination”
  【URL】
第31回言語処理学会年次大会優秀賞, （対象765件中14件）, 言語処理学会【URL】 (2025.3)
- 内海慶・持橋大地
  「ベイズ教師なし文境界認識」
第31回言語処理学会年次大会委員特別賞, （対象726件中32件）, 言語処理学会【URL】 (2025.3)
- 永田亮・持橋大地・井戸美里・窪田悠介・高村大也・川崎義史・大谷直輝
  「対象言語・対象単語を選ばない汎用的な文法化度の定量化手法」
- 近藤泰弘・持橋大地
  「語形のベクトル化による最適な言語地図の描画」
- 岸野稜・山際宏明・永田亮・横井祥・下平英寿
  「不均衡最適輸送を用いた意味変化検出」
- 磯野真之介・梶川康平・杉本侑嗣・浅原正幸・大関洋平
  「CCGによる日本語脳波データのモデリング」
- 浅原正幸
  「whole-NWJC: 『国語研日本語ウェブコーパス』全データ」
第31回言語処理学会年次大会若手奨励賞, （対象487件中20件）, 言語処理学会【URL】 (2025.3)
- 梶川康平
  「認知負荷の最適化戦略としての自由語順と項省略」

Collaborative Researches

【共同利用型(B)】大規模言語モデルによる日本語古典語の生成 - 近藤泰弘 (青山学院大学)　　(2024/06/01-2025/05/31)
【共同利用型(B)】ヘブライ語・ロシア語の語頭子音連続における調音の実態調査 - 横江百合子（東洋学園大学）(2024/06/01-2025/03/31)
【共同利用型(B)】発話のしにくさの感覚に関連する脳活動の抽出の試み - 北村達也 (甲南大学)　(2025/01/01-2025/03/31)
【共同利用型(C)】オープンかつ日本語に強い大規模言語モデルの構築 - 岡崎直観（東京工業大学）(2024/09/01-)
【共同利用型(C)】可搬性に優れた大規模言語モデル技術の開発 - 高村大也（産業技術総合研究所）(2024/09/01-)
【共同利用型(C)】日本語汎用大規模言語モデルの追加学習と学術・医療ドメインへの適応 - 相澤彰子（国立情報学研究所）(2024/12/01-)
【他機構】国立情報学研究所大規模言語モデル研究開発センター
【他大学】東北大学言語AI研究センター
【企業】SB Intuitions 株式会社（～2025年3月）
【企業】株式会社ホンダ・リサーチ・インスティチュート・ジャパン（～2026年3月）
【企業】NTTコミュニケーション科学基礎研究所（～2025年3月）

2023年度以前より継続のもの

Collaborative Researches

【共同利用型(C)】係り受け情報を用いた日本語母語話者の情報処理過程解明のための基礎的研究 - 庵功雄（一橋大学）（～2025年3月）
【共同利用型(C)】BCCWJ/CSJ への生理指標アノテーション付加 - 小泉政利 (東北大学)
【企業】株式会社リクルート Megagon Labs「日本語版Universal Dependenciesに基づく日本語依存構造解析モデルの研究開発」
【企業】株式会社ワークスアプリケーションズ「日本語事前学習モデルchiTraの研究開発」
【企業】シェルパ・アンド・カンパニー株式会社「高速な日本語形態素解析器の開発及び解析モデルの公開」