Hi! I'm Issa Sugiura
I am a first year PhD student at Institute of Science Tokyo, majoring in Computer Science.
I like to learn by building things.
Research Interests
My research focuses on building vision-language models (VLMs) with strong Japanese language and cultural understanding. To get there, I work across the full development pipeline — building training datasets and benchmarks, and developing the Japanese models themselves:
- constructing large-scale training datasets (e.g., WAON, Jagle)
- developing reliable evaluation benchmarks (e.g., JAMMEval, HakushoBench)
- building Japanese-centric VLMs (e.g., LLM-jp-4-VL 9B beta)
I am also interested in benchmarking LLM agents — particularly on economically valuable tasks that go beyond coding and math, and on long-horizon tasks (e.g., CoffeeBench, EDINET-Bench).
Education
- Apr 2026 – present
Doctoral Program (Expected)
Institute of Science Tokyo, Japan
Course of Artificial Intelligence, Department of Computer Science, School of Computing
- Apr 2024 – Mar 2026
Master's Degree
Kyoto University, Japan
Course of Communications and Computer Engineering, Graduate School of Informatics
- Apr 2020 – Mar 2024
Bachelor's Degree
Osaka University, Japan · GPA 3.57/4.00
Software Science Course, Department of Information and Computer Sciences, School of Engineering Science
Experience
- Apr 2026 – present
Research Assistant @ Institute of Science Tokyo
Research on VLMs.
- Dec 2024 – present
Student Intern @ Sakana AI
Building benchmarks to evaluate LLM agents on economically valuable, long-horizon tasks, including financial analysis (EDINET-Bench, ICLR 2026) and multi-agent economies (CoffeeBench).
- Feb 2024 – present
Research Assistant @ National Institute of Informatics
Developing Japanese vision-language models across the full pipeline — large-scale dataset construction (WAON, Jagle), evaluation benchmark design (JAMMEval, HakushoBench), and model training (LLM-jp-4-VL).
Publications
Peer-Reviewed Publications
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
Preprints
- 1.
- 2.
- 3.
- 4.
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models
Issa Sugiura, Shuhei Kurita, Yusuke Oda, Daisuke Kawahara, Yasuo Okabe, Naoaki Okazaki
arXiv, Oct 2025
- 5.
- 6.
- 7.
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
LLM-jp (incl. Issa Sugiura; 80 authors, listed in alphabetical order)
arXiv, Jul 2024
Domestic Conference (Non-Refereed)
- 1.
Jagle: 視覚言語モデルのための大規模日本語マルチモーダル事後学習データセットの構築
杉浦一瑳, 笹川慶人, 中尾圭佑, 前田航希, Yin Ziqi, Yang Zhishen, 栗田修平, 小田悠介, 徳久良子, 河原大輔, 岡崎直観
JSAI2026, 2026年6月
- 2.
- 3.
- 4.
WAON: 視覚言語モデルのための大規模かつ高品質な日本語画像・テキスト対データセット
委員特別賞杉浦一瑳, 栗田修平, 小田悠介, 河原大輔, 岡部寿男, 岡崎直観
NLP2026, 2026年3月
- 5.
Common Crawlを用いた大規模音声音響データセットの構築
浅井航平, 杉浦一瑳, 中田亘, 栗田修平, 高道慎之介, 小川哲司, 東中竜一郎
日本音響学会 2025年秋季研究発表会, 2025年9月
- 6.
- 7.
- 8.
- 9.
LLM-jp-3 VILA: 日本語マルチモーダルデータセット及び強力な日本語マルチモーダルモデルの構築
委員特別賞笹川慶人, 前田航希, 杉浦一瑳, 栗田修平, 岡崎直観, 河原大輔
NLP2025, 2025年3月
- 10.
Models / Systems
- 1.
Certifications / Qualifications
- Oct 19, 2025 TOEIC L&R: L 420 + R 425 = 845
- Aug 2022 Security Camp organized by IPA (Information-technology Promotion Agency), Web Security Course