Tsunehiko Tanaka

I am working as a Software Engineer at IBM in Tokyo, Japan.

I am interested in controllable embodied agents using formal representations and reinforcement learning (RL). I received my Ph.D. from Waseda University in March 2026, under the supervision of Dr. Edgar Simo-Serra. Previously, focusing on games as an application domain, I have worked on the performance control of RL agents and board game design using LLMs. Additionally, I was fortunate to have the opportunity to intern at IBM Research in 2021, where I worked on developing RL agents that utilize commonsense reasoning. I also conducted collaborative research under Dr. Matthew Stephenson at Flinders University.

I’ll be attending ICML’26—feel free to reach out via LinkedIn or email.

Email | Google Scholar | X | Github | Linkedin

Publications

	Return-Aligned Decision Transformer Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra. TMLR 2025 J2C Certification (To be presented at ICML'26) paper \| code \| DOI
	Grammar and Gameplay-aligned RL for Game Description Generation with LLMs Tsunehiko Tanaka, Edgar Simo-Serra Conference on Games 2025 paper \| code \| DOI
	Grammar-based Game Description Generation using Large Language Models Tsunehiko Tanaka, Edgar Simo-Serra Transactions on Games 2024 paper \| code \| DOI
	DiffG-RL: Leveraging Difference between State and Common Sense Tsunehiko Tanaka, Daiki Kimura, Michiaki Tatsubori Findings of EMNLP 2022 paper \| code \| DOI
	Commonsense Knowledge from Scene Graphs for Textual Environments Tsunehiko Tanaka, Daiki Kimura, Michiaki Tatsubori AAAI Workshop on Reinforcement Learning in Games 2022 paper \| code \| DOI
	Challenges in Explainability and Knowledge Extraction Daiki Kimura, Tsunehiko Tanaka, Michiaki Tatsubori, Asim Munawar Wordplay: When Language Meets Games @ NAACL 2022 paper
	LoL-V2T: Large-Scale Esports Video Description Dataset Tsunehiko Tanaka, Edgar Simo-Serra CVPRW (CVSports) 2021 project \| DOI