I am an undergraduate at Kobe University, currently an exchange student at
TUM (2025/26). My interests lie in safe and general physical intelligence, especially Generative models, Reinforcement Learning and Control Theory.
An in-depth note bridging reinforcement learning and constrained optimization. It covers fundamental structures including KKT conditions, and demonstrates how to achieve end-to-end safe set learning via implicit differentiation.
A demo of FSNet, an architecture designed for differentiable constrained optimization. It implements a differentiable projection layer that unrolls optimization processes to learn policies under hard constraints.