I wrote in a previous post how transformers are kind of like system identification methods applied to some sequence in a state space. In this post we try to understand if the other way round is …
Transformer models have captivated a lot of AI research in most of the past decade, and in this post my goal is to make them seem more interesting to controls people, who have much to contribute to …
Consider an $$n+1$$ dimensional matrix as follows:
$$ A = \begin{pmatrix} \frac{\pi^2}{6} & 1 & \frac{1}{4} & ... & \frac{1}{n^2} \\ 1 & \frac{\pi^2}{6} & \frac{1}{4} & ... & \frac{1}{n^2} \\ \vdots & …
This blog post summarizes the papers Settling the Reward Hypothesis1 and Utility Theory for Sequential Decision Making 2.
The reward hypothesis is at the core of Reinforcement Learning (RL) in that …
Recent news of the SpaceX catching its Starship Super Heavy booster1 is quite discussed and marveled upon in the media, and rightly so. Executing what was done can perhaps be explained as trying to …