#ml
This blog post summarizes the papers Settling the Reward Hypothesis1 and Utility Theory for Sequential Decision Making 2.
The reward hypothesis is at the core of Reinforcement Learning (RL) in that …
Recent news of the SpaceX catching its Starship Super Heavy booster1 is quite discussed and marveled upon in the media, and rightly so. Executing what was done can perhaps be explained as trying to …
Almost always we hear about classification or machine learning problems, the go-to methods to solve the problem are neural networks, or multi-layered percetrons (MLP). Now function approximation …
The internet is filled with machine learning resources, and one of the most annoying things about them is the sheer volume. There are many attempts at making compilations of papers, code, and current …