I make the case why people iteratively training any model should learn some core concerns of reinforcement learning.
Industry labs power up their robot learning research with parallelization!
How RL is starting to be used by industry and how RL is heading to a framing more suited for industrial scales.
Multi-agent scenarios make reward maximization a risk. Discussing when, rather than if, we should believe in the Reward Hypothesis.
How simulator exploitation, a dual of over-optimization, in AI is the canary in the coal mine for what negative implications could come from weakly-bou…
I don’t even like the idea of flying delivery drones, but that will be all we have.
Some debates that will be settled en route to RL being used in all corners of the modern world.
How English being the language of progress in AI could bias our machine’s minds and what we think they are capable of. Uncoupling AI from our notion of…
See all