(1 - 8 von 8
)
Reinforcement Learning & Bandits 2
ecmlpkdd2019.org
Debabrota Basu (Chalmers University of Technology), Pierre Senellart (DI ENS, ENS, CNRS, PSL University; INRIA), Stéphane Bressan (National University of Singapore) We propose a Bayesian information-geometric approach to the exploration-exploitation trade-off in stochastic multi-armed bandits. The uncertainty on reward generation and belief is ...
Neuroscience Seminar Series “Introduction to Fine-Grained ...research.pasteur.fr › event › neuroscience-seminar-s...
research.pasteur.fr
· Pierre Senellart. (University Professor and Deputy Director of the Computer Science Department Ecole Normale Supérieure, Paris ) ...
sortiert nach Relevanz / Datum