Reinforcement Learning researcher. These days, looking into multi-agent systems.
PhD in Machine Learning, obtained in 2021.
Pinned Loading
-
logistic_bandit
logistic_bandit PublicLogistic Bandit experiments. Official code for the paper "Jointly Efficient and Optimal Algorithms for Logistic Bandits".
-
hessian_free_dnn
hessian_free_dnn PublicSecond order (Hessian-Free, Martens 2015) experiments with deep neural networks, in Tensorflow.
Python 2
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.