Thompson Sampling for Multi-Objective Multi-Armed Bandits Problem. SQ Yahyaa, B Manderick ESANN, 2015 | 32 | 2015 |
Annealing-pareto multi-objective multi-armed bandit algorithm SQ Yahyaa, MM Drugan, B Manderick 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2014 | 24 | 2014 |
The scalarized multi-objective multi-armed bandit problem: An empirical study of its exploration vs. exploitation tradeoff SQ Yahyaa, MM Drugan, B Manderick 2014 International Joint Conference on Neural Networks (IJCNN), 2290-2297, 2014 | 24 | 2014 |
Knowledge Gradient for Multi-objective Multi-armed Bandit Algorithms. SQ Yahyaa, MM Drugan, B Manderick ICAART (1), 74-83, 2014 | 24 | 2014 |
Thompson Sampling in the Adaptive Linear Scalarized Multi Objective Multi Armed Bandit. SQ Yahyaa, MM Drugan, B Manderick ICAART (2), 55-65, 2015 | 15 | 2015 |
Knowledge gradient for online reinforcement learning S Yahyaa, B Manderick Agents and Artificial Intelligence: 6th International Conference, ICAART …, 2015 | 5 | 2015 |
The exploration vs exploitation trade-off in the multi-armed bandit problem: An empirical study SQ Yahyaa, B Manderick Proceedings of the 20th European Symposium on Artificial Neural Networks …, 2012 | 5 | 2012 |
Shortest path gaussian kernels for state action graphs: An empirical study S Yahyaa, B Manderick BNAIC 2012 The 24th Benelux Conference on Artificial Intelligence, 250, 2012 | 5 | 2012 |
Multivariate normal distribution based multi-armed bandit pareto algorithm SQ Yahyaa, MM Drugan, B Manderick The 7th European Conference on Machine Learning and Principles and Practice …, 2014 | 4 | 2014 |
Correlated Gaussian multi-objective multi-armed bandit across arms algorithm SQ Yahyaa, MM Drugan 2015 IEEE Symposium Series on Computational Intelligence, 593-600, 2015 | 3 | 2015 |
Knowledge Gradient Exploration in Online Least Squares Policy Iteration. SQ Yahyaa, B Manderick ICAART (2), 263-269, 2013 | 3 | 2013 |
Scalarized and pareto knowledge gradient for multi-objective multi-armed bandits S Yahyaa, MM Drugan, B Manderick Transactions on Computational Collective Intelligence XX, 99-116, 2015 | 2 | 2015 |
Linear Scalarized Knowledge Gradient in the Multi-Objective Multi-Armed Bandits Problem. SQ Yahyaa, MM Drugan, B Manderick ESANN, 2014 | 2 | 2014 |
Knowledge gradient exploration in online kernel-based LSPI S Yahyaa, B Manderick Proceedings of the 25th Belgium-Netherlands Artificial Intelligence …, 2013 | 2 | 2013 |
Explorations in Reinforcement Learning: Online Action Selection and Value Function Approximation SQ Yahyaa | 1 | 2015 |
Annealing linear scalarized based multi-objective multi-armed bandit algorithm SQ Yahyaa, MM Drugan, B Manderick 2015 IEEE Congress on Evolutionary Computation (CEC), 1738-1745, 2015 | | 2015 |
Online Knowledge Gradient Exploration in an Unknown Environment. SQ Yahyaa, B Manderick ICAART (1), 5-13, 2014 | | 2014 |
Empirical Evaluation of Shortest Path Gaussian Kernels over State Action Graphs. SQ Yahyaa, B Manderick ICAART (2), 225-231, 2013 | | 2013 |
The Exploration vs Exploitation Trade-Off in Bandit Problems: An Empirical Study S Yahyaa, B Manderick rn 1, 2, 2012 | | 2012 |