Tabular and Deep Learning of Whittle Index

Francisco Robledo; Urtzi Ayesta; Konstantin Avrachenkov; Vivek S Borkar

Poster De Conférence Année : 2022

Tabular and Deep Learning of Whittle Index

(1, 2) , , (3) , (4)

1
2
3
4

Francisco Robledo

Fonction : Auteur
PersonId : 1160530
IdHAL : francisco-robledo
ORCID : 0000-0003-1040-1513

University of the Basque Country = Euskal Herriko Unibertsitatea

Laboratoire de Mathématiques et de leurs Applications [Pau]

Urtzi Ayesta

Fonction : Auteur
PersonId : 14024
IdHAL : urtzi-ayesta
IdRef : 087245019

Konstantin Avrachenkov

Fonction : Collaborateur
PersonId : 11963
IdHAL : konstantin-avrachenkov
ORCID : 0000-0002-8124-8272
IdRef : 087245280

Network Engineering and Operations

Vivek S Borkar

Fonction : Collaborateur
PersonId : 994265
ORCID : 0000-0003-0756-5402

Department of Electrical Engineering [IIT-Bombay]

Résumé

- Whittle index policy is an asymptotically optimal heuristic for solving Restless Multi-Armed Bandit Problems (RMBAP). - We propose two algorithms, QWI and QWINN, for the computation of such indices. - Both employ a two timescale system for the computation of the indices and the Q-values of each state/action.

Domaines

Informatique [cs] Apprentissage [cs.LG] Intelligence artificielle [cs.AI]

Fichier principal

poster.pdf (2.63 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Francisco Robledo : Connectez-vous pour contacter le contributeur

https://univ-pau.hal.science/hal-03810695

Soumis le : mardi 11 octobre 2022-15:40:00

Dernière modification le : mardi 12 mars 2024-14:16:04

Dates et versions

hal-03810695 , version 1 (11-10-2022)

Identifiants

HAL Id : hal-03810695 , version 1

Citer

Francisco Robledo, Urtzi Ayesta, Konstantin Avrachenkov, Vivek S Borkar. Tabular and Deep Learning of Whittle Index. EWRL 2022 - 15th European Workshop on Reinforcement Learning, Sep 2022, Milan, Italy. ⟨hal-03810695⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-PAU LMA-PAU INRIA2 UNIV-COTEDAZUR ANR CIMI-TOULOUSE

53 Consultations

43 Téléchargements

Tabular and Deep Learning of Whittle Index

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager