NDLI: Region-based dynamic programming for partially observable markov decision processes.

Please wait, while we are loading the content...

Region-based dynamic programming for partially observable markov decision processes.

Content Provider	CiteSeerX
Author	Feng, Zhengzhu
Abstract	We present a major improvement to the dynamic programming (DP) algorithm for solving partially observable Markov decision processes (POMDPs). Our technique first targets the cross-sum pruning step of the DP update, a key source of complexity in POMDP algorithms. Unlike previous approaches, which reason about the whole belief space, the algorithms we present divide the belief space into smaller regions and perform independent pruning in each region. Because the number of useful vectors over each region can be much smaller than those over the whole belief space, we show that the linear programs used in the pruning process can be made exponentially smaller. With this exponential improvement to cross-sum pruning, we shift our attention to the next bottleneck, the maximization pruning step. Using the same region-based reasoning, we identify two types of structures in the belief space of a POMDP and show how to exploit them to reduce significantly the number of constraints in the linear programs used for maximization pruning. We discuss future research directions on extending these techniques to improve the scalability of POMDP algorithms.
File Format	PDF
Access Restriction	Open
Subject Keyword	Region-based Dynamic Programming Partially Observable Markov Decision Process Linear Program Belief Space Pomdp Algorithm Whole Belief Space Observable Markov Decision Process Dynamic Programming Future Research Direction Key Source Maximization Pruning Useful Vector Cross-sum Pruning Step Exponential Improvement Previous Approach Pruning Process Dp Update Next Bottleneck Region-based Reasoning Major Improvement Cross-sum Pruning
Content Type	Text

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in