eprintid: 1676
rev_number: 10
eprint_status: archive
userid: 46
dir: disk0/00/00/16/76
datestamp: 2013-09-11 10:28:00
lastmod: 2013-09-16 12:02:59
status_changed: 2013-09-11 10:28:00
type: conference_item
metadata_visibility: no_search
creators_name: Gnecco, Giorgio
creators_name: Gaggero, Mauro
creators_name: Sanguineti, Marcello
creators_name: Zoppoli, Riccardo
creators_id: giorgio.gnecco@imtlucca.it
creators_id: 
creators_id: 
creators_id: 
title: Dynamic Programming And Value-Function Approximation With
Application To Optimal Consumption
ispublished: pub
subjects: QA75
divisions: CSA
full_text_status: none
pres_type: paper
note: 43rd Conference of the Italian Operational Research Society
abstract: Sequential decision problems are considered, where a reward additive over a number of stages has to be maximized. Instances arise in scheduling eets of vehicles,
allocating resources, selling assets, optimizing transportation or telecommunication networks, inventory forecasting, financial planning, etc. At each stage, Dynamic Programming (DP) introduces the value function, which gives the value of the reward to be incurred at the next stage, as a function of the state at the current stage. The solution is formally obtained via recursive equations. However, closed-form solutions can be derived only in particular cases. We investigate how DP and suitable approximations of the value functions can be combined, providing a methodology to face high-dimensional sequential
decision problems. Approximations of the value functions are considered, expressed as linear combinations of basis functions obtained from a "mother function" (e.g., the Gaussian), by varying some "inner parameters" (e.g., variance and center coordinates) [1-5]. The accuracies of such suboptimal solutions are estimated. It is shown that
one can cope with the \curse of dimensionality" in value-function approximation (i.e., an exponential growth of the number of basis functions, required to guarantee a desired solution accuracy). The theoretical analysis is applied to a multidimensional version of the optimal consumption problem. (In the classical version, a consumer aims at maximizing the discounted value of the consumption of a good, given a time horizon, a sequence of interest rates, an initial wealth, and an income earned at each stage. Here, more consumers are considered.) The proposed approximation scheme is compared with classical linear approximators, i.e., linear combinations of a-priori
fixed basis functions. It is shown via simulations that the our approach provides a better solution accuracy, the number of computational units being the same as in fixed-basis approximation.
date: 2012-09
date_type: published
pagerange: 159
event_title: AIRO 2012
event_location: Vietri sul Mare, Italy
event_dates: September 4th-7th, 2012
event_type: conference
refereed: TRUE
related_url_url: http://www.airo2012.it/index.html
citation:   Gnecco, Giorgio and Gaggero, Mauro and Sanguineti, Marcello and Zoppoli, Riccardo  Dynamic Programming And Value-Function Approximation With Application To Optimal Consumption.  In: AIRO 2012, September 4th-7th, 2012, Vietri sul Mare, Italy p. 159.        (2012)