Temat: adaptive control - Prolib Integro

Skocz do pozycji: 1.

Tytuł:: On adaptive control of Markov chains using nonparametric estimation
Autorzy:: Drabik, Ewa
Stettner, Łukasz
Tematy:: controlled Markov chain
estimation
adaptive control; Pokaż więcej
Data publikacji:: 2000
Powiązania:: https://bibliotekanauki.pl/articles/1208175.pdf Link otwiera się w nowym oknie
Źródło:: Applicationes Mathematicae; 2000, 27, 2; 143-152
1233-7234
Pojawia się w:: Applicationes Mathematicae
Opis:: Two adaptive procedures for controlled Markov chains which are based on a nonparametric window estimation are shown.
Dostawca treści:: Biblioteka Nauki

Artykuł

Skocz do pozycji: 2.

Tytuł:: A generalization of Uenos inequality for n-step transition probabilities
Autorzy:: Nowak, Andrzej
Tematy:: adaptive control
transition probabilities
stochastic control
Markov chains; Pokaż więcej
Data publikacji:: 1998
Powiązania:: https://bibliotekanauki.pl/articles/1338973.pdf Link otwiera się w nowym oknie
Źródło:: Applicationes Mathematicae; 1998-1999, 25, 3; 295-299
1233-7234
Pojawia się w:: Applicationes Mathematicae
Opis:: We provide a generalization of Ueno's inequality for n-step transition probabilities of Markov chains in a general state space. Our result is relevant to the study of adaptive control problems and approximation problems in the theory of discrete-time Markov decision processes and stochastic games.
Dostawca treści:: Biblioteka Nauki

Artykuł

Skocz do pozycji: 3.

Tytuł:: Adaptive control of discrete time Markov processes by the large deviations method
Autorzy:: Duncan, T.
Pasik-Duncan, B.
Stettner, Łukasz
Tematy:: discrete time controlled Markov processes
adaptive control
large deviations; Pokaż więcej
Data publikacji:: 2000
Powiązania:: https://bibliotekanauki.pl/articles/1208162.pdf Link otwiera się w nowym oknie
Źródło:: Applicationes Mathematicae; 2000, 27, 3; 265-285
1233-7234
Pojawia się w:: Applicationes Mathematicae
Opis:: Some discrete time controlled Markov processes in a locally compact metric space whose transition operators depend on an unknown parameter are described. The adaptive controls are constructed using the large deviations of empirical distributions which are uniform in the parameter that takes values in a compact set. The adaptive procedure uses a finite family of continuous, almost optimal controls. Using the large deviations property it is shown that an adaptive control which is a fixed almost optimal control after a finite time is almost optimal with probability nearly 1.
Dostawca treści:: Biblioteka Nauki

Artykuł

Skocz do pozycji: 4.

Tytuł:: Recursive self-tuning control of finite Markov chains
Autorzy:: Borkar, Vivek
Tematy:: controlled Markov chains
stochastic approximation
relative value iteration
self-tuning control
adaptive control; Pokaż więcej
Data publikacji:: 1997
Powiązania:: https://bibliotekanauki.pl/articles/1339274.pdf Link otwiera się w nowym oknie
Źródło:: Applicationes Mathematicae; 1996-1997, 24, 2; 169-188
1233-7234
Pojawia się w:: Applicationes Mathematicae
Opis:: A recursive self-tuning control scheme for finite Markov chains is proposed wherein the unknown parameter is estimated by a stochastic approximation scheme for maximizing the log-likelihood function and the control is obtained via a relative value iteration algorithm. The analysis uses the asymptotic o.d.e.s associated with these.
Dostawca treści:: Biblioteka Nauki

Artykuł

Skocz do pozycji: 5.

Tytuł:: Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost
Autorzy:: Borkar, V.
Associate, S.
Tematy:: time-averaged cost
adaptive control
asymptotic optimality
cost-biased estimate
Bayesian estimation; Pokaż więcej
Data publikacji:: 1998
Powiązania:: https://bibliotekanauki.pl/articles/1339007.pdf Link otwiera się w nowym oknie
Źródło:: Applicationes Mathematicae; 1998-1999, 25, 3; 339-358
1233-7234
Pojawia się w:: Applicationes Mathematicae
Opis:: This paper considers Bayesian parameter estimation and an associated adaptive control scheme for controlled Markov chains and diffusions with time-averaged cost. Asymptotic behaviour of the posterior law of the parameter given the observed trajectory is analyzed. This analysis suggests a "cost-biased" estimation scheme and associated self-tuning adaptive control. This is shown to be asymptotically optimal in the almost sure sense.
Dostawca treści:: Biblioteka Nauki

Artykuł

Skocz do pozycji: 6.

Tytuł:: On adaptive control of a partially observed Markov chain
Autorzy:: Di Masi, Giovanni
Stettner, Łukasz
Tematy:: uniform ergodicity
long run average cost
filtering process
adaptive control
approximate filter
partially observed systems; Pokaż więcej
Data publikacji:: 1994
Powiązania:: https://bibliotekanauki.pl/articles/1340597.pdf Link otwiera się w nowym oknie
Źródło:: Applicationes Mathematicae; 1993-1995, 22, 2; 165-180
1233-7234
Pojawia się w:: Applicationes Mathematicae
Opis:: A control problem for a partially observable Markov chain depending on a parameter with long run average cost is studied. Using uniform ergodicity arguments it is shown that, for values of the parameter varying in a compact set, it is possible to consider only a finite number of nearly optimal controls based on the values of actually computable approximate filters. This leads to an algorithm that guarantees nearly selfoptimizing properties without identifiability conditions. The algorithm is based on probing control, whose cost is additionally assumed to be periodically observable.
Dostawca treści:: Biblioteka Nauki

Artykuł

Skocz do pozycji: 7.

Tytuł:: Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion
Autorzy:: Minjárez-Sosa, J.
Tematy:: Markov control process
discounted and average cost criterion
adaptive policy; Pokaż więcej
Data publikacji:: 1999
Powiązania:: https://bibliotekanauki.pl/articles/1338768.pdf Link otwiera się w nowym oknie
Źródło:: Applicationes Mathematicae; 1999, 26, 3; 267-280
1233-7234
Pojawia się w:: Applicationes Mathematicae
Opis:: We introduce average cost optimal adaptive policies in a class of discrete-time Markov control processes with Borel state and action spaces, allowing unbounded costs. The processes evolve according to the system equations $x_{t+1}=F(x_t,a_t,ξ _t)$, t=1,2,..., with i.i.d. $ℝ^k$-valued random vectors $ξ_t$, which are observable but whose density ϱ is unknown.
Dostawca treści:: Biblioteka Nauki

Artykuł

Informacja