Berry D.A., Fristedt B. — Bandit problems :: Электронная библиотека попечительского совета мехмата МГУ

Главная Ex Libris Книги Журналы Статьи Серии Каталог Wanted Загрузка ХудЛит Справка Поиск по индексам Поиск Форум

Авторизация

Поиск по указателям

Berry D.A., Fristedt B. — Bandit problems

Обсудите книгу на научном форуме

Нашли опечатку?
Выделите ее мышкой и нажмите Ctrl+Enter

Название: Bandit problems

Авторы: Berry D.A., Fristedt B.

Аннотация:

Our purpose in writing this monograph is to give a comprehensive treatment of the subject. We define bandit problems and give the necessary foundations in Chapter 2. Many of the important results that have appeared in the literature are presented in later chapters; these are interspersed with new results. We give proofs unless they are very easy or the result is not used in the sequel. We have simplified a number of arguments so many of the proofs given tend to be conceptual rather than calculational. All results given have been incorporated into our style and notation.

Язык:

Рубрика: Математика/

Статус предметного указателя: Готов указатель с номерами страниц

ed2k: ed2k stats

Год издания: 1985

Количество страниц: 275

Добавлена в каталог: 04.12.2005

Операции: Положить на полку | Скопировать ссылку для форума | Скопировать ID

Предметный указатель

-metric      24
-norm      24
Abdel Hamid, A.R.      207
Adaptive reinvestment      236
Admissible strategy      see “Strategy admissible”
Advantage of an arm      70 151 152
Anscombe, F.J.      208 211 219 221 222 223 233 241 243
Antle, C.E.      3 8 210 227 242 255
ARM      1
Arm, advantage of      70
Arm, definition of      9
Arm, infinitely many      242
Arm, optimal      2 4 17 27 70 227
Arm, risky      61 214
Armitage, P.      208
Asano, C.      221 231
Asymptotic optimality      208—211 221 222 227 229 232 234 238 241—243 247—249 251 254—258 260
Backwards induction      see “Dynamic programming”
Bandit      1 11 17 62 223
Bandit with a goal      213 214
Bandit, arm-acquiring      224 250 255
Bandit, continuous-time      7 131 166 179
Bandit, discrete-time      192
Bandit, finite-memory      3 7 207 222 223 231 241 242 248 249 251 256 257
Bandit, finite-state      202 231 257
Bandit, quasi-      139
Barnett, B.N.      3 7 63 64
Bather, J.A.      viii 178 189 197 201 206 209 210 211 213 220 221 228 229 230 242 243 245 247 250 252 254 259
Bayesian approach      2 4 191 208 210
Beckmann, M.J.      211 225 230 231
Begg, C.      8 211 219 221 241 246
Bellman, R.      2 7 86 88 134 210 212 213 227 228 229 240 251 257 259
Benzig, H.      212 231 232 239 240
Berger, J.      0 135
Bernoulli arm      211—224 28 30 34 35 36 37 38 65 82 107 129 134 142 150 192 209—219 223—226 228—240 243—245 247—249 251—256 259 260
Bernoulli arm, moments of      72
Berry, D.A.      50 64 73 82 88 90 92 99 107 125 128 129 131 134 150 165 210 212 213 214 215 217 218 220 221 223 224 225 228 233 234 235 236 238 239 240 245 248 252 253 254 255 256 259 260
Beta distribution      3 74 126 153 158 161 213 228 229 234 235 240 244 245 247 250 255
Blackwell, D.      236
Borel field      13—15
Borel subset      10 13 16 53 169
Boundary function      175
Bradt, R.N.      2 3 7 28 49 86 88 92 114 134 155 165 207 210 212 213 215 220 223 225 235 237 239 240 248 257 259
Brand, H.      215 216 217 244 259
Break-even observation      128
Break-even value of a known arm      39 86 100 107 123 127 133 172 210 212 227—229 238 250 255 256
Break-even value, calculation of      39 107 113 114 129 219 238
Break-even value, continuity of      101
Break-even value, effect of failure on      104
Break-even value, effect of success on      see “Staying with a winner”
Break-even value, existence of      86 100 123 133
Break-even value, formula for      101
Break-even value, graph of      109 110
Break-even value, lower bounds for      110 240
Break-even value, monotonicity of      87 102
Break-even value, upper bound for      114 240
Breakwell, J.      175 178 189 211 216 220
Brownian arm      167 175 182 184 185 201 210 218 220
Brownian motion      167 168 170—175 182 202 210 218
Bush, R.R.      216 217 232 233 244 251
Buyukkoc, C.      228 230 249 253 255 256
Cane, V.R.      217 251
Canner, P.L.      208 217 221 238 244 260
Cardiac arrhythmias      260
Chandrasekaran, B.      222 241 249
Chernoff, H.      167 175 178 179 189 211 216 217 218 219 220 221 227 246
Chow, Y.S.      20 49 258
Christensen, R.      134
Chung, F.      220
Classical statistical tests      208 221 233
Clayton, M.K.      viii 63 125 128 131 134 220
Clinical trial      1 5 51 59 63 208 211 213 214 218—221 223 225 226 229 231 232 240 241 244—246 253 259
Colton, T.      207 208 209 211 215 218 219 220 221 222 223 226 227 231 232 238 241 242 243 244 245 246 249 253 254 255 260 261
complete      14 15
Computational technique, numerical      47 114
Computer program      114
Concomitant variable      258
Conjecture of Berry      164 165 235 236
Conjecture of Chernoff      218 227
Conjecture of Clayton and Berry      131
Conserving selection      45
Continuity of $\Lambda$       101
Continuity of V      40 42 66 83
Continuity of W      45
Continuous time      7 59 218 246 247 253
Continuous-time approximation      83 219 220 242 246
Convergence in distribution      13 14 15 19 53
Cornfield, J.      208 221 222 260
Cost of switching      167 239
Counter example to continuity of V      42 56
Counter example to monotonicity of       36
Counter example to monotonicity of $\Delta$ in       80
Counter example to monotonicity of $\Delta$ in $\lambda$       87
Counter example to optimality of myopic strategy      4 30 34 215
Counter example to optimality of staying with a known arm      89 90
Counter example to optimality of staying with a winner      28
Counter example to optimality of staying with the known arm      89 90
Counter example to transitivity of preference among arms      137
Cover, T.M.      222 249
Cox, T.F.      230
Day, N.E.      221 223
DeGroot, M.H.      11 12 49 80 82 223 225 238 250 260
Delayed information      225 241 250
Dellacherie, C.      181 190
Dempster, M.A.H.      228
Dependent arms      28 65 81 212 222 224 232 238 240 243 247 248 250 251 255
Diffusion      174 177 210 238
Dirichlet measure      85 125 220
discount factor      1 225
Discount factor, nonobservable      53
Discount factor, observable      55
Discount function      60 131 180
Discount function, exponential      60 131 134 167 185 202 210 238
Discount function, regular      132
Discount function, shifted      132
Discount function, uniform      60 134 175 218—220
Discount sequence, condition for regularity      90
Discount sequence, definition of      9
Discount sequence, finite horizon      4
Discount sequence, geometric      2 5 6 47 50 51 54 91 107 108 113 115 117 120 136 197 212 215 218 223—225 227—231 235—241 244 250 251 253 255—259
Discount sequence, mixture of      51
Discount sequence, mixture of geometrics      54
Discount sequence, monotone      76 85
Discount sequence, nonmonotone      63
Discount sequence, norm of      24
Discount sequence, random      6 50 213 253 256
Discount sequence, random equivalent to nonrandom      54
Discount sequence, regular      90 99 111 120
Discount sequence, role of      50
Discount sequence, space of      24
Discount sequence, superregular      90 214
Discount sequence, topology for      24
Discount sequence, topology for random      53
Discount sequence, truncated geometric      235 239 250
Discount sequence, uniform      2—4 9 50 54 91 108 113 119 126 136 150 175 199 209 212—15 217—219 221 223—27 232—45 247—50 252—55 259
Discount sequence, unknown      50
Discrete-time approximation      61 134 175 219
Dose response      260
Drift process, deterministic      167 175
Drift process, random      167 175 220
Dubins, L.E.      19 45 49 223 225
Dynamic allocation index      see “Break-even value of a known arm”
Dynamic programming      4 6 9 25 36 46 114 212 219 224 233 235 248 252
Dynamic programming, advantages      106
Dynamic programming, disadvantages      106
Dynamic programming, fundamental equation of      27
Eick, S.G.      viii 20 36 49
Electronic device      216 243
Emrich, L.J.      224
Equal allocation      208 211 223 226 238 243—245 254 260

Estes, W.K.      216 217 224 233 244
Excessivity      223
Experimental phase      208 211 217 219 221—223 226 231 241 244 245 254—256 260
Exponential arm      225 227 229 232
Fabius, J.      81 82 210 211 224 225 231 238 242 250 254 260
Fahrenholtz, S.K.      224
Feldman, D.      2 4 7 80 81 82 210 214 215 219 222 223 224 225 238 240 243 245 248 250 254 255 259 260
Feller, W.      198 206
Ferguson, T.S.      viii 85 125 126 135
Finite horizon      2 9 24
Fischer, J.      225 230 231 240
Fisk, G.      243
Flehinger, B.J.      221 225 226 242 249
Fox, B.L.      209 210 221 226 227 245 249 250 253 255
Free boundary problem      178 217 220
Freedman, D.      19 49
Fristedt, B.      88 90 92 99 107 134 171 190 202 206 210 212 213 214 215 220 223 228 234 235 237 240 253 254 259
Fu, K.S.      222
Furukawa, N.      227
Gait, P.A.      227
Gambling, fundamental theorem, of      225
Game-theoretic approach      191
Gani, J.      7 135 149 228
Generator      174
Gittins index      see “Break-even value of a known arm”
Gittins — Jones theorem      6 139 227 228 230 238 244 249 250 253 255 256
Gittins — Jones theorem, converse to      145
Gittins, J.C.      vi viii 6 7 85 86 135 136 137 138 145 149 210 211 212 213 214 218 224 227 228 229 230 235 238 239 244 249 250 251 253 254 255 256 259
Glazebrook, K.D.      210 225 228 229 230 240 249 255 256
Goto, M.      221 231
Gray, K.B.Jr.      231 248
Greenhouse, S.W.      208 221 222 260
Gupta, S.      135
Halperin, M.      208 221 222 260
Hamada, T.      231
Heath, D.C.      viii 182 190
Hellman, M.E.      222 249
Hengartner, W.      63 212 240
Herkenrath, U.      63 210 213 228 230 231 232 235 243 257
Hill, C.      232
Hinderer, K.      212 231 232 240
History of observations      1 4 10 16 62
Horizon      2 9 50 208
Horizon, definition of      24
Horizon, equals two      152
Horowitz, A.D.      216 217 233 244
Improper prior      13 156
Independent arms      65 136 150
Independent increments      179
Infinite horizon      9
Infinitely divisible      179
information      2 5 7 10 11 15 63 154 179 225 241 250
Inter-arrival time      60
Iosifescu, M.      233
Isbell, J.R.      3 7 233 249 251 252
Isoptin      260
Ito's formula      173
Ito, K.      247
Jain, N.C.      viii 49
Job seleCtion      253
Johnson, N.L.      213 252
Johnson, S.M.      2 3 7 28 49 86 88 92 114 134 155 165 207 210 212 213 215 220 223 225 235 237 239 240 244 249 250 251 253 255 256 259
Jones, D.M.      vi viii 6 7 85 86 135 136 137 138 145 149 211 212 213 225 227 228 229 230 234 235 238 239
Jones, P.W.      234 235 237 245
Joshi, V.M.      164 165 235 236
Kadane, J.B.      81 82
Kakigi, R.      212 236
Kalaba, R.E.      236 252
Kalin, D.      63 210 212 213 228 230 231 235 236 237 240 243 257
Kandeel, H.A.      234 235 237 245
Karatzas, I.      174 185 90 211 238
Karlin, S.      2 3 7 28 49 86 88 92 114 134 155 165 207 210 212 213 215 220 223 225 235 237 239 240 248 257 259
Keener, R.W.      238 248 250
Kelley, T.A.      81 82 218 221 223 225 238 239 247 250 260
Kelly, F.P.      210 238 258
Kemperman, J.H.B.      248
Klimko, L.A.      3 8 210 227 242 255
Known arm      4 6 9 38 83 168 186 209 210 212—215 218 220 223 225 227 230 231 234 235 237 239 240 242 245 253 256—259
Known arm, staying with      see “Stopping problem”
Kolonko, M.      212 231 232 239 240
Kono, K.      240
Kotz, S.      213 252
Kumar, P.R.      240
Ladder epochs of random walks      238
Lai, T.L.      208 221 240 241
Lakshmanan, K.B.      241 249
Lakshmivarahan, S.      242 249
Langenberg, P.      221 249
Learning models      224 232 242 251 259
Learning models, linear      217 233
Learning models, nonlinear      233
Least-failures rule      239
Lee, R.D.      208 221 253
Lellouch, J.      221 244 245
Lenstra, J.K.      228
Levin, B.      221 240 241
Levy arm      179 184 186
Levy measure      186—188
Levy process      7 167 168 179—181 183 184 186 189
Likelihood ratio test      226 249
Lindley, D.V.      63 64
Lipster, A.S.      174 190
Locally compact      14 15
Louis, T.A.      221 225 226 242 249
m-Decreasing, definition of      76
m-Decreasing, strictly      76
m-Greater than      73 159
m-Increasing, definition of      76
m-Increasing, strictly      76
Mallows, C.I.      242
Market pricing problem      232 251
Markov environment      257
Markov process      177 228 254
Markov property      173
Maximin strategy      193 199 202 206
Maximizing length of success run      214 215
Maximum principle of Pontryagin      246
McCulloch, R.E.      viii
Medical ethics      208 219 233 244 245
Medical trials      see “Clinical trials”
Meeter, D.A.      221 242 255
Mehta, C.R.      211 219 221 241 246
Metrizable      14 15
Meybodi, M.R.      242 249
Meyer, P.-A.      181 190
Minimax approach      3 7 191 254
Minimax risk      210 211
Minimax strategy      193 196 202 206 210—211 217 224 251 253 254
Minimizing selections on an inferior arm      225 226 245 249 260
Mixed strategy      192 196 207 209 210 224 229 232 233 241 243 247 248 251—253
Monotonicity of A      86
Monotonicity of V      66 69 75 83
Monotonicity, of       36 98
Monotonicity, of $\Delta$       76 86
Monotonicity, of $\Lambda$       10 129
Monte Carlo      226 252
Morrison, D.F.      243
Mosteller, F.      216 217 223 232 233 244 251
Multi-phase design      222 243 256
Murray, F.S.      216 217 243
Myopic strategy      4 34 213—215 223—225 233—235 238 243—245 248 250 252 255 258 259 260
Nakajima, N.      244
Nash, P.      229 244 256
Ney, P.      190 206
Nonparametric arm      125 220
Nordbrock, E.      79 82
Normal arm      10 15 23 221 223 224 242 243 249 258
Normal distribution      10 24 129 175 179 220 224 258
Noshi, T.      244
Number of immediate failures tolerated      218

1 2

О проекте