Текущий выпуск Выпуск 1, 2025 Том 35

Все выпуски

Результыты поиска по 'non-anticipative strategies':

Найдено статей: 5

Авербух Ю.В.
Рандомизированное равновесие по Нэшу в дифференциальных играх, с. 299-308

Работа посвящена исследованию равновесия по Нэшу в неантагонистической детерминированной дифференциальной игре двух лиц в классе рандомизированных стратегий. Предполагается, что игроки информированы об управлении своего партнера, реализовавшегося к текущему времени. Поэтому игра формализуется в классе рандомизированных квазистратегий. В работе получена характеризация множества выигрышей (пар ожидаемых выигрышей игроков) в ситуациях равновесия по Нэшу с использованием вспомогательных антагонистических игр. Показано, что множество выигрышей в ситуациях рандомизированного равновесия по Нэшу является выпуклой оболочкой множества выигрышей в классе детерминированных стратегий. Приведен пример, показывающий дополнительные возможности, которые возникают при переходе к рандомизированным стратегиям.

дифференциальные игры, равновесие по Нэшу, рандомизированные стратегии

Averboukh Y.V.
Randomized Nash equilibrium for differential games, pp. 299-308

The paper is concerned with the randomized Nash equilibrium for a nonzero-sum deterministic differential game of two players. We assume that each player is informed about the control of the partner realized up to the current moment. Therefore, the game is formalized in the class of randomized non-anticipative strategies. The main result of the paper is the characterization of a set of Nash values considered as pairs of expected players' outcomes. The characterization involves the value functions of the auxiliary zero-sum games. As a corollary we get that the set of Nash values in the case when the players use randomized strategies is a convex hull of the set of Nash values in the class of deterministic strategies. Additionally, we present an example showing that the randomized strategies can enhance the outcome of the players.

differential games, Nash equilibrium, randomized strategies
Гомоюнов М.И., Серков Д.А.
Неупреждающие стратегии в задачах оптимизации гарантии при функциональных ограничениях на помехи, с. 553-571

Для динамической системы, управляемой в условиях помех, рассматривается задача оптимизации гарантированного результата. Особенностью задачи является наличие функциональных ограничений на помехи, при которых свойство замкнутости множества допустимых помех относительно операции «склейки» двух его элементов, вообще говоря, отсутствует. Это обстоятельство препятствует непосредственному применению методов теории дифференциальных игр для исследования задачи и тем самым приводит к необходимости их походящей модификации. В работе предложено новое понятие неупреждающей стратегии управления (квазистратегии). Доказано, что соответствующий функционал оптимального гарантированного результата удовлетворяет принципу динамического программирования. Как следствие, установлены так называемые свойства $u$- и $v$-стабильности этого функционала, которые в дальнейшем позволят построить конструктивное решение задачи в позиционных стратегиях.

оптимизация гарантии, функциональные ограничения, неупреждающие стратегии, принцип динамического программирования

Gomoyunov M.I., Serkov D.A.
Non-anticipative strategies in guarantee optimization problems under functional constraints on disturbances, pp. 553-571

For a dynamical system controlled under conditions of disturbances, a problem of optimizing the guaranteed result is considered. A feature of the problem is the presence of functional constraints on disturbances, under which, in general, the set of admissible disturbances is not closed with respect to the operation of “gluing up” of two of its elements. This circumstance does not allow to apply directly the methods developed within the differential games theory for studying the problem and, thus, leads to the necessity of modifying them appropriately. The paper provides a new notion of a non-anticipative control strategy. It is proved that the corresponding functional of the optimal guaranteed result satisfies the dynamic programming principle. As a consequence, so-called properties of $u$- and $v$-stability of this functional are established, which may allow, in the future, to obtain a constructive solution of the problem in the form of feedback (positional) controls.

guarantee optimization, functional constraints, non-anticipative strategies, dynamic programming principle
Ченцов А.Г., Хачай Д.М.
Оператор программного поглощения и релаксация дифференциальной игры сближения-уклонения, с. 64-91

В настоящей работе рассматривается естественная релаксация игровой задачи наведения. А именно, для двух замкнутых множеств - параметров задачи - решается аналогичная задача о наведении для $\varepsilon$-окрестностей данных множеств. Нас интересует наименьший размер таких окрестностей, для которых игрок I может решить задачу наведения в классе обобщенных квазистратегий. Для построения решения используется модификация метода программных итераций. Вышеупомянутый размер окрестностей находится как функция позиции и в дальнейшем определяется путем применения специальной итерационной процедуры. Также в работе показано, что искомая функция является неподвижной точкой оператора, определяющего данную процедуру.

дифференциальная игра сближения-уклонения, метод программных итераций, гарантированный результат

Chentsov A.G., Khachai D.M.
Relaxation of pursuit-evasion differential game and program absorption operator, pp. 64-91

We consider some natural relaxation of pursuit-evasion differential game. For two closed sets, which are parameters, similar guidance problem for $\varepsilon$-neighborhoods is being solved. We are interested in finding a minimal size of such neighborhoods, which allows player I successfully solve his guidance problem in the class of generalized non-anticipating strategies. To resolve above-mentioned differential game, a modification of Program Iterations Method is implemented. Size of the neighborhoods is found as a position function and it's defined by application of special iterative procedure further below. As a corollary, it is shown that desired function is a fixed point of the open-loop operator, which defines the procedure.

pursuit-evasion differential game, program iterations method, guaranteed result
Гомоюнов М.И., Серков Д.А.
Об оптимизации гарантии в задаче управления с конечным множеством помех, с. 613-628

В статье изучается задача управления в условиях помех, которая формулируется как задача оптимизации гарантированного результата. В отличие от классической постановки таких задач предполагается, что множество допустимых помех конечно и состоит из кусочно-непрерывных функций. С учетом этого дополнительного функционального ограничения на помеху определяется подходящий класс неупреждающих стратегий (квазистратегий) управления и рассматривается соответствующая величина оптимального гарантированного результата. При некотором техническом предположении о свойстве различимости допустимых помех доказывается, что этот результат может быть достигнут путем использования стратегий управления с полной памятью. Как следствие, устанавливается неулучшаемость класса стратегий с полной памятью. Ключевым элементом доказательства является процедура распознавания действующих в системе помех, которая позволяет всякой неупреждающей стратегии поставить в соответствие близкую по гарантированному результату стратегию с полной памятью. В заключение статьи приводится иллюстрирующий пример.

управление в условиях помех, оптимальная гарантия, неупреждающая стратегия, стратегия с полной памятью, распознавание помех, неулучшаемость

Gomoyunov M.I., Serkov D.A.
On guarantee optimization in control problem with finite set of disturbances, pp. 613-628

In this paper, we deal with a control problem under conditions of disturbances, which is stated as a problem of optimization of the guaranteed result. Compared to the classical formulation of such problems, we assume that the set of admissible disturbances is finite and consists of piecewise continuous functions. In connection with this additional functional constraint on the disturbance, we introduce an appropriate class of non-anticipative control strategies and consider the corresponding value of the optimal guaranteed result. Under a technical assumption concerning a property of distinguishability of the admissible disturbances, we prove that this result can be achieved by using control strategies with full memory. As a consequence, we establish unimprovability of the class of full-memory strategies. A key element of the proof is a procedure of recovering the disturbance acting in the system, which allows us to associate every non-anticipative strategy with a full-memory strategy providing a close guaranteed result. The paper concludes with an illustrative example.

control problem under disturbances, optimal guaranteed result, non-anticipative strategy, full-memory strategy, recovery of disturbances, unimprovability
Серков Д.А.
О построении частично неупреждающего мультиселектора и его использовании в задачах динамической оптимизации, с. 410-434

В контексте задач гарантированного управления рассматриваются следующие вопросы: связь возможности пошагового (на заданном разбиении $\Delta$) вычисления селектора мультифункции (м/ф) $\alpha$ для неизвестного, восстанавливаемого по шагам $\Delta$, аргумента с существованием у $\alpha$ мультиселектора (м/с) со специальным свойством (названым здесь $\Delta$-неупреждаемостью или частичной неупреждаемостью); второй вопрос — способы построение такого м/с для произвольной пары $(\alpha, \Delta)$; и последний — поиск эффективно проверяемых условий, обеспечивающих совпадение $\Delta$-неупреждающего м/с с неупреждающим.

Мотивом к рассмотрению этих вопросов послужила схема управления, возникающая, например, в методе альтернированного интеграла, при использовании в управлении контрстратегий, или в некоторых задачах при использовании метода управления с поводырём.

В работе показано, что рассматриваемая пошаговая схема управления реализуема тогда и только тогда, когда м/ф $\alpha$ имеет $\Delta$-неупреждающий и непустозначный м/с. Дана конечношаговая процедура построения такого м/с. Указаны эффективно проверяемые условия, обеспечивающие неупреждаемость частично неупреждающего м/с. Рассмотрены иллюстрирующие примеры.

неупреждающие мультиселекторы, многозначные стратегии, оптимизация гарантированного результата

Serkov D.A.
On the construction of partially non-anticipative multiselector and its application to dynamic optimization problems, pp. 410-434

Let sets of functions $Z$ and $\Omega$ on the time interval $T$ be given, let there also be a multifunction (m/f) $\alpha$ acting from $\Omega$ to $Z$ and a finite set $\Delta$ of moments from $T$. The work deals with the following questions: the first one is the connection between the possibility of stepwise construction (specified by $\Delta$) of a selector $z$ of $\alpha(\omega)$ for an unknown step-by-step implemented argument $\omega\in\Omega$ and the existence of a multiselector (m/s) $\beta$ of the m/f $\alpha$ with a non-anticipatory property of special kind (we call it partially or $\Delta$-non-anticipated); the second question is when and how non-anticipated m/s could be expressed by means of partially non-anticipated one; and the last question is how to build the above $\Delta$-non-anticipated m/s $\beta$ for a given pair $(\alpha,\Delta)$.

The consideration of these questions is motivated by the presence of such step-by-step procedures in the differential game theory, for example, in the alternating integral method, in pursuit-evasion problems posed with use of counter-strategies, and in the method of guide control.

It is shown that the step-by-step construction of the value $z\in\alpha(\omega)$ can be carried out for any steps-implemented argument $\omega$ if and only if the above m/s $\beta$ is non-empty-valued. The key point of the work is the description of finite-step procedure for calculation of this $\Delta$-non-anticipated m/s $\beta$. Conditions are given that guarantee the m/s $\beta$ be a non-anticipative one. Illustrative examples are considered that include, in particular, control problems with disturbance.

non-anticipative multi-selectors, set-valued strategies, optimization of guarantee

Журнал индексируется в Web of Science (Emerging Sources Citation Index)

Журнал индексируется в

Журнал входит в базы данных zbMATH, MathSciNet

Журнал включен в базу данных Russian Science Citation Index (RSCI) на платформе Web of Science

Журнал входит в систему Российского индекса научного цитирования.

Журнал включен в перечень ВАК.

Электронная версия журнала на Общероссийском математическом портале Math-Net.Ru.

Журнал включен в