Policy iteration in a two-dimensional ruin problem

Stajic, Stefan

doi:10.34726/hss.2022.83925

Record link:

https://doi.org/10.34726/hss.2022.83925
http://hdl.handle.net/20.500.12708/19305

Title:

Policy iteration in a two-dimensional ruin problem

Citation:

Stajic, S. (2022). Policy iteration in a two-dimensional ruin problem [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2022.83925

reposiTUm DOI:

10.34726/hss.2022.83925

CatalogPlus:

AC16421463

Publication Type:

Thesis - Diplomarbeit

Language:

English

Authors:

Stajic, Stefan

Advisor:

Grandits, Peter

Organisational Unit:

E105 - Institut für Stochastik und Wirtschaftsmathematik

Date (published):

2022

Number of Pages:

137

Keywords:

Ruinprobleme; Kontrolltheorie

ruin problems; control theory

Abstract:

In der vorliegenden Arbeit geht es um die Anwendung und numerische Umsetzung der Policy-Iterationsmethode auf ein zweidimensionales stochastisches Kontrollmodell mit zwei unabhängigen Brownschen Bewegungen, die das Verhalten von zwei Firmen auf einem Markt beschreiben, wie es in dem Artikel von McKean und Shepp [18] vorgestellt wurde. Im Artikel wurde nachgewiesen, dass die gewählte Strategie, anhand derer die Unternehmen durch den Staat in Form von Steuererleichterungen gefördert werden, je nach Optimierungskriterium variiert. Dabei stellt sich heraus, dass, wenn das Ziel darin besteht, die gemeinsame Überlebenswahrscheinlichkeit beider Unternehmen zu maximieren, die Push-Bottom Strategie, d.h. die Strategie, die das schwächere Unternehmen zum betrachteten Zeitpunkt unterstützt, die optimale Strategie ist. Besteht das Kriterium hingegen in der Maximierung der erwarteten Anzahl von Unternehmen, die zahlungsfähig bleiben, ist die Anwendung der Push-Top Strategie teilweise geeigneter als die Push-Bottom Strategie. Nun stellt sich die Frage, ob die optimale Strategie für das jeweilige Optimierungsproblem auch dann gefunden werden kann, wenn die Unternehmen zu Beginn entsprechend einer willkürlichen Strategie durch die Regierung gefördert werden. Da die zugrunde liegende partielle Differentialgleichung für die Wertfunktion nichtlinear ist, können Standardmethoden für lineare partielle Differentialgleichungen nicht verwendet werden. Aus diesem Grund verwenden wir die Policy-Iterationsmethode, bei der in jedem Schritt eine verbesserte Strategie hergeleitet und die Lösung der dabei auftretenden linearen partiellen Differentialgleichung mit Hilfe von Differenzenquotienten und des Überrelaxationsverfahrens, auch SOR-Verfahren genannt, approximiert wird.

This paper is concerned with the application and numerical implementation of the policy iteration method to a two-dimensional stochastic control model with two independent Brownian motions describing the behavior of two firms in a market, as presented in the article by McKean and Shepp [18]. In the article it was shown that the chosen strategy by which the companies are supported by the government in the form of tax breaks varies depending on the optimization criterion. It turns out that if the objective is to maximize the joint survival probability of both firms, the push-bottom strategy, i.e., the strategy that supports the weaker firm at the considered time, is the optimal strategy. If, on the other hand, the criterion is to maximize the expected number of firms that remain solvent, the application of the push-top strategy is partially more suitable than the push-bottom strategy. Now the question arises whether the optimal strategy for the respective optimization problem can be found even if the companies are subsidized by the government at the beginning according to an arbitrary strategy. Since the underlying partial differential equation for the value function is nonlinear, standard methods for linear partial differential equations cannot be used. For this reason, we use the policy iteration method, in which an improved policy is derived at each step and the solution of the thereby observed linear partial differential equation is approximated using difference quotients and the successive over-relaxation method.

Additional information:

Abweichender Titel nach Übersetzung der Verfasserin/des Verfassers

License:

In Copyright

Appears in Collections:

Thesis