An approximate dynamic programming approach for comparing firing policies in a networked air defense environment

•The threat of theater ballistic missiles remains an important concern for many nations.•This research examines a networked, defense-in-depth air and missile defense problem.•Two approximate dynamic programming (ADP) algorithms provide firing policy solutions.•Designed experiments yield insights abo...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computers & operations research 2020-05, Vol.117, p.104890-15, Article 104890
Hauptverfasser:	Summers, Daniel S., Robbins, Matthew J., Lunday, Brian J.
Format:	Artikel
Sprache:	eng
Schlagworte:	Air and missile defense Air defense Algorithms Approximate dynamic programming Ballistic missiles Computer Science Computer Science, Interdisciplinary Applications Computer simulation Defense programs Dynamic programming Dynamic weapon target assignment problem Engineering Engineering, Industrial Firing Interceptors Least squares Markov decision processes Markov processes Military Operations research Operations Research & Management Science Policies Science & Technology Technology Weapons
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•The threat of theater ballistic missiles remains an important concern for many nations.•This research examines a networked, defense-in-depth air and missile defense problem.•Two approximate dynamic programming (ADP) algorithms provide firing policy solutions.•Designed experiments yield insights about algorithm performance and problem features.•ADP policies outperform current U.S. policy of firing two interceptors at each missile. An objective for effective air defense is to identify the firing policy for interceptor allocation to incoming missiles that minimizes the expected total damage to defended assets over a sequence of engagements. We formulate this dynamic weapon target assignment problem as a Markov decision process and utilize a simulation-based, approximate dynamic programming (ADP) approach to solve problem instances based on a representative scenario. Least squares policy evaluation and least squares temporal differences algorithms are developed to determine approximate solutions. A designed experiment investigates problem features such as conflict duration, attacker and defender weapon sophistication, and defended asset values. An empirical comparison of the ADP policies and two baseline policies (i.e., firing either one or two interceptors at each incoming theater ballistic missile (TBM)) yields several insights: the ADP policies outperform both baseline polices when conflict duration is short and attacker weapons are sophisticated; firing one interceptor at each TBM (regardless of inventory status) outperforms the tested ADP policies when conflict duration is long and attacker weapons are less sophisticated; and firing two interceptors at each TBM (regardless of inventory status), which is the United States Army’s currently implemented policy, is never the superlative policy for the test instances investigated.
ISSN:	0305-0548 1873-765X 0305-0548
DOI:	10.1016/j.cor.2020.104890