Formal methods for risk analysis | Topics and Proposals

For background information on the topic as a whole, scroll to the end of this page.

Available project proposals

Some of the research in Formal methods for risk analysis is closely related to that of quantitative analysis, as it relies on stochastic model-checking techniques. Conversely, many examples and case studies for quantitative analysis come from risk management.

Attack tree metrics (Milan Lopuhaä-Zwakenberg, Reza Soltani)
Supervisor: Milan Lopuhaä-Zwakenberg and Reza Soltani
Contact: Milan Lopuhaä-Zwakenberg
Attack trees are a popular and useful tool to model a system's vulnerability to attackers. They are used in quantitative security analysis: they can be used to calculate security metrics such as the minimal time or resources an attacker needs to successfully compromise the system. As attack trees can grow quite large in practice, there is an ongoing need for algorithms that can calculate security metrics on a feasible timescale.
In this project, the student adds to this field of research by implementing proposed algorithms, inventing new ones, or analyzing the behaviour of existing methods. Some possible avenues for research are:
- Linear programming: many security metrics can be phrased as optimization problems (such as the minimal cost of a succesful attack). Therefore, a student could create new metric calculation algorithms based on optimization techniques such as linear programming.
- Multi-metric analysis: it often makes sense to consider the interplay between metrics: one attack may be more costly to the attacker than another, but do more damage in return. There are some ideas on how to co-analyze metrics, but these have hardly been implemented, and there is also plenty of room for new ideas.
- Attack tree properties: As the problem of calculating security metrics is NP-hard, many existing algorithms are of worst-case exponential complexity. However, it is unclear how these algorithms fare in practice, and what properties of the attack tree have an impact on algorithm complexity.
- Modular analysis: An underused idea in attack tree analysis is to split up the attack tree into so-called modules and analyze each module separately. This idea can be applied to many different algorithms, and it would be interesting to see what difference this makes on both a theoretical and an experimental level.
Of course, there are many other possibilities, and students are welcome to bring their own ideas to the table.
Combining data-driven and expert driven attack trees/graphs to discover “unknown unknowns” (Milan Lopuhaä-Zwakenberg, Azqa Nadeem)
Supervisors: Milan Lopuhaä-Zwakenberg, Azqa Nadeem
Contact: Milan Lopuhaä-Zwakenberg
Attack trees/graphs are models of attacker strategies that show all the possible pathways an attacker can follow to penetrate a network. Traditionally, these attack trees are generated using expert knowledge regarding vulnerabilities and network topologies. Recently, alert-driven attack graphs have been proposed as a novel data-driven approach to generate attack graphs without a priori expert knowledge. In this project, we aim to combine the two approaches in order to find potential missing paths in data-driven attack graphs and/or expert-driven attack trees. We apply this setting to a network of IoT devices. We build expert-driven attack trees from the IoT network specification. Next, we deploy a number of cyber attacks and collect network logs which are used to construct data-driven attack graphs. The final step is to compare the two approaches to find “unknown unknowns” in both approaches.
Combining Fault Trees and Bayesian Networks (Stefano Nicoletti, Reza Soltani)
Supervisors: Reza Soltani, Stefano Maria Nicoletti
Contact: Stefano Maria Nicoletti
Fault Tree (FT) models are formalisms typically used in reliability engineering, safety, and risk analysis. A known drawback of FTs is that they assume statistical independence between basic events. Nevertheless, in real engineering complex systems, it is common to have dependencies not only between components but also with environmental and operational conditions.
Thus, the goal of this project is to investigate to what extend it is possible to deal with this limitation by making an interconnection with a Bayesian Network (BN). BN are probabilistic graphical models widely used to model dependencies between random variables. Useful literature in this direction is [1] and [2].
General research question
To what extend is it possible to account for dependencies of random variables via a Bayesian Network connected to the basic events of a Fault Tree model?
References
[1] Thomas, S., & Groth, K. M. (2021). Toward a hybrid causal framework for autonomous vehicle safety analysis. Proceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability, 1748006X211043310. https://doi.org/10.1177/1748006X211043310
[2] Moradi, R., & Groth, K. M. (2020). Modernizing risk assessment: A systematic integration of PRA and PHM techniques. Reliability Engineering & System Safety, 204, 107194. https://doi.org/10.1016/j.ress.2020.107194
Tasks
- Learn about Bayesian Networks and Fault Trees
- Investigate how to combine these models to account for statistical dependency
- Implement the approach, for example as a Python script.
Requirements
It is advisable to have the following knowledge:
- Knowledge in probability theory.
- Knowing about Bayesian Networks is a plus.
- Familiarity with the Python programming language.
Comparing attack graphs with business process models for modeling attacker strategies (Faiza Bukhsh, Milan Lopuhaä-Zwakenberg, Azqa Nadeem)
Supervisors: Faiza Bukhsh, Milan Lopuhaä-Zwakenberg, Azqa Nadeem
Contact: Milan Lopuhaä-Zwakenberg
Both attack graphs and business process models have been used in the literature to model attacker strategies. Alert-driven attack graphs are extracted from discrete markovian finite state automata that are great for modeling the semantic meaning of symbols, i.e., identical symbols that have different futures/pasts are modeled using different states. On the other hand, business process models are great for modeling concurrency, i.e., modeling simultaneous attacker actions. In this project, we aim to compare the merits of each approach in modeling attacker strategies from a dataset of intrusion alerts collected through a penetration testing competition. We also aim to combine the two approaches to get the best of both worlds, i.e., modeling concurrent attacker actions while also considering their semantic meaning.
Converting Attack-Fault tree to game automata (Milan Lopuhaä-Zwakenberg, Reza Soltani)
Supervisor: Reza Soltani, Milan Lopuhaä-Zwakenberg
Contact: Milan Lopuhaä-Zwakenberg
In the cyber-physical domain, safety and security have a complex relationship that entails trade-offs. Specifically, taking steps to improve safety may weaken security, and vice versa. Security and safety are frequently looked into separately in different studies.
When it comes to safety, Fault Trees (FT) analysis is frequently used to identify potential problems, such as component failures that propagate throughout a system. From a security perspective, we use Attack Trees (AT) to understand how attackers might break things down into smaller steps and elements to achieve their goals.
Attack-Fault Trees (AFT) are utilized for a thorough study that considers elements related to safety and security. AFTs have limitations; for example, it is not possible to fully show the effect of safety and security on each other. One of the solutions is to convert AFTs into game automata.
In earlier work, we have modeled FTs as a 1.5-player game [1]. One could use an analogous approach to model an AFT as a 2-player game. Such a time-game automaton could then be subject to comprehensive quantitative safety-security analysis using model checkers.
Some research questions:
- If the attacker knows about the safety failures, how does this affect the cost and probability of the attack?
- Can attacks increase safety failure?
Reference:
1. Soltani, R., Volk, M., Diamonte, L., Lopuhaä-Zwakenberg, M., Stoelinga, M. (2023). Optimal Spare Management via Statistical Model Checking: A Case Study in Research Reactors. In: Cimatti, A., Titolo, L. (eds) Formal Methods for Industrial Critical Systems. FMICS 2023. Lecture Notes in Computer Science, vol 14290. Springer, Cham. https://doi.org/10.1007/978-3-031-43681-9_12
For more information, please contact r.soltani@utwente.nl or m.a.lopuhaa@utwente.nl.
Converting decision trees into fault trees (Milan Lopuhaä-Zwakenberg)
Supervisor: Milan Lopuhaä-Zwakenberg
Decision trees and fault trees are two models to represent the reliability of complex systems. While the information they contain is mathematically identical, the fact that the models are formed in different ways means that they are used by different people for different reasons. Therefore, it would be interesting to convert decision trees to fault trees, but contrary to the opposite direction this conversion has not been explored so far.
In this project, the student will study and develop algorithms to convert decision trees into fault trees. While it is reasonably straightforward to come up with naïve algorithms, we are particularly interested in how those algorithms can be improved by decreasing their computation time and by outputting smaller fault trees.
Game theory on safety and security (Milan Lopuhaä-Zwakenberg, Reza Soltani)
Supervisor: Reza Soltani, Milan Lopuhaä-Zwakenberg
Contact: Milan Lopuhaä-Zwakenberg
Discover how game theory principles provide invaluable insights into assessing and enhancing safety measures, predicting adversarial behaviors, and fortifying cybersecurity frameworks.
Currently, when it comes to safety, Fault Trees (FT) analysis is frequently used to identify potential problems, such as component failures that propagate throughout a system. From a security perspective, we use Attack Trees (AT) to understand how attackers might break things down into smaller steps and elements to achieve their goals.
Attack-Fault Trees (AFT) are utilized for a thorough study that considers elements related to safety and security. However, AFTs have limitations; for example, it is not possible to fully show the effect of safety and security on each other.
The goal of the project is to investigate the relationship between safety and security from a game theory perspective and how we can implement the player concept for these domains.
Please feel free to contact r.soltani@utwente.nl for more discussion or if you have any questions.
Improving Fault Tree analysis via Binary Decision Diagrams (Milan Lopuhaä-Zwakenberg, Tom van Dijk)
Supervisors: Milan Lopuhaä-Zwakenberg, Tom van Dijk
Fault trees are a common formalism to model how failures occur in system components and how they eventually lead to overall system failure. Analysing fault trees reveals how likely system failures are and whether the reliability of components needs to be improved.
As fault trees can contain thousands of elements, their analysis must be efficient in both time and memory consumption. One common approach is to use binary decision diagrams (BDDs) as a data structure. A BDD is graph which encodes a Boolean function in a compact representation. Translating a fault tree into a BDD yields an efficient representation which allows to analyse large fault trees. However, the memory consumption of the BDD depends on certain factors such as the variable ordering or the order in which smaller BDDs are composed during the translation.
The goal of this research project is to improve the existing fault tree analysis. The fault tree analysis we look at is implemented in the Storm model checker and uses the Sylvan library to create and handle BDDs. Building upon Sylvan and Storm, we aim to extend and optimize the existing implementation of fault tree analysis via BDDs.
Tasks
Possible research tasks are:
- Find heuristics for good variable orderings which yield small BDDs.
- Investigate whether different orders for the composition of sub-BDDs affect the peak memory requirement.
- Investigate whether multiple cores can be further exploited during the analysis.
- Integrate modular approaches which only require parts of the BDD to be stored in memory at a time.
- Bring in own ideas.
Requirements
- Knowledge about fault trees and binary decision diagrams is beneficial but not required.
- Experience with C/C++ is strongly recommended as the implementation uses these languages.
Improving the inference of risk models (Milan Lopuhaä-Zwakenberg)
Supervisor: Milan Lopuhaä-Zwakenberg
Risk models such as fault trees (FTs) are formalisms typically used in reliability engineering, safety, and risk analysis. A known drawback of FTs is the time-consuming manual construction of these models, making them prone to human errors and incompleteness. Thanks to the ever-increasing availability of inspection and monitoring data, the development of algorithms that take care of this task in a data-driven manner are possible.
In [1] this challenge was tackled using multi-objective evolutionary algorithms, yielding compact and reliable FT models. Building upon this work, several interesting research directions are possible.
Possible research directions
- Boolean circuit design: Circuit designs aims to find circuits with a minimal number of logical gates representing a given Boolean function. FT models can be encoded by Boolean formulas. A possible research direction is to find and apply existing algorithms from circuit design for FT inference.
- Guiding the evolutionary algorithms: The current algorithm [1] takes a long time to converge when randomly applying genetic operators. One possible research direction is therefore to investigate to what extent knowledge about the models can be included to guide the genetic operators.
- Noisy and incomplete data sets: Current approaches only consider noise-free and complete failure data sets, which is of course not representative of real applications. Thus, it is interesting to investigate how to effectively deal and account for noise and incompleteness in failure data sets when inferring FT models in a data-driven manner.
References
[1] Jimenez-Roa, L. A., Heskes, T., Tinga, T., & Stoelinga, M. I. A. (2021). Automatic inference of fault tree models via multi-objective evolutionary algorithms. Preprint available online.
Requirements
Most research directions encompass a prototypical implementation. It is therefore advisable to be familiar with the Python programing language.
A long awaited marriage: investigating safety/security interactions in real-world scenarios (Stefano Nicoletti, Reza Soltani)
Supervisors: Stefano Maria Nicoletti, Reza Soltani
Contact: Stefano Maria Nicoletti
The introduction and adoption of new technologies implies the rise of new risks, related both to accidental damage (safety) and to malicious attacks (security). Furthermore, safety and security are often intertwined: passwords protect medical data from unauthorized access but are an obstacle for patients' safety during emergencies, IoT sensors can help to monitor the pressure levels of a pipeline but may provide extended attack surfaces for malicious hackers.
The existence of interdependencies between safety and security calls for a better (formal) understanding of this kind of interactions:
- At what level do safety and security interact? What interacts with what?
- Do these interactions regard requirements, countermeasures, factors (etc.)?
Tasks
The suggested tasks are the following:
- Together with your supervisor, decide on a case study of interest and investigate how safety and security interact in this setting
- Chose a formalism to analyse said case study (by discussing with your supervisors) and understand its modelling capabilities
- Starting from the chosen case study, investigate which of the different interactions between safety and security are captured by the formalism of choice
- Investigate at what level these interactions take place
For this topic, you should have a strong interest in multidisciplinary research.
Polyhedral fuzzy numbers (Milan Lopuhaä-Zwakenberg, Benedikt Peterseim)
Supervisors: Milan Lopuhaä-Zwakenberg, Benedikt Peterseim
Contact: Milan Lopuhaä-Zwakenberg
A constant problem in risk analysis is that many parameters are unknown or only approximately known. One way to deal with this is the mathematical formalism of fuzzy numbers, where a parameter does not have a singular value, but multiple ones with various degrees of certainty. These are then used in numerical analysis, and the final result will be a fuzzy number as well, quantifying the uncertainty of the risk analysis itself.
While fuzzy numbers are very useful from a mathematical perspective, their standard form requires in theory infinite memory to store, making it useless for practical applications. There exist some simplified versions of fuzzy numbers, such as triangular fuzzy numbers, but these are too restrictive: many mathematical operations that are used in risk analysis are not supported by fuzzy numbers.
The aim of this project is to study a new type of fuzzy numbers, called polyhedral fuzzy numbers, that are general enough to be used in risk analysis but specific enough to allow for efficient computation. In the project, the student makes efficient algorithms for mathematical operations on fuzzy numbers. A natural domain of application are attack trees and fault trees, but this can be changed according to the student's interest.
Software failure rates (Stefano Nicoletti, Reza Soltani)
Supervisor: Stefano Maria Nicoletti, Reza Soltani, Mariëlle Stoelinga
Contact: Stefano Maria Nicoletti
Fault trees are an important formalism to assess the reliability of systems. Typically, they have been very successfully applied to complex hardware systems, like railroads or chemical plants. However, software is getting more and more important. Hence, software fault trees have been developed to account for software failures as well. One important research question is how to estimate the failure rate of software components.
Research directions:
The key idea of this project is to find good estimations for failure rates of software. Possible directions are:
- Develop a Bayesian network capturing the dependencies between different influences on the software reliability (used programming language, experience of the team, test coverage, size of project, etc.)
- Integrating existing tools (e.g., SonarQube) for better estimation of software failure rates.
Enhancing TOPAAS for Modern Software Reliability Assessment (Stefano Nicoletti, Mariëlle Stoelinga)
Background
For over a decade, Rijkswaterstaat (RWS) has used the task-oriented probability of abnormalities analysis for software (TOPAAS) as its primary method for quantifying software reliability in critical systems. While TOPAAS has proven valuable, its long-term application has revealed potential shortcomings and areas for improvement. Additionally, technological advancements offer new possibilities to enhance the method's effectiveness and accuracy.
Research Objectives
This master's thesis project aims to investigate and propose improvements to the TOPAAS method, incorporating modern technologies and addressing known limitations. The research will focus on one of the following areas:
1. Integration of Automated Test Environments (ATEs)
·   Explore the use of digital twins for extensive testing scenarios
·   Investigate methods to update TOPAAS results using ATE-generated data
·   Evaluate the potential of Reliability Growth Models based on ATE test data
2. Incorporation of Formal Methods
·   Assess the impact of formal specification techniques (e.g., mCRL2) on software reliability
·   Explore the integration of synthesized software using CIF-models
·   Propose modifications to TOPAAS to account for formal methods' influence on reliability
3. Addressing Information Gaps in Software Renewals
·   Analyze the impact of missing historical data on TOPAAS parameters
·   Develop strategies to mitigate conservative estimates due to lack of information
·   Propose alternative approaches for assessing renewed software reliability
4. Exploration of Emerging Technologies and Methodologies
·   Identify and evaluate new technologies relevant to software reliability assessment
·   Propose additional improvement opportunities for TOPAAS
Methodology
The student will:
1. Conduct a comprehensive literature review on software reliability assessment and/or testing methods and recent advancements
2. Analyze the current TOPAAS framework and its limitations
Depending on the area choosen:
3. Design and execute experiments using ATEs to generate reliability data
4. Investigate the application of formal methods in software development and their impact on reliability
5. Develop and test proposed improvements to the TOPAAS method
For all areas:
6. Validate the enhanced approach using case studies from RWS critical systems
Expected Outcomes
Depending on the scope, the following apply:
1. A detailed analysis of TOPAAS limitations and improvement opportunities
2. A framework for integrating ATE-generated data into TOPAAS assessments
3. Recommendations for incorporating formal methods into software reliability quantification
4. Strategies for addressing information gaps in software renewal scenarios
5. A prototype of an enhanced TOPAAS method incorporating the proposed improvements
6. Validation results demonstrating the effectiveness of the enhanced approach
Required Skills
·   Strong background in software engineering and reliability assessment and/or
·   Familiarity with formal methods and model-based design and/or
·   Experience with statistical analysis and Bayesian methods and/or
·   Programming skills for implementing and testing proposed improvements
This research project offers an exciting opportunity to contribute to the advancement of software reliability assessment methods for critical systems, with potential impacts on infrastructure safety and efficiency.
Contact us!
Feel free to send us an email at <s.m.nicoletti@utwente.nl> and <m.i.a.stoelinga@utwente.nl>.

Contact

Background

Would it not be great to live in a world where computers never crash, trains always ride and banks are never hacked?

This is the ultimate goal of risk management. In FMT, we work on the risk analysis phase, where one identifies, evaluates and prioritizes risks, to come up with (cost-)effective countermeasures. By using mathematical models, we can not only reason about risk, but quantify risk as well. As risk models from industry become very large very quickly, creating algorithms that analyze risk models efficiently is an important area of research.

Prominent mathematical models include graph models such as fault trees (for safety), attack trees (for security), binary decision diagrams, and Bayesian networks. Application areas we have considered so far include cybersecurity, medical devices, and robotics, but we are interested in many applications: if you have one of your own, please come to us.

Projects for Formal methods for risk analysis roughly fall within some or more of the following categories:

Modeling techniques: What types of risk can we model, and what do we want our models to express? For example, how do we model the interplay between safety and security?
Model analyis: Can we find new ways to efficiently calculate risk parameters from models, such as the likelihood and impact of adverse events?
Big data: How do we get from a large dataset to a risk model that is easy to analyze, yet retains as much relevant information as possible?
Industrial case studies: Do one or more of the above for real-world data, in collaboration with companies, government bodies or risk consultancy firms.

If you have an idea related to any of these, please contact us directly. Alternatively, you can work on one of the concrete proposals above.

Available project proposals

General research question

References

Tasks

Requirements

Tasks

Requirements

Possible research directions

References

Requirements

Tasks

Research directions:

Contact

Background