Skip to content

Spring Boot Faults

Introduction

  • It can target random pods with a Spring Boot application and allows configuring the assaults to inject multiple spring boot faults simultaneously on the target pod.
  • It supports app-kill, cpu-stress, memory-stress, latency, and exceptions faults

Scenario: Inject Spring Boot Faults

Spring Boot Faults

Uses

View the uses of the experiment

coming soon

Prerequisites

Verify the prerequisites

  • Ensure that Kubernetes Version > 1.16
  • Ensure that the Litmus Chaos Operator is running by executing kubectl get pods in operator namespace (typically, litmus).If not, install from here
  • Ensure that the spring-boot-faults experiment resource is available in the cluster by executing kubectl get chaosexperiments in the desired namespace. If not, install from here
  • Chaos Monkey Spring Boot dependency should be present in application. It can be enabled by two ways:
    1. Add internal dependency inside the spring boot application
      1. Add Chaos Monkey for Spring Boot as dependency for your project
        <dependency>
            <groupId>de.codecentric</groupId>
            <artifactId>chaos-monkey-spring-boot</artifactId>
            <version>2.6.1</version>
        </dependency>
        
      2. Start your Spring Boot App with the chaos-monkey spring profile enabled
        java -jar your-app.jar --spring.profiles.active=chaos-monkey --chaos.monkey.enabled=true
        
    2. Add as external dependency
      1. You can extend your existing application with the chaos-monkey and add it as an external dependency at startup, for this it is necessary to use the PropertiesLauncher of Spring Boot
        <dependency>
            <groupId>de.codecentric</groupId>
            <artifactId>chaos-monkey-spring-boot</artifactId>
            <classifier>jar-with-dependencies</classifier>
            <version>2.6.1</version>
        </dependency>
        
      2. Start your Spring Boot application, add Chaos Monkey for Spring Boot JAR and properties
        java -cp your-app.jar -Dloader.path=chaos-monkey-spring-boot-2.6.1-jar-with-dependencies.jar org.springframework.boot.loader.PropertiesLauncher --spring.profiles.active=chaos-monkey --spring.config.location=file:./chaos-monkey.properties
        

Default Validations

View the default validations
  • Spring boot pods are healthy before and after chaos injection

Minimal RBAC configuration example (optional)

NOTE

If you are using this experiment as part of a litmus workflow scheduled constructed & executed from chaos-center, then you may be making use of the litmus-admin RBAC, which is pre-installed in the cluster as part of the agent setup.

View the Minimal RBAC permissions

apiVersion: v1
kind: ServiceAccount
metadata:
  name: spring-boot-faults-sa
  namespace: default
  labels:
    name: spring-boot-faults-sa
    app.kubernetes.io/part-of: litmus
---
apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  name: spring-boot-faults-sa
  namespace: default
  labels:
    name: spring-boot-faults-sa
    app.kubernetes.io/part-of: litmus
rules:
  # Create and monitor the experiment & helper pods
  - apiGroups: [""]
    resources: ["pods"]
    verbs: ["create","delete","get","list","patch","update", "deletecollection"]
  # Performs CRUD operations on the events inside chaosengine and chaosresult
  - apiGroups: [""]
    resources: ["events"]
    verbs: ["create","get","list","patch","update"]
  # Track and get the runner, experiment, and helper pods log 
  - apiGroups: [""]
    resources: ["pods/log"]
    verbs: ["get","list","watch"]
  # for creating and managing to execute commands inside target container
  - apiGroups: [""]
    resources: ["pods/exec"]
    verbs: ["get","list","create"]
  # for configuring and monitor the experiment job by the chaos-runner pod
  - apiGroups: ["batch"]
    resources: ["jobs"]
    verbs: ["create","list","get","delete","deletecollection"]
  # for creation, status polling and deletion of litmus chaos resources used within a chaos workflow
  - apiGroups: ["litmuschaos.io"]
    resources: ["chaosengines","chaosexperiments","chaosresults"]
    verbs: ["create","list","get","patch","update","delete"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
  name: spring-boot-faults-sa
  namespace: default
  labels:
    name: spring-boot-faults-sa
    app.kubernetes.io/part-of: litmus
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: Role
  name: spring-boot-faults-sa
subjects:
  - kind: ServiceAccount
    name: spring-boot-faults-sa
    namespace: default
Use this sample RBAC manifest to create a chaosServiceAccount in the desired (app) namespace. This example consists of the minimum necessary role permissions to execute the experiment.

Experiment tunables

check the experiment tunables

Mandatory Fields

Variables Description Notes
CM_PORT It contains port of the spring boot application
CM_KILL_APPLICATION_ACTIVE It enable the app-kill faults It supports boolean values. Default is false
CM_LATENCY_ACTIVE It enable the latency faults It supports boolean values. Default is false
CM_MEMORY_ACTIVE It enable the memory stress faults It supports boolean values. Default is false
CM_CPU_ACTIVE It enable the cpu stress faults It supports boolean values. Default is false
CM_EXCEPTIONS_ACTIVE It enable the exceptions faults It supports boolean values. Default is false
CPU_LOAD_FRACTION It contains fraction of cpu to be stressed, 0.95 equals 95% default value is 0.9. It supports value in range [0.1,1.0]
CM_EXCEPTIONS_TYPE It contains type of raised exception Defaults value: java.lang.IllegalArgumentException
CM_EXCEPTIONS_ARGUMENTS It contains argument of raised exception Defaults value: java.lang.String:custom illegal argument exception
LATENCY It contains network latency to be injected(in ms) default value is 2000
MEMORY_FILL_FRACTION It contains fraction of memory to be stressed, 0.7 equals 70% default value is 0.70. It supports value in range [0.01,0.95]

Optional Fields

Variables Description Notes
CM_LEVEL It contains number of requests are to be attacked, n value means nth request will be affected Defaults value: 1, it lies in [1,10000] range
CM_WATCHED_CUSTOM_SERVICES It limits watched packages/classes/methods, it contains comma seperated list of fully qualified packages(class and/or method names) ByDefault it is empty list, which means it target all services
CM_WATCHERS It contains comma separated list of watchers from the following watchers list [controller, restController, service, repository, component, webClient] ByDefault it is restController
TOTAL_CHAOS_DURATION The time duration for chaos injection (seconds) Defaults to 30
SEQUENCE It defines sequence of chaos execution for multiple target pods Default value: parallel. Supported: serial, parallel
PODS_AFFECTED_PERC The Percentage of total pods to target Defaults to 0% (corresponds to 1 replica)
LIB The chaos lib used to inject the chaos Defaults to litmus. Supported litmus only
RAMP_TIME Period to wait before and after injection of chaos in sec

Experiment Examples

Common Experiment Tunables

Refer the common attributes and Spring Boot specific tunable to tune the common tunables for all experiments and spring-boot specific tunables.

Inject Multiple Faults Simultaneously (CPU, Latency and Exceptions)

It injects cpu, latency, and exceptions faults simultaneously on the target pods

Use the following example to tune this:

apiVersion: litmuschaos.io/v1alpha1
kind: ChaosEngine
metadata:
  name: spring-boot-chaos
  namespace: default
spec:
  appinfo:
    appns: 'default'
    applabel: 'app=spring-boot'
    appkind: 'deployment'
  # It can be active/stop
  engineState: 'active'
  chaosServiceAccount: spring-boot-faults-sa
  experiments:
    - name: spring-boot-faults
      spec:
        components:
          env:
            # set chaos duration (in sec) as desired
            - name: TOTAL_CHAOS_DURATION
              value: '30'

            # port of the spring boot application
            - name: CM_PORT
              value: '8080'

            # it enables spring-boot latency fault
            - name: CM_LATENCY_ACTIVE
              value: 'true'

            # provide the latency (ms)
            # it is applicable when latency is active
            - name: LATENCY
              value: '2000'

            # it enables spring-boot cpu stress fault
            - name: CM_CPU_ACTIVE
              value: 'true'

            # it contains fraction of cpu to be stressed(0.95 equals 95%)
            # it supports value in range [0.1,1.0]
            # it is applicable when cpu is active
            - name: CPU_LOAD_FRACTION
              value: '0.9'

            # it enables spring-boot exceptions fault
            - name: CM_EXCEPTIONS_ACTIVE
              value: 'true'

            # Type of raised exception
            # it is applicable when exceptions is active
            - name: CM_EXCEPTIONS_TYPE
              value: 'java.lang.IllegalArgumentException'

              # Argument of raised exception
              # it is applicable when exceptions is active
            - name: CM_EXCEPTIONS_ARGUMENTS
              value: 'java.lang.String:custom illegal argument exception'

            ## percentage of total pods to target
            - name: PODS_AFFECTED_PERC
              value: ''

Inject Multiple Faults Simultaneously (Appkill and Memory)

It injects appkill and memory stress faults simultaneously on the target pods

Use the following example to tune this:

apiVersion: litmuschaos.io/v1alpha1
kind: ChaosEngine
metadata:
  name: spring-boot-chaos
  namespace: default
spec:
  appinfo:
    appns: 'default'
    applabel: 'app=spring-boot'
    appkind: 'deployment'
  # It can be active/stop
  engineState: 'active'
  chaosServiceAccount: spring-boot-faults-sa
  experiments:
    - name: spring-boot-faults
      spec:
        components:
          env:
            # set chaos duration (in sec) as desired
            - name: TOTAL_CHAOS_DURATION
              value: '30'

            # port of the spring boot application
            - name: CM_PORT
              value: '8080'

            # it enables spring app-kill fault
            - name: CM_KILL_APPLICATION_ACTIVE
              value: 'true'

            # it enables spring-boot memory stress fault
            - name: CM_MEMORY_ACTIVE
              value: ''

            # it contains fraction of memory to be stressed(0.70 equals 70%)
            # it supports value in range [0.01,0.95]
            # it is applicable when memory is active
            - name: MEMORY_FILL_FRACTION
              value: '0.70'

            ## percentage of total pods to target
            - name: PODS_AFFECTED_PERC
              value: ''