| .. This work is licensed under a Creative Commons Attribution 4.0 International License. |
| .. http://creativecommons.org/licenses/by/4.0 |
| .. Copyright 2018 Amdocs, Bell Canada |
| |
| .. Links |
| .. _Helm: https://docs.helm.sh/ |
| .. _Helm Charts: https://github.com/kubernetes/charts |
| .. _Kubernetes: https://Kubernetes.io/ |
| .. _Docker: https://www.docker.com/ |
| .. _Nexus: https://nexus.onap.org/#welcome |
| .. _AWS Elastic Block Store: https://aws.amazon.com/ebs/ |
| .. _Azure File: https://docs.microsoft.com/en-us/azure/storage/files/storage-files-introduction |
| .. _GCE Persistent Disk: https://cloud.google.com/compute/docs/disks/ |
| .. _Gluster FS: https://www.gluster.org/ |
| .. _Kubernetes Storage Class: https://Kubernetes.io/docs/concepts/storage/storage-classes/ |
| .. _Assigning Pods to Nodes: https://Kubernetes.io/docs/concepts/configuration/assign-pod-node/ |
| |
| |
| .. _developer-guide-label: |
| |
| OOM Developer Guide |
| ################### |
| |
| .. figure:: oomLogoV2-medium.png |
| :align: right |
| |
| ONAP consists of a large number of components, each of which are substantial |
| projects within themselves, which results in a high degree of complexity in |
| deployment and management. To cope with this complexity the ONAP Operations |
| Manager (OOM) uses a Helm_ model of ONAP - Helm being the primary management |
| system for Kubernetes_ container systems - to drive all user driven life-cycle |
| management operations. The Helm model of ONAP is composed of a set of |
| hierarchical Helm charts that define the structure of the ONAP components and |
| the configuration of these components. These charts are fully parameterized |
| such that a single environment file defines all of the parameters needed to |
| deploy ONAP. A user of ONAP may maintain several such environment files to |
| control the deployment of ONAP in multiple environments such as development, |
| pre-production, and production. |
| |
| The following sections describe how the ONAP Helm charts are constructed. |
| |
| .. contents:: |
| :depth: 3 |
| :local: |
| .. |
| |
| Container Background |
| ==================== |
| Linux containers allow for an application and all of its operating system |
| dependencies to be packaged and deployed as a single unit without including a |
| guest operating system as done with virtual machines. The most popular |
| container solution is Docker_ which provides tools for container management |
| like the Docker Host (dockerd) which can create, run, stop, move, or delete a |
| container. Docker has a very popular registry of containers images that can be |
| used by any Docker system; however, in the ONAP context, Docker images are |
| built by the standard CI/CD flow and stored in Nexus_ repositories. OOM uses |
| the "standard" ONAP docker containers and three new ones specifically created |
| for OOM. |
| |
| Containers are isolated from each other primarily via name spaces within the |
| Linux kernel without the need for multiple guest operating systems. As such, |
| multiple containers can be deployed with little overhead such as all of ONAP |
| can be deployed on a single host. With some optimization of the ONAP components |
| (e.g. elimination of redundant database instances) it may be possible to deploy |
| ONAP on a single laptop computer. |
| |
| Helm Charts |
| =========== |
| A Helm chart is a collection of files that describe a related set of Kubernetes |
| resources. A simple chart might be used to deploy something simple, like a |
| memcached pod, while a complex chart might contain many micro-service arranged |
| in a hierarchy as found in the `aai` ONAP component. |
| |
| Charts are created as files laid out in a particular directory tree, then they |
| can be packaged into versioned archives to be deployed. There is a public |
| archive of `Helm Charts`_ on GitHub that includes many technologies applicable |
| to ONAP. Some of these charts have been used in ONAP and all of the ONAP charts |
| have been created following the guidelines provided. |
| |
| The top level of the ONAP charts is shown below: |
| |
| .. graphviz:: |
| |
| digraph onap_top_chart { |
| rankdir="LR"; |
| { |
| node [shape=folder] |
| oValues [label="values.yaml"] |
| oChart [label="Chart.yaml"] |
| dev [label="dev.yaml"] |
| prod [label="prod.yaml"] |
| crb [label="clusterrolebindings.yaml"] |
| secrets [label="secrets.yaml"] |
| } |
| { |
| node [style=dashed] |
| vCom [label="component"] |
| } |
| |
| onap -> oValues |
| onap -> oChart |
| onap -> templates |
| onap -> resources |
| oValues -> vCom |
| resources -> environments |
| environments -> dev |
| environments -> prod |
| templates -> crb |
| templates -> secrets |
| } |
| |
| Within the `values.yaml` file at the `onap` level, one will find a set of |
| boolean values that control which of the ONAP components get deployed as shown |
| below: |
| |
| .. code-block:: yaml |
| |
| aaf: # Application Authorization Framework |
| enabled: false |
| <...> |
| so: # Service Orchestrator |
| enabled: true |
| |
| By setting these flags a custom deployment can be created and used during |
| deployment by using the `-f` Helm option as follows:: |
| |
| > helm install local/onap -name development -f dev.yaml |
| |
| Note that there are one or more example deployment files in the |
| `onap/resources/environments/` directory. It is best practice to create a unique |
| deployment file for each environment used to ensure consistent behaviour. |
| |
| To aid in the long term supportability of ONAP, a set of common charts have |
| been created (and will be expanded in subsequent releases of ONAP) that can be |
| used by any of the ONAP components by including the common component in its |
| `requirements.yaml` file. The common components are arranged as follows: |
| |
| .. graphviz:: |
| |
| digraph onap_common_chart { |
| rankdir="LR"; |
| { |
| node [shape=folder] |
| mValues [label="values.yaml"] |
| ccValues [label="values.yaml"] |
| comValues [label="values.yaml"] |
| comChart [label="Chart.yaml"] |
| ccChart [label="Chart.yaml"] |
| mChart [label="Chart.yaml"] |
| |
| mReq [label="requirements.yaml"] |
| mService [label="service.yaml"] |
| mMap [label="configmap.yaml"] |
| ccName [label="_name.tpl"] |
| ccNS [label="_namespace.tpl"] |
| } |
| { |
| cCom [label="common"] |
| mTemp [label="templates"] |
| ccTemp [label="templates"] |
| } |
| { |
| more [label="...",style=dashed] |
| } |
| |
| common -> comValues |
| common -> comChart |
| common -> cCom |
| common -> mysql |
| common -> more |
| |
| cCom -> ccChart |
| cCom -> ccValues |
| cCom -> ccTemp |
| ccTemp -> ccName |
| ccTemp -> ccNS |
| |
| mysql -> mValues |
| mysql -> mChart |
| mysql -> mReq |
| mysql -> mTemp |
| mTemp -> mService |
| mTemp -> mMap |
| } |
| |
| The common section of charts consists of a set of templates that assist with |
| parameter substitution (`_name.tpl` and `_namespace.tpl`) and a set of charts |
| for components used throughout ONAP. Initially `mysql` is in the common area but |
| this will expand to include other databases like `mariadb-galera`, `postgres`, |
| and `cassandra`. Other candidates for common components include `redis` and |
| `kafka`. When the common components are used by other charts they are |
| instantiated each time. In subsequent ONAP releases some of the common |
| components could be a setup as services that are used by multiple ONAP |
| components thus minimizing the deployment and operational costs. |
| |
| All of the ONAP components have charts that follow the pattern shown below: |
| |
| .. graphviz:: |
| |
| digraph onap_component_chart { |
| rankdir="LR"; |
| { |
| node [shape=folder] |
| cValues [label="values.yaml"] |
| cChart [label="Chart.yaml"] |
| cService [label="service.yaml"] |
| cMap [label="configmap.yaml"] |
| cFiles [label="config file(s)"] |
| } |
| { |
| cCharts [label="charts"] |
| cTemp [label="templates"] |
| cRes [label="resources"] |
| |
| } |
| { |
| sCom [label="component",style=dashed] |
| } |
| |
| component -> cValues |
| component -> cChart |
| component -> cCharts |
| component -> cTemp |
| component -> cRes |
| cTemp -> cService |
| cTemp -> cMap |
| cRes -> config |
| config -> cFiles |
| cCharts -> sCom |
| } |
| |
| Note that the component charts may include a hierarchy of components and in |
| themselves can be quite complex. |
| |
| Configuration of the components varies somewhat from component to component but |
| generally follows the pattern of one or more `configmap.yaml` files which can |
| directly provide configuration to the containers in addition to processing |
| configuration files stored in the `config` directory. It is the responsibility |
| of each ONAP component team to update these configuration files when changes |
| are made to the project containers that impact configuration. |
| |
| The following section describes how the hierarchical ONAP configuration system is |
| key to management of such a large system. |
| |
| Configuration Management |
| ======================== |
| |
| ONAP is a large system composed of many components - each of which are complex |
| systems in themselves - that needs to be deployed in a number of different |
| ways. For example, within a single operator's network there may be R&D |
| deployments under active development, pre-production versions undergoing system |
| testing and production systems that are operating live networks. Each of these |
| deployments will differ in significant ways, such as the version of the |
| software images deployed. In addition, there may be a number of application |
| specific configuration differences, such as operating system environment |
| variables. The following describes how the Helm configuration management |
| system is used within the OOM project to manage both ONAP infrastructure |
| configuration as well as ONAP components configuration. |
| |
| One of the artifacts that OOM/Kubernetes uses to deploy ONAP components is the |
| deployment specification, yet another yaml file. Within these deployment specs |
| are a number of parameters as shown in the following mariadb example: |
| |
| .. code-block:: yaml |
| |
| apiVersion: extensions/v1beta1 |
| kind: Deployment |
| metadata: |
| name: mariadb |
| spec: |
| <...> |
| template: |
| <...> |
| spec: |
| hostname: mariadb |
| containers: |
| - args: |
| image: nexus3.onap.org:10001/mariadb:10.1.11 |
| name: "mariadb" |
| env: |
| - name: MYSQL_ROOT_PASSWORD |
| value: password |
| - name: MARIADB_MAJOR |
| value: "10.1" |
| <...> |
| imagePullSecrets: |
| - name: onap-docker-registry-key |
| |
| Note that within the deployment specification, one of the container arguments |
| is the key/value pair image: nexus3.onap.org:10001/mariadb:10.1.11 which |
| specifies the version of the mariadb software to deploy. Although the |
| deployment specifications greatly simplify deployment, maintenance of the |
| deployment specifications themselves become problematic as software versions |
| change over time or as different versions are required for different |
| deployments. For example, if the R&D team needs to deploy a newer version of |
| mariadb than what is currently used in the production environment, they would |
| need to clone the deployment specification and change this value. Fortunately, |
| this problem has been solved with the templating capabilities of Helm. |
| |
| The following example shows how the deployment specifications are modified to |
| incorporate Helm templates such that key/value pairs can be defined outside of |
| the deployment specifications and passed during instantiation of the component. |
| |
| .. code-block:: yaml |
| |
| apiVersion: extensions/v1beta1 |
| kind: Deployment |
| metadata: |
| name: mariadb |
| namespace: "{{ .Values.nsPrefix }}-mso" |
| spec: |
| <...> |
| template: |
| <...> |
| spec: |
| hostname: mariadb |
| containers: |
| - args: |
| image: {{ .Values.image.mariadb }} |
| imagePullPolicy: {{ .Values.pullPolicy }} |
| name: "mariadb" |
| env: |
| - name: MYSQL_ROOT_PASSWORD |
| value: password |
| - name: MARIADB_MAJOR |
| value: "10.1" |
| <...> |
| imagePullSecrets: |
| - name: "{{ .Values.nsPrefix }}-docker-registry-key"apiVersion: extensions/v1beta1 |
| kind: Deployment |
| metadata: |
| name: mariadb |
| namespace: "{{ .Values.nsPrefix }}-mso" |
| spec: |
| <...> |
| template: |
| <...> |
| spec: |
| hostname: mariadb |
| containers: |
| - args: |
| image: {{ .Values.image.mariadb }} |
| imagePullPolicy: {{ .Values.pullPolicy }} |
| name: "mariadb" |
| env: |
| - name: MYSQL_ROOT_PASSWORD |
| value: password |
| - name: MARIADB_MAJOR |
| value: "10.1" |
| <...> |
| imagePullSecrets: |
| - name: "{{ .Values.nsPrefix }}-docker-registry-key" |
| |
| This version of the deployment specification has gone through the process of |
| templating values that are likely to change between deployments. Note that the |
| image is now specified as: image: {{ .Values.image.mariadb }} instead of a |
| string used previously. During the deployment phase, Helm (actually the Helm |
| sub-component Tiller) substitutes the {{ .. }} entries with a variable defined |
| in a values.yaml file. The content of this file is as follows: |
| |
| .. code-block:: yaml |
| |
| nsPrefix: onap |
| pullPolicy: IfNotPresent |
| image: |
| readiness: oomk8s/readiness-check:2.0.0 |
| mso: nexus3.onap.org:10001/openecomp/mso:1.0-STAGING-latest |
| mariadb: nexus3.onap.org:10001/mariadb:10.1.11 |
| |
| Within the values.yaml file there is an image section with the key/value pair |
| mariadb: nexus3.onap.org:10001/mariadb:10.1.11 which is the same value used in |
| the non-templated version. Once all of the substitutions are complete, the |
| resulting deployment specification ready to be used by Kubernetes. |
| |
| Also note that in this example, the namespace key/value pair is specified in |
| the values.yaml file. This key/value pair will be global across the entire |
| ONAP deployment and is therefore a prime example of where configuration |
| hierarchy can be very useful. |
| |
| When creating a deployment template consider the use of default values if |
| appropriate. Helm templating has built in support for DEFAULT values, here is |
| an example: |
| |
| .. code-block:: yaml |
| |
| imagePullSecrets: |
| - name: "{{ .Values.nsPrefix | default "onap" }}-docker-registry-key" |
| |
| The pipeline operator ("|") used here hints at that power of Helm templates in |
| that much like an operating system command line the pipeline operator allow |
| over 60 Helm functions to be embedded directly into the template (note that the |
| Helm template language is a superset of the Go template language). These |
| functions include simple string operations like upper and more complex flow |
| control operations like if/else. |
| |
| |
| ONAP Application Configuration |
| ------------------------------ |
| |
| Dependency Management |
| --------------------- |
| These Helm charts describe the desired state |
| of an ONAP deployment and instruct the Kubernetes container manager as to how |
| to maintain the deployment in this state. These dependencies dictate the order |
| in-which the containers are started for the first time such that such |
| dependencies are always met without arbitrary sleep times between container |
| startups. For example, the SDC back-end container requires the Elastic-Search, |
| Cassandra and Kibana containers within SDC to be ready and is also dependent on |
| DMaaP (or the message-router) to be ready - where ready implies the built-in |
| "readiness" probes succeeded - before becoming fully operational. When an |
| initial deployment of ONAP is requested the current state of the system is NULL |
| so ONAP is deployed by the Kubernetes manager as a set of Docker containers on |
| one or more predetermined hosts. The hosts could be physical machines or |
| virtual machines. When deploying on virtual machines the resulting system will |
| be very similar to "Heat" based deployments, i.e. Docker containers running |
| within a set of VMs, the primary difference being that the allocation of |
| containers to VMs is done dynamically with OOM and statically with "Heat". |
| Example SO deployment descriptor file shows SO's dependency on its mariadb |
| data-base component: |
| |
| SO deployment specification excerpt: |
| |
| .. code-block:: yaml |
| |
| apiVersion: extensions/v1beta1 |
| kind: Deployment |
| metadata: |
| name: {{ include "common.name" . }} |
| namespace: {{ include "common.namespace" . }} |
| labels: |
| app: {{ include "common.name" . }} |
| chart: {{ .Chart.Name }}-{{ .Chart.Version | replace "+" "_" }} |
| release: {{ .Release.Name }} |
| heritage: {{ .Release.Service }} |
| spec: |
| replicas: {{ .Values.replicaCount }} |
| template: |
| metadata: |
| labels: |
| app: {{ include "common.name" . }} |
| release: {{ .Release.Name }} |
| spec: |
| initContainers: |
| - command: |
| - /root/ready.py |
| args: |
| - --container-name |
| - so-mariadb |
| env: |
| ... |
| |
| Kubernetes Container Orchestration |
| ================================== |
| The ONAP components are managed by the Kubernetes_ container management system |
| which maintains the desired state of the container system as described by one |
| or more deployment descriptors - similar in concept to OpenStack HEAT |
| Orchestration Templates. The following sections describe the fundamental |
| objects managed by Kubernetes, the network these components use to communicate |
| with each other and other entities outside of ONAP and the templates that |
| describe the configuration and desired state of the ONAP components. |
| |
| Name Spaces |
| ----------- |
| Within the namespaces are Kubernetes services that provide external connectivity to pods that host Docker containers. |
| |
| ONAP Components to Kubernetes Object Relationships |
| -------------------------------------------------- |
| Kubernetes deployments consist of multiple objects: |
| |
| - **nodes** - a worker machine - either physical or virtual - that hosts |
| multiple containers managed by Kubernetes. |
| - **services** - an abstraction of a logical set of pods that provide a |
| micro-service. |
| - **pods** - one or more (but typically one) container(s) that provide specific |
| application functionality. |
| - **persistent volumes** - One or more permanent volumes need to be established |
| to hold non-ephemeral configuration and state data. |
| |
| The relationship between these objects is shown in the following figure: |
| |
| .. .. uml:: |
| .. |
| .. @startuml |
| .. node PH { |
| .. component Service { |
| .. component Pod0 |
| .. component Pod1 |
| .. } |
| .. } |
| .. |
| .. database PV |
| .. @enduml |
| |
| .. figure:: kubernetes_objects.png |
| |
| OOM uses these Kubernetes objects as described in the following sections. |
| |
| Nodes |
| ~~~~~ |
| OOM works with both physical and virtual worker machines. |
| |
| * Virtual Machine Deployments - If ONAP is to be deployed onto a set of virtual |
| machines, the creation of the VMs is outside of the scope of OOM and could be |
| done in many ways, such as |
| |
| * manually, for example by a user using the OpenStack Horizon dashboard or |
| AWS EC2, or |
| * automatically, for example with the use of a OpenStack Heat Orchestration |
| Template which builds an ONAP stack, Azure ARM template, AWS CloudFormation |
| Template, or |
| * orchestrated, for example with Cloudify creating the VMs from a TOSCA |
| template and controlling their life cycle for the life of the ONAP |
| deployment. |
| |
| * Physical Machine Deployments - If ONAP is to be deployed onto physical |
| machines there are several options but the recommendation is to use Rancher |
| along with Helm to associate hosts with a Kubernetes cluster. |
| |
| Pods |
| ~~~~ |
| A group of containers with shared storage and networking can be grouped |
| together into a Kubernetes pod. All of the containers within a pod are |
| co-located and co-scheduled so they operate as a single unit. Within ONAP |
| Amsterdam release, pods are mapped one-to-one to docker containers although |
| this may change in the future. As explained in the Services section below the |
| use of Pods within each ONAP component is abstracted from other ONAP |
| components. |
| |
| Services |
| ~~~~~~~~ |
| OOM uses the Kubernetes service abstraction to provide a consistent access |
| point for each of the ONAP components independent of the pod or container |
| architecture of that component. For example, the SDNC component may introduce |
| OpenDaylight clustering as some point and change the number of pods in this |
| component to three or more but this change will be isolated from the other ONAP |
| components by the service abstraction. A service can include a load balancer |
| on its ingress to distribute traffic between the pods and even react to dynamic |
| changes in the number of pods if they are part of a replica set. |
| |
| Persistent Volumes |
| ~~~~~~~~~~~~~~~~~~ |
| To enable ONAP to be deployed into a wide variety of cloud infrastructures a |
| flexible persistent storage architecture, built on Kubernetes persistent |
| volumes, provides the ability to define the physical storage in a central |
| location and have all ONAP components securely store their data. |
| |
| When deploying ONAP into a public cloud, available storage services such as |
| `AWS Elastic Block Store`_, `Azure File`_, or `GCE Persistent Disk`_ are |
| options. Alternatively, when deploying into a private cloud the storage |
| architecture might consist of Fiber Channel, `Gluster FS`_, or iSCSI. Many |
| other storage options existing, refer to the `Kubernetes Storage Class`_ |
| documentation for a full list of the options. The storage architecture may vary |
| from deployment to deployment but in all cases a reliable, redundant storage |
| system must be provided to ONAP with which the state information of all ONAP |
| components will be securely stored. The Storage Class for a given deployment is |
| a single parameter listed in the ONAP values.yaml file and therefore is easily |
| customized. Operation of this storage system is outside the scope of the OOM. |
| |
| .. code-block:: yaml |
| |
| Insert values.yaml code block with storage block here |
| |
| Once the storage class is selected and the physical storage is provided, the |
| ONAP deployment step creates a pool of persistent volumes within the given |
| physical storage that is used by all of the ONAP components. ONAP components |
| simply make a claim on these persistent volumes (PV), with a persistent volume |
| claim (PVC), to gain access to their storage. |
| |
| The following figure illustrates the relationships between the persistent |
| volume claims, the persistent volumes, the storage class, and the physical |
| storage. |
| |
| .. graphviz:: |
| |
| digraph PV { |
| label = "Persistance Volume Claim to Physical Storage Mapping" |
| { |
| node [shape=cylinder] |
| D0 [label="Drive0"] |
| D1 [label="Drive1"] |
| Dx [label="Drivex"] |
| } |
| { |
| node [shape=Mrecord label="StorageClass:ceph"] |
| sc |
| } |
| { |
| node [shape=point] |
| p0 p1 p2 |
| p3 p4 p5 |
| } |
| subgraph clusterSDC { |
| label="SDC" |
| PVC0 |
| PVC1 |
| } |
| subgraph clusterSDNC { |
| label="SDNC" |
| PVC2 |
| } |
| subgraph clusterSO { |
| label="SO" |
| PVCn |
| } |
| PV0 -> sc |
| PV1 -> sc |
| PV2 -> sc |
| PVn -> sc |
| |
| sc -> {D0 D1 Dx} |
| PVC0 -> PV0 |
| PVC1 -> PV1 |
| PVC2 -> PV2 |
| PVCn -> PVn |
| |
| # force all of these nodes to the same line in the given order |
| subgraph { |
| rank = same; PV0;PV1;PV2;PVn;p0;p1;p2 |
| PV0->PV1->PV2->p0->p1->p2->PVn [style=invis] |
| } |
| |
| subgraph { |
| rank = same; D0;D1;Dx;p3;p4;p5 |
| D0->D1->p3->p4->p5->Dx [style=invis] |
| } |
| |
| } |
| |
| In-order for an ONAP component to use a persistent volume it must make a claim |
| against a specific persistent volume defined in the ONAP common charts. Note |
| that there is a one-to-one relationship between a PVC and PV. The following is |
| an excerpt from a component chart that defines a PVC: |
| |
| .. code-block:: yaml |
| |
| Insert PVC example here |
| |
| OOM Networking with Kubernetes |
| ------------------------------ |
| |
| - DNS |
| - Ports - Flattening the containers also expose port conflicts between the containers which need to be resolved. |
| |
| Node Ports |
| ~~~~~~~~~~ |
| |
| Pod Placement Rules |
| ------------------- |
| OOM will use the rich set of Kubernetes node and pod affinity / |
| anti-affinity rules to minimize the chance of a single failure resulting in a |
| loss of ONAP service. Node affinity / anti-affinity is used to guide the |
| Kubernetes orchestrator in the placement of pods on nodes (physical or virtual |
| machines). For example: |
| |
| - if a container used Intel DPDK technology the pod may state that it as |
| affinity to an Intel processor based node, or |
| - geographical based node labels (such as the Kubernetes standard zone or |
| region labels) may be used to ensure placement of a DCAE complex close to the |
| VNFs generating high volumes of traffic thus minimizing networking cost. |
| Specifically, if nodes were pre-assigned labels East and West, the pod |
| deployment spec to distribute pods to these nodes would be: |
| |
| .. code-block:: yaml |
| |
| nodeSelector: |
| failure-domain.beta.Kubernetes.io/region: {{ .Values.location }} |
| |
| - "location: West" is specified in the `values.yaml` file used to deploy |
| one DCAE cluster and "location: East" is specified in a second `values.yaml` |
| file (see OOM Configuration Management for more information about |
| configuration files like the `values.yaml` file). |
| |
| Node affinity can also be used to achieve geographic redundancy if pods are |
| assigned to multiple failure domains. For more information refer to `Assigning |
| Pods to Nodes`_. |
| |
| .. note:: |
| One could use Pod to Node assignment to totally constrain Kubernetes when |
| doing initial container assignment to replicate the Amsterdam release |
| OpenStack Heat based deployment. Should one wish to do this, each VM would |
| need a unique node name which would be used to specify a node constaint |
| for every component. These assignment could be specified in an environment |
| specific values.yaml file. Constraining Kubernetes in this way is not |
| recommended. |
| |
| Kubernetes has a comprehensive system called Taints and Tolerations that can be |
| used to force the container orchestrator to repel pods from nodes based on |
| static events (an administrator assigning a taint to a node) or dynamic events |
| (such as a node becoming unreachable or running out of disk space). There are |
| no plans to use taints or tolerations in the ONAP Beijing release. Pod |
| affinity / anti-affinity is the concept of creating a spacial relationship |
| between pods when the Kubernetes orchestrator does assignment (both initially |
| an in operation) to nodes as explained in Inter-pod affinity and anti-affinity. |
| For example, one might choose to co-located all of the ONAP SDC containers on a |
| single node as they are not critical runtime components and co-location |
| minimizes overhead. On the other hand, one might choose to ensure that all of |
| the containers in an ODL cluster (SDNC and APPC) are placed on separate nodes |
| such that a node failure has minimal impact to the operation of the cluster. |
| An example of how pod affinity / anti-affinity is shown below: |
| |
| Pod Affinity / Anti-Affinity |
| |
| .. code-block:: yaml |
| |
| apiVersion: v1 |
| kind: Pod |
| metadata: |
| name: with-pod-affinity |
| spec: |
| affinity: |
| podAffinity: |
| requiredDuringSchedulingIgnoredDuringExecution: |
| - labelSelector: |
| matchExpressions: |
| - key: security |
| operator: In |
| values: |
| - S1 |
| topologyKey: failure-domain.beta.Kubernetes.io/zone |
| podAntiAffinity: |
| preferredDuringSchedulingIgnoredDuringExecution: |
| - weight: 100 |
| podAffinityTerm: |
| labelSelector: |
| matchExpressions: |
| - key: security |
| operator: In |
| values: |
| - S2 |
| topologyKey: Kubernetes.io/hostname |
| containers: |
| - name: with-pod-affinity |
| image: gcr.io/google_containers/pause:2.0 |
| |
| This example contains both podAffinity and podAntiAffinity rules, the first |
| rule is is a must (requiredDuringSchedulingIgnoredDuringExecution) while the |
| second will be met pending other considerations |
| (preferredDuringSchedulingIgnoredDuringExecution). Preemption Another feature |
| that may assist in achieving a repeatable deployment in the presence of faults |
| that may have reduced the capacity of the cloud is assigning priority to the |
| containers such that mission critical components have the ability to evict less |
| critical components. Kubernetes provides this capability with Pod Priority and |
| Preemption. Prior to having more advanced production grade features available, |
| the ability to at least be able to re-deploy ONAP (or a subset of) reliably |
| provides a level of confidence that should an outage occur the system can be |
| brought back on-line predictably. |
| |
| Health Checks |
| ------------- |
| |
| Monitoring of ONAP components is configured in the agents within JSON files and |
| stored in gerrit under the consul-agent-config, here is an example from the AAI |
| model loader (aai-model-loader-health.json): |
| |
| .. code-block:: json |
| |
| { |
| "service": { |
| "name": "A&AI Model Loader", |
| "checks": [ |
| { |
| "id": "model-loader-process", |
| "name": "Model Loader Presence", |
| "script": "/consul/config/scripts/model-loader-script.sh", |
| "interval": "15s", |
| "timeout": "1s" |
| } |
| ] |
| } |
| } |
| |
| Liveness Probes |
| --------------- |
| |
| These liveness probes can simply check that a port is available, that a |
| built-in health check is reporting good health, or that the Consul health check |
| is positive. For example, to monitor the SDNC component has following liveness |
| probe can be found in the SDNC DB deployment specification: |
| |
| .. code-block:: yaml |
| |
| sdnc db liveness probe |
| |
| livenessProbe: |
| exec: |
| command: ["mysqladmin", "ping"] |
| initialDelaySeconds: 30 periodSeconds: 10 |
| timeoutSeconds: 5 |
| |
| The 'initialDelaySeconds' control the period of time between the readiness |
| probe succeeding and the liveness probe starting. 'periodSeconds' and |
| 'timeoutSeconds' control the actual operation of the probe. Note that |
| containers are inherently ephemeral so the healing action destroys failed |
| containers and any state information within it. To avoid a loss of state, a |
| persistent volume should be used to store all data that needs to be persisted |
| over the re-creation of a container. Persistent volumes have been created for |
| the database components of each of the projects and the same technique can be |
| used for all persistent state information. |
| |
| |
| |
| Environment Files |
| ~~~~~~~~~~~~~~~~~ |
| |
| MSB Integration |
| =============== |
| |
| The \ `Microservices Bus |
| Project <https://wiki.onap.org/pages/viewpage.action?pageId=3246982>`__ provides |
| facilities to integrate micro-services into ONAP and therefore needs to |
| integrate into OOM - primarily through Consul which is the backend of |
| MSB service discovery. The following is a brief description of how this |
| integration will be done: |
| |
| A registrator to push the service endpoint info to MSB service |
| discovery. |
| |
| - The needed service endpoint info is put into the kubernetes yaml file |
| as annotation, including service name, Protocol,version, visual |
| range,LB method, IP, Port,etc. |
| |
| - OOM deploy/start/restart/scale in/scale out/upgrade ONAP components |
| |
| - Registrator watch the kubernetes event |
| |
| - When an ONAP component instance has been started/destroyed by OOM, |
| Registrator get the notification from kubernetes |
| |
| - Registrator parse the service endpoint info from annotation and |
| register/update/unregister it to MSB service discovery |
| |
| - MSB API Gateway uses the service endpoint info for service routing |
| and load balancing. |
| |
| Details of the registration service API can be found at \ `Microservice |
| Bus API |
| Documentation <https://wiki.onap.org/display/DW/Microservice+Bus+API+Documentation>`__. |
| |
| ONAP Component Registration to MSB |
| ---------------------------------- |
| The charts of all ONAP components intending to register against MSB must have |
| an annotation in their service(s) template. A `sdc` example follows: |
| |
| .. code-block:: yaml |
| |
| apiVersion: v1 |
| kind: Service |
| metadata: |
| labels: |
| app: sdc-be |
| name: sdc-be |
| namespace: "{{ .Values.nsPrefix }}" |
| annotations: |
| msb.onap.org/service-info: '[ |
| { |
| "serviceName": "sdc", |
| "version": "v1", |
| "url": "/sdc/v1", |
| "protocol": "REST", |
| "port": "8080", |
| "visualRange":"1" |
| }, |
| { |
| "serviceName": "sdc-deprecated", |
| "version": "v1", |
| "url": "/sdc/v1", |
| "protocol": "REST", |
| "port": "8080", |
| "visualRange":"1", |
| "path":"/sdc/v1" |
| } |
| ]' |
| ... |
| |
| |
| MSB Integration with OOM |
| ------------------------ |
| A preliminary view of the OOM-MSB integration is as follows: |
| |
| .. figure:: MSB-OOM-Diagram.png |
| |
| A message sequence chart of the registration process: |
| |
| .. uml:: |
| |
| participant "OOM" as oom |
| participant "ONAP Component" as onap |
| participant "Service Discovery" as sd |
| participant "External API Gateway" as eagw |
| participant "Router (Internal API Gateway)" as iagw |
| |
| box "MSB" #LightBlue |
| participant sd |
| participant eagw |
| participant iagw |
| end box |
| |
| == Deploy Servcie == |
| |
| oom -> onap: Deploy |
| oom -> sd: Register service endpoints |
| sd -> eagw: Services exposed to external system |
| sd -> iagw: Services for internal use |
| |
| == Component Life-cycle Management == |
| |
| oom -> onap: Start/Stop/Scale/Migrate/Upgrade |
| oom -> sd: Update service info |
| sd -> eagw: Update service info |
| sd -> iagw: Update service info |
| |
| == Service Health Check == |
| |
| sd -> onap: Check the health of service |
| sd -> eagw: Update service status |
| sd -> iagw: Update service status |
| |
| |
| MSB Deployment Instructions |
| --------------------------- |
| MSB is helm installable ONAP component which is often automatically deployed. |
| To install it individually enter:: |
| |
| > helm install <repo-name>/msb |
| |
| .. note:: |
| TBD: Vaidate if the following procedure is still required. |
| |
| Please note that Kubernetes authentication token must be set at |
| *kubernetes/kube2msb/values.yaml* so the kube2msb registrator can get the |
| access to watch the kubernetes events and get service annotation by |
| Kubernetes APIs. The token can be found in the kubectl configuration file |
| *~/.kube/config* |
| |
| More details can be found here `MSB installation <http://onap.readthedocs.io/en/latest/submodules/msb/apigateway.git/docs/platform/installation.html>`__. |
| |
| .. MISC |
| .. ==== |
| .. Note that although OOM uses Kubernetes facilities to minimize the effort |
| .. required of the ONAP component owners to implement a successful rolling upgrade |
| .. strategy there are other considerations that must be taken into consideration. |
| .. For example, external APIs - both internal and external to ONAP - should be |
| .. designed to gracefully accept transactions from a peer at a different software |
| .. version to avoid deadlock situations. Embedded version codes in messages may |
| .. facilitate such capabilities. |
| .. |
| .. Within each of the projects a new configuration repository contains all of the |
| .. project specific configuration artifacts. As changes are made within the |
| .. project, it's the responsibility of the project team to make appropriate |
| .. changes to the configuration data. |