blob: 7c8e16b5bf97b76560bf2f97dacf0481d7d797b0 [file] [log] [blame]
Eric Debeau993b77b2020-08-19 15:30:00 +02001.. This work is licensed under a Creative Commons Attribution 4.0
2.. International License.
Roger Maitland953b5f12018-03-22 15:24:04 -04003.. http://creativecommons.org/licenses/by/4.0
Eric Debeau993b77b2020-08-19 15:30:00 +02004.. Copyright 2018-2020 Amdocs, Bell Canada, Orange, Samsung
Jessica Wagantallafb1ead2020-04-09 12:39:10 -07005.. _oom_user_guide:
Roger Maitland953b5f12018-03-22 15:24:04 -04006
7.. Links
8.. _Curated applications for Kubernetes: https://github.com/kubernetes/charts
9.. _Services: https://kubernetes.io/docs/concepts/services-networking/service/
10.. _ReplicaSet: https://kubernetes.io/docs/concepts/workloads/controllers/replicaset/
11.. _StatefulSet: https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/
12.. _Helm Documentation: https://docs.helm.sh/helm/
13.. _Helm: https://docs.helm.sh/
14.. _Kubernetes: https://Kubernetes.io/
Eric Debeauc4e405f2020-12-07 14:49:52 +010015.. _Kubernetes LoadBalancer: https://kubernetes.io/docs/concepts/services-networking/service/#loadbalancer
Pawel Wieczorek96359152020-12-02 14:27:40 +010016.. _`Docker installation guide`: https://docs.docker.com/engine/install/
Roger Maitland953b5f12018-03-22 15:24:04 -040017.. _user-guide-label:
18
19OOM User Guide
20##############
21
22The ONAP Operations Manager (OOM) provide the ability to manage the entire
23life-cycle of an ONAP installation, from the initial deployment to final
24decommissioning. This guide provides instructions for users of ONAP to
25use the Kubernetes_/Helm_ system as a complete ONAP management system.
26
27This guide provides many examples of Helm command line operations. For a
28complete description of these commands please refer to the `Helm
29Documentation`_.
30
31.. figure:: oomLogoV2-medium.png
32 :align: right
33
34The following sections describe the life-cycle operations:
35
36- Deploy_ - with built-in component dependency management
37- Configure_ - unified configuration across all ONAP components
38- Monitor_ - real-time health monitoring feeding to a Consul UI and Kubernetes
39- Heal_- failed ONAP containers are recreated automatically
40- Scale_ - cluster ONAP services to enable seamless scaling
Eric Debeau993b77b2020-08-19 15:30:00 +020041- Upgrade_ - change-out containers or configuration with little or no service
42 impact
Roger Maitland953b5f12018-03-22 15:24:04 -040043- Delete_ - cleanup individual containers or entire deployments
44
45.. figure:: oomLogoV2-Deploy.png
46 :align: right
47
48Deploy
49======
50
51The OOM team with assistance from the ONAP project teams, have built a
52comprehensive set of Helm charts, yaml files very similar to TOSCA files, that
53describe the composition of each of the ONAP components and the relationship
54within and between components. Using this model Helm is able to deploy all of
Roger Maitlandbb8adda2018-04-05 16:18:11 -040055ONAP with a few simple commands.
56
57Pre-requisites
58--------------
Pawel Wieczorek96359152020-12-02 14:27:40 +010059Your environment must have Docker installed as well as both the Kubernetes
60`kubectl` and Helm setup as a one time activity.
61
62Install Docker
63~~~~~~~~~~~~~~
64Follow official `Docker installation guide`_.
Roger Maitlandbb8adda2018-04-05 16:18:11 -040065
66Install Kubectl
67~~~~~~~~~~~~~~~
Sylvain Desbureaux983c7552019-01-28 13:59:43 +010068Enter the following to install kubectl (on Ubuntu, there are slight differences
69on other O/Ss), the Kubernetes command line interface used to manage a
70Kubernetes cluster::
Roger Maitlandbb8adda2018-04-05 16:18:11 -040071
Michael O'Brien42d87d02018-04-18 17:17:54 -040072 > curl -LO https://storage.googleapis.com/kubernetes-release/release/v1.8.10/bin/linux/amd64/kubectl
Roger Maitlandbb8adda2018-04-05 16:18:11 -040073 > chmod +x ./kubectl
74 > sudo mv ./kubectl /usr/local/bin/kubectl
75 > mkdir ~/.kube
76
Sylvain Desbureaux983c7552019-01-28 13:59:43 +010077Paste kubectl config from Rancher (see the :ref:`cloud-setup-guide-label` for
78alternative Kubernetes environment setups) into the `~/.kube/config` file.
Roger Maitlandbb8adda2018-04-05 16:18:11 -040079
80Verify that the Kubernetes config is correct::
81
82 > kubectl get pods --all-namespaces
83
84At this point you should see six Kubernetes pods running.
85
86Install Helm
87~~~~~~~~~~~~
Sylvain Desbureaux983c7552019-01-28 13:59:43 +010088Helm is used by OOM for package and configuration management. To install Helm,
89enter the following::
Roger Maitlandbb8adda2018-04-05 16:18:11 -040090
Michael O'Brien633217a2018-08-15 14:10:43 -040091 > wget http://storage.googleapis.com/kubernetes-helm/helm-v2.9.1-linux-amd64.tar.gz
92 > tar -zxvf helm-v2.9.1-linux-amd64.tar.gz
Roger Maitlandbb8adda2018-04-05 16:18:11 -040093 > sudo mv linux-amd64/helm /usr/local/bin/helm
94
95Verify the Helm version with::
96
97 > helm version
98
99Install the Helm Tiller application and initialize with::
100
101 > helm init
102
103Install the Helm Repo
104---------------------
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100105Once kubectl and Helm are setup, one needs to setup a local Helm server to
106server up the ONAP charts::
Roger Maitland953b5f12018-03-22 15:24:04 -0400107
108 > helm install osn/onap
109
110.. note::
111 The osn repo is not currently available so creation of a local repository is
112 required.
113
114Helm is able to use charts served up from a repository and comes setup with a
115default CNCF provided `Curated applications for Kubernetes`_ repository called
116stable which should be removed to avoid confusion::
117
118 > helm repo remove stable
119
120.. To setup the Open Source Networking Nexus repository for helm enter::
121.. > helm repo add osn 'https://nexus3.onap.org:10001/helm/helm-repo-in-nexus/master/'
122
123To prepare your system for an installation of ONAP, you'll need to::
124
Sylvain Desbureaux34a250b2020-06-08 15:02:11 +0200125 > git clone -b frankfurt --recurse-submodules -j2 http://gerrit.onap.org/r/oom
Roger Maitlandbb8adda2018-04-05 16:18:11 -0400126 > cd oom/kubernetes
127
Roger Maitland953b5f12018-03-22 15:24:04 -0400128
Roger Maitland953b5f12018-03-22 15:24:04 -0400129To setup a local Helm server to server up the ONAP charts::
130
Roger Maitlandbb8adda2018-04-05 16:18:11 -0400131 > helm init
Roger Maitland953b5f12018-03-22 15:24:04 -0400132 > helm serve &
133
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100134Note the port number that is listed and use it in the Helm repo add as
135follows::
Roger Maitland953b5f12018-03-22 15:24:04 -0400136
137 > helm repo add local http://127.0.0.1:8879
138
139To get a list of all of the available Helm chart repositories::
140
141 > helm repo list
142 NAME URL
143 local http://127.0.0.1:8879
144
Roger Maitland9e5067c2018-03-27 10:57:08 -0400145Then build your local Helm repository::
146
Jakub Latusekdb52a6d2020-10-15 15:02:47 +0200147 > make SKIP_LINT=TRUE [HELM_BIN=<HELM_PATH>] all
148
149`HELM_BIN`
150 Sets the helm binary to be used. The default value use helm from PATH. Allow the user to have
151 multiple version of helm in operating system and choose which one to use.
Roger Maitland9e5067c2018-03-27 10:57:08 -0400152
Roger Maitland953b5f12018-03-22 15:24:04 -0400153The Helm search command reads through all of the repositories configured on the
154system, and looks for matches::
155
156 > helm search -l
157 NAME VERSION DESCRIPTION
Sylvain Desbureaux4f9902b2020-11-25 08:34:55 +0100158 local/appc 7.0.0 Application Controller
159 local/clamp 7.0.0 ONAP Clamp
160 local/common 7.0.0 Common templates for inclusion in other charts
161 local/onap 7.0.0 Open Network Automation Platform (ONAP)
162 local/robot 7.0.0 A helm Chart for kubernetes-ONAP Robot
163 local/so 7.0.0 ONAP Service Orchestrator
Roger Maitland953b5f12018-03-22 15:24:04 -0400164
165In any case, setup of the Helm repository is a one time activity.
166
Pawel Wieczorek1d4b96f2019-01-23 16:46:56 +0100167Next, install Helm Plugins required to deploy the ONAP Casablanca release::
168
169 > cp -R helm/plugins/ ~/.helm
170
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100171Once the repo is setup, installation of ONAP can be done with a single
172command::
Roger Maitland953b5f12018-03-22 15:24:04 -0400173
Pawel Wieczorek1d4b96f2019-01-23 16:46:56 +0100174 > helm deploy development local/onap --namespace onap
Roger Maitland953b5f12018-03-22 15:24:04 -0400175
176This will install ONAP from a local repository in a 'development' Helm release.
177As described below, to override the default configuration values provided by
178OOM, an environment file can be provided on the command line as follows::
179
Pawel Wieczorek1d4b96f2019-01-23 16:46:56 +0100180 > helm deploy development local/onap --namespace onap -f overrides.yaml
Roger Maitland953b5f12018-03-22 15:24:04 -0400181
182To get a summary of the status of all of the pods (containers) running in your
183deployment::
184
185 > kubectl get pods --all-namespaces -o=wide
186
187.. note::
188 The Kubernetes namespace concept allows for multiple instances of a component
189 (such as all of ONAP) to co-exist with other components in the same
190 Kubernetes cluster by isolating them entirely. Namespaces share only the
191 hosts that form the cluster thus providing isolation between production and
192 development systems as an example. The OOM deployment of ONAP in Beijing is
193 now done within a single Kubernetes namespace where in Amsterdam a namespace
194 was created for each of the ONAP components.
195
196.. note::
Roger Maitlandd96413f2018-04-09 10:06:07 -0400197 The Helm `--name` option refers to a release name and not a Kubernetes namespace.
Roger Maitland953b5f12018-03-22 15:24:04 -0400198
199
200To install a specific version of a single ONAP component (`so` in this example)
Pawel Wieczorek1d4b96f2019-01-23 16:46:56 +0100201with the given release name enter::
Roger Maitland953b5f12018-03-22 15:24:04 -0400202
Pawel Wieczorek1d4b96f2019-01-23 16:46:56 +0100203 > helm deploy so onap/so --version 3.0.1
Roger Maitland953b5f12018-03-22 15:24:04 -0400204
205To display details of a specific resource or group of resources type::
206
207 > kubectl describe pod so-1071802958-6twbl
208
209where the pod identifier refers to the auto-generated pod identifier.
210
211.. figure:: oomLogoV2-Configure.png
212 :align: right
213
214Configure
215=========
216
217Each project within ONAP has its own configuration data generally consisting
218of: environment variables, configuration files, and database initial values.
219Many technologies are used across the projects resulting in significant
220operational complexity and an inability to apply global parameters across the
221entire ONAP deployment. OOM solves this problem by introducing a common
222configuration technology, Helm charts, that provide a hierarchical
Gildas Lanilis64d17ae2018-05-18 16:58:05 -0700223configuration with the ability to override values with higher
Roger Maitland953b5f12018-03-22 15:24:04 -0400224level charts or command line options.
225
226The structure of the configuration of ONAP is shown in the following diagram.
227Note that key/value pairs of a parent will always take precedence over those
228of a child. Also note that values set on the command line have the highest
229precedence of all.
230
231.. graphviz::
232
233 digraph config {
234 {
235 node [shape=folder]
236 oValues [label="values.yaml"]
237 demo [label="onap-demo.yaml"]
238 prod [label="onap-production.yaml"]
239 oReq [label="requirements.yaml"]
240 soValues [label="values.yaml"]
241 soReq [label="requirements.yaml"]
242 mdValues [label="values.yaml"]
243 }
244 {
245 oResources [label="resources"]
246 }
247 onap -> oResources
248 onap -> oValues
249 oResources -> environments
250 oResources -> oReq
251 oReq -> so
252 environments -> demo
253 environments -> prod
254 so -> soValues
255 so -> soReq
256 so -> charts
257 charts -> mariadb
258 mariadb -> mdValues
259
260 }
261
262The top level onap/values.yaml file contains the values required to be set
263before deploying ONAP. Here is the contents of this file:
264
Pawel Wieczoreka1903d62019-11-14 14:19:59 +0100265.. include:: ../kubernetes/onap/values.yaml
Roger Maitland953b5f12018-03-22 15:24:04 -0400266 :code: yaml
267
268One may wish to create a value file that is specific to a given deployment such
269that it can be differentiated from other deployments. For example, a
270onap-development.yaml file may create a minimal environment for development
271while onap-production.yaml might describe a production deployment that operates
272independently of the developer version.
273
274For example, if the production OpenStack instance was different from a
275developer's instance, the onap-production.yaml file may contain a different
276value for the vnfDeployment/openstack/oam_network_cidr key as shown below.
277
278.. code-block:: yaml
279
280 nsPrefix: onap
281 nodePortPrefix: 302
282 apps: consul msb mso message-router sdnc vid robot portal policy appc aai
283 sdc dcaegen2 log cli multicloud clamp vnfsdk aaf kube2msb
284 dataRootDir: /dockerdata-nfs
285
286 # docker repositories
287 repository:
288 onap: nexus3.onap.org:10001
289 oom: oomk8s
290 aai: aaionap
291 filebeat: docker.elastic.co
292
293 image:
294 pullPolicy: Never
295
296 # vnf deployment environment
297 vnfDeployment:
298 openstack:
299 ubuntu_14_image: "Ubuntu_14.04.5_LTS"
300 public_net_id: "e8f51956-00dd-4425-af36-045716781ffc"
301 oam_network_id: "d4769dfb-c9e4-4f72-b3d6-1d18f4ac4ee6"
302 oam_subnet_id: "191f7580-acf6-4c2b-8ec0-ba7d99b3bc4e"
303 oam_network_cidr: "192.168.30.0/24"
304 <...>
305
306
307To deploy ONAP with this environment file, enter::
308
Sylvain Desbureaux5e19e242020-03-02 14:41:48 +0100309 > helm deploy local/onap -n onap -f environments/onap-production.yaml
Roger Maitland953b5f12018-03-22 15:24:04 -0400310
311.. include:: environments_onap_demo.yaml
312 :code: yaml
313
314When deploying all of ONAP a requirements.yaml file control which and what
315version of the ONAP components are included. Here is an excerpt of this
316file:
317
318.. code-block:: yaml
319
320 # Referencing a named repo called 'local'.
321 # Can add this repo by running commands like:
322 # > helm serve
323 # > helm repo add local http://127.0.0.1:8879
324 dependencies:
325 <...>
326 - name: so
327 version: ~2.0.0
328 repository: '@local'
329 condition: so.enabled
330 <...>
331
332The ~ operator in the `so` version value indicates that the latest "2.X.X"
333version of `so` shall be used thus allowing the chart to allow for minor
334upgrades that don't impact the so API; hence, version 2.0.1 will be installed
335in this case.
336
337The onap/resources/environment/onap-dev.yaml (see the excerpt below) enables
338for fine grained control on what components are included as part of this
339deployment. By changing this `so` line to `enabled: false` the `so` component
340will not be deployed. If this change is part of an upgrade the existing `so`
341component will be shut down. Other `so` parameters and even `so` child values
342can be modified, for example the `so`'s `liveness` probe could be disabled
343(which is not recommended as this change would disable auto-healing of `so`).
344
345.. code-block:: yaml
346
347 #################################################################
348 # Global configuration overrides.
349 #
350 # These overrides will affect all helm charts (ie. applications)
351 # that are listed below and are 'enabled'.
352 #################################################################
353 global:
354 <...>
355
356 #################################################################
357 # Enable/disable and configure helm charts (ie. applications)
358 # to customize the ONAP deployment.
359 #################################################################
360 aaf:
361 enabled: false
362 <...>
363 so: # Service Orchestrator
364 enabled: true
365
366 replicaCount: 1
367
368 liveness:
369 # necessary to disable liveness probe when setting breakpoints
370 # in debugger so K8s doesn't restart unresponsive container
371 enabled: true
372
373 <...>
374
Roger Maitlandda221582018-05-10 13:43:58 -0400375Accessing the ONAP Portal using OOM and a Kubernetes Cluster
376------------------------------------------------------------
377
378The ONAP deployment created by OOM operates in a private IP network that isn't
Eric Debeau993b77b2020-08-19 15:30:00 +0200379publicly accessible (i.e. OpenStack VMs with private internal network) which
Roger Maitlandda221582018-05-10 13:43:58 -0400380blocks access to the ONAP Portal. To enable direct access to this Portal from a
381user's own environment (a laptop etc.) the portal application's port 8989 is
382exposed through a `Kubernetes LoadBalancer`_ object.
383
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100384Typically, to be able to access the Kubernetes nodes publicly a public address
Eric Debeau993b77b2020-08-19 15:30:00 +0200385is assigned. In OpenStack this is a floating IP address.
Roger Maitlandda221582018-05-10 13:43:58 -0400386
387When the `portal-app` chart is deployed a Kubernetes service is created that
388instantiates a load balancer. The LB chooses the private interface of one of
389the nodes as in the example below (10.0.0.4 is private to the K8s cluster only).
390Then to be able to access the portal on port 8989 from outside the K8s &
Eric Debeau993b77b2020-08-19 15:30:00 +0200391OpenStack environment, the user needs to assign/get the floating IP address that
Roger Maitlandda221582018-05-10 13:43:58 -0400392corresponds to the private IP as follows::
393
394 > kubectl -n onap get services|grep "portal-app"
395 portal-app LoadBalancer 10.43.142.201 10.0.0.4 8989:30215/TCP,8006:30213/TCP,8010:30214/TCP 1d app=portal-app,release=dev
396
397
398In this example, use the 10.0.0.4 private address as a key find the
399corresponding public address which in this example is 10.12.6.155. If you're
Eric Debeau993b77b2020-08-19 15:30:00 +0200400using OpenStack you'll do the lookup with the horizon GUI or the OpenStack CLI
Roger Maitlandda221582018-05-10 13:43:58 -0400401for your tenant (openstack server list). That IP is then used in your
402`/etc/hosts` to map the fixed DNS aliases required by the ONAP Portal as shown
403below::
404
405 10.12.6.155 portal.api.simpledemo.onap.org
406 10.12.6.155 vid.api.simpledemo.onap.org
407 10.12.6.155 sdc.api.fe.simpledemo.onap.org
andreasgeissler4a618ba2018-11-30 14:20:46 +0000408 10.12.6.155 sdc.workflow.plugin.simpledemo.onap.org
409 10.12.6.155 sdc.dcae.plugin.simpledemo.onap.org
Roger Maitlandda221582018-05-10 13:43:58 -0400410 10.12.6.155 portal-sdk.simpledemo.onap.org
411 10.12.6.155 policy.api.simpledemo.onap.org
412 10.12.6.155 aai.api.sparky.simpledemo.onap.org
413 10.12.6.155 cli.api.simpledemo.onap.org
414 10.12.6.155 msb.api.discovery.simpledemo.onap.org
andreasgeissler4a618ba2018-11-30 14:20:46 +0000415 10.12.6.155 msb.api.simpledemo.onap.org
416 10.12.6.155 clamp.api.simpledemo.onap.org
417 10.12.6.155 so.api.simpledemo.onap.org
Sylvain Desbureaux1037d752020-04-20 14:17:16 +0200418 10.12.6.155 sdc.workflow.plugin.simpledemo.onap.org
Roger Maitlandda221582018-05-10 13:43:58 -0400419
420Ensure you've disabled any proxy settings the browser you are using to access
andreasgeissler4a618ba2018-11-30 14:20:46 +0000421the portal and then simply access now the new ssl-encrypted URL:
Eric Debeauc4e405f2020-12-07 14:49:52 +0100422``https://portal.api.simpledemo.onap.org:30225/ONAPPORTAL/login.htm``
Roger Maitlandda221582018-05-10 13:43:58 -0400423
andreasgeissler4a618ba2018-11-30 14:20:46 +0000424.. note::
425 Using the HTTPS based Portal URL the Browser needs to be configured to accept
426 unsecure credentials.
427 Additionally when opening an Application inside the Portal, the Browser
428 might block the content, which requires to disable the blocking and reloading
429 of the page
430
431.. note::
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100432 Besides the ONAP Portal the Components can deliver additional user interfaces,
andreasgeissler4a618ba2018-11-30 14:20:46 +0000433 please check the Component specific documentation.
Roger Maitlandda221582018-05-10 13:43:58 -0400434
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100435.. note::
Roger Maitlandda221582018-05-10 13:43:58 -0400436
Hector Anapan-Lavalle55547da2018-07-26 13:33:17 -0400437 | Alternatives Considered:
438
439 - Kubernetes port forwarding was considered but discarded as it would require
440 the end user to run a script that opens up port forwarding tunnels to each of
441 the pods that provides a portal application widget.
442
443 - Reverting to a VNC server similar to what was deployed in the Amsterdam
444 release was also considered but there were many issues with resolution, lack
445 of volume mount, /etc/hosts dynamic update, file upload that were a tall order
446 to solve in time for the Beijing release.
447
448 Observations:
449
450 - If you are not using floating IPs in your Kubernetes deployment and directly attaching
451 a public IP address (i.e. by using your public provider network) to your K8S Node
452 VMs' network interface, then the output of 'kubectl -n onap get services | grep "portal-app"'
453 will show your public IP instead of the private network's IP. Therefore,
454 you can grab this public IP directly (as compared to trying to find the floating
455 IP first) and map this IP in /etc/hosts.
Roger Maitlandda221582018-05-10 13:43:58 -0400456
Roger Maitland953b5f12018-03-22 15:24:04 -0400457.. figure:: oomLogoV2-Monitor.png
458 :align: right
459
460Monitor
461=======
462
463All highly available systems include at least one facility to monitor the
464health of components within the system. Such health monitors are often used as
Eric Debeau993b77b2020-08-19 15:30:00 +0200465inputs to distributed coordination systems (such as etcd, Zookeeper, or Consul)
466and monitoring systems (such as Nagios or Zabbix). OOM provides two mechanisms
Roger Maitland953b5f12018-03-22 15:24:04 -0400467to monitor the real-time health of an ONAP deployment:
468
469- a Consul GUI for a human operator or downstream monitoring systems and
470 Kubernetes liveness probes that enable automatic healing of failed
471 containers, and
472- a set of liveness probes which feed into the Kubernetes manager which
473 are described in the Heal section.
474
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100475Within ONAP, Consul is the monitoring system of choice and deployed by OOM in
476two parts:
Roger Maitland953b5f12018-03-22 15:24:04 -0400477
478- a three-way, centralized Consul server cluster is deployed as a highly
Gildas Lanilis64d17ae2018-05-18 16:58:05 -0700479 available monitor of all of the ONAP components, and
Roger Maitland953b5f12018-03-22 15:24:04 -0400480- a number of Consul agents.
481
482The Consul server provides a user interface that allows a user to graphically
483view the current health status of all of the ONAP components for which agents
484have been created - a sample from the ONAP Integration labs follows:
485
486.. figure:: consulHealth.png
487 :align: center
488
Eric Debeauc4e405f2020-12-07 14:49:52 +0100489To see the real-time health of a deployment go to: ``http://<kubernetes IP>:30270/ui/``
Roger Maitland953b5f12018-03-22 15:24:04 -0400490where a GUI much like the following will be found:
491
492
493.. figure:: oomLogoV2-Heal.png
494 :align: right
495
496Heal
497====
498
499The ONAP deployment is defined by Helm charts as mentioned earlier. These Helm
500charts are also used to implement automatic recoverability of ONAP components
501when individual components fail. Once ONAP is deployed, a "liveness" probe
502starts checking the health of the components after a specified startup time.
503
504Should a liveness probe indicate a failed container it will be terminated and a
505replacement will be started in its place - containers are ephemeral. Should the
506deployment specification indicate that there are one or more dependencies to
507this container or component (for example a dependency on a database) the
508dependency will be satisfied before the replacement container/component is
509started. This mechanism ensures that, after a failure, all of the ONAP
510components restart successfully.
511
512To test healing, the following command can be used to delete a pod::
513
514 > kubectl delete pod [pod name] -n [pod namespace]
515
516One could then use the following command to monitor the pods and observe the
517pod being terminated and the service being automatically healed with the
518creation of a replacement pod::
519
520 > kubectl get pods --all-namespaces -o=wide
521
522.. figure:: oomLogoV2-Scale.png
523 :align: right
524
525Scale
526=====
527
528Many of the ONAP components are horizontally scalable which allows them to
529adapt to expected offered load. During the Beijing release scaling is static,
530that is during deployment or upgrade a cluster size is defined and this cluster
531will be maintained even in the presence of faults. The parameter that controls
532the cluster size of a given component is found in the values.yaml file for that
533component. Here is an excerpt that shows this parameter:
534
535.. code-block:: yaml
536
537 # default number of instances
538 replicaCount: 1
539
540In order to change the size of a cluster, an operator could use a helm upgrade
541(described in detail in the next section) as follows::
542
543 > helm upgrade --set replicaCount=3 onap/so/mariadb
544
545The ONAP components use Kubernetes provided facilities to build clustered,
546highly available systems including: Services_ with load-balancers, ReplicaSet_,
547and StatefulSet_. Some of the open-source projects used by the ONAP components
548directly support clustered configurations, for example ODL and MariaDB Galera.
549
550The Kubernetes Services_ abstraction to provide a consistent access point for
551each of the ONAP components, independent of the pod or container architecture
552of that component. For example, SDN-C uses OpenDaylight clustering with a
553default cluster size of three but uses a Kubernetes service to and change the
554number of pods in this abstract this cluster from the other ONAP components
555such that the cluster could change size and this change is isolated from the
556other ONAP components by the load-balancer implemented in the ODL service
557abstraction.
558
559A ReplicaSet_ is a construct that is used to describe the desired state of the
560cluster. For example 'replicas: 3' indicates to Kubernetes that a cluster of 3
561instances is the desired state. Should one of the members of the cluster fail,
562a new member will be automatically started to replace it.
563
564Some of the ONAP components many need a more deterministic deployment; for
565example to enable intra-cluster communication. For these applications the
566component can be deployed as a Kubernetes StatefulSet_ which will maintain a
567persistent identifier for the pods and thus a stable network id for the pods.
568For example: the pod names might be web-0, web-1, web-{N-1} for N 'web' pods
569with corresponding DNS entries such that intra service communication is simple
570even if the pods are physically distributed across multiple nodes. An example
571of how these capabilities can be used is described in the Running Consul on
572Kubernetes tutorial.
573
574.. figure:: oomLogoV2-Upgrade.png
575 :align: right
576
577Upgrade
578=======
579
580Helm has built-in capabilities to enable the upgrade of pods without causing a
581loss of the service being provided by that pod or pods (if configured as a
582cluster). As described in the OOM Developer's Guide, ONAP components provide
583an abstracted 'service' end point with the pods or containers providing this
584service hidden from other ONAP components by a load balancer. This capability
585is used during upgrades to allow a pod with a new image to be added to the
586service before removing the pod with the old image. This 'make before break'
587capability ensures minimal downtime.
588
589Prior to doing an upgrade, determine of the status of the deployed charts::
590
591 > helm list
592 NAME REVISION UPDATED STATUS CHART NAMESPACE
593 so 1 Mon Feb 5 10:05:22 2018 DEPLOYED so-2.0.1 default
594
595When upgrading a cluster a parameter controls the minimum size of the cluster
596during the upgrade while another parameter controls the maximum number of nodes
597in the cluster. For example, SNDC configured as a 3-way ODL cluster might
598require that during the upgrade no fewer than 2 pods are available at all times
599to provide service while no more than 5 pods are ever deployed across the two
600versions at any one time to avoid depleting the cluster of resources. In this
601scenario, the SDNC cluster would start with 3 old pods then Kubernetes may add
602a new pod (3 old, 1 new), delete one old (2 old, 1 new), add two new pods (2
603old, 3 new) and finally delete the 2 old pods (3 new). During this sequence
604the constraints of the minimum of two pods and maximum of five would be
605maintained while providing service the whole time.
606
607Initiation of an upgrade is triggered by changes in the Helm charts. For
608example, if the image specified for one of the pods in the SDNC deployment
609specification were to change (i.e. point to a new Docker image in the nexus3
610repository - commonly through the change of a deployment variable), the
611sequence of events described in the previous paragraph would be initiated.
612
613For example, to upgrade a container by changing configuration, specifically an
614environment value::
615
Sylvain Desbureaux5e19e242020-03-02 14:41:48 +0100616 > helm deploy onap onap/so --version 2.0.1 --set enableDebug=true
Roger Maitland953b5f12018-03-22 15:24:04 -0400617
618Issuing this command will result in the appropriate container being stopped by
619Kubernetes and replaced with a new container with the new environment value.
620
621To upgrade a component to a new version with a new configuration file enter::
622
Eric Debeau993b77b2020-08-19 15:30:00 +0200623 > helm deploy onap onap/so --version 2.0.2 -f environments/demo.yaml
Roger Maitland953b5f12018-03-22 15:24:04 -0400624
625To fetch release history enter::
626
627 > helm history so
628 REVISION UPDATED STATUS CHART DESCRIPTION
629 1 Mon Feb 5 10:05:22 2018 SUPERSEDED so-2.0.1 Install complete
630 2 Mon Feb 5 10:10:55 2018 DEPLOYED so-2.0.2 Upgrade complete
631
632Unfortunately, not all upgrades are successful. In recognition of this the
633lineup of pods within an ONAP deployment is tagged such that an administrator
634may force the ONAP deployment back to the previously tagged configuration or to
635a specific configuration, say to jump back two steps if an incompatibility
636between two ONAP components is discovered after the two individual upgrades
637succeeded.
638
639This rollback functionality gives the administrator confidence that in the
640unfortunate circumstance of a failed upgrade the system can be rapidly brought
641back to a known good state. This process of rolling upgrades while under
642service is illustrated in this short YouTube video showing a Zero Downtime
643Upgrade of a web application while under a 10 million transaction per second
644load.
645
646For example, to roll-back back to previous system revision enter::
647
648 > helm rollback so 1
649
650 > helm history so
651 REVISION UPDATED STATUS CHART DESCRIPTION
652 1 Mon Feb 5 10:05:22 2018 SUPERSEDED so-2.0.1 Install complete
653 2 Mon Feb 5 10:10:55 2018 SUPERSEDED so-2.0.2 Upgrade complete
654 3 Mon Feb 5 10:14:32 2018 DEPLOYED so-2.0.1 Rollback to 1
655
656.. note::
657
658 The description field can be overridden to document actions taken or include
659 tracking numbers.
660
661Many of the ONAP components contain their own databases which are used to
662record configuration or state information. The schemas of these databases may
663change from version to version in such a way that data stored within the
664database needs to be migrated between versions. If such a migration script is
665available it can be invoked during the upgrade (or rollback) by Container
666Lifecycle Hooks. Two such hooks are available, PostStart and PreStop, which
667containers can access by registering a handler against one or both. Note that
668it is the responsibility of the ONAP component owners to implement the hook
669handlers - which could be a shell script or a call to a specific container HTTP
670endpoint - following the guidelines listed on the Kubernetes site. Lifecycle
671hooks are not restricted to database migration or even upgrades but can be used
672anywhere specific operations need to be taken during lifecycle operations.
673
674OOM uses Helm K8S package manager to deploy ONAP components. Each component is
675arranged in a packaging format called a chart - a collection of files that
676describe a set of k8s resources. Helm allows for rolling upgrades of the ONAP
677component deployed. To upgrade a component Helm release you will need an
678updated Helm chart. The chart might have modified, deleted or added values,
679deployment yamls, and more. To get the release name use::
680
681 > helm ls
682
683To easily upgrade the release use::
684
685 > helm upgrade [RELEASE] [CHART]
686
687To roll back to a previous release version use::
688
689 > helm rollback [flags] [RELEASE] [REVISION]
690
691For example, to upgrade the onap-so helm release to the latest SO container
692release v1.1.2:
693
694- Edit so values.yaml which is part of the chart
695- Change "so: nexus3.onap.org:10001/openecomp/so:v1.1.1" to
696 "so: nexus3.onap.org:10001/openecomp/so:v1.1.2"
697- From the chart location run::
698
699 > helm upgrade onap-so
700
701The previous so pod will be terminated and a new so pod with an updated so
702container will be created.
703
704.. figure:: oomLogoV2-Delete.png
705 :align: right
706
707Delete
708======
709
710Existing deployments can be partially or fully removed once they are no longer
711needed. To minimize errors it is recommended that before deleting components
712from a running deployment the operator perform a 'dry-run' to display exactly
713what will happen with a given command prior to actually deleting anything. For
714example::
715
Sylvain Desbureaux5e19e242020-03-02 14:41:48 +0100716 > helm undeploy onap --dry-run
Roger Maitland953b5f12018-03-22 15:24:04 -0400717
Sylvain Desbureaux5e19e242020-03-02 14:41:48 +0100718will display the outcome of deleting the 'onap' release from the
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100719deployment.
Roger Maitland953b5f12018-03-22 15:24:04 -0400720To completely delete a release and remove it from the internal store enter::
721
Sylvain Desbureaux5e19e242020-03-02 14:41:48 +0100722 > helm undeploy onap --purge
Roger Maitland953b5f12018-03-22 15:24:04 -0400723
724One can also remove individual components from a deployment by changing the
725ONAP configuration values. For example, to remove `so` from a running
726deployment enter::
727
Sylvain Desbureaux5e19e242020-03-02 14:41:48 +0100728 > helm undeploy onap-so --purge
Roger Maitland953b5f12018-03-22 15:24:04 -0400729
730will remove `so` as the configuration indicates it's no longer part of the
731deployment. This might be useful if a one wanted to replace just `so` by
Sylvain Desbureaux983c7552019-01-28 13:59:43 +0100732installing a custom version.