Container-based network simulation

The "make test" framework provides a good way to test individual features. However, when testing several features at once - or validating nontrivial configurations - it may prove difficult or impossible to use the unit-test framework.

This note explains how to set up lxc/lxd, and a 5-container testbed to test a split-tunnel nat + ikev2 + ipsec + ipv6 prefix-delegation scenario.

OS / Distro test results

This setup has been tested on an Ubuntu 18.04 LTS system. If you're feeling adventurous, the same scenario also worked on a recent Ubuntu 20.04 "preview" daily build.

Other distros may work fine, or not at all.

Proxy Server

If you need to use a proxy server e.g. from a lab system, you'll probably need to set HTTP_PROXY, HTTPS_PROXY, http_proxy and https_proxy in /etc/environment. Directly setting variables in the environment doesn't work. The lxd snap daemon needs the proxy settings, not the user interface.

Something like so:

    HTTP_PROXY=http://my.proxy.server:8080
    HTTPS_PROXY=http://my.proxy.server:4333
    http_proxy=http://my.proxy.server:8080
    https_proxy=http://my.proxy.server:4333

Install and configure lxd

Install the lxd snap. The lxd snap is up to date, as opposed to the results of "sudo apt-get install lxd".

    # snap install lxd
    # lxd init

"lxd init" asks several questions. With the exception of the storage pool, take the defaults. To match the configs shown below, create a storage pool named "vpp." Storage pools of type "zfs" and "files" have been tested successfully.

zfs is more space-efficient. "lxc copy" is infinitely faster with zfs. The path for the zfs storage pool is under /var. Do not replace it with a symbolic link, unless you want to rebuild all of your containers from scratch. Ask me how I know that.

Create three network segments

Aka, linux bridges.

    # lxc network create respond
    # lxc network create internet
    # lxc network create initiate

We'll explain the test topology in a bit. Stay tuned.

Set up the default container profile

Execute "lxc profile edit default", and install the following configuration. Note that the "shared" directory should mount your vpp workspaces. With that trick, you can edit code from any of the containers, run vpp without installing it, etc.

    config: {}
    description: Default LXD profile
    devices:
      eth0:
        name: eth0
        network: lxdbr0
        type: nic
      eth1:
        name: eth1
        nictype: bridged
        parent: internet
        type: nic
      eth2:
        name: eth2
        nictype: bridged
        parent: respond
        type: nic
      eth3:
        name: eth3
        nictype: bridged
        parent: initiate
        type: nic
      root:
        path: /
        pool: vpp
        type: disk
      shared:
        path: /scratch
        source: /scratch
        type: disk
    name: default

Set up the network configurations

Edit the fake "internet" backbone:

  # lxc network edit internet

Install the ip addresses shown below, to avoid having to rebuild the vpp and host configuration:

    config:
      ipv4.address: 10.26.68.1/24
      ipv4.dhcp.ranges: 10.26.68.10-10.26.68.50
      ipv4.nat: "true"
      ipv6.address: none
      ipv6.nat: "false"
    description: ""
    name: internet
    type: bridge
    used_by:
    managed: true
    status: Created
    locations:
    - none

Repeat the process with the "respond" and "initiate" networks, using these configurations:

respond network configuration

    config:
      ipv4.address: 10.166.14.1/24
      ipv4.dhcp.ranges: 10.166.14.10-10.166.14.50
      ipv4.nat: "true"
      ipv6.address: none
      ipv6.nat: "false"
    description: ""
    name: respond
    type: bridge
    used_by:
    managed: true
    status: Created
    locations:
    - none

initiate network configuration

    config:
      ipv4.address: 10.219.188.1/24
      ipv4.dhcp.ranges: 10.219.188.10-10.219.188.50
      ipv4.nat: "true"
      ipv6.address: none
      ipv6.nat: "false"
    description: ""
    name: initiate
    type: bridge
    used_by:
    managed: true
    status: Created
    locations:
    - none

Create a "master" container image

The master container image should be set up so that you can build vpp, ssh into the container, edit source code, run gdb, etc.

Make sure that e.g. public key auth ssh works.

    # lxd launch ubuntu:18.04 respond
    <spew>
    # lxc exec respond bash
    respond# cd /scratch/my-vpp-workspace
    respond# apt-get install make ssh
    respond# make install-dep
    respond# exit
    # lxc stop respond

Mark the container image privileged. If you forget this step, you'll trip over a netlink error (-11) aka EAGAIN when you try to roll in the vpp configurations.

    # lxc config set respond security.privileged "true"

Duplicate the "master" container image

To avoid having to configure N containers, be sure that the master container image is fully set up before you help it have children:

    # lxc copy respond respondhost
    # lxc copy respond initiate
    # lxc copy respond initiatehost
    # lxc copy respond dhcpserver    # optional, to test ipv6 prefix delegation

Install handy script

See below for a handly script which executes lxc commands across the current set of running containers. I call it "lxc-foreach," feel free to call the script Ishmael if you like.

Examples:

    $ lxc-foreach start
    <issues "lxc start" for each container in the list>

After a few seconds, use this one to open an ssh connection to each container. The ssh command parses the output of "lxc info," which displays container ip addresses.

    $ lxc-foreach ssh

Here's the script:

    #!/bin/bash

    set -u
    export containers="respond respondhost initiate initiatehost dhcpserver"

    if [ x$1 = "x" ] ; then
        echo missing command
        exit 1
    fi

    if [ $1 = "ssh" ] ; then
        for c in $containers
        do
            inet=`lxc info $c | grep eth0 | grep -v inet6 | head -1 | cut -f 3`
            if [ x$inet = "x" ] ; then
                echo $c not started
            else
                gnome-terminal --command "/usr/bin/ssh $inet"
            fi
        done
    exit 0
    fi

    for c in $containers
    do
        echo lxc $1 $c
        lxc $1 $c
    done

    exit 0

Test topology

Finally, we're ready to describe a test topology. First, a picture:

    ===+======== management lan/bridge lxdbr0 (dhcp) ===========+===
       |                             |                          |
       |                             |                          |
       |                             |                          |
       v                             |                          v
      eth0                           |                         eth0
    +------+ eth1                                       eth1 +------+
    | respond | 10.26.88.100 <= internet bridge => 10.26.88.101 | initiate |
    +------+                                                 +------+
      eth2 / bvi0 10.166.14.2        |       10.219.188.2 eth3 / bvi0
       |                             |                          |
       | ("respond" bridge)             |          ("initiate" bridge) |
       |                             |                          |
       v                             |                          v
      eth2 10.166.14.3               |           eth3 10.219.188.3
    +----------+                     |                   +----------+
    | respondhost |                     |                   | respondhost |
    +----------+                     |                   +----------+
      eth0 (management lan) <========+========> eth0 (management lan)

Test topology discussion

This topology is suitable for testing almost any tunnel encap/decap scenario. The two containers "respondhost" and "initiatehost" are end-stations connected to two vpp instances running on "respond" and "initiate".

We leverage the Linux end-station network stacks to generate traffic of all sorts.

The so-called "internet" bridge models the public internet. The "respond" and "initiate" bridges connect vpp instances to local hosts

End station configs

The end-station Linux configurations set up the eth2 and eth3 ip addresses shown above, and add tunnel routes to the opposite end-station networks.

respondhost configuration

    ifconfig eth2 10.166.14.3/24 up
    route add -net 10.219.188.0/24 gw 10.166.14.2

initiatehost configuration

    sudo ifconfig eth3 10.219.188.3/24 up
    sudo route add -net 10.166.14.0/24 gw 10.219.188.2

VPP configs

Split nat44 / ikev2 + ipsec tunneling, with ipv6 prefix delegation in the "respond" config.

respond configuration

    set term pag off

    comment { "internet" }
    create host-interface name eth1
    set int ip address host-eth1 10.26.68.100/24
    set int ip6 table host-eth1 0
    set int state host-eth1 up

    comment { default route via initiate }
    ip route add 0.0.0.0/0 via 10.26.68.101

    comment { "respond-private-net" }
    create host-interface name eth2
    bvi create instance 0
    set int l2 bridge bvi0 1 bvi
    set int ip address bvi0 10.166.14.2/24
    set int state bvi0 up
    set int l2 bridge host-eth2 1
    set int state host-eth2 up


    nat44 add interface address host-eth1
    set interface nat44 in host-eth2 out host-eth1
    nat44 add identity mapping external host-eth1 udp 500
    nat44 add identity mapping external host-eth1 udp 4500
    comment { nat44 untranslated subnet 10.219.188.0/24 }

    comment { responder profile }
    ikev2 profile add initiate
    ikev2 profile set initiate udp-encap
    ikev2 profile set initiate auth rsa-sig cert-file /scratch/setups/respondcert.pem
    set ikev2 local key /scratch/setups/initiatekey.pem
    ikev2 profile set initiate id local fqdn initiator.my.net
    ikev2 profile set initiate id remote fqdn responder.my.net
    ikev2 profile set initiate traffic-selector remote ip-range 10.219.188.0 - 10.219.188.255 port-range 0 - 65535 protocol 0
    ikev2 profile set initiate traffic-selector local ip-range 10.166.14.0 - 10.166.14.255 port-range 0 - 65535 protocol 0
    create ipip tunnel src 10.26.68.100 dst 10.26.68.101
    ikev2 profile set initiate tunnel ipip0

    comment { ipv6 prefix delegation }
    ip6 nd address autoconfig host-eth1 default-route
    dhcp6 client host-eth1
    dhcp6 pd client host-eth1 prefix group hgw
    set ip6 address bvi0 prefix group hgw ::2/56
    ip6 nd address autoconfig bvi0 default-route
    ip6 nd bvi0 ra-interval 5 3 ra-lifetime 180

    set int mtu packet 1390 ipip0
    set int unnum ipip0 use host-eth1
    ip route add 10.219.188.0/24 via ipip0

initiate configuration

    set term pag off

    comment { "internet" }
    create host-interface name eth1
    comment { set dhcp client intfc host-eth1 hostname initiate }
    set int ip address host-eth1 10.26.68.101/24
    set int state host-eth1 up

    comment { default route via "internet gateway" }
    comment { ip route add 0.0.0.0/0 via 10.26.68.1 }

    comment { "initiate-private-net" }
    create host-interface name eth3
    bvi create instance 0
    set int l2 bridge bvi0 1 bvi
    set int ip address bvi0 10.219.188.2/24
    set int state bvi0 up
    set int l2 bridge host-eth3 1
    set int state host-eth3 up

    nat44 add interface address host-eth1
    set interface nat44 in bvi0 out host-eth1
    nat44 add identity mapping external host-eth1 udp 500
    nat44 add identity mapping external host-eth1 udp 4500
    comment { nat44 untranslated subnet 10.166.14.0/24 }

    comment { initiator profile }
    ikev2 profile add respond
    ikev2 profile set respond udp-encap
    ikev2 profile set respond auth rsa-sig cert-file /scratch/setups/initiatecert.pem
    set ikev2 local key /scratch/setups/respondkey.pem
    ikev2 profile set respond id local fqdn responder.my.net
    ikev2 profile set respond id remote fqdn initiator.my.net

    ikev2 profile set respond traffic-selector remote ip-range 10.166.14.0 - 10.166.14.255 port-range 0 - 65535 protocol 0
    ikev2 profile set respond traffic-selector local ip-range 10.219.188.0 - 10.219.188.255 port-range 0 - 65535 protocol 0

    ikev2 profile set respond responder host-eth1 10.26.68.100
    ikev2 profile set respond ike-crypto-alg aes-cbc 256  ike-integ-alg sha1-96  ike-dh modp-2048
    ikev2 profile set respond esp-crypto-alg aes-cbc 256  esp-integ-alg sha1-96  esp-dh ecp-256
    ikev2 profile set respond sa-lifetime 3600 10 5 0

    create ipip tunnel src 10.26.68.101 dst 10.26.68.100
    ikev2 profile set respond tunnel ipip0
    ikev2 initiate sa-init respond

    set int mtu packet 1390 ipip0
    set int unnum ipip0 use host-eth1
    ip route add 10.166.14.0/24 via ipip0

IKEv2 certificate setup

In both of the vpp configurations, you'll see "/scratch/setups/xxx.pem" mentioned. These certificates are used in the ikev2 key exchange.

Here's how to generate the certificates:

    openssl req -x509 -nodes -newkey rsa:4096 -keyout respondkey.pem -out respondcert.pem -days 3560
    openssl x509 -text -noout -in respondcert.pem
    openssl req -x509 -nodes -newkey rsa:4096 -keyout initiatekey.pem -out initiatecert.pem -days 3560
    openssl x509 -text -noout -in initiatecert.pem

Make sure that the "respond" and "initiate" configurations point to the certificates.

DHCPv6 server setup

If you need an ipv6 dhcp server to test ipv6 prefix delegation, create the "dhcpserver" container as shown above.

Install the "isc-dhcp-server" Debian package:

    sudo apt-get install isc-dhcp-server

/etc/dhcp/dhcpd6.conf

Edit the dhcpv6 configuration and add an ipv6 subnet with prefix delegation. For example:

    subnet6 2001:db01:0:1::/64 {
            range6 2001:db01:0:1::1 2001:db01:0:1::9;
            prefix6 2001:db01:0:100:: 2001:db01:0:200::/56;
    }

Add an ipv6 address on eth1, which is connected to the "internet" bridge, and start the dhcp server. I use the following trivial bash script, which runs the dhcp6 server in the foreground and produces dhcp traffic spew:

    #!/bin/bash
    ifconfig eth1 inet6 add 2001:db01:0:1::10/64 || true
    dhcpd -6 -d -cf /etc/dhcp/dhcpd6.conf

The "|| true" bit keeps going if eth1 already has the indicated ipv6 address.

Container / Host Interoperation

Host / container interoperation is highly desirable. If the host and a set of containers don't run the same distro and distro version, it's reasonably likely that the glibc versions won't match. That, in turn, makes vpp binaries built in one environment fail in the other.

Trying to install multiple versions of glibc - especially at the host level - often ends very badly and is not recommended. It's not just glibc, either. The dynamic loader ld-linux-xxx-so.2 is glibc version specific.

Fortunately, it's reasonable easy to build lxd container images based on specific Ubuntu or Debian versions.

Create a custom root filesystem image

First, install the "debootstrap" tool:

    sudo apt-get install debootstrap

Make a temp directory, and use debootstrap to populate it. In this example, we create an Ubuntu 20.04 (focal fossa) base image:

    # mkdir /tmp/myroot
    # debootstrap focal /tmp/myroot http://archive.ubuntu.com/ubuntu

To tinker with the base image (if desired):

    # chroot /tmp/myroot
    <add packages, etc.>
    # exit

Make a compressed tarball of the base image:

    # tar zcf /tmp/rootfs.tar.gz -C /tmp/myroot .

Create a "metadata.yaml" file which describes the base image:

    architecture: "x86_64"
    # To get current date in Unix time, use `date +%s` command
    creation_date: 1458040200
    properties:
    architecture: "x86_64"
    description: "My custom Focal Fossa image"
    os: "Ubuntu"
    release: "focal"

Make a compressed tarball of metadata.yaml:

    # tar zcf metadata.tar.gz metadata.yaml

Import the image into lxc / lxd:

    $ lxc image import metadata.tar.gz rootfd.tar.gz --alias focal-base

Create a container which uses the customized base image:

    $ lxc launch focal-base focaltest
    $ lxc exec focaltest bash

The next several steps should be executed in the container, in the bash shell spun up by "lxc exec..."

Configure container networking

In the container, create /etc/netplan/50-cloud-init.yaml:

    network:
        version: 2
        ethernets:
            eth0:
                dhcp4: true

Use "cat > /etc/netplan/50-cloud-init.yaml", and cut-'n-paste if your favorite text editor is AWOL.

Apply the configuration:

    # netplan apply

At this point, eth0 should have an ip address, and you should see a default route with "route -n".

Configure apt

Again, in the container, set up /etc/apt/sources.list via cut-'n-paste from a recently update "focal fossa" host. Something like so:

    deb http://us.archive.ubuntu.com/ubuntu/ focal main restricted
    deb http://us.archive.ubuntu.com/ubuntu/ focal-updates main restricted
    deb http://us.archive.ubuntu.com/ubuntu/ focal universe
    deb http://us.archive.ubuntu.com/ubuntu/ focal-updates universe
    deb http://us.archive.ubuntu.com/ubuntu/ focal multiverse
    deb http://us.archive.ubuntu.com/ubuntu/ focal-updates multiverse
    deb http://us.archive.ubuntu.com/ubuntu/ focal-backports main restricted universe multiverse
    deb http://security.ubuntu.com/ubuntu focal-security main restricted
    deb http://security.ubuntu.com/ubuntu focal-security universe
    deb http://security.ubuntu.com/ubuntu focal-security multiverse

"apt-get update" and "apt-install" should produce reasonable results. Suggest "apt-get install make git".

At this point, you can use the "/scratch" sharepoint (or similar) to execute "make install-dep install-ext-deps" to set up the container with the vpp toolchain; proceed as desired.