forked from openvswitch/ovs
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathvtep.xml
889 lines (757 loc) · 31.7 KB
/
vtep.xml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
<?xml version="1.0" encoding="utf-8"?>
<database title="Hardware VTEP Database">
<p>
This schema specifies relations that a VTEP can use to integrate
physical ports into logical switches maintained by a network
virtualization controller such as NSX.
</p>
<p>Glossary:</p>
<dl>
<dt>VTEP</dt>
<dd>
VXLAN Tunnel End Point, an entity which originates and/or terminates
VXLAN tunnels.
</dd>
<dt>HSC</dt>
<dd>
Hardware Switch Controller.
</dd>
<dt>NVC</dt>
<dd>
Network Virtualization Controller, e.g. NSX.
</dd>
<dt>VRF</dt>
<dd>
Virtual Routing and Forwarding instance.
</dd>
</dl>
<table name="Global" title="Top-level configuration.">
Top-level configuration for a hardware VTEP. There must be
exactly one record in the <ref table="Global"/> table.
<column name="switches">
The physical switches managed by the VTEP.
</column>
<group title="Database Configuration">
<p>
These columns primarily configure the database server
(<code>ovsdb-server</code>), not the hardware VTEP itself.
</p>
<column name="managers">
Database clients to which the database server should connect or
to which it should listen, along with options for how these
connection should be configured. See the <ref table="Manager"/>
table for more information.
</column>
</group>
</table>
<table name="Manager" title="OVSDB management connection.">
<p>
Configuration for a database connection to an Open vSwitch Database
(OVSDB) client.
</p>
<p>
The database server can initiate and maintain active connections
to remote clients. It can also listen for database connections.
</p>
<group title="Core Features">
<column name="target">
<p>Connection method for managers.</p>
<p>
The following connection methods are currently supported:
</p>
<dl>
<dt><code>ssl:<var>ip</var></code>[<code>:<var>port</var></code>]</dt>
<dd>
<p>
The specified SSL <var>port</var> (default: 6632) on the host at
the given <var>ip</var>, which must be expressed as an IP address
(not a DNS name).
</p>
<p>
SSL key and certificate configuration happens outside the
database.
</p>
</dd>
<dt><code>tcp:<var>ip</var></code>[<code>:<var>port</var></code>]</dt>
<dd>
The specified TCP <var>port</var> (default: 6632) on the host at
the given <var>ip</var>, which must be expressed as an IP address
(not a DNS name).
</dd>
<dt><code>pssl:</code>[<var>port</var>][<code>:<var>ip</var></code>]</dt>
<dd>
<p>
Listens for SSL connections on the specified TCP <var>port</var>
(default: 6632). If <var>ip</var>, which must be expressed as an
IP address (not a DNS name), is specified, then connections are
restricted to the specified local IP address.
</p>
</dd>
<dt><code>ptcp:</code>[<var>port</var>][<code>:<var>ip</var></code>]</dt>
<dd>
Listens for connections on the specified TCP <var>port</var>
(default: 6632). If <var>ip</var>, which must be expressed as an
IP address (not a DNS name), is specified, then connections are
restricted to the specified local IP address.
</dd>
</dl>
</column>
</group>
<group title="Client Failure Detection and Handling">
<column name="max_backoff">
Maximum number of milliseconds to wait between connection attempts.
Default is implementation-specific.
</column>
<column name="inactivity_probe">
Maximum number of milliseconds of idle time on connection to the
client before sending an inactivity probe message. If the Open
vSwitch database does not communicate with the client for the
specified number of seconds, it will send a probe. If a
response is not received for the same additional amount of time,
the database server assumes the connection has been broken
and attempts to reconnect. Default is implementation-specific.
A value of 0 disables inactivity probes.
</column>
</group>
<group title="Status">
<column name="is_connected">
<code>true</code> if currently connected to this manager,
<code>false</code> otherwise.
</column>
<column name="status" key="last_error">
A human-readable description of the last error on the connection
to the manager; i.e. <code>strerror(errno)</code>. This key
will exist only if an error has occurred.
</column>
<column name="status" key="state"
type='{"type": "string", "enum": ["set", ["VOID", "BACKOFF", "CONNECTING", "ACTIVE", "IDLE"]]}'>
<p>
The state of the connection to the manager:
</p>
<dl>
<dt><code>VOID</code></dt>
<dd>Connection is disabled.</dd>
<dt><code>BACKOFF</code></dt>
<dd>Attempting to reconnect at an increasing period.</dd>
<dt><code>CONNECTING</code></dt>
<dd>Attempting to connect.</dd>
<dt><code>ACTIVE</code></dt>
<dd>Connected, remote host responsive.</dd>
<dt><code>IDLE</code></dt>
<dd>Connection is idle. Waiting for response to keep-alive.</dd>
</dl>
<p>
These values may change in the future. They are provided only for
human consumption.
</p>
</column>
<column name="status" key="sec_since_connect"
type='{"type": "integer", "minInteger": 0}'>
The amount of time since this manager last successfully connected
to the database (in seconds). Value is empty if manager has never
successfully connected.
</column>
<column name="status" key="sec_since_disconnect"
type='{"type": "integer", "minInteger": 0}'>
The amount of time since this manager last disconnected from the
database (in seconds). Value is empty if manager has never
disconnected.
</column>
<column name="status" key="locks_held">
Space-separated list of the names of OVSDB locks that the connection
holds. Omitted if the connection does not hold any locks.
</column>
<column name="status" key="locks_waiting">
Space-separated list of the names of OVSDB locks that the connection is
currently waiting to acquire. Omitted if the connection is not waiting
for any locks.
</column>
<column name="status" key="locks_lost">
Space-separated list of the names of OVSDB locks that the connection
has had stolen by another OVSDB client. Omitted if no locks have been
stolen from this connection.
</column>
<column name="status" key="n_connections"
type='{"type": "integer", "minInteger": 2}'>
<p>
When <ref column="target"/> specifies a connection method that
listens for inbound connections (e.g. <code>ptcp:</code> or
<code>pssl:</code>) and more than one connection is actually active,
the value is the number of active connections. Otherwise, this
key-value pair is omitted.
</p>
<p>
When multiple connections are active, status columns and key-value
pairs (other than this one) report the status of one arbitrarily
chosen connection.
</p>
</column>
</group>
<group title="Connection Parameters">
<p>
Additional configuration for a connection between the manager
and the database server.
</p>
<column name="other_config" key="dscp"
type='{"type": "integer"}'>
The Differentiated Service Code Point (DSCP) is specified using 6 bits
in the Type of Service (TOS) field in the IP header. DSCP provides a
mechanism to classify the network traffic and provide Quality of
Service (QoS) on IP networks.
The DSCP value specified here is used when establishing the
connection between the manager and the database server. If no
value is specified, a default value of 48 is chosen. Valid DSCP
values must be in the range 0 to 63.
</column>
</group>
</table>
<table name="Physical_Switch" title="A physical switch.">
A physical switch that implements a VTEP.
<column name="ports">
The physical ports within the switch.
</column>
<group title="Network Status">
<column name="management_ips">
IPv4 or IPv6 addresses at which the switch may be contacted
for management purposes.
</column>
<column name="tunnel_ips">
<p>
IPv4 or IPv6 addresses on which the switch may originate or
terminate tunnels.
</p>
<p>
This column is intended to allow a <ref table="Manager"/> to
determine the <ref table="Physical_Switch"/> that terminates
the tunnel represented by a <ref table="Physical_Locator"/>.
</p>
</column>
</group>
<group title="Identification">
<column name="name">
Symbolic name for the switch, such as its hostname.
</column>
<column name="description">
An extended description for the switch, such as its switch login
banner.
</column>
</group>
<group title="Error Notification">
<p>
An entry in this column indicates to the NVC that this switch
has encountered a fault. The switch must clear this column
when the fault has been cleared.
</p>
<column name="switch_fault_status" key="mac_table_exhaustion">
Indicates that the switch has been unable to process MAC
entries requested by the NVC due to lack of table resources.
</column>
<column name="switch_fault_status" key="tunnel_exhaustion">
Indicates that the switch has been unable to create tunnels
requested by the NVC due to lack of resources.
</column>
<column name="switch_fault_status" key="unspecified_fault">
Indicates that an error has occurred in the switch but that no
more specific information is available.
</column>
</group>
</table>
<table name="Physical_Port" title="A port within a physical switch.">
A port within a <ref table="Physical_Switch"/>.
<column name="vlan_bindings">
Identifies how VLANs on the physical port are bound to logical switches.
If, for example, the map contains a (VLAN, logical switch) pair, a packet
that arrives on the port in the VLAN is considered to belong to the
paired logical switch.
</column>
<column name="vlan_stats">
Statistics for VLANs bound to logical switches on the physical port. An
implementation that fully supports such statistics would populate this
column with a mapping for every VLAN that is bound in <ref
column="vlan_bindings"/>. An implementation that does not support such
statistics or only partially supports them would not populate this column
or partially populate it, respectively.
</column>
<group title="Identification">
<column name="name">
Symbolic name for the port. The name ought to be unique within a given
<ref table="Physical_Switch"/>, but the database is not capable of
enforcing this.
</column>
<column name="description">
An extended description for the port.
</column>
</group>
<group title="Error Notification">
<p>
An entry in this column indicates to the NVC that the physical port has
encountered a fault. The switch must clear this column when the errror
has been cleared.
</p>
<column name="port_fault_status" key="invalid_vlan_map">
<p>
Indicates that a VLAN-to-logical-switch mapping requested by
the controller could not be instantiated by the switch
because of a conflict with local configuration.
</p>
</column>
<column name="port_fault_status" key="unspecified_fault">
<p>
Indicates that an error has occurred on the port but that no
more specific information is available.
</p>
</column>
</group>
</table>
<table name="Logical_Binding_Stats" title="Statistics for a VLAN on a physical port bound to a logical network.">
Reports statistics for the <ref table="Logical_Switch"/> with which a VLAN
on a <ref table="Physical_Port"/> is associated.
<group title="Statistics">
These statistics count only packets to which the binding applies.
<column name="packets_from_local">
Number of packets sent by the <ref table="Physical_Switch"/>.
</column>
<column name="bytes_from_local">
Number of bytes in packets sent by the <ref table="Physical_Switch"/>.
</column>
<column name="packets_to_local">
Number of packets received by the <ref table="Physical_Switch"/>.
</column>
<column name="bytes_to_local">
Number of bytes in packets received by the <ref
table="Physical_Switch"/>.
</column>
</group>
</table>
<table name="Logical_Switch" title="A layer-2 domain.">
A logical Ethernet switch, whose implementation may span physical and
virtual media, possibly crossing L3 domains via tunnels; a logical layer-2
domain; an Ethernet broadcast domain.
<group title="Per Logical-Switch Tunnel Key">
<p>
Tunnel protocols tend to have a field that allows the tunnel
to be partitioned into sub-tunnels: VXLAN has a VNI, GRE and
STT have a key, CAPWAP has a WSI, and so on. We call these
generically ``tunnel keys.'' Given that one needs to use a
tunnel key at all, there are at least two reasonable ways to
assign their values:
</p>
<ul>
<li>
<p>
Per <ref table="Logical_Switch"/>+<ref table="Physical_Locator"/>
pair. That is, each logical switch may be assigned a different
tunnel key on every <ref table="Physical_Locator"/>. This model is
especially flexible.
</p>
<p>
In this model, <ref table="Physical_Locator"/> carries the tunnel
key. Therefore, one <ref table="Physical_Locator"/> record will
exist for each logical switch carried at a given IP destination.
</p>
</li>
<li>
<p>
Per <ref table="Logical_Switch"/>. That is, every tunnel
associated with a particular logical switch carries the same tunnel
key, regardless of the <ref table="Physical_Locator"/> to which the
tunnel is addressed. This model may ease switch implementation
because it imposes fewer requirements on the hardware datapath.
</p>
<p>
In this model, <ref table="Logical_Switch"/> carries the tunnel
key. Therefore, one <ref table="Physical_Locator"/> record will
exist for each IP destination.
</p>
</li>
</ul>
<column name="tunnel_key">
<p>
This column is used only in the tunnel key per <ref
table="Logical_Switch"/> model (see above), because only in that
model is there a tunnel key associated with a logical switch.
</p>
<p>
For <code>vxlan_over_ipv4</code> encapsulation, this column
is the VXLAN VNI that identifies a logical switch. It must
be in the range 0 to 16,777,215.
</p>
</column>
</group>
<group title="Identification">
<column name="name">
Symbolic name for the logical switch.
</column>
<column name="description">
An extended description for the logical switch, such as its switch
login banner.
</column>
</group>
</table>
<table name="Ucast_Macs_Local" title="Unicast MACs (local)">
<p>
Mapping of unicast MAC addresses to tunnels (physical
locators). This table is written by the HSC, so it contains the
MAC addresses that have been learned on physical ports by a
VTEP.
</p>
<column name="MAC">
A MAC address that has been learned by the VTEP.
</column>
<column name="logical_switch">
The Logical switch to which this mapping applies.
</column>
<column name="locator">
The physical locator to be used to reach this MAC address. In
this table, the physical locator will be one of the tunnel IP
addresses of the appropriate VTEP.
</column>
<column name="ipaddr">
The IP address to which this MAC corresponds. Optional field for
the purpose of ARP supression.
</column>
</table>
<table name="Ucast_Macs_Remote" title="Unicast MACs (remote)">
<p>
Mapping of unicast MAC addresses to tunnels (physical
locators). This table is written by the NVC, so it contains the
MAC addresses that the NVC has learned. These include VM MAC
addresses, in which case the physical locators will be
hypervisor IP addresses. The NVC will also report MACs that it
has learned from other HSCs in the network, in which case the
physical locators will be tunnel IP addresses of the
corresponding VTEPs.
</p>
<column name="MAC">
A MAC address that has been learned by the NVC.
</column>
<column name="logical_switch">
The Logical switch to which this mapping applies.
</column>
<column name="locator">
The physical locator to be used to reach this MAC address. In
this table, the physical locator will be either a hypervisor IP
address or a tunnel IP addresses of another VTEP.
</column>
<column name="ipaddr">
The IP address to which this MAC corresponds. Optional field for
the purpose of ARP supression.
</column>
</table>
<table name="Mcast_Macs_Local" title="Multicast MACs (local)">
<p>
Mapping of multicast MAC addresses to tunnels (physical
locators). This table is written by the HSC, so it contains the
MAC addresses that have been learned on physical ports by a
VTEP. These may be learned by IGMP snooping, for example. This
table also specifies how to handle unknown unicast and broadcast packets.
</p>
<column name="MAC">
<p>
A MAC address that has been learned by the VTEP.
</p>
<p>
The keyword <code>unknown-dst</code> is used as a special
``Ethernet address'' that indicates the locations to which
packets in a logical switch whose destination addresses do not
otherwise appear in <ref table="Ucast_Macs_Local"/> (for
unicast addresses) or <ref table="Mcast_Macs_Local"/> (for
multicast addresses) should be sent.
</p>
</column>
<column name="logical_switch">
The Logical switch to which this mapping applies.
</column>
<column name="locator_set">
The physical locator set to be used to reach this MAC address. In
this table, the physical locator set will be contain one or more tunnel IP
addresses of the appropriate VTEP(s).
</column>
</table>
<table name="Mcast_Macs_Remote" title="Multicast MACs (remote)">
<p>
Mapping of multicast MAC addresses to tunnels (physical
locators). This table is written by the NVC, so it contains the
MAC addresses that the NVC has learned. This
table also specifies how to handle unknown unicast and broadcast
packets.
</p>
<p>
Multicast packet replication may be handled by a service node,
in which case the physical locators will be IP addresses of
service nodes. If the VTEP supports replication onto multiple
tunnels, then this may be used to replicate directly onto
VTEP-hyperisor tunnels.
</p>
<column name="MAC">
<p>
A MAC address that has been learned by the NVC.
</p>
<p>
The keyword <code>unknown-dst</code> is used as a special
``Ethernet address'' that indicates the locations to which
packets in a logical switch whose destination addresses do not
otherwise appear in <ref table="Ucast_Macs_Remote"/> (for
unicast addresses) or <ref table="Mcast_Macs_Remote"/> (for
multicast addresses) should be sent.
</p>
</column>
<column name="logical_switch">
The Logical switch to which this mapping applies.
</column>
<column name="locator_set">
The physical locator set to be used to reach this MAC address. In
this table, the physical locator set will be either a service node IP
address or a set of tunnel IP addresses of hypervisors (and
potentially other VTEPs).
</column>
<column name="ipaddr">
The IP address to which this MAC corresponds. Optional field for
the purpose of ARP supression.
</column>
</table>
<table name="Logical_Router" title="A logical L3 router.">
<p>
A logical router, or VRF. A logical router may be connected to one or more
logical switches. Subnet addresses and interface addresses may be configured on the
interfaces.
</p>
<column name="switch_binding">
Maps from an IPv4 or IPv6 address prefix in CIDR notation to a
logical switch. Multiple prefixes may map to the same switch. By
writing a 32-bit (or 128-bit for v6) address with a /N prefix
length, both the router's interface address and the subnet
prefix can be configured. For example, 192.68.1.1/24 creates a
/24 subnet for the logical switch attached to the interface and
assigns the address 192.68.1.1 to the router interface.
</column>
<column name="static_routes">
One or more static routes, mapping IP prefixes to next hop IP addresses.
</column>
<group title="Identification">
<column name="name">
Symbolic name for the logical router.
</column>
<column name="description">
An extended description for the logical router.
</column>
</group>
</table>
<table name="Arp_Sources_Local" title="ARP source addresses for logical routers">
<p>
MAC address to be used when a VTEP issues ARP requests on behalf
of a logical router.
</p>
<p>
A distributed logical router is implemented by a set of VTEPs
(both hardware VTEPs and vswitches). In order for a given VTEP
to populate the local ARP cache for a logical router, it issues
ARP requests with a source MAC address that is unique to the VTEP. A
single per-VTEP MAC can be re-used across all logical
networks. This table contains the MACs that are used by the
VTEPs of a given HSC. The table provides the mapping from MAC to
physical locator for each VTEP so that replies to the ARP
requests can be sent back to the correct VTEP using the
appropriate physical locator.
</p>
<column name="src_mac">
The source MAC to be used by a given VTEP.
</column>
<column name="locator">
The <ref table="Physical_Locator"/> to use for replies to ARP
requests from this MAC address.
</column>
</table>
<table name="Arp_Sources_Remote" title="ARP source addresses for logical routers">
<p>
MAC address to be used when a remote VTEP issues ARP requests on behalf
of a logical router.
</p>
<p>
This table is the remote counterpart of <ref
table="Arp_sources_local"/>. The NVC writes this table to notify
the HSC of the MACs that will be used by remote VTEPs when they
issue ARP requests on behalf of a distributed logical router.
</p>
<column name="src_mac">
The source MAC to be used by a given VTEP.
</column>
<column name="locator">
The <ref table="Physical_Locator"/> to use for replies to ARP
requests from this MAC address.
</column>
</table>
<table name="Physical_Locator_Set">
<p>
A set of one or more <ref table="Physical_Locator"/>s.
</p>
<p>
This table exists only because OVSDB does not have a way to
express the type ``map from string to one or more <ref
table="Physical_Locator"/> records.''
</p>
<column name="locators"/>
</table>
<table name="Physical_Locator">
<p>
Identifies an endpoint to which logical switch traffic may be
encapsulated and forwarded.
</p>
<p>
For the <code>vxlan_over_ipv4</code> encapsulation, the only
encapsulation defined so far, all endpoints associated with a given <ref
table="Logical_Switch"/> must use a common tunnel key, which is carried
in the <ref table="Logical_Switch" column="tunnel_key"/> column of <ref
table="Logical_Switch"/>.
</p>
<p>
For some encapsulations yet to be defined, we expect <ref
table="Physical_Locator"/> to identify both an endpoint and a tunnel key.
When the first such encapsulation is defined, we expect to add a
``tunnel_key'' column to <ref table="Physical_Locator"/> to allow the
tunnel key to be defined.
</p>
<p>
See the ``Per Logical-Switch Tunnel Key'' section in the <ref
table="Logical_Switch"/> table for further discussion of the model.
</p>
<column name="encapsulation_type">
The type of tunneling encapsulation.
</column>
<column name="dst_ip">
<p>
For <code>vxlan_over_ipv4</code> encapsulation, the IPv4 address of the
VXLAN tunnel endpoint.
</p>
<p>
We expect that this column could be used for IPv4 or IPv6 addresses in
encapsulations to be introduced later.
</p>
</column>
<group title="Bidirectional Forwarding Detection (BFD)">
<p>
BFD, defined in RFC 5880, allows point to point detection of
connectivity failures by occasional transmission of BFD control
messages. VTEPs are expected to implement BFD.
</p>
<p>
BFD operates by regularly transmitting BFD control messages at a
rate negotiated independently in each direction. Each endpoint
specifies the rate at which it expects to receive control messages,
and the rate at which it's willing to transmit them. An endpoint
which fails to receive BFD control messages for a period of three
times the expected reception rate will signal a connectivity
fault. In the case of a unidirectional connectivity issue, the
system not receiving BFD control messages will signal the problem
to its peer in the messages it transmits.
</p>
<p>
A hardware VTEP is expected to use BFD to determine reachability of
devices at the end of the tunnels with which it exchanges data. This
can enable the VTEP to choose a functioning service node among a set of
service nodes providing high availability. It also enables the NVC to
report the health status of tunnels.
</p>
<p>
In most cases the BFD peer of a hardware VTEP will be an Open vSwitch
instance. The Open vSwitch implementation of BFD aims to comply
faithfully with the requirements put forth in RFC 5880. Open vSwitch
does not implement the optional Authentication or ``Echo Mode''
features.
</p>
<group title="BFD Configuration">
<p>
A controller sets up key-value pairs in the <ref column="bfd"/>
column to enable and configure BFD.
</p>
<column name="bfd" key="enable" type='{"type": "boolean"}'>
True to enable BFD on this <ref table="Physical_Locator"/>.
</column>
<column name="bfd" key="min_rx"
type='{"type": "integer", "minInteger": 1}'>
The shortest interval, in milliseconds, at which this BFD session
offers to receive BFD control messages. The remote endpoint may
choose to send messages at a slower rate. Defaults to
<code>1000</code>.
</column>
<column name="bfd" key="min_tx"
type='{"type": "integer", "minInteger": 1}'>
The shortest interval, in milliseconds, at which this BFD session is
willing to transmit BFD control messages. Messages will actually be
transmitted at a slower rate if the remote endpoint is not willing to
receive as quickly as specified. Defaults to <code>100</code>.
</column>
<column name="bfd" key="decay_min_rx" type='{"type": "integer"}'>
An alternate receive interval, in milliseconds, that must be greater
than or equal to <ref column="bfd" key="min_rx"/>. The
implementation switches from <ref column="bfd" key="min_rx"/> to <ref
column="bfd" key="decay_min_rx"/> when there is no obvious incoming
data traffic at the interface, to reduce the CPU and bandwidth cost
of monitoring an idle interface. This feature may be disabled by
setting a value of 0. This feature is reset whenever <ref
column="bfd" key="decay_min_rx"/> or <ref column="bfd" key="min_rx"/>
changes.
</column>
<column name="bfd" key="forwarding_if_rx" type='{"type": "boolean"}'>
True to consider the interface capable of packet I/O as long as it
continues to receive any packets (not just BFD packets). This
prevents link congestion that causes consecutive BFD control packets
to be lost from marking the interface down.
</column>
<column name="bfd" key="cpath_down" type='{"type": "boolean"}'>
Set to true to notify the remote endpoint that traffic should not be
forwarded to this system for some reason other than a connectivty
failure on the interface being monitored. The typical underlying
reason is ``concatenated path down,'' that is, that connectivity
beyond the local system is down. Defaults to false.
</column>
<column name="bfd" key="check_tnl_key" type='{"type": "boolean"}'>
Set to true to make BFD accept only control messages with a tunnel
key of zero. By default, BFD accepts control messages with any
tunnel key.
</column>
<column name="bfd" key="bfd_dst_mac">
Set to an Ethernet address in the form
<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>:<var>xx</var>
to set the MAC used as destination for transmitted BFD packets and
expected as destination for received BFD packets. The default is
<code>00:23:20:00:00:01</code>.
</column>
</group>
<group title="BFD Status">
<p>
The VTEP sets key-value pairs in the <ref column="bfd_status"/>
column to report the status of BFD on this interface. When BFD is
not enabled, with <ref column="bfd" key="enable"/>, the switch clears
all key-value pairs from <ref column="bfd_status"/>.
</p>
<column name="bfd_status" key="state"
type='{"type": "string",
"enum": ["set", ["admin_down", "down", "init", "up"]]}'>
Reports the state of the BFD session. The BFD session is fully
healthy and negotiated if <code>UP</code>.
</column>
<column name="bfd_status" key="forwarding" type='{"type": "boolean"}'>
Reports whether the BFD session believes this <ref
table="Physical_Locator"/> may be used to forward traffic. Typically
this means the local session is signaling <code>UP</code>, and the
remote system isn't signaling a problem such as concatenated path
down.
</column>
<column name="bfd_status" key="diagnostic">
In case of a problem, set to a short message that reports what the
local BFD session thinks is wrong.
</column>
<column name="bfd_status" key="remote_state"
type='{"type": "string",
"enum": ["set", ["admin_down", "down", "init", "up"]]}'>
Reports the state of the remote endpoint's BFD session.
</column>
<column name="bfd_status" key="remote_diagnostic">
In case of a problem, set to a short message that reports what the
remote endpoint's BFD session thinks is wrong.
</column>
</group>
</group>
</table>
</database>