]> git.ipfire.org Git - thirdparty/bird.git/blob - doc/bird.sgml
93b701d37d791cbb0702a691f8635a3321ea924f
[thirdparty/bird.git] / doc / bird.sgml
1 <!doctype birddoc system>
2
3 <!--
4 BIRD 2.0 documentation
5
6 This documentation can have 4 forms: sgml (this is master copy), html, ASCII
7 text and dvi/postscript (generated from sgml using sgmltools). You should always
8 edit master copy.
9
10 This is a slightly modified linuxdoc dtd. Anything in <descrip> tags is
11 considered definition of configuration primitives, <cf> is fragment of
12 configuration within normal text, <m> is "meta" information within fragment of
13 configuration - something in config which is not keyword.
14
15 (set-fill-column 80)
16
17 Copyright 1999,2000 Pavel Machek <pavel@ucw.cz>, distribute under GPL version 2 or later.
18
19 -->
20
21 <book>
22
23 <title>BIRD 2.0 User's Guide
24 <author>
25 Ondrej Filip <it/&lt;feela@network.cz&gt;/,
26 Pavel Machek <it/&lt;pavel@ucw.cz&gt;/,
27 Martin Mares <it/&lt;mj@ucw.cz&gt;/,
28 Jan Matejka <it/&lt;mq@jmq.cz&gt;/,
29 Ondrej Zajicek <it/&lt;santiago@crfreenet.org&gt;/
30 </author>
31
32 <abstract>
33 This document contains user documentation for the BIRD Internet Routing Daemon project.
34 </abstract>
35
36 <!-- Table of contents -->
37 <toc>
38
39 <!-- Begin the document -->
40
41
42 <chapt>Introduction
43 <label id="intro">
44
45 <sect>What is BIRD
46 <label id="what-is-bird">
47
48 <p>The name `BIRD' is actually an acronym standing for `BIRD Internet Routing
49 Daemon'. Let's take a closer look at the meaning of the name:
50
51 <p><em/BIRD/: Well, we think we have already explained that. It's an acronym
52 standing for `BIRD Internet Routing Daemon', you remember, don't you? :-)
53
54 <p><em/Internet Routing/: It's a program (well, a daemon, as you are going to
55 discover in a moment) which works as a dynamic router in an Internet type
56 network (that is, in a network running either the IPv4 or the IPv6 protocol).
57 Routers are devices which forward packets between interconnected networks in
58 order to allow hosts not connected directly to the same local area network to
59 communicate with each other. They also communicate with the other routers in the
60 Internet to discover the topology of the network which allows them to find
61 optimal (in terms of some metric) rules for forwarding of packets (which are
62 called routing tables) and to adapt themselves to the changing conditions such
63 as outages of network links, building of new connections and so on. Most of
64 these routers are costly dedicated devices running obscure firmware which is
65 hard to configure and not open to any changes (on the other hand, their special
66 hardware design allows them to keep up with lots of high-speed network
67 interfaces, better than general-purpose computer does). Fortunately, most
68 operating systems of the UNIX family allow an ordinary computer to act as a
69 router and forward packets belonging to the other hosts, but only according to a
70 statically configured table.
71
72 <p>A <em/Routing Daemon/ is in UNIX terminology a non-interactive program
73 running on background which does the dynamic part of Internet routing, that is
74 it communicates with the other routers, calculates routing tables and sends them
75 to the OS kernel which does the actual packet forwarding. There already exist
76 other such routing daemons: routed (RIP only), GateD (non-free),
77 <HTMLURL URL="http://www.zebra.org" name="Zebra"> and
78 <HTMLURL URL="http://sourceforge.net/projects/mrt" name="MRTD">,
79 but their capabilities are limited and they are relatively hard to configure
80 and maintain.
81
82 <p>BIRD is an Internet Routing Daemon designed to avoid all of these shortcomings,
83 to support all the routing technology used in the today's Internet or planned to
84 be used in near future and to have a clean extensible architecture allowing new
85 routing protocols to be incorporated easily. Among other features, BIRD
86 supports:
87
88 <itemize>
89 <item>both IPv4 and IPv6 protocols
90 <item>multiple routing tables
91 <item>the Border Gateway Protocol (BGPv4)
92 <item>the Routing Information Protocol (RIPv2, RIPng)
93 <item>the Open Shortest Path First protocol (OSPFv2, OSPFv3)
94 <item>the Babel Routing Protocol
95 <item>the Router Advertisements for IPv6 hosts
96 <item>a virtual protocol for exchange of routes between different
97 routing tables on a single host
98 <item>a command-line interface allowing on-line control and inspection
99 of status of the daemon
100 <item>soft reconfiguration (no need to use complex online commands to
101 change the configuration, just edit the configuration file and
102 notify BIRD to re-read it and it will smoothly switch itself to
103 the new configuration, not disturbing routing protocols unless
104 they are affected by the configuration changes)
105 <item>a powerful language for route filtering
106 </itemize>
107
108 <p>BIRD has been developed at the Faculty of Math and Physics, Charles
109 University, Prague, Czech Republic as a student project. It can be freely
110 distributed under the terms of the GNU General Public License.
111
112 <p>BIRD has been designed to work on all UNIX-like systems. It has been
113 developed and tested under Linux 2.0 to 2.6, and then ported to FreeBSD, NetBSD
114 and OpenBSD, porting to other systems (even non-UNIX ones) should be relatively
115 easy due to its highly modular architecture.
116
117 <p>BIRD 1.x supported either IPv4 or IPv6 protocol, but had to be compiled separately
118 for each one. BIRD~2 supports both of them with a possibility of further extension.
119 BIRD~2 supports Linux at least 3.16, FreeBSD 10, NetBSD 7.0, and OpenBSD 5.8.
120 Anyway, it will probably work well also on older systems.
121
122 <sect>Installing BIRD
123 <label id="install">
124
125 <p>On a recent UNIX system with GNU development tools (GCC, binutils, m4, make)
126 and Perl, installing BIRD should be as easy as:
127
128 <code>
129 ./configure
130 make
131 make install
132 vi /usr/local/etc/bird.conf
133 bird
134 </code>
135
136 <p>You can use <tt>./configure --help</tt> to get a list of configure
137 options. The most important ones are: <tt/--with-protocols=/ to produce a slightly smaller
138 BIRD executable by configuring out routing protocols you don't use, and
139 <tt/--prefix=/ to install BIRD to a place different from <file>/usr/local</file>.
140
141
142 <sect>Running BIRD
143 <label id="argv">
144
145 <p>You can pass several command-line options to bird:
146
147 <descrip>
148 <tag><label id="argv-config">-c <m/config name/</tag>
149 use given configuration file instead of <it/prefix/<file>/etc/bird.conf</file>.
150
151 <tag><label id="argv-debug">-d</tag>
152 enable debug messages and run bird in foreground.
153
154 <tag><label id="argv-log-file">-D <m/filename of debug log/</tag>
155 log debugging information to given file instead of stderr.
156
157 <tag><label id="argv-foreground">-f</tag>
158 run bird in foreground.
159
160 <tag><label id="argv-group">-g <m/group/</tag>
161 use that group ID, see the next section for details.
162
163 <tag><label id="argv-help">-h, --help</tag>
164 display command-line options to bird.
165
166 <tag><label id="argv-local">-l</tag>
167 look for a configuration file and a communication socket in the current
168 working directory instead of in default system locations. However, paths
169 specified by options <cf/-c/, <cf/-s/ have higher priority.
170
171 <tag><label id="argv-parse">-p</tag>
172 just parse the config file and exit. Return value is zero if the config
173 file is valid, nonzero if there are some errors.
174
175 <tag><label id="argv-pid">-P <m/name of PID file/</tag>
176 create a PID file with given filename.
177
178 <tag><label id="argv-recovery">-R</tag>
179 apply graceful restart recovery after start.
180
181 <tag><label id="argv-socket">-s <m/name of communication socket/</tag>
182 use given filename for a socket for communications with the client,
183 default is <it/prefix/<file>/var/run/bird.ctl</file>.
184
185 <tag><label id="argv-user">-u <m/user/</tag>
186 drop privileges and use that user ID, see the next section for details.
187
188 <tag><label id="argv-version">--version</tag>
189 display bird version.
190 </descrip>
191
192 <p>BIRD writes messages about its work to log files or syslog (according to config).
193
194
195 <sect>Privileges
196 <label id="privileges">
197
198 <p>BIRD, as a routing daemon, uses several privileged operations (like setting
199 routing table and using raw sockets). Traditionally, BIRD is executed and runs
200 with root privileges, which may be prone to security problems. The recommended
201 way is to use a privilege restriction (options <cf/-u/, <cf/-g/). In that case
202 BIRD is executed with root privileges, but it changes its user and group ID to
203 an unprivileged ones, while using Linux capabilities to retain just required
204 privileges (capabilities CAP_NET_*). Note that the control socket is created
205 before the privileges are dropped, but the config file is read after that. The
206 privilege restriction is not implemented in BSD port of BIRD.
207
208 <p>An unprivileged user (as an argument to <cf/-u/ options) may be the user
209 <cf/nobody/, but it is suggested to use a new dedicated user account (like
210 <cf/bird/). The similar considerations apply for the group option, but there is
211 one more condition -- the users in the same group can use <file/birdc/ to
212 control BIRD.
213
214 <p>Finally, there is a possibility to use external tools to run BIRD in an
215 environment with restricted privileges. This may need some configuration, but it
216 is generally easy -- BIRD needs just the standard library, privileges to read
217 the config file and create the control socket and the CAP_NET_* capabilities.
218
219
220 <chapt>Architecture
221 <label id="architecture">
222
223 <sect>Routing tables
224 <label id="routing-tables">
225
226 <p>The heart of BIRD is a routing table. BIRD has several independent routing tables;
227 each of them contains routes of exactly one <m/nettype/ (see below). There are two
228 default tables -- <cf/master4/ for IPv4 routes and <cf/master6/ for IPv6 routes.
229 Other tables must be explicitly configured.
230
231 <p>
232 These routing tables are not kernel forwarding tables. No forwarding is done by
233 BIRD. If you want to forward packets using the routes in BIRD tables, you may
234 use the Kernel protocol (see below) to synchronize them with kernel FIBs.
235
236 <p>
237 Every nettype defines a (kind of) primary key on routes. Every route source can
238 supply one route for every possible primary key; new route announcement replaces
239 the old route from the same source, keeping other routes intact. BIRD always
240 chooses the best route for each primary key among the known routes and keeps the
241 others as suboptimal. When the best route is retracted, BIRD re-runs the best
242 route selection algorithm to find the current best route.
243
244 <p>
245 The global best route selection algorithm is (roughly) as follows:
246
247 <itemize>
248 <item>Preferences of the routes are compared.
249 <item>Source protocol instance preferences are compared.
250 <item>If source protocols are the same (e.g. BGP vs. BGP), the protocol's route selection algorithm is invoked.
251 <item>If source protocols are different (e.g. BGP vs. OSPF), result of the algorithm is undefined.
252 </itemize>
253
254 <p><label id="dsc-table-sorted">Usually, a routing table just chooses a selected
255 route from a list of entries for one network. But if the <cf/sorted/ option is
256 activated, these lists of entries are kept completely sorted (according to
257 preference or some protocol-dependent metric). This is needed for some features
258 of some protocols (e.g. <cf/secondary/ option of BGP protocol, which allows to
259 accept not just a selected route, but the first route (in the sorted list) that
260 is accepted by filters), but it is incompatible with some other features (e.g.
261 <cf/deterministic med/ option of BGP protocol, which activates a way of choosing
262 selected route that cannot be described using comparison and ordering). Minor
263 advantage is that routes are shown sorted in <cf/show route/, minor disadvantage
264 is that it is slightly more computationally expensive.
265
266 <sect>Routes and network types
267 <label id="routes">
268
269 <p>BIRD works with several types of routes. Some of them are typical IP routes,
270 others are better described as forwarding rules. We call them all routes,
271 regardless of this difference.
272
273 <p>Every route consists of several attributes (read more about them in the
274 <ref id="route-attributes" name="Route attributes"> section); the common for all
275 routes are:
276
277 <itemize>
278 <item>IP address of router which told us about this route
279 <item>Source protocol instance
280 <item>Route preference
281 <item>Optional attributes defined by protocols
282 </itemize>
283
284 <p>Other attributes depend on nettypes. Some of them are part of the primary key, these are marked (PK).
285
286 <sect1>IPv4 and IPv6 routes
287 <label id="ip-routes">
288
289 <p>The traditional routes. Configuration keywords are <cf/ipv4/ and <cf/ipv6/.
290
291 <itemize>
292 <item>(PK) Route destination (IP prefix together with its length)
293 <item>Route next hops (see below)
294 </itemize>
295
296 <sect1>IPv6 source-specific routes
297 <label id="ip-sadr-routes">
298
299 <p>The IPv6 routes containing both destination and source prefix. They are used
300 for source-specific routing (SSR), also called source-address dependent routing
301 (SADR), see <rfc id="8043">. Currently limited mostly to the Babel protocol.
302 Configuration keyword is <cf/ipv6 sadr/.
303
304 <itemize>
305 <item>(PK) Route destination (IP prefix together with its length)
306 <item>(PK) Route source (IP prefix together with its length)
307 <item>Route next hops (see below)
308 </itemize>
309
310 <sect1>VPN IPv4 and IPv6 routes
311 <label id="vpn-routes">
312
313 <p>Routes for IPv4 and IPv6 with VPN Route Distinguisher (<rfc id="4364">).
314 Configuration keywords are <cf/vpn4/ and <cf/vpn6/.
315
316 <itemize>
317 <item>(PK) Route destination (IP prefix together with its length)
318 <item>(PK) Route distinguisher (according to <rfc id="4364">)
319 <item>Route next hops
320 </itemize>
321
322 <sect1>Route Origin Authorization for IPv4 and IPv6
323 <label id="roa-routes">
324
325 <p>These entries can be used to validate route origination of BGP routes.
326 A ROA entry specifies prefixes which could be originated by an AS number.
327 Their keywords are <cf/roa4/ and <cf/roa6/.
328
329 <itemize>
330 <item>(PK) IP prefix together with its length
331 <item>(PK) Matching prefix maximal length
332 <item>(PK) AS number
333 </itemize>
334
335 <sect1>Flowspec for IPv4 and IPv6
336 <label id="flow-routes">
337
338 <p>Flowspec rules are a form of firewall and traffic flow control rules
339 distributed mostly via BGP. These rules may help the operators stop various
340 network attacks in the beginning before eating up the whole bandwidth.
341 Configuration keywords are <cf/flow4/ and <cf/flow6/.
342
343 <itemize>
344 <item>(PK) IP prefix together with its length
345 <item>(PK) Flow definition data
346 <item>Flow action (encoded internally as BGP communities according to <rfc id="5575">)
347 </itemize>
348
349 <sect1>MPLS switching rules
350 <label id="mpls-routes">
351
352 <p>This nettype is currently a stub before implementing more support of <rfc id="3031">.
353 BIRD currently does not support any label distribution protocol nor any label assignment method.
354 Only the Kernel, Pipe and Static protocols can use MPLS tables.
355 Configuration keyword is <cf/mpls/.
356
357 <itemize>
358 <item>(PK) MPLS label
359 <item>Route next hops
360 </itemize>
361
362 <sect1>Route next hops
363 <label id="route-next-hop">
364
365 <p>This is not a nettype. The route next hop is a complex attribute common for many
366 nettypes as you can see before. Every next hop has its assigned device
367 (either assumed from its IP address or set explicitly). It may have also
368 an IP address and an MPLS stack (one or both independently).
369 Maximal MPLS stack depth is set (in compile time) to 8 labels.
370
371 <p>Every route (when eligible to have a next hop) can have more than one next hop.
372 In that case, every next hop has also its weight.
373
374 <sect>Protocols and channels
375 <label id="protocols-concept">
376
377 <p>BIRD protocol is an abstract class of producers and consumers of the routes.
378 Each protocol may run in multiple instances and bind on one side to route
379 tables via channels, on the other side to specified listen sockets (BGP),
380 interfaces (Babel, OSPF, RIP), APIs (Kernel, Direct), or nothing (Static, Pipe).
381
382 <p>There are also two protocols that do not have any channels -- BFD and Device.
383 Both of them are kind of service for other protocols.
384
385 <p>Each protocol is connected to a routing table through a channel. Some protocols
386 support only one channel (OSPF, RIP), some protocols support more channels (BGP, Direct).
387 Each channel has two filters which can accept, reject and modify the routes.
388 An <it/export/ filter is applied to routes passed from the routing table to the protocol,
389 an <it/import/ filter is applied to routes in the opposite direction.
390
391 <sect>Graceful restart
392 <label id="graceful-restart">
393
394 <p>When BIRD is started after restart or crash, it repopulates routing tables in
395 an uncoordinated manner, like after clean start. This may be impractical in some
396 cases, because if the forwarding plane (i.e. kernel routing tables) remains
397 intact, then its synchronization with BIRD would temporarily disrupt packet
398 forwarding until protocols converge. Graceful restart is a mechanism that could
399 help with this issue. Generally, it works by starting protocols and letting them
400 repopulate routing tables while deferring route propagation until protocols
401 acknowledge their convergence. Note that graceful restart behavior have to be
402 configured for all relevant protocols and requires protocol-specific support
403 (currently implemented for Kernel and BGP protocols), it is activated for
404 particular boot by option <cf/-R/.
405
406
407 <chapt>Configuration
408 <label id="config">
409
410 <sect>Introduction
411 <label id="config-intro">
412
413 <p>BIRD is configured using a text configuration file. Upon startup, BIRD reads
414 <it/prefix/<file>/etc/bird.conf</file> (unless the <tt/-c/ command line option
415 is given). Configuration may be changed at user's request: if you modify the
416 config file and then signal BIRD with <tt/SIGHUP/, it will adjust to the new
417 config. Then there's the client which allows you to talk with BIRD in an
418 extensive way.
419
420 <p>In the config, everything on a line after <cf/#/ or inside <cf>/* */</cf> is
421 a comment, whitespace characters are treated as a single space. If there's a
422 variable number of options, they are grouped using the <cf/{ }/ brackets. Each
423 option is terminated by a <cf/;/. Configuration is case sensitive. There are two
424 ways how to name symbols (like protocol names, filter names, constants etc.).
425 You can either use a simple string starting with a letter followed by any
426 combination of letters and numbers (e.g. <cf/R123/, <cf/myfilter/, <cf/bgp5/) or
427 you can enclose the name into apostrophes (<cf/'/) and than you can use any
428 combination of numbers, letters. hyphens, dots and colons (e.g.
429 <cf/'1:strange-name'/, <cf/'-NAME-'/, <cf/'cool::name'/).
430
431 <p>Here is an example of a simple config file. It enables synchronization of
432 routing tables with OS kernel, learns network interfaces and runs RIP on all
433 network interfaces found.
434
435 <code>
436 protocol kernel {
437 ipv4 {
438 export all; # Default is export none
439 };
440 persist; # Don't remove routes on BIRD shutdown
441 }
442
443 protocol device {
444 }
445
446 protocol rip {
447 ipv4 {
448 import all;
449 export all;
450 };
451 interface "*";
452 }
453 </code>
454
455
456 <sect>Global options
457 <label id="global-opts">
458
459 <p><descrip>
460 <tag><label id="opt-include">include "<m/filename/";</tag>
461 This statement causes inclusion of a new file. The <m/filename/ could
462 also be a wildcard, in that case matching files are included in
463 alphabetic order. The maximal depth is 8. Note that this statement can
464 be used anywhere in the config file, even inside other options, but
465 always on the beginning of line. In the following example, the first
466 semicolon belongs to the <cf/include/, the second to <cf/ipv6 table/.
467 If the <file/tablename.conf/ contains exactly one token (the name of the
468 table), this construction is correct:
469 <code>
470 ipv6 table
471 include "tablename.conf";;
472 </code>
473
474 <tag><label id="opt-log">log "<m/filename/"|syslog [name <m/name/]|stderr all|{ <m/list of classes/ }</tag>
475 Set logging of messages having the given class (either <cf/all/ or
476 <cf/{ error|trace [, <m/.../] }/ etc.) into selected destination (a file specified
477 as a filename string, syslog with optional name argument, or the stderr
478 output). Classes are:
479 <cf/info/, <cf/warning/, <cf/error/ and <cf/fatal/ for messages about local problems,
480 <cf/debug/ for debugging messages,
481 <cf/trace/ when you want to know what happens in the network,
482 <cf/remote/ for messages about misbehavior of remote machines,
483 <cf/auth/ about authentication failures,
484 <cf/bug/ for internal BIRD bugs.
485 You may specify more than one <cf/log/ line to establish logging to
486 multiple destinations. Default: log everything to the system log.
487
488 <tag><label id="opt-debug-protocols">debug protocols all|off|{ states|routes|filters|interfaces|events|packets [, <m/.../] }</tag>
489 Set global defaults of protocol debugging options. See <cf/debug/ in the
490 following section. Default: off.
491
492 <tag><label id="opt-debug-commands">debug commands <m/number/</tag>
493 Control logging of client connections (0 for no logging, 1 for logging
494 of connects and disconnects, 2 and higher for logging of all client
495 commands). Default: 0.
496
497 <tag><label id="opt-debug-latency">debug latency <m/switch/</tag>
498 Activate tracking of elapsed time for internal events. Recent events
499 could be examined using <cf/dump events/ command. Default: off.
500
501 <tag><label id="opt-debug-latency-limit">debug latency limit <m/time/</tag>
502 If <cf/debug latency/ is enabled, this option allows to specify a limit
503 for elapsed time. Events exceeding the limit are logged. Default: 1 s.
504
505 <tag><label id="opt-watchdog-warn">watchdog warning <m/time/</tag>
506 Set time limit for I/O loop cycle. If one iteration took more time to
507 complete, a warning is logged. Default: 5 s.
508
509 <tag><label id="opt-watchdog-timeout">watchdog timeout <m/time/</tag>
510 Set time limit for I/O loop cycle. If the limit is breached, BIRD is
511 killed by abort signal. The timeout has effective granularity of
512 seconds, zero means disabled. Default: disabled (0).
513
514 <tag><label id="opt-mrtdump">mrtdump "<m/filename/"</tag>
515 Set MRTdump file name. This option must be specified to allow MRTdump
516 feature. Default: no dump file.
517
518 <tag><label id="opt-mrtdump-protocols">mrtdump protocols all|off|{ states|messages [, <m/.../] }</tag>
519 Set global defaults of MRTdump options. See <cf/mrtdump/ in the
520 following section. Default: off.
521
522 <tag><label id="opt-filter">filter <m/name local variables/{ <m/commands/ }</tag>
523 Define a filter. You can learn more about filters in the following
524 chapter.
525
526 <tag><label id="opt-function">function <m/name/ (<m/parameters/) <m/local variables/ { <m/commands/ }</tag>
527 Define a function. You can learn more about functions in the following chapter.
528
529 <tag><label id="opt-protocol">protocol rip|ospf|bgp|<m/.../ [<m/name/ [from <m/name2/]] { <m>protocol options</m> }</tag>
530 Define a protocol instance called <cf><m/name/</cf> (or with a name like
531 "rip5" generated automatically if you don't specify any
532 <cf><m/name/</cf>). You can learn more about configuring protocols in
533 their own chapters. When <cf>from <m/name2/</cf> expression is used,
534 initial protocol options are taken from protocol or template
535 <cf><m/name2/</cf> You can run more than one instance of most protocols
536 (like RIP or BGP). By default, no instances are configured.
537
538 <tag><label id="opt-template">template rip|ospf|bgp|<m/.../ [<m/name/ [from <m/name2/]] { <m>protocol options</m> }</tag>
539 Define a protocol template instance called <m/name/ (or with a name like
540 "bgp1" generated automatically if you don't specify any <m/name/).
541 Protocol templates can be used to group common options when many
542 similarly configured protocol instances are to be defined. Protocol
543 instances (and other templates) can use templates by using <cf/from/
544 expression and the name of the template. At the moment templates (and
545 <cf/from/ expression) are not implemented for OSPF protocol.
546
547 <tag><label id="opt-define">define <m/constant/ = <m/expression/</tag>
548 Define a constant. You can use it later in every place you could use a
549 value of the same type. Besides, there are some predefined numeric
550 constants based on /etc/iproute2/rt_* files. A list of defined constants
551 can be seen (together with other symbols) using 'show symbols' command.
552
553 <tag><label id="opt-router-id">router id <m/IPv4 address/</tag>
554 Set BIRD's router ID. It's a world-wide unique identification of your
555 router, usually one of router's IPv4 addresses. Default: the lowest
556 IPv4 address of a non-loopback interface.
557
558 <tag><label id="opt-router-id-from">router id from [-] [ "<m/mask/" ] [ <m/prefix/ ] [, <m/.../]</tag>
559 Set BIRD's router ID based on an IPv4 address of an interface specified by
560 an interface pattern.
561 See <ref id="proto-iface" name="interface"> section for detailed
562 description of interface patterns with extended clauses.
563
564 <tag><label id="opt-graceful-restart">graceful restart wait <m/number/</tag>
565 During graceful restart recovery, BIRD waits for convergence of routing
566 protocols. This option allows to specify a timeout for the recovery to
567 prevent waiting indefinitely if some protocols cannot converge. Default:
568 240 seconds.
569
570 <tag><label id="opt-timeformat">timeformat route|protocol|base|log "<m/format1/" [<m/limit/ "<m/format2/"]</tag>
571 This option allows to specify a format of date/time used by BIRD. The
572 first argument specifies for which purpose such format is used.
573 <cf/route/ is a format used in 'show route' command output,
574 <cf/protocol/ is used in 'show protocols' command output, <cf/base/ is
575 used for other commands and <cf/log/ is used in a log file.
576
577 "<m/format1/" is a format string using <it/strftime(3)/ notation (see
578 <it/man strftime/ for details). It is extended to support sub-second
579 time part with variable precision (up to microseconds) using "%f"
580 conversion code (e.g., "%T.%3f" is hh:mm:ss.sss time). <m/limit/ and
581 "<m/format2/" allow to specify the second format string for times in
582 past deeper than <m/limit/ seconds.
583
584 There are several shorthands: <cf/iso long/ is a ISO 8601 date/time
585 format (YYYY-MM-DD hh:mm:ss) that can be also specified using <cf/"%F
586 %T"/. Similarly, <cf/iso long ms/ and <cf/iso long us/ are ISO 8601
587 date/time formats with millisecond or microsecond precision.
588 <cf/iso short/ is a variant of ISO 8601 that uses just the time format
589 (hh:mm:ss) for near times (up to 20 hours in the past) and the date
590 format (YYYY-MM-DD) for far times. This is a shorthand for <cf/"%T"
591 72000 "%F"/. And there are also <cf/iso short ms/ and <cf/iso short us/
592 high-precision variants of that.
593
594 By default, BIRD uses the <cf/iso short ms/ format for <cf/route/ and
595 <cf/protocol/ times, and the <cf/iso long ms/ format for <cf/base/ and
596 <cf/log/ times.
597
598 <tag><label id="opt-table"><m/nettype/ table <m/name/ [sorted]</tag>
599 Create a new routing table. The default routing tables <cf/master4/ and
600 <cf/master6/ are created implicitly, other routing tables have to be
601 added by this command. Option <cf/sorted/ can be used to enable sorting
602 of routes, see <ref id="dsc-table-sorted" name="sorted table">
603 description for details.
604
605 <tag><label id="opt-eval">eval <m/expr/</tag>
606 Evaluates given filter expression. It is used by the developers for testing of filters.
607 </descrip>
608
609
610 <sect>Protocol options
611 <label id="protocol-opts">
612
613 <p>For each protocol instance, you can configure a bunch of options. Some of
614 them (those described in this section) are generic, some are specific to the
615 protocol (see sections talking about the protocols).
616
617 <p>Several options use a <m/switch/ argument. It can be either <cf/on/,
618 <cf/yes/ or a numeric expression with a non-zero value for the option to be
619 enabled or <cf/off/, <cf/no/ or a numeric expression evaluating to zero to
620 disable it. An empty <m/switch/ is equivalent to <cf/on/ ("silence means
621 agreement").
622
623 <descrip>
624 <tag><label id="proto-disabled">disabled <m/switch/</tag>
625 Disables the protocol. You can change the disable/enable status from the
626 command line interface without needing to touch the configuration.
627 Disabled protocols are not activated. Default: protocol is enabled.
628
629 <tag><label id="proto-debug">debug all|off|{ states|routes|filters|interfaces|events|packets [, <m/.../] }</tag>
630 Set protocol debugging options. If asked, each protocol is capable of
631 writing trace messages about its work to the log (with category
632 <cf/trace/). You can either request printing of <cf/all/ trace messages
633 or only of the types selected: <cf/states/ for protocol state changes
634 (protocol going up, down, starting, stopping etc.), <cf/routes/ for
635 routes exchanged with the routing table, <cf/filters/ for details on
636 route filtering, <cf/interfaces/ for interface change events sent to the
637 protocol, <cf/events/ for events internal to the protocol and <cf/packets/
638 for packets sent and received by the protocol. Default: off.
639
640 <tag><label id="proto-mrtdump">mrtdump all|off|{ states|messages [, <m/.../] }</tag>
641 Set protocol MRTdump flags. MRTdump is a standard binary format for
642 logging information from routing protocols and daemons. These flags
643 control what kind of information is logged from the protocol to the
644 MRTdump file (which must be specified by global <cf/mrtdump/ option, see
645 the previous section). Although these flags are similar to flags of
646 <cf/debug/ option, their meaning is different and protocol-specific. For
647 BGP protocol, <cf/states/ logs BGP state changes and <cf/messages/ logs
648 received BGP messages. Other protocols does not support MRTdump yet.
649
650 <tag><label id="proto-router-id">router id <m/IPv4 address/</tag>
651 This option can be used to override global router id for a given
652 protocol. Default: uses global router id.
653
654 <tag><label id="proto-description">description "<m/text/"</tag>
655 This is an optional description of the protocol. It is displayed as a
656 part of the output of 'show route all' command.
657
658 <tag><label id="proto-vrf">vrf "<m/text/"</tag>
659 Associate the protocol with specific VRF. The protocol will be
660 restricted to interfaces assigned to the VRF and will use sockets bound
661 to the VRF. Appropriate VRF interface must exist on OS level. For kernel
662 protocol, an appropriate table still must be explicitly selected by
663 <cf/table/ option. Note that for proper VRF support it is necessary to
664 use Linux kernel version at least 4.14, older versions have limited VRF
665 implementation.
666
667 <tag><label id="proto-channel"><m/channel name/ [{<m/channel config/}]</tag>
668 Every channel must be explicitly stated. See the protocol-specific
669 configuration for the list of supported channel names. See the
670 <ref id="channel-opts" name="channel configuration section"> for channel
671 definition.
672 </descrip>
673
674 <p>There are several options that give sense only with certain protocols:
675
676 <descrip>
677 <tag><label id="proto-iface">interface [-] [ "<m/mask/" ] [ <m/prefix/ ] [, <m/.../] [ { <m/option/; [<m/.../] } ]</tag>
678 Specifies a set of interfaces on which the protocol is activated with
679 given interface-specific options. A set of interfaces specified by one
680 interface option is described using an interface pattern. The interface
681 pattern consists of a sequence of clauses (separated by commas), each
682 clause is a mask specified as a shell-like pattern. Interfaces are
683 matched by their name.
684
685 An interface matches the pattern if it matches any of its clauses. If
686 the clause begins with <cf/-/, matching interfaces are excluded. Patterns
687 are processed left-to-right, thus <cf/interface "eth0", -"eth*", "*";/
688 means eth0 and all non-ethernets.
689
690 Some protocols (namely OSPFv2 and Direct) support extended clauses that
691 may contain a mask, a prefix, or both of them. An interface matches such
692 clause if its name matches the mask (if specified) and its address
693 matches the prefix (if specified). Extended clauses are used when the
694 protocol handles multiple addresses on an interface independently.
695
696 An interface option can be used more times with different interface-specific
697 options, in that case for given interface the first matching interface
698 option is used.
699
700 This option is allowed in Babel, BFD, Device, Direct, OSPF, RAdv and RIP
701 protocols. In OSPF protocol it is used in the <cf/area/ subsection.
702
703 Default: none.
704
705 Examples:
706
707 <cf>interface "*" { type broadcast; };</cf> - start the protocol on all
708 interfaces with <cf>type broadcast</cf> option.
709
710 <cf>interface "eth1", "eth4", "eth5" { type ptp; };</cf> - start the
711 protocol on enumerated interfaces with <cf>type ptp</cf> option.
712
713 <cf>interface -192.168.1.0/24, 192.168.0.0/16;</cf> - start the protocol
714 on all interfaces that have address from 192.168.0.0/16, but not from
715 192.168.1.0/24.
716
717 <cf>interface -192.168.1.0/24, 192.168.0.0/16;</cf> - start the protocol
718 on all interfaces that have address from 192.168.0.0/16, but not from
719 192.168.1.0/24.
720
721 <cf>interface "eth*" 192.168.1.0/24;</cf> - start the protocol on all
722 ethernet interfaces that have address from 192.168.1.0/24.
723
724 <tag><label id="proto-tx-class">tx class|dscp <m/num/</tag>
725 This option specifies the value of ToS/DS/Class field in IP headers of
726 the outgoing protocol packets. This may affect how the protocol packets
727 are processed by the network relative to the other network traffic. With
728 <cf/class/ keyword, the value (0-255) is used for the whole ToS/Class
729 octet (but two bits reserved for ECN are ignored). With <cf/dscp/
730 keyword, the value (0-63) is used just for the DS field in the octet.
731 Default value is 0xc0 (DSCP 0x30 - CS6).
732
733 <tag><label id="proto-tx-priority">tx priority <m/num/</tag>
734 This option specifies the local packet priority. This may affect how the
735 protocol packets are processed in the local TX queues. This option is
736 Linux specific. Default value is 7 (highest priority, privileged traffic).
737
738 <tag><label id="proto-pass">password "<m/password/" [ { <m>password options</m> } ]</tag>
739 Specifies a password that can be used by the protocol as a shared secret
740 key. Password option can be used more times to specify more passwords.
741 If more passwords are specified, it is a protocol-dependent decision
742 which one is really used. Specifying passwords does not mean that
743 authentication is enabled, authentication can be enabled by separate,
744 protocol-dependent <cf/authentication/ option.
745
746 This option is allowed in BFD, OSPF and RIP protocols. BGP has also
747 <cf/password/ option, but it is slightly different and described
748 separately.
749 Default: none.
750 </descrip>
751
752 <p>Password option can contain section with some (not necessary all) password sub-options:
753
754 <descrip>
755 <tag><label id="proto-pass-id">id <M>num</M></tag>
756 ID of the password, (1-255). If it is not used, BIRD will choose ID based
757 on an order of the password item in the interface. For example, second
758 password item in one interface will have default ID 2. ID is used by
759 some routing protocols to identify which password was used to
760 authenticate protocol packets.
761
762 <tag><label id="proto-pass-gen-from">generate from "<m/time/"</tag>
763 The start time of the usage of the password for packet signing.
764 The format of <cf><m/time/</cf> is <tt>dd-mm-yyyy HH:MM:SS</tt>.
765
766 <tag><label id="proto-pass-gen-to">generate to "<m/time/"</tag>
767 The last time of the usage of the password for packet signing.
768
769 <tag><label id="proto-pass-accept-from">accept from "<m/time/"</tag>
770 The start time of the usage of the password for packet verification.
771
772 <tag><label id="proto-pass-accept-to">accept to "<m/time/"</tag>
773 The last time of the usage of the password for packet verification.
774
775 <tag><label id="proto-pass-from">from "<m/time/"</tag>
776 Shorthand for setting both <cf/generate from/ and <cf/accept from/.
777
778 <tag><label id="proto-pass-to">to "<m/time/"</tag>
779 Shorthand for setting both <cf/generate to/ and <cf/accept to/.
780
781 <tag><label id="proto-pass-algorithm">algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 )</tag>
782 The message authentication algorithm for the password when cryptographic
783 authentication is enabled. The default value depends on the protocol.
784 For RIP and OSPFv2 it is Keyed-MD5 (for compatibility), for OSPFv3
785 protocol it is HMAC-SHA-256.
786
787 </descrip>
788
789
790 <sect>Channel options
791 <label id="channel-opts">
792
793 <p>Every channel belongs to a protocol and is configured inside its block. The
794 minimal channel config is empty, then it uses default values. The name of the
795 channel implies its nettype. Channel definitions can be inherited from protocol
796 templates. Multiple definitions of the same channel are forbidden, but channels
797 inherited from templates can be updated by new definitions.
798
799 <descrip>
800 <tag><label id="proto-table">table <m/name/</tag>
801 Specify a table to which the channel is connected. Default: the first
802 table of given nettype.
803
804 <tag><label id="proto-preference">preference <m/expr/</tag>
805 Sets the preference of routes generated by the protocol and imported
806 through this channel. Default: protocol dependent.
807
808 <tag><label id="proto-import">import all | none | filter <m/name/ | filter { <m/filter commands/ } | where <m/boolean filter expression/</tag>
809 Specify a filter to be used for filtering routes coming from the
810 protocol to the routing table. <cf/all/ is for keeping all routes,
811 <cf/none/ is for dropping all routes. Default: <cf/all/ (except for
812 EBGP).
813
814 <tag><label id="proto-export">export <m/filter/</tag>
815 This is similar to the <cf>import</cf> keyword, except that it works in
816 the direction from the routing table to the protocol. Default: <cf/none/
817 (except for EBGP).
818
819 <tag><label id="proto-import-keep-filtered">import keep filtered <m/switch/</tag>
820 Usually, if an import filter rejects a route, the route is forgotten.
821 When this option is active, these routes are kept in the routing table,
822 but they are hidden and not propagated to other protocols. But it is
823 possible to show them using <cf/show route filtered/. Note that this
824 option does not work for the pipe protocol. Default: off.
825
826 <tag><label id="proto-import-limit">import limit [<m/number/ | off ] [action warn | block | restart | disable]</tag>
827 Specify an import route limit (a maximum number of routes imported from
828 the protocol) and optionally the action to be taken when the limit is
829 hit. Warn action just prints warning log message. Block action discards
830 new routes coming from the protocol. Restart and disable actions shut
831 the protocol down like appropriate commands. Disable is the default
832 action if an action is not explicitly specified. Note that limits are
833 reset during protocol reconfigure, reload or restart. Default: <cf/off/.
834
835 <tag><label id="proto-receive-limit">receive limit [<m/number/ | off ] [action warn | block | restart | disable]</tag>
836 Specify an receive route limit (a maximum number of routes received from
837 the protocol and remembered). It works almost identically to <cf>import
838 limit</cf> option, the only difference is that if <cf/import keep
839 filtered/ option is active, filtered routes are counted towards the
840 limit and blocked routes are forgotten, as the main purpose of the
841 receive limit is to protect routing tables from overflow. Import limit,
842 on the contrary, counts accepted routes only and routes blocked by the
843 limit are handled like filtered routes. Default: <cf/off/.
844
845 <tag><label id="proto-export-limit">export limit [ <m/number/ | off ] [action warn | block | restart | disable]</tag>
846 Specify an export route limit, works similarly to the <cf>import
847 limit</cf> option, but for the routes exported to the protocol. This
848 option is experimental, there are some problems in details of its
849 behavior -- the number of exported routes can temporarily exceed the
850 limit without triggering it during protocol reload, exported routes
851 counter ignores route blocking and block action also blocks route
852 updates of already accepted routes -- and these details will probably
853 change in the future. Default: <cf/off/.
854 </descrip>
855
856 <p>This is a trivial example of RIP configured for IPv6 on all interfaces:
857 <code>
858 protocol rip ng {
859 ipv6;
860 interface "*";
861 }
862 </code>
863
864 <p>This is a non-trivial example.
865 <code>
866 protocol rip ng {
867 ipv6 {
868 table mytable6;
869 import filter { ... };
870 export filter { ... };
871 import limit 50;
872 };
873 interface "*";
874 }
875 </code>
876
877 <p>And this is even more complicated example using templates.
878 <code>
879 template bgp {
880 local 198.51.100.14 as 65000;
881
882 ipv4 {
883 table mytable4;
884 import filter { ... };
885 export none;
886 };
887 ipv6 {
888 table mytable6;
889 import filter { ... };
890 export none;
891 };
892 }
893
894 protocol bgp from {
895 neighbor 198.51.100.130 as 64496;
896
897 # IPv4 channel is inherited as-is, while IPv6
898 # channel is adjusted by export filter option
899 ipv6 {
900 export filter { ... };
901 };
902 }
903 </code>
904
905
906 <chapt>Remote control
907 <label id="remote-control">
908
909 <p>You can use the command-line client <file>birdc</file> to talk with a running
910 BIRD. Communication is done using a <file/bird.ctl/ UNIX domain socket (unless
911 changed with the <tt/-s/ option given to both the server and the client). The
912 commands can perform simple actions such as enabling/disabling of protocols,
913 telling BIRD to show various information, telling it to show routing table
914 filtered by filter, or asking BIRD to reconfigure. Press <tt/?/ at any time to
915 get online help. Option <tt/-r/ can be used to enable a restricted mode of BIRD
916 client, which allows just read-only commands (<cf/show .../). Option <tt/-v/ can
917 be passed to the client, to make it dump numeric return codes along with the
918 messages. You do not necessarily need to use <file/birdc/ to talk to BIRD, your
919 own applications could do that, too -- the format of communication between BIRD
920 and <file/birdc/ is stable (see the programmer's documentation).
921
922 <p>There is also lightweight variant of BIRD client called <file/birdcl/, which
923 does not support command line editing and history and has minimal dependencies.
924 This is useful for running BIRD in resource constrained environments, where
925 Readline library (required for regular BIRD client) is not available.
926
927 <p>Many commands have the <m/name/ of the protocol instance as an argument.
928 This argument can be omitted if there exists only a single instance.
929
930 <p>Here is a brief list of supported functions:
931
932 <descrip>
933 <tag><label id="cli-show-status">show status</tag>
934 Show router status, that is BIRD version, uptime and time from last
935 reconfiguration.
936
937 <tag><label id="cli-show-interfaces">show interfaces [summary]</tag>
938 Show the list of interfaces. For each interface, print its type, state,
939 MTU and addresses assigned.
940
941 <tag><label id="cli-show-protocols">show protocols [all]</tag>
942 Show list of protocol instances along with tables they are connected to
943 and protocol status, possibly giving verbose information, if <cf/all/ is
944 specified.
945
946 <!-- TODO: Move these protocol-specific remote control commands to the protocol sections -->
947 <tag><label id="cli-show-ospf-iface">show ospf interface [<m/name/] ["<m/interface/"]</tag>
948 Show detailed information about OSPF interfaces.
949
950 <tag><label id="cli-show-ospf-neighbors">show ospf neighbors [<m/name/] ["<m/interface/"]</tag>
951 Show a list of OSPF neighbors and a state of adjacency to them.
952
953 <tag><label id="cli-show-ospf-state">show ospf state [all] [<m/name/]</tag>
954 Show detailed information about OSPF areas based on a content of the
955 link-state database. It shows network topology, stub networks,
956 aggregated networks and routers from other areas and external routes.
957 The command shows information about reachable network nodes, use option
958 <cf/all/ to show information about all network nodes in the link-state
959 database.
960
961 <tag><label id="cli-show-ospf-topology">show ospf topology [all] [<m/name/]</tag>
962 Show a topology of OSPF areas based on a content of the link-state
963 database. It is just a stripped-down version of 'show ospf state'.
964
965 <tag><label id="cli-show-ospf-lsadb">show ospf lsadb [global | area <m/id/ | link] [type <m/num/] [lsid <m/id/] [self | router <m/id/] [<m/name/] </tag>
966 Show contents of an OSPF LSA database. Options could be used to filter
967 entries.
968
969 <tag><label id="cli-show-rip-interfaces">show rip interfaces [<m/name/] ["<m/interface/"]</tag>
970 Show detailed information about RIP interfaces.
971
972 <tag><label id="cli-show-rip-neighbors">show rip neighbors [<m/name/] ["<m/interface/"]</tag>
973 Show a list of RIP neighbors and associated state.
974
975 <tag><label id="cli-show-static">show static [<m/name/]</tag>
976 Show detailed information about static routes.
977
978 <tag><label id="cli-show-bfd-sessions">show bfd sessions [<m/name/]</tag>
979 Show information about BFD sessions.
980
981 <tag><label id="cli-show-symbols">show symbols [table|filter|function|protocol|template|roa|<m/symbol/]</tag>
982 Show the list of symbols defined in the configuration (names of
983 protocols, routing tables etc.).
984
985 <tag><label id="cli-show-route">show route [[for] <m/prefix/|<m/IP/] [table (<m/t/ | all)] [filter <m/f/|where <m/c/] [(export|preexport|noexport) <m/p/] [protocol <m/p/] [(stats|count)] [<m/options/]</tag>
986 Show contents of specified routing tables, that is routes, their metrics
987 and (in case the <cf/all/ switch is given) all their attributes.
988
989 <p>You can specify a <m/prefix/ if you want to print routes for a
990 specific network. If you use <cf>for <m/prefix or IP/</cf>, you'll get
991 the entry which will be used for forwarding of packets to the given
992 destination. By default, all routes for each network are printed with
993 the selected one at the top, unless <cf/primary/ is given in which case
994 only the selected route is shown.
995
996 <p>The <cf/show route/ command can process one or multiple routing
997 tables. The set of selected tables is determined on three levels: First,
998 tables can be explicitly selected by <cf/table/ switch, which could be
999 used multiple times, all tables are specified by <cf/table all/. Second,
1000 tables can be implicitly selected by channels or protocols that are
1001 arguments of several other switches (e.g., <cf/export/, <cf/protocol/).
1002 Last, the set of default tables is used: <cf/master4/, <cf/master6/ and
1003 each first table of any other network type.
1004
1005 <p>You can also ask for printing only routes processed and accepted by
1006 a given filter (<cf>filter <m/name/</cf> or <cf>filter { <m/filter/ }
1007 </cf> or matching a given condition (<cf>where <m/condition/</cf>).
1008
1009 The <cf/export/, <cf/preexport/ and <cf/noexport/ switches ask for
1010 printing of routes that are exported to the specified protocol or
1011 channel. With <cf/preexport/, the export filter of the channel is
1012 skipped. With <cf/noexport/, routes rejected by the export filter are
1013 printed instead. Note that routes not exported for other reasons
1014 (e.g. secondary routes or routes imported from that protocol) are not
1015 printed even with <cf/noexport/. These switches also imply that
1016 associated routing tables are selected instead of default ones.
1017
1018 <p>You can also select just routes added by a specific protocol.
1019 <cf>protocol <m/p/</cf>. This switch also implies that associated
1020 routing tables are selected instead of default ones.
1021
1022 <p>If BIRD is configured to keep filtered routes (see <cf/import keep
1023 filtered/ option), you can show them instead of routes by using
1024 <cf/filtered/ switch.
1025
1026 <p>The <cf/stats/ switch requests showing of route statistics (the
1027 number of networks, number of routes before and after filtering). If
1028 you use <cf/count/ instead, only the statistics will be printed.
1029
1030 <tag><label id="cli-configure">configure [soft] ["<m/config file/"] [timeout [<m/num/]]</tag>
1031 Reload configuration from a given file. BIRD will smoothly switch itself
1032 to the new configuration, protocols are reconfigured if possible,
1033 restarted otherwise. Changes in filters usually lead to restart of
1034 affected protocols.
1035
1036 If <cf/soft/ option is used, changes in filters does not cause BIRD to
1037 restart affected protocols, therefore already accepted routes (according
1038 to old filters) would be still propagated, but new routes would be
1039 processed according to the new filters.
1040
1041 If <cf/timeout/ option is used, config timer is activated. The new
1042 configuration could be either confirmed using <cf/configure confirm/
1043 command, or it will be reverted to the old one when the config timer
1044 expires. This is useful for cases when reconfiguration breaks current
1045 routing and a router becomes inaccessible for an administrator. The
1046 config timeout expiration is equivalent to <cf/configure undo/
1047 command. The timeout duration could be specified, default is 300 s.
1048
1049 <tag><label id="cli-configure-confirm">configure confirm</tag>
1050 Deactivate the config undo timer and therefore confirm the current
1051 configuration.
1052
1053 <tag><label id="cli-configure-undo">configure undo</tag>
1054 Undo the last configuration change and smoothly switch back to the
1055 previous (stored) configuration. If the last configuration change was
1056 soft, the undo change is also soft. There is only one level of undo, but
1057 in some specific cases when several reconfiguration requests are given
1058 immediately in a row and the intermediate ones are skipped then the undo
1059 also skips them back.
1060
1061 <tag><label id="cli-configure-check">configure check ["<m/config file/"]</tag>
1062 Read and parse given config file, but do not use it. useful for checking
1063 syntactic and some semantic validity of an config file.
1064
1065 <tag><label id="cli-enable-disable-restart">enable|disable|restart <m/name/|"<m/pattern/"|all</tag>
1066 Enable, disable or restart a given protocol instance, instances matching
1067 the <cf><m/pattern/</cf> or <cf/all/ instances.
1068
1069 <tag><label id="cli-reload">reload [in|out] <m/name/|"<m/pattern/"|all</tag>
1070 Reload a given protocol instance, that means re-import routes from the
1071 protocol instance and re-export preferred routes to the instance. If
1072 <cf/in/ or <cf/out/ options are used, the command is restricted to one
1073 direction (re-import or re-export).
1074
1075 This command is useful if appropriate filters have changed but the
1076 protocol instance was not restarted (or reloaded), therefore it still
1077 propagates the old set of routes. For example when <cf/configure soft/
1078 command was used to change filters.
1079
1080 Re-export always succeeds, but re-import is protocol-dependent and might
1081 fail (for example, if BGP neighbor does not support route-refresh
1082 extension). In that case, re-export is also skipped. Note that for the
1083 pipe protocol, both directions are always reloaded together (<cf/in/ or
1084 <cf/out/ options are ignored in that case).
1085
1086 <tag><label id="cli-down">down</tag>
1087 Shut BIRD down.
1088
1089 <tag><label id="cli-debug">debug <m/protocol/|<m/pattern/|all all|off|{ states|routes|filters|events|packets [, <m/.../] }</tag>
1090 Control protocol debugging.
1091
1092 <tag><label id="cli-dump">dump resources|sockets|interfaces|neighbors|attributes|routes|protocols</tag>
1093 Dump contents of internal data structures to the debugging output.
1094
1095 <tag><label id="cli-echo">echo all|off|{ <m/list of log classes/ } [ <m/buffer-size/ ]</tag>
1096 Control echoing of log messages to the command-line output.
1097 See <ref id="opt-log" name="log option"> for a list of log classes.
1098
1099 <tag><label id="cli-eval">eval <m/expr/</tag>
1100 Evaluate given expression.
1101 </descrip>
1102
1103
1104 <chapt>Filters
1105 <label id="filters">
1106
1107 <sect>Introduction
1108 <label id="filters-intro">
1109
1110 <p>BIRD contains a simple programming language. (No, it can't yet read mail :-).
1111 There are two objects in this language: filters and functions. Filters are
1112 interpreted by BIRD core when a route is being passed between protocols and
1113 routing tables. The filter language contains control structures such as if's and
1114 switches, but it allows no loops. An example of a filter using many features can
1115 be found in <file>filter/test.conf</file>.
1116
1117 <p>Filter gets the route, looks at its attributes and modifies some of them if
1118 it wishes. At the end, it decides whether to pass the changed route through
1119 (using <cf/accept/) or whether to <cf/reject/ it. A simple filter looks like
1120 this:
1121
1122 <code>
1123 filter not_too_far
1124 int var;
1125 {
1126 if defined( rip_metric ) then
1127 var = rip_metric;
1128 else {
1129 var = 1;
1130 rip_metric = 1;
1131 }
1132 if rip_metric &gt; 10 then
1133 reject "RIP metric is too big";
1134 else
1135 accept "ok";
1136 }
1137 </code>
1138
1139 <p>As you can see, a filter has a header, a list of local variables, and a body.
1140 The header consists of the <cf/filter/ keyword followed by a (unique) name of
1141 filter. The list of local variables consists of <cf><M>type name</M>;</cf>
1142 pairs where each pair defines one local variable. The body consists of <cf>
1143 { <M>statements</M> }</cf>. Each <m/statement/ is terminated by a <cf/;/. You
1144 can group several statements to a single compound statement by using braces
1145 (<cf>{ <M>statements</M> }</cf>) which is useful if you want to make a bigger
1146 block of code conditional.
1147
1148 <p>BIRD supports functions, so that you don't have to repeat the same blocks of
1149 code over and over. Functions can have zero or more parameters and they can have
1150 local variables. Recursion is not allowed. Function definitions look like this:
1151
1152 <code>
1153 function name ()
1154 int local_variable;
1155 {
1156 local_variable = 5;
1157 }
1158
1159 function with_parameters (int parameter)
1160 {
1161 print parameter;
1162 }
1163 </code>
1164
1165 <p>Unlike in C, variables are declared after the <cf/function/ line, but before
1166 the first <cf/{/. You can't declare variables in nested blocks. Functions are
1167 called like in C: <cf>name(); with_parameters(5);</cf>. Function may return
1168 values using the <cf>return <m/[expr]/</cf> command. Returning a value exits
1169 from current function (this is similar to C).
1170
1171 <p>Filters are declared in a way similar to functions except they can't have
1172 explicit parameters. They get a route table entry as an implicit parameter, it
1173 is also passed automatically to any functions called. The filter must terminate
1174 with either <cf/accept/ or <cf/reject/ statement. If there's a runtime error in
1175 filter, the route is rejected.
1176
1177 <p>A nice trick to debug filters is to use <cf>show route filter <m/name/</cf>
1178 from the command line client. An example session might look like:
1179
1180 <code>
1181 pavel@bug:~/bird$ ./birdc -s bird.ctl
1182 BIRD 0.0.0 ready.
1183 bird> show route
1184 10.0.0.0/8 dev eth0 [direct1 23:21] (240)
1185 195.113.30.2/32 dev tunl1 [direct1 23:21] (240)
1186 127.0.0.0/8 dev lo [direct1 23:21] (240)
1187 bird> show route ?
1188 show route [<prefix>] [table <t>] [filter <f>] [all] [primary]...
1189 bird> show route filter { if 127.0.0.5 &tilde; net then accept; }
1190 127.0.0.0/8 dev lo [direct1 23:21] (240)
1191 bird>
1192 </code>
1193
1194
1195 <sect>Data types
1196 <label id="data-types">
1197
1198 <p>Each variable and each value has certain type. Booleans, integers and enums
1199 are incompatible with each other (that is to prevent you from shooting in the
1200 foot).
1201
1202 <descrip>
1203 <tag><label id="type-bool">bool</tag>
1204 This is a boolean type, it can have only two values, <cf/true/ and
1205 <cf/false/. Boolean is the only type you can use in <cf/if/ statements.
1206
1207 <tag><label id="type-int">int</tag>
1208 This is a general integer type. It is an unsigned 32bit type; i.e., you
1209 can expect it to store values from 0 to 4294967295. Overflows are not
1210 checked. You can use <cf/0x1234/ syntax to write hexadecimal values.
1211
1212 <tag><label id="type-pair">pair</tag>
1213 This is a pair of two short integers. Each component can have values
1214 from 0 to 65535. Literals of this type are written as <cf/(1234,5678)/.
1215 The same syntax can also be used to construct a pair from two arbitrary
1216 integer expressions (for example <cf/(1+2,a)/).
1217
1218 <tag><label id="type-quad">quad</tag>
1219 This is a dotted quad of numbers used to represent router IDs (and
1220 others). Each component can have a value from 0 to 255. Literals of
1221 this type are written like IPv4 addresses.
1222
1223 <tag><label id="type-string">string</tag>
1224 This is a string of characters. There are no ways to modify strings in
1225 filters. You can pass them between functions, assign them to variables
1226 of type <cf/string/, print such variables, use standard string
1227 comparison operations (e.g. <cf/=, !=, &lt;, &gt;, &lt;=, &gt;=/), but
1228 you can't concatenate two strings. String literals are written as
1229 <cf/"This is a string constant"/. Additionally matching (<cf/&tilde;,
1230 !&tilde;/) operators could be used to match a string value against
1231 a shell pattern (represented also as a string).
1232
1233 <tag><label id="type-ip">ip</tag>
1234 This type can hold a single IP address. The IPv4 addresses are stored as
1235 IPv4-Mapped IPv6 addresses so one data type for both of them is used.
1236 Whether the address is IPv4 or not may be checked by <cf>.is_ip4</cf>
1237 which returns <cf/bool/. IP addresses are written in the standard
1238 notation (<cf/10.20.30.40/ or <cf/fec0:3:4::1/). You can apply special
1239 operator <cf>.mask(<M>num</M>)</cf> on values of type ip. It masks out
1240 all but first <cf><M>num</M></cf> bits from the IP address. So
1241 <cf/1.2.3.4.mask(8) = 1.0.0.0/ is true.
1242
1243 <tag><label id="type-prefix">prefix</tag>
1244 This type can hold a network prefix consisting of IP address, prefix
1245 length and several other values. This is the key in route tables.
1246
1247 Prefixes may be of several types, which can be determined by the special
1248 operator <cf/.type/. The type may be:
1249
1250 <cf/NET_IP4/ and <cf/NET_IP6/ prefixes hold an IP prefix. The literals
1251 are written as <cf><m/ipaddress//<m/pxlen/</cf>. There are two special
1252 operators on these: <cf/.ip/ which extracts the IP address from the
1253 pair, and <cf/.len/, which separates prefix length from the pair.
1254 So <cf>1.2.0.0/16.len = 16</cf> is true.
1255
1256 <cf/NET_IP6_SADR/ nettype holds both destination and source IPv6
1257 prefix. The literals are written as <cf><m/ipaddress//<m/pxlen/ from
1258 <m/ipaddress//<m/pxlen/</cf>, where the first part is the destination
1259 prefix and the second art is the source prefix. They support the same
1260 operators as IP prefixes, but just for the destination part.
1261
1262 <cf/NET_VPN4/ and <cf/NET_VPN6/ prefixes hold an IP prefix with VPN
1263 Route Distinguisher (<rfc id="4364">). They support the same special
1264 operators as IP prefixes, and also <cf/.rd/ which extracts the Route
1265 Distinguisher. Their literals are written
1266 as <cf><m/vpnrd/ <m/ipprefix/</cf>
1267
1268 <cf/NET_ROA4/ and <cf/NET_ROA6/ prefixes hold an IP prefix range
1269 together with an ASN. They support the same special operators as IP
1270 prefixes, and also <cf/.maxlen/ which extracts maximal prefix length,
1271 and <cf/.asn/ which extracts the ASN.
1272
1273 <cf/NET_FLOW4/ and <cf/NET_FLOW6/ hold an IP prefix together with a
1274 flowspec rule. Filters currently don't support flowspec parsing.
1275
1276 <cf/NET_MPLS/ holds a single MPLS label and its handling is currently
1277 not implemented.
1278
1279 <tag><label id="type-vpnrd">vpnrd</tag>
1280 This is a route distinguisher according to <rfc id="4364">. There are
1281 three kinds of RD's: <cf><m/asn/:<m/32bit int/</cf>, <cf><m/asn4/:<m/16bit int/</cf>
1282 and <cf><m/IPv4 address/:<m/32bit int/</cf>
1283
1284 <tag><label id="type-ec">ec</tag>
1285 This is a specialized type used to represent BGP extended community
1286 values. It is essentially a 64bit value, literals of this type are
1287 usually written as <cf>(<m/kind/, <m/key/, <m/value/)</cf>, where
1288 <cf/kind/ is a kind of extended community (e.g. <cf/rt/ / <cf/ro/ for a
1289 route target / route origin communities), the format and possible values
1290 of <cf/key/ and <cf/value/ are usually integers, but it depends on the
1291 used kind. Similarly to pairs, ECs can be constructed using expressions
1292 for <cf/key/ and <cf/value/ parts, (e.g. <cf/(ro, myas, 3*10)/, where
1293 <cf/myas/ is an integer variable).
1294
1295 <tag><label id="type-lc">lc</tag>
1296 This is a specialized type used to represent BGP large community
1297 values. It is essentially a triplet of 32bit values, where the first
1298 value is reserved for the AS number of the issuer, while meaning of
1299 remaining parts is defined by the issuer. Literals of this type are
1300 written as <cf/(123, 456, 789)/, with any integer values. Similarly to
1301 pairs, LCs can be constructed using expressions for its parts, (e.g.
1302 <cf/(myas, 10+20, 3*10)/, where <cf/myas/ is an integer variable).
1303
1304 <tag><label id="type-set">int|pair|quad|ip|prefix|ec|lc|enum set</tag>
1305 Filters recognize four types of sets. Sets are similar to strings: you
1306 can pass them around but you can't modify them. Literals of type <cf>int
1307 set</cf> look like <cf> [ 1, 2, 5..7 ]</cf>. As you can see, both simple
1308 values and ranges are permitted in sets.
1309
1310 For pair sets, expressions like <cf/(123,*)/ can be used to denote
1311 ranges (in that case <cf/(123,0)..(123,65535)/). You can also use
1312 <cf/(123,5..100)/ for range <cf/(123,5)..(123,100)/. You can also use
1313 <cf/*/ and <cf/a..b/ expressions in the first part of a pair, note that
1314 such expressions are translated to a set of intervals, which may be
1315 memory intensive. E.g. <cf/(*,4..20)/ is translated to <cf/(0,4..20),
1316 (1,4..20), (2,4..20), ... (65535, 4..20)/.
1317
1318 EC sets use similar expressions like pair sets, e.g. <cf/(rt, 123,
1319 10..20)/ or <cf/(ro, 123, *)/. Expressions requiring the translation
1320 (like <cf/(rt, *, 3)/) are not allowed (as they usually have 4B range
1321 for ASNs).
1322
1323 Also LC sets use similar expressions like pair sets. You can use ranges
1324 and wildcards, but if one field uses that, more specific (later) fields
1325 must be wildcards. E.g., <cf/(10, 20..30, *)/ or <cf/(10, 20, 30..40)/
1326 is valid, while <cf/(10, *, 20..30)/ or <cf/(10, 20..30, 40)/ is not
1327 valid.
1328
1329 You can also use expressions for int, pair, EC and LC set values.
1330 However, it must be possible to evaluate these expressions before daemon
1331 boots. So you can use only constants inside them. E.g.
1332
1333 <code>
1334 define one=1;
1335 define myas=64500;
1336 int set odds;
1337 pair set ps;
1338 ec set es;
1339
1340 odds = [ one, 2+1, 6-one, 2*2*2-1, 9, 11 ];
1341 ps = [ (1,one+one), (3,4)..(4,8), (5,*), (6,3..6), (7..9,*) ];
1342 es = [ (rt, myas, 3*10), (rt, myas+one, 0..16*16*16-1), (ro, myas+2, *) ];
1343 </code>
1344
1345 Sets of prefixes are special: their literals does not allow ranges, but
1346 allows prefix patterns that are written
1347 as <cf><M>ipaddress</M>/<M>pxlen</M>{<M>low</M>,<M>high</M>}</cf>.
1348 Prefix <cf><m>ip1</m>/<m>len1</m></cf> matches prefix
1349 pattern <cf><m>ip2</m>/<m>len2</m>{<m>l</m>,<m>h</m>}</cf> if the
1350 first <cf>min(len1, len2)</cf> bits of <cf/ip1/ and <cf/ip2/ are
1351 identical and <cf>len1 &lt;= ip1 &lt;= len2</cf>. A valid prefix pattern
1352 has to satisfy <cf>low &lt;= high</cf>, but <cf/pxlen/ is not
1353 constrained by <cf/low/ or <cf/high/. Obviously, a prefix matches a
1354 prefix set literal if it matches any prefix pattern in the prefix set
1355 literal.
1356
1357 There are also two shorthands for prefix patterns: <cf><m/address//<m/len/+</cf>
1358 is a shorthand for <cf><m/address//<m/len/{<m/len/,<m/maxlen/}</cf>
1359 (where <cf><m/maxlen/</cf> is 32 for IPv4 and 128 for IPv6), that means
1360 network prefix <cf><m/address//<m/len/</cf> and all its subnets.
1361 <cf><m/address//<m/len/-</cf> is a shorthand for
1362 <cf><m/address//<m/len/{0,<m/len/}</cf>, that means network prefix
1363 <cf><m/address//<m/len/</cf> and all its supernets (network prefixes
1364 that contain it).
1365
1366 For example, <cf>[ 1.0.0.0/8, 2.0.0.0/8+, 3.0.0.0/8-, 4.0.0.0/8{16,24}
1367 ]</cf> matches prefix <cf>1.0.0.0/8</cf>, all subprefixes of
1368 <cf>2.0.0.0/8</cf>, all superprefixes of <cf>3.0.0.0/8</cf> and prefixes
1369 <cf/4.X.X.X/ whose prefix length is 16 to 24. <cf>[ 0.0.0.0/0{20,24} ]</cf>
1370 matches all prefixes (regardless of IP address) whose prefix length is
1371 20 to 24, <cf>[ 1.2.3.4/32- ]</cf> matches any prefix that contains IP
1372 address <cf>1.2.3.4</cf>. <cf>1.2.0.0/16 &tilde; [ 1.0.0.0/8{15,17} ]</cf>
1373 is true, but <cf>1.0.0.0/16 &tilde; [ 1.0.0.0/8- ]</cf> is false.
1374
1375 Cisco-style patterns like <cf>10.0.0.0/8 ge 16 le 24</cf> can be expressed
1376 in BIRD as <cf>10.0.0.0/8{16,24}</cf>, <cf>192.168.0.0/16 le 24</cf> as
1377 <cf>192.168.0.0/16{16,24}</cf> and <cf>192.168.0.0/16 ge 24</cf> as
1378 <cf>192.168.0.0/16{24,32}</cf>.
1379
1380 It is possible to mix IPv4 and IPv6 prefixes/addresses in a prefix/ip set
1381 but its behavior may change between versions without any warning; don't do
1382 it unless you are more than sure what you are doing. (Really, don't do it.)
1383
1384 <tag><label id="type-enum">enum</tag>
1385 Enumeration types are fixed sets of possibilities. You can't define your
1386 own variables of such type, but some route attributes are of enumeration
1387 type. Enumeration types are incompatible with each other.
1388
1389 <tag><label id="type-bgppath">bgppath</tag>
1390 BGP path is a list of autonomous system numbers. You can't write
1391 literals of this type. There are several special operators on bgppaths:
1392
1393 <cf><m/P/.first</cf> returns the first ASN (the neighbor ASN) in path <m/P/.
1394
1395 <cf><m/P/.last</cf> returns the last ASN (the source ASN) in path <m/P/.
1396
1397 <cf><m/P/.last_nonaggregated</cf> returns the last ASN in the non-aggregated part of the path <m/P/.
1398
1399 Both <cf/first/ and <cf/last/ return zero if there is no appropriate
1400 ASN, for example if the path contains an AS set element as the first (or
1401 the last) part. If the path ends with an AS set, <cf/last_nonaggregated/
1402 may be used to get last ASN before any AS set.
1403
1404 <cf><m/P/.len</cf> returns the length of path <m/P/.
1405
1406 <cf><m/P/.empty</cf> makes the path <m/P/ empty.
1407
1408 <cf>prepend(<m/P/,<m/A/)</cf> prepends ASN <m/A/ to path <m/P/ and
1409 returns the result.
1410
1411 <cf>delete(<m/P/,<m/A/)</cf> deletes all instances of ASN <m/A/ from
1412 from path <m/P/ and returns the result. <m/A/ may also be an integer
1413 set, in that case the operator deletes all ASNs from path <m/P/ that are
1414 also members of set <m/A/.
1415
1416 <cf>filter(<m/P/,<m/A/)</cf> deletes all ASNs from path <m/P/ that are
1417 not members of integer set <m/A/. I.e., <cf/filter/ do the same as
1418 <cf/delete/ with inverted set <m/A/.
1419
1420 Statement <cf><m/P/ = prepend(<m/P/, <m/A/);</cf> can be shortened to
1421 <cf><m/P/.prepend(<m/A/);</cf> if <m/P/ is appropriate route attribute
1422 (for example <cf/bgp_path/). Similarly for <cf/delete/ and <cf/filter/.
1423
1424 <tag><label id="type-bgpmask">bgpmask</tag>
1425 BGP masks are patterns used for BGP path matching (using <cf>path
1426 &tilde; [= 2 3 5 * =]</cf> syntax). The masks resemble wildcard patterns
1427 as used by UNIX shells. Autonomous system numbers match themselves,
1428 <cf/*/ matches any (even empty) sequence of arbitrary AS numbers and
1429 <cf/?/ matches one arbitrary AS number. For example, if <cf>bgp_path</cf>
1430 is 4 3 2 1, then: <tt>bgp_path &tilde; [= * 4 3 * =]</tt> is true,
1431 but <tt>bgp_path &tilde; [= * 4 5 * =]</tt> is false. BGP mask
1432 expressions can also contain integer expressions enclosed in parenthesis
1433 and integer variables, for example <tt>[= * 4 (1+2) a =]</tt>. You can
1434 also use ranges, for example <tt>[= * 3..5 2 100..200 * =]</tt>.
1435
1436 <tag><label id="type-clist">clist</tag>
1437 Clist is similar to a set, except that unlike other sets, it can be
1438 modified. The type is used for community list (a set of pairs) and for
1439 cluster list (a set of quads). There exist no literals of this type.
1440 There are three special operators on clists:
1441
1442 <cf><m/C/.len</cf> returns the length of clist <m/C/.
1443
1444 <cf><m/C/.empty</cf> makes the list <m/C/ empty.
1445
1446 <cf>add(<m/C/,<m/P/)</cf> adds pair (or quad) <m/P/ to clist <m/C/ and
1447 returns the result. If item <m/P/ is already in clist <m/C/, it does
1448 nothing. <m/P/ may also be a clist, in that case all its members are
1449 added; i.e., it works as clist union.
1450
1451 <cf>delete(<m/C/,<m/P/)</cf> deletes pair (or quad) <m/P/ from clist
1452 <m/C/ and returns the result. If clist <m/C/ does not contain item
1453 <m/P/, it does nothing. <m/P/ may also be a pair (or quad) set, in that
1454 case the operator deletes all items from clist <m/C/ that are also
1455 members of set <m/P/. Moreover, <m/P/ may also be a clist, which works
1456 analogously; i.e., it works as clist difference.
1457
1458 <cf>filter(<m/C/,<m/P/)</cf> deletes all items from clist <m/C/ that are
1459 not members of pair (or quad) set <m/P/. I.e., <cf/filter/ do the same
1460 as <cf/delete/ with inverted set <m/P/. <m/P/ may also be a clist, which
1461 works analogously; i.e., it works as clist intersection.
1462
1463 Statement <cf><m/C/ = add(<m/C/, <m/P/);</cf> can be shortened to
1464 <cf><m/C/.add(<m/P/);</cf> if <m/C/ is appropriate route attribute (for
1465 example <cf/bgp_community/). Similarly for <cf/delete/ and <cf/filter/.
1466
1467 <tag><label id="type-eclist">eclist</tag>
1468 Eclist is a data type used for BGP extended community lists. Eclists
1469 are very similar to clists, but they are sets of ECs instead of pairs.
1470 The same operations (like <cf/add/, <cf/delete/ or <cf/&tilde;/ and
1471 <cf/!&tilde;/ membership operators) can be used to modify or test
1472 eclists, with ECs instead of pairs as arguments.
1473
1474 <tag><label id="type-lclist">lclist/</tag>
1475 Lclist is a data type used for BGP large community lists. Like eclists,
1476 lclists are very similar to clists, but they are sets of LCs instead of
1477 pairs. The same operations (like <cf/add/, <cf/delete/ or <cf/&tilde;/
1478 and <cf/!&tilde;/ membership operators) can be used to modify or test
1479 lclists, with LCs instead of pairs as arguments.
1480 </descrip>
1481
1482
1483 <sect>Operators
1484 <label id="operators">
1485
1486 <p>The filter language supports common integer operators <cf>(+,-,*,/)</cf>,
1487 parentheses <cf/(a*(b+c))/, comparison <cf/(a=b, a!=b, a&lt;b, a&gt;=b)/.
1488 Logical operations include unary not (<cf/!/), and (<cf/&amp;&amp;/), and or
1489 (<cf/&verbar;&verbar;/). Special operators include (<cf/&tilde;/,
1490 <cf/!&tilde;/) for "is (not) element of a set" operation - it can be used on
1491 element and set of elements of the same type (returning true if element is
1492 contained in the given set), or on two strings (returning true if first string
1493 matches a shell-like pattern stored in second string) or on IP and prefix
1494 (returning true if IP is within the range defined by that prefix), or on prefix
1495 and prefix (returning true if first prefix is more specific than second one) or
1496 on bgppath and bgpmask (returning true if the path matches the mask) or on
1497 number and bgppath (returning true if the number is in the path) or on bgppath
1498 and int (number) set (returning true if any ASN from the path is in the set) or
1499 on pair/quad and clist (returning true if the pair/quad is element of the
1500 clist) or on clist and pair/quad set (returning true if there is an element of
1501 the clist that is also a member of the pair/quad set).
1502
1503 <p>There is one operator related to ROA infrastructure - <cf/roa_check()/. It
1504 examines a ROA table and does <rfc id="6483"> route origin validation for a
1505 given network prefix. The basic usage is <cf>roa_check(<m/table/)</cf>, which
1506 checks current route (which should be from BGP to have AS_PATH argument) in the
1507 specified ROA table and returns ROA_UNKNOWN if there is no relevant ROA,
1508 ROA_VALID if there is a matching ROA, or ROA_INVALID if there are some relevant
1509 ROAs but none of them match. There is also an extended variant
1510 <cf>roa_check(<m/table/, <m/prefix/, <m/asn/)</cf>, which allows to specify a
1511 prefix and an ASN as arguments.
1512
1513
1514 <sect>Control structures
1515 <label id="control-structures">
1516
1517 <p>Filters support two control structures: conditions and case switches.
1518
1519 <p>Syntax of a condition is: <cf>if <M>boolean expression</M> then <m/commandT/;
1520 else <m/commandF/;</cf> and you can use <cf>{ <m/command1/; <m/command2/;
1521 <M>...</M> }</cf> instead of either command. The <cf>else</cf> clause may be
1522 omitted. If the <cf><m>boolean expression</m></cf> is true, <m/commandT/ is
1523 executed, otherwise <m/commandF/ is executed.
1524
1525 <p>The <cf>case</cf> is similar to case from Pascal. Syntax is <cf>case
1526 <m/expr/ { else: | <m/num_or_prefix [ .. num_or_prefix]/: <m/statement/ ; [
1527 ... ] }</cf>. The expression after <cf>case</cf> can be of any type which can be
1528 on the left side of the &tilde; operator and anything that could be a member of
1529 a set is allowed before <cf/:/. Multiple commands are allowed without <cf/{}/
1530 grouping. If <cf><m/expr/</cf> matches one of the <cf/:/ clauses, statements
1531 between it and next <cf/:/ statement are executed. If <cf><m/expr/</cf> matches
1532 neither of the <cf/:/ clauses, the statements after <cf/else:/ are executed.
1533
1534 <p>Here is example that uses <cf/if/ and <cf/case/ structures:
1535
1536 <code>
1537 case arg1 {
1538 2: print "two"; print "I can do more commands without {}";
1539 3 .. 5: print "three to five";
1540 else: print "something else";
1541 }
1542
1543 if 1234 = i then printn "."; else {
1544 print "not 1234";
1545 print "You need {} around multiple commands";
1546 }
1547 </code>
1548
1549
1550 <sect>Route attributes
1551 <label id="route-attributes">
1552
1553 <p>A filter is implicitly passed a route, and it can access its attributes just
1554 like it accesses variables. Attempts to access undefined attribute result in a
1555 runtime error; you can check if an attribute is defined by using the
1556 <cf>defined( <m>attribute</m> )</cf> operator. One notable exception to this
1557 rule are attributes of bgppath and *clist types, where undefined value is
1558 regarded as empty bgppath/*clist for most purposes.
1559
1560 <descrip>
1561 <tag><label id="rta-net"><m/prefix/ net</tag>
1562 The network prefix or anything else the route is talking about. The
1563 primary key of the routing table. Read-only. (See the <ref id="routes"
1564 name="chapter about routes">.)
1565
1566 <tag><label id="rta-scope"><m/enum/ scope</tag>
1567 The scope of the route. Possible values: <cf/SCOPE_HOST/ for routes
1568 local to this host, <cf/SCOPE_LINK/ for those specific for a physical
1569 link, <cf/SCOPE_SITE/ and <cf/SCOPE_ORGANIZATION/ for private routes and
1570 <cf/SCOPE_UNIVERSE/ for globally visible routes. This attribute is not
1571 interpreted by BIRD and can be used to mark routes in filters. The
1572 default value for new routes is <cf/SCOPE_UNIVERSE/.
1573
1574 <tag><label id="rta-preference"><m/int/ preference</tag>
1575 Preference of the route. Valid values are 0-65535. (See the chapter
1576 about routing tables.)
1577
1578 <tag><label id="rta-from"><m/ip/ from</tag>
1579 The router which the route has originated from.
1580
1581 <tag><label id="rta-gw"><m/ip/ gw</tag>
1582 Next hop packets routed using this route should be forwarded to.
1583
1584 <tag><label id="rta-proto"><m/string/ proto</tag>
1585 The name of the protocol which the route has been imported from.
1586 Read-only.
1587
1588 <tag><label id="rta-source"><m/enum/ source</tag>
1589 what protocol has told me about this route. Possible values:
1590 <cf/RTS_DUMMY/, <cf/RTS_STATIC/, <cf/RTS_INHERIT/, <cf/RTS_DEVICE/,
1591 <cf/RTS_STATIC_DEVICE/, <cf/RTS_REDIRECT/, <cf/RTS_RIP/, <cf/RTS_OSPF/,
1592 <cf/RTS_OSPF_IA/, <cf/RTS_OSPF_EXT1/, <cf/RTS_OSPF_EXT2/, <cf/RTS_BGP/,
1593 <cf/RTS_PIPE/, <cf/RTS_BABEL/.
1594
1595 <tag><label id="rta-dest"><m/enum/ dest</tag>
1596 Type of destination the packets should be sent to
1597 (<cf/RTD_ROUTER/ for forwarding to a neighboring router,
1598 <cf/RTD_DEVICE/ for routing to a directly-connected network,
1599 <cf/RTD_MULTIPATH/ for multipath destinations,
1600 <cf/RTD_BLACKHOLE/ for packets to be silently discarded,
1601 <cf/RTD_UNREACHABLE/, <cf/RTD_PROHIBIT/ for packets that should be
1602 returned with ICMP host unreachable / ICMP administratively prohibited
1603 messages). Can be changed, but only to <cf/RTD_BLACKHOLE/,
1604 <cf/RTD_UNREACHABLE/ or <cf/RTD_PROHIBIT/.
1605
1606 <tag><label id="rta-ifname"><m/string/ ifname</tag>
1607 Name of the outgoing interface. Sink routes (like blackhole, unreachable
1608 or prohibit) and multipath routes have no interface associated with
1609 them, so <cf/ifname/ returns an empty string for such routes. Read-only.
1610
1611 <tag><label id="rta-ifindex"><m/int/ ifindex</tag>
1612 Index of the outgoing interface. System wide index of the interface. May
1613 be used for interface matching, however indexes might change on interface
1614 creation/removal. Zero is returned for routes with undefined outgoing
1615 interfaces. Read-only.
1616
1617 <tag><label id="rta-igp-metric"><m/int/ igp_metric</tag>
1618 The optional attribute that can be used to specify a distance to the
1619 network for routes that do not have a native protocol metric attribute
1620 (like <cf/ospf_metric1/ for OSPF routes). It is used mainly by BGP to
1621 compare internal distances to boundary routers (see below).
1622 </descrip>
1623
1624 <p>There also exist protocol-specific attributes which are described in the
1625 corresponding protocol sections.
1626
1627
1628 <sect>Other statements
1629 <label id="other-statements">
1630
1631 <p>The following statements are available:
1632
1633 <descrip>
1634 <tag><label id="assignment"><m/variable/ = <m/expr/</tag>
1635 Set variable to a given value.
1636
1637 <tag><label id="filter-accept-reject">accept|reject [ <m/expr/ ]</tag>
1638 Accept or reject the route, possibly printing <cf><m>expr</m></cf>.
1639
1640 <tag><label id="return">return <m/expr/</tag>
1641 Return <cf><m>expr</m></cf> from the current function, the function ends
1642 at this point.
1643
1644 <tag><label id="print">print|printn <m/expr/ [<m/, expr.../]</tag>
1645 Prints given expressions; useful mainly while debugging filters. The
1646 <cf/printn/ variant does not terminate the line.
1647
1648 <tag><label id="quitbird">quitbird</tag>
1649 Terminates BIRD. Useful when debugging the filter interpreter.
1650 </descrip>
1651
1652
1653 <chapt>Protocols
1654 <label id="protocols">
1655
1656 <sect>Babel
1657 <label id="babel">
1658
1659 <sect1>Introduction
1660 <label id="babel-intro">
1661
1662 <p>The Babel protocol
1663 (<rfc id="6126">) is a loop-avoiding distance-vector routing protocol that is
1664 robust and efficient both in ordinary wired networks and in wireless mesh
1665 networks. Babel is conceptually very simple in its operation and "just works"
1666 in its default configuration, though some configuration is possible and in some
1667 cases desirable.
1668
1669 <p>The Babel protocol is dual stack; i.e., it can carry both IPv4 and IPv6
1670 routes over the same IPv6 transport. For sending and receiving Babel packets,
1671 only a link-local IPv6 address is needed.
1672
1673 <p>BIRD implements an extension for IPv6 source-specific routing (SSR or SADR),
1674 but must be configured accordingly to use it. SADR-enabled Babel router can
1675 interoperate with non-SADR Babel router, but the later would ignore routes
1676 with specific (non-zero) source prefix.
1677
1678 <sect1>Configuration
1679 <label id="babel-config">
1680
1681 <p>The Babel protocol support both IPv4 and IPv6 channels; both can be
1682 configured simultaneously. It can also be configured with <ref
1683 id="ip-sadr-routes" name="IPv6 SADR"> channel instead of regular IPv6
1684 channel, in such case SADR support is enabled. Babel supports no global
1685 configuration options apart from those common to all other protocols, but
1686 supports the following per-interface configuration options:
1687
1688 <code>
1689 protocol babel [<name>] {
1690 ipv4 { <channel config> };
1691 ipv6 [sadr] { <channel config> };
1692 randomize router id <switch>;
1693 interface <interface pattern> {
1694 type <wired|wireless>;
1695 rxcost <number>;
1696 limit <number>;
1697 hello interval <time>;
1698 update interval <time>;
1699 port <number>;
1700 tx class|dscp <number>;
1701 tx priority <number>;
1702 rx buffer <number>;
1703 tx length <number>;
1704 check link <switch>;
1705 next hop ipv4 <address>;
1706 next hop ipv6 <address>;
1707 };
1708 }
1709 </code>
1710
1711 <descrip>
1712 <tag><label id="babel-channel">ipv4 | ipv6 [sadr] <m/channel config/</tag>
1713 The supported channels are IPv4, IPv6, and IPv6 SADR.
1714
1715 <tag><label id="babel-random-router-id">randomize router id <m/switch/</tag>
1716 If enabled, Bird will randomize the top 32 bits of its router ID whenever
1717 the protocol instance starts up. If a Babel node restarts, it loses its
1718 sequence number, which can cause its routes to be rejected by peers until
1719 the state is cleared out by other nodes in the network (which can take on
1720 the order of minutes). Enabling this option causes Bird to pick a random
1721 router ID every time it starts up, which avoids this problem at the cost
1722 of not having stable router IDs in the network. Default: no.
1723
1724 <tag><label id="babel-type">type wired|wireless </tag>
1725 This option specifies the interface type: Wired or wireless. On wired
1726 interfaces a neighbor is considered unreachable after a small number of
1727 Hello packets are lost, as described by <cf/limit/ option. On wireless
1728 interfaces the ETX link quality estimation technique is used to compute
1729 the metrics of routes discovered over this interface. This technique will
1730 gradually degrade the metric of routes when packets are lost rather than
1731 the more binary up/down mechanism of wired type links. Default:
1732 <cf/wired/.
1733
1734 <tag><label id="babel-rxcost">rxcost <m/num/</tag>
1735 This option specifies the nominal RX cost of the interface. The effective
1736 neighbor costs for route metrics will be computed from this value with a
1737 mechanism determined by the interface <cf/type/. Note that in contrast to
1738 other routing protocols like RIP or OSPF, the <cf/rxcost/ specifies the
1739 cost of RX instead of TX, so it affects primarily neighbors' route
1740 selection and not local route selection. Default: 96 for wired interfaces,
1741 256 for wireless.
1742
1743 <tag><label id="babel-limit">limit <m/num/</tag>
1744 BIRD keeps track of received Hello messages from each neighbor to
1745 establish neighbor reachability. For wired type interfaces, this option
1746 specifies how many of last 16 hellos have to be correctly received in
1747 order to neighbor is assumed to be up. The option is ignored on wireless
1748 type interfaces, where gradual cost degradation is used instead of sharp
1749 limit. Default: 12.
1750
1751 <tag><label id="babel-hello">hello interval <m/time/ s|ms</tag>
1752 Interval at which periodic Hello messages are sent on this interface,
1753 with time units. Default: 4 seconds.
1754
1755 <tag><label id="babel-update">update interval <m/time/ s|ms</tag>
1756 Interval at which periodic (full) updates are sent, with time
1757 units. Default: 4 times the hello interval.
1758
1759 <tag><label id="babel-port">port <m/number/</tag>
1760 This option selects an UDP port to operate on. The default is to operate
1761 on port 6696 as specified in the Babel RFC.
1762
1763 <tag><label id="babel-tx-class">tx class|dscp|priority <m/number/</tag>
1764 These options specify the ToS/DiffServ/Traffic class/Priority of the
1765 outgoing Babel packets. See <ref id="proto-tx-class" name="tx class"> common
1766 option for detailed description.
1767
1768 <tag><label id="babel-rx-buffer">rx buffer <m/number/</tag>
1769 This option specifies the size of buffers used for packet processing.
1770 The buffer size should be bigger than maximal size of received packets.
1771 The default value is the interface MTU, and the value will be clamped to a
1772 minimum of 512 bytes + IP packet overhead.
1773
1774 <tag><label id="babel-tx-length">tx length <m/number/</tag>
1775 This option specifies the maximum length of generated Babel packets. To
1776 avoid IP fragmentation, it should not exceed the interface MTU value.
1777 The default value is the interface MTU value, and the value will be
1778 clamped to a minimum of 512 bytes + IP packet overhead.
1779
1780 <tag><label id="babel-check-link">check link <m/switch/</tag>
1781 If set, the hardware link state (as reported by OS) is taken into
1782 consideration. When the link disappears (e.g. an ethernet cable is
1783 unplugged), neighbors are immediately considered unreachable and all
1784 routes received from them are withdrawn. It is possible that some
1785 hardware drivers or platforms do not implement this feature. Default:
1786 yes.
1787
1788 <tag><label id="babel-next-hop-ipv4">next hop ipv4 <m/address/</tag>
1789 Set the next hop address advertised for IPv4 routes advertised on this
1790 interface. Default: the preferred IPv4 address of the interface.
1791
1792 <tag><label id="babel-next-hop-ipv6">next hop ipv6 <m/address/</tag>
1793 Set the next hop address advertised for IPv6 routes advertised on this
1794 interface. If not set, the same link-local address that is used as the
1795 source for Babel packets will be used. In normal operation, it should not
1796 be necessary to set this option.
1797 </descrip>
1798
1799 <sect1>Attributes
1800 <label id="babel-attr">
1801
1802 <p>Babel defines just one attribute: the internal babel metric of the route. It
1803 is exposed as the <cf/babel_metric/ attribute and has range from 1 to infinity
1804 (65535).
1805
1806 <sect1>Example
1807 <label id="babel-exam">
1808
1809 <p><code>
1810 protocol babel {
1811 interface "eth*" {
1812 type wired;
1813 };
1814 interface "wlan0", "wlan1" {
1815 type wireless;
1816 hello interval 1;
1817 rxcost 512;
1818 };
1819 interface "tap0";
1820
1821 # This matches the default of babeld: redistribute all addresses
1822 # configured on local interfaces, plus re-distribute all routes received
1823 # from other babel peers.
1824
1825 ipv4 {
1826 export where (source = RTS_DEVICE) || (source = RTS_BABEL);
1827 };
1828 ipv6 {
1829 export where (source = RTS_DEVICE) || (source = RTS_BABEL);
1830 };
1831 }
1832 </code>
1833
1834 <sect1>Known issues
1835 <label id="babel-issues">
1836
1837 <p>When retracting a route, Babel generates an unreachable route for a little
1838 while (according to RFC). The interaction of this behavior with other protocols
1839 is not well tested and strange things may happen.
1840
1841
1842 <sect>BFD
1843 <label id="bfd">
1844
1845 <sect1>Introduction
1846 <label id="bfd-intro">
1847
1848 <p>Bidirectional Forwarding Detection (BFD) is not a routing protocol itself, it
1849 is an independent tool providing liveness and failure detection. Routing
1850 protocols like OSPF and BGP use integrated periodic "hello" messages to monitor
1851 liveness of neighbors, but detection times of these mechanisms are high (e.g. 40
1852 seconds by default in OSPF, could be set down to several seconds). BFD offers
1853 universal, fast and low-overhead mechanism for failure detection, which could be
1854 attached to any routing protocol in an advisory role.
1855
1856 <p>BFD consists of mostly independent BFD sessions. Each session monitors an
1857 unicast bidirectional path between two BFD-enabled routers. This is done by
1858 periodically sending control packets in both directions. BFD does not handle
1859 neighbor discovery, BFD sessions are created on demand by request of other
1860 protocols (like OSPF or BGP), which supply appropriate information like IP
1861 addresses and associated interfaces. When a session changes its state, these
1862 protocols are notified and act accordingly (e.g. break an OSPF adjacency when
1863 the BFD session went down).
1864
1865 <p>BIRD implements basic BFD behavior as defined in <rfc id="5880"> (some
1866 advanced features like the echo mode or authentication are not implemented), IP
1867 transport for BFD as defined in <rfc id="5881"> and <rfc id="5883"> and
1868 interaction with client protocols as defined in <rfc id="5882">.
1869 We currently support at most one protocol instance.
1870
1871 <p>BFD packets are sent with a dynamic source port number. Linux systems use by
1872 default a bit different dynamic port range than the IANA approved one
1873 (49152-65535). If you experience problems with compatibility, please adjust
1874 <cf>/proc/sys/net/ipv4/ip_local_port_range</cf>
1875
1876 <sect1>Configuration
1877 <label id="bfd-config">
1878
1879 <p>BFD configuration consists mainly of multiple definitions of interfaces.
1880 Most BFD config options are session specific. When a new session is requested
1881 and dynamically created, it is configured from one of these definitions. For
1882 sessions to directly connected neighbors, <cf/interface/ definitions are chosen
1883 based on the interface associated with the session, while <cf/multihop/
1884 definition is used for multihop sessions. If no definition is relevant, the
1885 session is just created with the default configuration. Therefore, an empty BFD
1886 configuration is often sufficient.
1887
1888 <p>Note that to use BFD for other protocols like OSPF or BGP, these protocols
1889 also have to be configured to request BFD sessions, usually by <cf/bfd/ option.
1890
1891 <p>Some of BFD session options require <m/time/ value, which has to be specified
1892 with the appropriate unit: <m/num/ <cf/s/|<cf/ms/|<cf/us/. Although microseconds
1893 are allowed as units, practical minimum values are usually in order of tens of
1894 milliseconds.
1895
1896 <code>
1897 protocol bfd [&lt;name&gt;] {
1898 interface &lt;interface pattern&gt; {
1899 interval &lt;time&gt;;
1900 min rx interval &lt;time&gt;;
1901 min tx interval &lt;time&gt;;
1902 idle tx interval &lt;time&gt;;
1903 multiplier &lt;num&gt;;
1904 passive &lt;switch&gt;;
1905 authentication none;
1906 authentication simple;
1907 authentication [meticulous] keyed md5|sha1;
1908 password "&lt;text&gt;";
1909 password "&lt;text&gt;" {
1910 id &lt;num&gt;;
1911 generate from "&lt;date&gt;";
1912 generate to "&lt;date&gt;";
1913 accept from "&lt;date&gt;";
1914 accept to "&lt;date&gt;";
1915 from "&lt;date&gt;";
1916 to "&lt;date&gt;";
1917 };
1918 };
1919 multihop {
1920 interval &lt;time&gt;;
1921 min rx interval &lt;time&gt;;
1922 min tx interval &lt;time&gt;;
1923 idle tx interval &lt;time&gt;;
1924 multiplier &lt;num&gt;;
1925 passive &lt;switch&gt;;
1926 };
1927 neighbor &lt;ip&gt; [dev "&lt;interface&gt;"] [local &lt;ip&gt;] [multihop &lt;switch&gt;];
1928 }
1929 </code>
1930
1931 <descrip>
1932 <tag><label id="bfd-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
1933 Interface definitions allow to specify options for sessions associated
1934 with such interfaces and also may contain interface specific options.
1935 See <ref id="proto-iface" name="interface"> common option for a detailed
1936 description of interface patterns. Note that contrary to the behavior of
1937 <cf/interface/ definitions of other protocols, BFD protocol would accept
1938 sessions (in default configuration) even on interfaces not covered by
1939 such definitions.
1940
1941 <tag><label id="bfd-multihop">multihop { <m/options/ }</tag>
1942 Multihop definitions allow to specify options for multihop BFD sessions,
1943 in the same manner as <cf/interface/ definitions are used for directly
1944 connected sessions. Currently only one such definition (for all multihop
1945 sessions) could be used.
1946
1947 <tag><label id="bfd-neighbor">neighbor <m/ip/ [dev "<m/interface/"] [local <m/ip/] [multihop <m/switch/]</tag>
1948 BFD sessions are usually created on demand as requested by other
1949 protocols (like OSPF or BGP). This option allows to explicitly add
1950 a BFD session to the specified neighbor regardless of such requests.
1951
1952 The session is identified by the IP address of the neighbor, with
1953 optional specification of used interface and local IP. By default
1954 the neighbor must be directly connected, unless the session is
1955 configured as multihop. Note that local IP must be specified for
1956 multihop sessions.
1957 </descrip>
1958
1959 <p>Session specific options (part of <cf/interface/ and <cf/multihop/ definitions):
1960
1961 <descrip>
1962 <tag><label id="bfd-interval">interval <m/time/</tag>
1963 BFD ensures availability of the forwarding path associated with the
1964 session by periodically sending BFD control packets in both
1965 directions. The rate of such packets is controlled by two options,
1966 <cf/min rx interval/ and <cf/min tx interval/ (see below). This option
1967 is just a shorthand to set both of these options together.
1968
1969 <tag><label id="bfd-min-rx-interval">min rx interval <m/time/</tag>
1970 This option specifies the minimum RX interval, which is announced to the
1971 neighbor and used there to limit the neighbor's rate of generated BFD
1972 control packets. Default: 10 ms.
1973
1974 <tag><label id="bfd-min-tx-interval">min tx interval <m/time/</tag>
1975 This option specifies the desired TX interval, which controls the rate
1976 of generated BFD control packets (together with <cf/min rx interval/
1977 announced by the neighbor). Note that this value is used only if the BFD
1978 session is up, otherwise the value of <cf/idle tx interval/ is used
1979 instead. Default: 100 ms.
1980
1981 <tag><label id="bfd-idle-tx-interval">idle tx interval <m/time/</tag>
1982 In order to limit unnecessary traffic in cases where a neighbor is not
1983 available or not running BFD, the rate of generated BFD control packets
1984 is lower when the BFD session is not up. This option specifies the
1985 desired TX interval in such cases instead of <cf/min tx interval/.
1986 Default: 1 s.
1987
1988 <tag><label id="bfd-multiplier">multiplier <m/num/</tag>
1989 Failure detection time for BFD sessions is based on established rate of
1990 BFD control packets (<cf>min rx/tx interval</cf>) multiplied by this
1991 multiplier, which is essentially (ignoring jitter) a number of missed
1992 packets after which the session is declared down. Note that rates and
1993 multipliers could be different in each direction of a BFD session.
1994 Default: 5.
1995
1996 <tag><label id="bfd-passive">passive <m/switch/</tag>
1997 Generally, both BFD session endpoints try to establish the session by
1998 sending control packets to the other side. This option allows to enable
1999 passive mode, which means that the router does not send BFD packets
2000 until it has received one from the other side. Default: disabled.
2001
2002 <tag>authentication none</tag>
2003 No passwords are sent in BFD packets. This is the default value.
2004
2005 <tag>authentication simple</tag>
2006 Every packet carries 16 bytes of password. Received packets lacking this
2007 password are ignored. This authentication mechanism is very weak.
2008
2009 <tag>authentication [meticulous] keyed md5|sha1</tag>
2010 An authentication code is appended to each packet. The cryptographic
2011 algorithm is keyed MD5 or keyed SHA-1. Note that the algorithm is common
2012 for all keys (on one interface), in contrast to OSPF or RIP, where it
2013 is a per-key option. Passwords (keys) are not sent open via network.
2014
2015 The <cf/meticulous/ variant means that cryptographic sequence numbers
2016 are increased for each sent packet, while in the basic variant they are
2017 increased about once per second. Generally, the <cf/meticulous/ variant
2018 offers better resistance to replay attacks but may require more
2019 computation.
2020
2021 <tag>password "<M>text</M>"</tag>
2022 Specifies a password used for authentication. See <ref id="proto-pass"
2023 name="password"> common option for detailed description. Note that
2024 password option <cf/algorithm/ is not available in BFD protocol. The
2025 algorithm is selected by <cf/authentication/ option for all passwords.
2026
2027 </descrip>
2028
2029 <sect1>Example
2030 <label id="bfd-exam">
2031
2032 <p><code>
2033 protocol bfd {
2034 interface "eth*" {
2035 min rx interval 20 ms;
2036 min tx interval 50 ms;
2037 idle tx interval 300 ms;
2038 };
2039 interface "gre*" {
2040 interval 200 ms;
2041 multiplier 10;
2042 passive;
2043 };
2044 multihop {
2045 interval 200 ms;
2046 multiplier 10;
2047 };
2048
2049 neighbor 192.168.1.10;
2050 neighbor 192.168.2.2 dev "eth2";
2051 neighbor 192.168.10.1 local 192.168.1.1 multihop;
2052 }
2053 </code>
2054
2055
2056 <sect>BGP
2057 <label id="bgp">
2058
2059 <p>The Border Gateway Protocol is the routing protocol used for backbone level
2060 routing in the today's Internet. Contrary to other protocols, its convergence
2061 does not rely on all routers following the same rules for route selection,
2062 making it possible to implement any routing policy at any router in the network,
2063 the only restriction being that if a router advertises a route, it must accept
2064 and forward packets according to it.
2065
2066 <p>BGP works in terms of autonomous systems (often abbreviated as AS). Each AS
2067 is a part of the network with common management and common routing policy. It is
2068 identified by a unique 16-bit number (ASN). Routers within each AS usually
2069 exchange AS-internal routing information with each other using an interior
2070 gateway protocol (IGP, such as OSPF or RIP). Boundary routers at the border of
2071 the AS communicate global (inter-AS) network reachability information with their
2072 neighbors in the neighboring AS'es via exterior BGP (eBGP) and redistribute
2073 received information to other routers in the AS via interior BGP (iBGP).
2074
2075 <p>Each BGP router sends to its neighbors updates of the parts of its routing
2076 table it wishes to export along with complete path information (a list of AS'es
2077 the packet will travel through if it uses the particular route) in order to
2078 avoid routing loops.
2079
2080 <sect1>Supported standards
2081 <label id="bgp-standards">
2082
2083 <p>
2084 <itemize>
2085 <item> <rfc id="4271"> - Border Gateway Protocol 4 (BGP)
2086 <item> <rfc id="1997"> - BGP Communities Attribute
2087 <item> <rfc id="2385"> - Protection of BGP Sessions via TCP MD5 Signature
2088 <item> <rfc id="2545"> - Use of BGP Multiprotocol Extensions for IPv6
2089 <item> <rfc id="2918"> - Route Refresh Capability
2090 <item> <rfc id="3107"> - Carrying Label Information in BGP
2091 <item> <rfc id="4360"> - BGP Extended Communities Attribute
2092 <item> <rfc id="4364"> - BGP/MPLS IPv4 Virtual Private Networks
2093 <item> <rfc id="4456"> - BGP Route Reflection
2094 <item> <rfc id="4486"> - Subcodes for BGP Cease Notification Message
2095 <item> <rfc id="4659"> - BGP/MPLS IPv6 Virtual Private Networks
2096 <item> <rfc id="4724"> - Graceful Restart Mechanism for BGP
2097 <item> <rfc id="4760"> - Multiprotocol extensions for BGP
2098 <item> <rfc id="4798"> - Connecting IPv6 Islands over IPv4 MPLS
2099 <item> <rfc id="5065"> - AS confederations for BGP
2100 <item> <rfc id="5082"> - Generalized TTL Security Mechanism
2101 <item> <rfc id="5492"> - Capabilities Advertisement with BGP
2102 <item> <rfc id="5549"> - Advertising IPv4 NLRI with an IPv6 Next Hop
2103 <item> <rfc id="5575"> - Dissemination of Flow Specification Rules
2104 <item> <rfc id="5668"> - 4-Octet AS Specific BGP Extended Community
2105 <item> <rfc id="6286"> - AS-Wide Unique BGP Identifier
2106 <item> <rfc id="6608"> - Subcodes for BGP Finite State Machine Error
2107 <item> <rfc id="6793"> - BGP Support for 4-Octet AS Numbers
2108 <item> <rfc id="7313"> - Enhanced Route Refresh Capability for BGP
2109 <item> <rfc id="7606"> - Revised Error Handling for BGP UPDATE Messages
2110 <item> <rfc id="7911"> - Advertisement of Multiple Paths in BGP
2111 <item> <rfc id="7947"> - Internet Exchange BGP Route Server
2112 <item> <rfc id="8092"> - BGP Large Communities Attribute
2113 <item> <rfc id="8203"> - BGP Administrative Shutdown Communication
2114 <item> <rfc id="8212"> - Default EBGP Route Propagation Behavior without Policies
2115 </itemize>
2116
2117 <sect1>Route selection rules
2118 <label id="bgp-route-select-rules">
2119
2120 <p>BGP doesn't have any simple metric, so the rules for selection of an optimal
2121 route among multiple BGP routes with the same preference are a bit more complex
2122 and they are implemented according to the following algorithm. It starts the
2123 first rule, if there are more "best" routes, then it uses the second rule to
2124 choose among them and so on.
2125
2126 <itemize>
2127 <item>Prefer route with the highest Local Preference attribute.
2128 <item>Prefer route with the shortest AS path.
2129 <item>Prefer IGP origin over EGP and EGP origin over incomplete.
2130 <item>Prefer the lowest value of the Multiple Exit Discriminator.
2131 <item>Prefer routes received via eBGP over ones received via iBGP.
2132 <item>Prefer routes with lower internal distance to a boundary router.
2133 <item>Prefer the route with the lowest value of router ID of the
2134 advertising router.
2135 </itemize>
2136
2137 <sect1>IGP routing table
2138 <label id="bgp-igp-routing-table">
2139
2140 <p>BGP is mainly concerned with global network reachability and with routes to
2141 other autonomous systems. When such routes are redistributed to routers in the
2142 AS via BGP, they contain IP addresses of a boundary routers (in route attribute
2143 NEXT_HOP). BGP depends on existing IGP routing table with AS-internal routes to
2144 determine immediate next hops for routes and to know their internal distances to
2145 boundary routers for the purpose of BGP route selection. In BIRD, there is
2146 usually one routing table used for both IGP routes and BGP routes.
2147
2148 <sect1>Protocol configuration
2149 <label id="bgp-proto-config">
2150
2151 <p>Each instance of the BGP corresponds to one neighboring router. This allows
2152 to set routing policy and all the other parameters differently for each neighbor
2153 using the following configuration parameters:
2154
2155 <descrip>
2156 <tag><label id="bgp-local">local [<m/ip/] [port <m/number/] [as <m/number/]</tag>
2157 Define which AS we are part of. (Note that contrary to other IP routers,
2158 BIRD is able to act as a router located in multiple AS'es simultaneously,
2159 but in such cases you need to tweak the BGP paths manually in the filters
2160 to get consistent behavior.) Optional <cf/ip/ argument specifies a source
2161 address, equivalent to the <cf/source address/ option (see below).
2162 Optional <cf/port/ argument specifies the local BGP port instead of
2163 standard port 179. The parameter may be used multiple times with
2164 different sub-options (e.g., both <cf/local 10.0.0.1 as 65000;/ and
2165 <cf/local 10.0.0.1; local as 65000;/ are valid). This parameter is
2166 mandatory.
2167
2168 <tag><label id="bgp-neighbor">neighbor [<m/ip/] [port <m/number/] [as <m/number/]</tag>
2169 Define neighboring router this instance will be talking to and what AS
2170 it is located in. In case the neighbor is in the same AS as we are, we
2171 automatically switch to iBGP. Optionally, the remote port may also be
2172 specified. Like <cf/local/ parameter, this parameter may also be used
2173 multiple times with different sub-options. This parameter is mandatory.
2174
2175 <tag><label id="bgp-iface">interface <m/string/</tag>
2176 Define interface we should use for link-local BGP IPv6 sessions.
2177 Interface can also be specified as a part of <cf/neighbor address/
2178 (e.g., <cf/neighbor fe80::1234%eth0 as 65000;/). The option may also be
2179 used for non link-local sessions when it is necessary to explicitly
2180 specify an interface, but only for direct (not multihop) sessions.
2181
2182 <tag><label id="bgp-direct">direct</tag>
2183 Specify that the neighbor is directly connected. The IP address of the
2184 neighbor must be from a directly reachable IP range (i.e. associated
2185 with one of your router's interfaces), otherwise the BGP session
2186 wouldn't start but it would wait for such interface to appear. The
2187 alternative is the <cf/multihop/ option. Default: enabled for eBGP.
2188
2189 <tag><label id="bgp-multihop">multihop [<m/number/]</tag>
2190 Configure multihop BGP session to a neighbor that isn't directly
2191 connected. Accurately, this option should be used if the configured
2192 neighbor IP address does not match with any local network subnets. Such
2193 IP address have to be reachable through system routing table. The
2194 alternative is the <cf/direct/ option. For multihop BGP it is
2195 recommended to explicitly configure the source address to have it
2196 stable. Optional <cf/number/ argument can be used to specify the number
2197 of hops (used for TTL). Note that the number of networks (edges) in a
2198 path is counted; i.e., if two BGP speakers are separated by one router,
2199 the number of hops is 2. Default: enabled for iBGP.
2200
2201 <tag><label id="bgp-source-address">source address <m/ip/</tag>
2202 Define local address we should use for next hop calculation and as a
2203 source address for the BGP session. Default: the address of the local
2204 end of the interface our neighbor is connected to.
2205
2206 <tag><label id="bgp-strict-bind">strict bind <m/switch/</tag>
2207 Specify whether BGP listening socket should be bound to a specific local
2208 address (the same as the <cf/source address/) and associated interface,
2209 or to all addresses. Binding to a specific address could be useful in
2210 cases like running multiple BIRD instances on a machine, each using its
2211 IP address. Note that listening sockets bound to a specific address and
2212 to all addresses collide, therefore either all BGP protocols (of the
2213 same address family and using the same local port) should have set
2214 <cf/strict bind/, or none of them. Default: disabled.
2215
2216 <tag><label id="bgp-check-link">check link <M>switch</M></tag>
2217 BGP could use hardware link state into consideration. If enabled,
2218 BIRD tracks the link state of the associated interface and when link
2219 disappears (e.g. an ethernet cable is unplugged), the BGP session is
2220 immediately shut down. Note that this option cannot be used with
2221 multihop BGP. Default: enabled for direct BGP, disabled otherwise.
2222
2223 <tag><label id="bgp-bfd">bfd <M>switch</M>|graceful</tag>
2224 BGP could use BFD protocol as an advisory mechanism for neighbor
2225 liveness and failure detection. If enabled, BIRD setups a BFD session
2226 for the BGP neighbor and tracks its liveness by it. This has an
2227 advantage of an order of magnitude lower detection times in case of
2228 failure. When a neighbor failure is detected, the BGP session is
2229 restarted. Optionally, it can be configured (by <cf/graceful/ argument)
2230 to trigger graceful restart instead of regular restart. Note that BFD
2231 protocol also has to be configured, see <ref id="bfd" name="BFD">
2232 section for details. Default: disabled.
2233
2234 <tag><label id="bgp-ttl-security">ttl security <m/switch/</tag>
2235 Use GTSM (<rfc id="5082"> - the generalized TTL security mechanism). GTSM
2236 protects against spoofed packets by ignoring received packets with a
2237 smaller than expected TTL. To work properly, GTSM have to be enabled on
2238 both sides of a BGP session. If both <cf/ttl security/ and
2239 <cf/multihop/ options are enabled, <cf/multihop/ option should specify
2240 proper hop value to compute expected TTL. Kernel support required:
2241 Linux: 2.6.34+ (IPv4), 2.6.35+ (IPv6), BSD: since long ago, IPv4 only.
2242 Note that full (ICMP protection, for example) <rfc id="5082"> support is
2243 provided by Linux only. Default: disabled.
2244
2245 <tag><label id="bgp-password">password <m/string/</tag>
2246 Use this password for MD5 authentication of BGP sessions (<rfc id="2385">). When
2247 used on BSD systems, see also <cf/setkey/ option below. Default: no
2248 authentication.
2249
2250 <tag><label id="bgp-setkey">setkey <m/switch/</tag>
2251 On BSD systems, keys for TCP MD5 authentication are stored in the global
2252 SA/SP database, which can be accessed by external utilities (e.g.
2253 setkey(8)). BIRD configures security associations in the SA/SP database
2254 automatically based on <cf/password/ options (see above), this option
2255 allows to disable automatic updates by BIRD when manual configuration by
2256 external utilities is preferred. Note that automatic SA/SP database
2257 updates are currently implemented only for FreeBSD. Passwords have to be
2258 set manually by an external utility on NetBSD and OpenBSD. Default:
2259 enabled (ignored on non-FreeBSD).
2260
2261 <tag><label id="bgp-passive">passive <m/switch/</tag>
2262 Standard BGP behavior is both initiating outgoing connections and
2263 accepting incoming connections. In passive mode, outgoing connections
2264 are not initiated. Default: off.
2265
2266 <tag><label id="bgp-confederation">confederation <m/number/</tag>
2267 BGP confederations (<rfc id="5065">) are collections of autonomous
2268 systems that act as one entity to external systems, represented by one
2269 confederation identifier (instead of AS numbers). This option allows to
2270 enable BGP confederation behavior and to specify the local confederation
2271 identifier. When BGP confederations are used, all BGP speakers that are
2272 members of the BGP confederation should have the same confederation
2273 identifier configured. Default: 0 (no confederation).
2274
2275 <tag><label id="bgp-confederation-member">confederation member <m/switch/</tag>
2276 When BGP confederations are used, this option allows to specify whether
2277 the BGP neighbor is a member of the same confederation as the local BGP
2278 speaker. The option is unnecessary (and ignored) for IBGP sessions, as
2279 the same AS number implies the same confederation. Default: no.
2280
2281 <tag><label id="bgp-rr-client">rr client</tag>
2282 Be a route reflector and treat the neighbor as a route reflection
2283 client. Default: disabled.
2284
2285 <tag><label id="bgp-rr-cluster-id">rr cluster id <m/IPv4 address/</tag>
2286 Route reflectors use cluster id to avoid route reflection loops. When
2287 there is one route reflector in a cluster it usually uses its router id
2288 as a cluster id, but when there are more route reflectors in a cluster,
2289 these need to be configured (using this option) to use a common cluster
2290 id. Clients in a cluster need not know their cluster id and this option
2291 is not allowed for them. Default: the same as router id.
2292
2293 <tag><label id="bgp-rs-client">rs client</tag>
2294 Be a route server and treat the neighbor as a route server client.
2295 A route server is used as a replacement for full mesh EBGP routing in
2296 Internet exchange points in a similar way to route reflectors used in
2297 IBGP routing. BIRD does not implement obsoleted <rfc id="1863">, but
2298 uses ad-hoc implementation, which behaves like plain EBGP but reduces
2299 modifications to advertised route attributes to be transparent (for
2300 example does not prepend its AS number to AS PATH attribute and
2301 keeps MED attribute). Default: disabled.
2302
2303 <tag><label id="bgp-allow-local-pref">allow bgp_local_pref <m/switch/</tag>
2304 A standard BGP implementation do not send the Local Preference attribute
2305 to eBGP neighbors and ignore this attribute if received from eBGP
2306 neighbors, as per <rfc id="4271">. When this option is enabled on an
2307 eBGP session, this attribute will be sent to and accepted from the peer,
2308 which is useful for example if you have a setup like in <rfc id="7938">.
2309 The option does not affect iBGP sessions. Default: off.
2310
2311 <tag><label id="bgp-allow-local-as">allow local as [<m/number/]</tag>
2312 BGP prevents routing loops by rejecting received routes with the local
2313 AS number in the AS path. This option allows to loose or disable the
2314 check. Optional <cf/number/ argument can be used to specify the maximum
2315 number of local ASNs in the AS path that is allowed for received
2316 routes. When the option is used without the argument, the check is
2317 completely disabled and you should ensure loop-free behavior by some
2318 other means. Default: 0 (no local AS number allowed).
2319
2320 <tag><label id="bgp-enable-route-refresh">enable route refresh <m/switch/</tag>
2321 After the initial route exchange, BGP protocol uses incremental updates
2322 to keep BGP speakers synchronized. Sometimes (e.g., if BGP speaker
2323 changes its import filter, or if there is suspicion of inconsistency) it
2324 is necessary to do a new complete route exchange. BGP protocol extension
2325 Route Refresh (<rfc id="2918">) allows BGP speaker to request
2326 re-advertisement of all routes from its neighbor. BGP protocol
2327 extension Enhanced Route Refresh (<rfc id="7313">) specifies explicit
2328 begin and end for such exchanges, therefore the receiver can remove
2329 stale routes that were not advertised during the exchange. This option
2330 specifies whether BIRD advertises these capabilities and supports
2331 related procedures. Note that even when disabled, BIRD can send route
2332 refresh requests. Default: on.
2333
2334 <tag><label id="bgp-graceful-restart">graceful restart <m/switch/|aware</tag>
2335 When a BGP speaker restarts or crashes, neighbors will discard all
2336 received paths from the speaker, which disrupts packet forwarding even
2337 when the forwarding plane of the speaker remains intact. <rfc id="4724">
2338 specifies an optional graceful restart mechanism to alleviate this
2339 issue. This option controls the mechanism. It has three states:
2340 Disabled, when no support is provided. Aware, when the graceful restart
2341 support is announced and the support for restarting neighbors is
2342 provided, but no local graceful restart is allowed (i.e. receiving-only
2343 role). Enabled, when the full graceful restart support is provided
2344 (i.e. both restarting and receiving role). Restarting role could be also
2345 configured per-channel. Note that proper support for local graceful
2346 restart requires also configuration of other protocols. Default: aware.
2347
2348 <tag><label id="bgp-graceful-restart-time">graceful restart time <m/number/</tag>
2349 The restart time is announced in the BGP graceful restart capability
2350 and specifies how long the neighbor would wait for the BGP session to
2351 re-establish after a restart before deleting stale routes. Default:
2352 120 seconds.
2353
2354 <tag><label id="bgp-long-lived-graceful-restart">long lived graceful restart <m/switch/|aware</tag>
2355 The long-lived graceful restart is an extension of the traditional
2356 <ref id="bgp-graceful-restart" name="BGP graceful restart">, where stale
2357 routes are kept even after the <ref id="bgp-graceful-restart-time"
2358 name="restart time"> expires for additional long-lived stale time, but
2359 they are marked with the LLGR_STALE community, depreferenced, and
2360 withdrawn from routers not supporting LLGR. Like traditional BGP
2361 graceful restart, it has three states: disabled, aware (receiving-only),
2362 and enabled. Note that long-lived graceful restart requires at least
2363 aware level of traditional BGP graceful restart. Default: aware, unless
2364 graceful restart is disabled.
2365
2366 <tag><label id="bgp-long-lived-stale-time">long lived stale time <m/number/</tag>
2367 The long-lived stale time is announced in the BGP long-lived graceful
2368 restart capability and specifies how long the neighbor would keep stale
2369 routes depreferenced during long-lived graceful restart until either the
2370 session is re-stablished and synchronized or the stale time expires and
2371 routes are removed. Default: 3600 seconds.
2372
2373 <tag><label id="bgp-interpret-communities">interpret communities <m/switch/</tag>
2374 <rfc id="1997"> demands that BGP speaker should process well-known
2375 communities like no-export (65535, 65281) or no-advertise (65535,
2376 65282). For example, received route carrying a no-adverise community
2377 should not be advertised to any of its neighbors. If this option is
2378 enabled (which is by default), BIRD has such behavior automatically (it
2379 is evaluated when a route is exported to the BGP protocol just before
2380 the export filter). Otherwise, this integrated processing of
2381 well-known communities is disabled. In that case, similar behavior can
2382 be implemented in the export filter. Default: on.
2383
2384 <tag><label id="bgp-enable-as4">enable as4 <m/switch/</tag>
2385 BGP protocol was designed to use 2B AS numbers and was extended later to
2386 allow 4B AS number. BIRD supports 4B AS extension, but by disabling this
2387 option it can be persuaded not to advertise it and to maintain old-style
2388 sessions with its neighbors. This might be useful for circumventing bugs
2389 in neighbor's implementation of 4B AS extension. Even when disabled
2390 (off), BIRD behaves internally as AS4-aware BGP router. Default: on.
2391
2392 <tag><label id="bgp-enable-extended-messages">enable extended messages <m/switch/</tag>
2393 The BGP protocol uses maximum message length of 4096 bytes. This option
2394 provides an extension to allow extended messages with length up
2395 to 65535 bytes. Default: off.
2396
2397 <tag><label id="bgp-capabilities">capabilities <m/switch/</tag>
2398 Use capability advertisement to advertise optional capabilities. This is
2399 standard behavior for newer BGP implementations, but there might be some
2400 older BGP implementations that reject such connection attempts. When
2401 disabled (off), features that request it (4B AS support) are also
2402 disabled. Default: on, with automatic fallback to off when received
2403 capability-related error.
2404
2405 <tag><label id="bgp-advertise-ipv4">advertise ipv4 <m/switch/</tag>
2406 Advertise IPv4 multiprotocol capability. This is not a correct behavior
2407 according to the strict interpretation of <rfc id="4760">, but it is
2408 widespread and required by some BGP implementations (Cisco and Quagga).
2409 This option is relevant to IPv4 mode with enabled capability
2410 advertisement only. Default: on.
2411
2412 <tag><label id="bgp-disable-after-error">disable after error <m/switch/</tag>
2413 When an error is encountered (either locally or by the other side),
2414 disable the instance automatically and wait for an administrator to fix
2415 the problem manually. Default: off.
2416
2417 <tag><label id="bgp-disable-after-cease">disable after cease <m/switch/|<m/set-of-flags/</tag>
2418 When a Cease notification is received, disable the instance
2419 automatically and wait for an administrator to fix the problem manually.
2420 When used with <m/switch/ argument, it means handle every Cease subtype
2421 with the exception of <cf/connection collision/. Default: off.
2422
2423 The <m/set-of-flags/ allows to narrow down relevant Cease subtypes. The
2424 syntax is <cf>{<m/flag/ [, <m/.../] }</cf>, where flags are: <cf/cease/,
2425 <cf/prefix limit hit/, <cf/administrative shutdown/,
2426 <cf/peer deconfigured/, <cf/administrative reset/,
2427 <cf/connection rejected/, <cf/configuration change/,
2428 <cf/connection collision/, <cf/out of resources/.
2429
2430 <tag><label id="bgp-hold-time">hold time <m/number/</tag>
2431 Time in seconds to wait for a Keepalive message from the other side
2432 before considering the connection stale. Default: depends on agreement
2433 with the neighboring router, we prefer 240 seconds if the other side is
2434 willing to accept it.
2435
2436 <tag><label id="bgp-startup-hold-time">startup hold time <m/number/</tag>
2437 Value of the hold timer used before the routers have a chance to exchange
2438 open messages and agree on the real value. Default: 240 seconds.
2439
2440 <tag><label id="bgp-keepalive-time">keepalive time <m/number/</tag>
2441 Delay in seconds between sending of two consecutive Keepalive messages.
2442 Default: One third of the hold time.
2443
2444 <tag><label id="bgp-connect-delay-time">connect delay time <m/number/</tag>
2445 Delay in seconds between protocol startup and the first attempt to
2446 connect. Default: 5 seconds.
2447
2448 <tag><label id="bgp-connect-retry-time">connect retry time <m/number/</tag>
2449 Time in seconds to wait before retrying a failed attempt to connect.
2450 Default: 120 seconds.
2451
2452 <tag><label id="bgp-error-wait-time">error wait time <m/number/,<m/number/</tag>
2453 Minimum and maximum delay in seconds between a protocol failure (either
2454 local or reported by the peer) and automatic restart. Doesn't apply
2455 when <cf/disable after error/ is configured. If consecutive errors
2456 happen, the delay is increased exponentially until it reaches the
2457 maximum. Default: 60, 300.
2458
2459 <tag><label id="bgp-error-forget-time">error forget time <m/number/</tag>
2460 Maximum time in seconds between two protocol failures to treat them as a
2461 error sequence which makes <cf/error wait time/ increase exponentially.
2462 Default: 300 seconds.
2463
2464 <tag><label id="bgp-path-metric">path metric <m/switch/</tag>
2465 Enable comparison of path lengths when deciding which BGP route is the
2466 best one. Default: on.
2467
2468 <tag><label id="bgp-med-metric">med metric <m/switch/</tag>
2469 Enable comparison of MED attributes (during best route selection) even
2470 between routes received from different ASes. This may be useful if all
2471 MED attributes contain some consistent metric, perhaps enforced in
2472 import filters of AS boundary routers. If this option is disabled, MED
2473 attributes are compared only if routes are received from the same AS
2474 (which is the standard behavior). Default: off.
2475
2476 <tag><label id="bgp-deterministic-med">deterministic med <m/switch/</tag>
2477 BGP route selection algorithm is often viewed as a comparison between
2478 individual routes (e.g. if a new route appears and is better than the
2479 current best one, it is chosen as the new best one). But the proper
2480 route selection, as specified by <rfc id="4271">, cannot be fully
2481 implemented in that way. The problem is mainly in handling the MED
2482 attribute. BIRD, by default, uses an simplification based on individual
2483 route comparison, which in some cases may lead to temporally dependent
2484 behavior (i.e. the selection is dependent on the order in which routes
2485 appeared). This option enables a different (and slower) algorithm
2486 implementing proper <rfc id="4271"> route selection, which is
2487 deterministic. Alternative way how to get deterministic behavior is to
2488 use <cf/med metric/ option. This option is incompatible with <ref
2489 id="dsc-table-sorted" name="sorted tables">. Default: off.
2490
2491 <tag><label id="bgp-igp-metric">igp metric <m/switch/</tag>
2492 Enable comparison of internal distances to boundary routers during best
2493 route selection. Default: on.
2494
2495 <tag><label id="bgp-prefer-older">prefer older <m/switch/</tag>
2496 Standard route selection algorithm breaks ties by comparing router IDs.
2497 This changes the behavior to prefer older routes (when both are external
2498 and from different peer). For details, see <rfc id="5004">. Default: off.
2499
2500 <tag><label id="bgp-default-med">default bgp_med <m/number/</tag>
2501 Value of the Multiple Exit Discriminator to be used during route
2502 selection when the MED attribute is missing. Default: 0.
2503
2504 <tag><label id="bgp-default-local-pref">default bgp_local_pref <m/number/</tag>
2505 A default value for the Local Preference attribute. It is used when
2506 a new Local Preference attribute is attached to a route by the BGP
2507 protocol itself (for example, if a route is received through eBGP and
2508 therefore does not have such attribute). Default: 100 (0 in pre-1.2.0
2509 versions of BIRD).
2510 </descrip>
2511
2512 <sect1>Channel configuration
2513 <label id="bgp-channel-config">
2514
2515 <p>BGP supports several AFIs and SAFIs over one connection. Every AFI/SAFI
2516 announced to the peer corresponds to one channel. The table of supported AFI/SAFIs
2517 together with their appropriate channels follows.
2518
2519 <table loc="h">
2520 <tabular ca="l|l|l|r|r">
2521 <bf/Channel name/ | <bf/Table nettype/ | <bf/IGP table allowed/ | <bf/AFI/ | <bf/SAFI/
2522 @<hline>
2523 <cf/ipv4/ | <cf/ipv4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 1
2524 @ <cf/ipv6/ | <cf/ipv6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 1
2525 @ <cf/ipv4 multicast/ | <cf/ipv4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 2
2526 @ <cf/ipv6 multicast/ | <cf/ipv6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 2
2527 @ <cf/ipv4 mpls/ | <cf/ipv4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 4
2528 @ <cf/ipv6 mpls/ | <cf/ipv6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 4
2529 @ <cf/vpn4 mpls/ | <cf/vpn4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 128
2530 @ <cf/vpn6 mpls/ | <cf/vpn6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 128
2531 @ <cf/vpn4 multicast/ | <cf/vpn4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 129
2532 @ <cf/vpn6 multicast/ | <cf/vpn6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 129
2533 @ <cf/flow4/ | <cf/flow4/ | --- | 1 | 133
2534 @ <cf/flow6/ | <cf/flow6/ | --- | 2 | 133
2535 </tabular>
2536 </table>
2537
2538 <p>Due to <rfc id="8212">, external BGP protocol requires explicit configuration
2539 of import and export policies (in contrast to other protocols, where default
2540 policies of <cf/import all/ and <cf/export none/ are used in absence of explicit
2541 configuration). Note that blanket policies like <cf/all/ or <cf/none/ can still
2542 be used in explicit configuration.
2543
2544 <p>BGP channels have additional config options (together with the common ones):
2545
2546 <descrip>
2547 <tag><label id="bgp-next-hop-keep">next hop keep</tag>
2548 Forward the received Next Hop attribute even in situations where the
2549 local address should be used instead, like when the route is sent to an
2550 interface with a different subnet. Default: disabled.
2551
2552 <tag><label id="bgp-next-hop-self">next hop self</tag>
2553 Avoid calculation of the Next Hop attribute and always advertise our own
2554 source address as a next hop. This needs to be used only occasionally to
2555 circumvent misconfigurations of other routers. Default: disabled.
2556
2557 <tag><label id="bgp-next-hop-address">next hop address <m/ip/</tag>
2558 Avoid calculation of the Next Hop attribute and always advertise this address
2559 as a next hop.
2560
2561 <tag><label id="bgp-missing-lladdr">missing lladdr self|drop|ignore</tag>
2562 Next Hop attribute in BGP-IPv6 sometimes contains just the global IPv6
2563 address, but sometimes it has to contain both global and link-local IPv6
2564 addresses. This option specifies what to do if BIRD have to send both
2565 addresses but does not know link-local address. This situation might
2566 happen when routes from other protocols are exported to BGP, or when
2567 improper updates are received from BGP peers. <cf/self/ means that BIRD
2568 advertises its own local address instead. <cf/drop/ means that BIRD
2569 skips that prefixes and logs error. <cf/ignore/ means that BIRD ignores
2570 the problem and sends just the global address (and therefore forms
2571 improper BGP update). Default: <cf/self/, unless BIRD is configured as a
2572 route server (option <cf/rs client/), in that case default is <cf/ignore/,
2573 because route servers usually do not forward packets themselves.
2574
2575 <tag><label id="bgp-gateway">gateway direct|recursive</tag>
2576 For received routes, their <cf/gw/ (immediate next hop) attribute is
2577 computed from received <cf/bgp_next_hop/ attribute. This option
2578 specifies how it is computed. Direct mode means that the IP address from
2579 <cf/bgp_next_hop/ is used if it is directly reachable, otherwise the
2580 neighbor IP address is used. Recursive mode means that the gateway is
2581 computed by an IGP routing table lookup for the IP address from
2582 <cf/bgp_next_hop/. Note that there is just one level of indirection in
2583 recursive mode - the route obtained by the lookup must not be recursive
2584 itself, to prevent mutually recursive routes.
2585
2586 Recursive mode is the behavior specified by the BGP
2587 standard. Direct mode is simpler, does not require any routes in a
2588 routing table, and was used in older versions of BIRD, but does not
2589 handle well nontrivial iBGP setups and multihop. Recursive mode is
2590 incompatible with <ref id="dsc-table-sorted" name="sorted tables">. Default:
2591 <cf/direct/ for direct sessions, <cf/recursive/ for multihop sessions.
2592
2593 <tag><label id="bgp-igp-table">igp table <m/name/</tag>
2594 Specifies a table that is used as an IGP routing table. The type of this
2595 table must be as allowed in the table above. This option is allowed once
2596 for every allowed table type. Default: the same as the main table
2597 the channel is connected to (if eligible).
2598
2599 <tag><label id="bgp-secondary">secondary <m/switch/</tag>
2600 Usually, if an export filter rejects a selected route, no other route is
2601 propagated for that network. This option allows to try the next route in
2602 order until one that is accepted is found or all routes for that network
2603 are rejected. This can be used for route servers that need to propagate
2604 different tables to each client but do not want to have these tables
2605 explicitly (to conserve memory). This option requires that the connected
2606 routing table is <ref id="dsc-table-sorted" name="sorted">. Default: off.
2607
2608 <tag><label id="bgp-extended-next-hop">extended next hop <m/switch/</tag>
2609 BGP expects that announced next hops have the same address family as
2610 associated network prefixes. This option provides an extension to use
2611 IPv4 next hops with IPv6 prefixes and vice versa. For IPv4 / VPNv4
2612 channels, the behavior is controlled by the Extended Next Hop Encoding
2613 capability, as described in <rfc id="5549">. For IPv6 / VPNv6 channels,
2614 just IPv4-mapped IPv6 addresses are used, as described in
2615 <rfc id="4798"> and <rfc id="4659">. Default: off.
2616
2617 <tag><label id="bgp-add-paths">add paths <m/switch/|rx|tx</tag>
2618 Standard BGP can propagate only one path (route) per destination network
2619 (usually the selected one). This option controls the add-path protocol
2620 extension, which allows to advertise any number of paths to a
2621 destination. Note that to be active, add-path has to be enabled on both
2622 sides of the BGP session, but it could be enabled separately for RX and
2623 TX direction. When active, all available routes accepted by the export
2624 filter are advertised to the neighbor. Default: off.
2625
2626 <tag><label id="bgp-graceful-restart-c">graceful restart <m/switch/</tag>
2627 Although BGP graceful restart is configured mainly by protocol-wide
2628 <ref id="bgp-graceful-restart" name="options">, it is possible to
2629 configure restarting role per AFI/SAFI pair by this channel option.
2630 The option is ignored if graceful restart is disabled by protocol-wide
2631 option. Default: off in aware mode, on in full mode.
2632
2633 <tag><label id="bgp-long-lived-graceful-restart-c">long lived graceful restart <m/switch/</tag>
2634 BGP long-lived graceful restart is configured mainly by protocol-wide
2635 <ref id="bgp-long-lived-graceful-restart" name="options">, but the
2636 restarting role can be set per AFI/SAFI pair by this channel option.
2637 The option is ignored if long-lived graceful restart is disabled by
2638 protocol-wide option. Default: off in aware mode, on in full mode.
2639
2640 <tag><label id="bgp-long-lived-stale-time-c">long lived stale time <m/number/</tag>
2641 Like previous graceful restart channel options, this option allows to
2642 set <ref id="bgp-long-lived-stale-time" name="long lived stale time">
2643 per AFI/SAFI pair instead of per protocol. Default: set by protocol-wide
2644 option.
2645 </descrip>
2646
2647 <sect1>Attributes
2648 <label id="bgp-attr">
2649
2650 <p>BGP defines several route attributes. Some of them (those marked with
2651 `<tt/I/' in the table below) are available on internal BGP connections only,
2652 some of them (marked with `<tt/O/') are optional.
2653
2654 <descrip>
2655 <tag><label id="rta-bgp-path">bgppath bgp_path</tag>
2656 Sequence of AS numbers describing the AS path the packet will travel
2657 through when forwarded according to the particular route. In case of
2658 internal BGP it doesn't contain the number of the local AS.
2659
2660 <tag><label id="rta-bgp-local-pref">int bgp_local_pref [I]</tag>
2661 Local preference value used for selection among multiple BGP routes (see
2662 the selection rules above). It's used as an additional metric which is
2663 propagated through the whole local AS.
2664
2665 <tag><label id="rta-bgp-med">int bgp_med [O]</tag>
2666 The Multiple Exit Discriminator of the route is an optional attribute
2667 which is used on external (inter-AS) links to convey to an adjacent AS
2668 the optimal entry point into the local AS. The received attribute is
2669 also propagated over internal BGP links. The attribute value is zeroed
2670 when a route is exported to an external BGP instance to ensure that the
2671 attribute received from a neighboring AS is not propagated to other
2672 neighboring ASes. A new value might be set in the export filter of an
2673 external BGP instance. See <rfc id="4451"> for further discussion of
2674 BGP MED attribute.
2675
2676 <tag><label id="rta-bgp-origin">enum bgp_origin</tag>
2677 Origin of the route: either <cf/ORIGIN_IGP/ if the route has originated
2678 in an interior routing protocol or <cf/ORIGIN_EGP/ if it's been imported
2679 from the <tt>EGP</tt> protocol (nowadays it seems to be obsolete) or
2680 <cf/ORIGIN_INCOMPLETE/ if the origin is unknown.
2681
2682 <tag><label id="rta-bgp-next-hop">ip bgp_next_hop</tag>
2683 Next hop to be used for forwarding of packets to this destination. On
2684 internal BGP connections, it's an address of the originating router if
2685 it's inside the local AS or a boundary router the packet will leave the
2686 AS through if it's an exterior route, so each BGP speaker within the AS
2687 has a chance to use the shortest interior path possible to this point.
2688
2689 <tag><label id="rta-bgp-atomic-aggr">void bgp_atomic_aggr [O]</tag>
2690 This is an optional attribute which carries no value, but the sole
2691 presence of which indicates that the route has been aggregated from
2692 multiple routes by some router on the path from the originator.
2693
2694 <!-- we don't handle aggregators right since they are of a very obscure type
2695 <tag>bgp_aggregator</tag>
2696 -->
2697 <tag><label id="rta-bgp-community">clist bgp_community [O]</tag>
2698 List of community values associated with the route. Each such value is a
2699 pair (represented as a <cf/pair/ data type inside the filters) of 16-bit
2700 integers, the first of them containing the number of the AS which
2701 defines the community and the second one being a per-AS identifier.
2702 There are lots of uses of the community mechanism, but generally they
2703 are used to carry policy information like "don't export to USA peers".
2704 As each AS can define its own routing policy, it also has a complete
2705 freedom about which community attributes it defines and what will their
2706 semantics be.
2707
2708 <tag><label id="rta-bgp-ext-community">eclist bgp_ext_community [O]</tag>
2709 List of extended community values associated with the route. Extended
2710 communities have similar usage as plain communities, but they have an
2711 extended range (to allow 4B ASNs) and a nontrivial structure with a type
2712 field. Individual community values are represented using an <cf/ec/ data
2713 type inside the filters.
2714
2715 <tag><label id="rta-bgp-large-community">lclist bgp_large_community [O]</tag>
2716 List of large community values associated with the route. Large BGP
2717 communities is another variant of communities, but contrary to extended
2718 communities they behave very much the same way as regular communities,
2719 just larger -- they are uniform untyped triplets of 32bit numbers.
2720 Individual community values are represented using an <cf/lc/ data type
2721 inside the filters.
2722
2723 <tag><label id="rta-bgp-originator-id">quad bgp_originator_id [I, O]</tag>
2724 This attribute is created by the route reflector when reflecting the
2725 route and contains the router ID of the originator of the route in the
2726 local AS.
2727
2728 <tag><label id="rta-bgp-cluster-list">clist bgp_cluster_list [I, O]</tag>
2729 This attribute contains a list of cluster IDs of route reflectors. Each
2730 route reflector prepends its cluster ID when reflecting the route.
2731 </descrip>
2732
2733 <sect1>Example
2734 <label id="bgp-exam">
2735
2736 <p><code>
2737 protocol bgp {
2738 local 198.51.100.14 as 65000; # Use a private AS number
2739 neighbor 198.51.100.130 as 64496; # Our neighbor ...
2740 multihop; # ... which is connected indirectly
2741 ipv4 {
2742 export filter { # We use non-trivial export rules
2743 if source = RTS_STATIC then { # Export only static routes
2744 # Assign our community
2745 bgp_community.add((65000,64501));
2746 # Artificially increase path length
2747 # by advertising local AS number twice
2748 if bgp_path ~ [= 65000 =] then
2749 bgp_path.prepend(65000);
2750 accept;
2751 }
2752 reject;
2753 };
2754 import all;
2755 next hop self; # advertise this router as next hop
2756 igp table myigptable4; # IGP table for routes with IPv4 nexthops
2757 igp table myigptable6; # IGP table for routes with IPv6 nexthops
2758 };
2759 ipv6 {
2760 export filter mylargefilter; # We use a named filter
2761 import all;
2762 missing lladdr self;
2763 igp table myigptable4; # IGP table for routes with IPv4 nexthops
2764 igp table myigptable6; # IGP table for routes with IPv6 nexthops
2765 };
2766 ipv4 multicast {
2767 import all;
2768 export filter someotherfilter;
2769 table mymulticasttable4; # Another IPv4 table, dedicated for multicast
2770 igp table myigptable4;
2771 };
2772 }
2773 </code>
2774
2775
2776 <sect>Device
2777 <label id="device">
2778
2779 <p>The Device protocol is not a real routing protocol. It doesn't generate any
2780 routes and it only serves as a module for getting information about network
2781 interfaces from the kernel. This protocol supports no channel.
2782
2783 <p>Except for very unusual circumstances, you probably should include this
2784 protocol in the configuration since almost all other protocols require network
2785 interfaces to be defined for them to work with.
2786
2787 <sect1>Configuration
2788 <label id="device-config">
2789
2790 <p><descrip>
2791 <tag><label id="device-scan-time">scan time <m/number/</tag>
2792 Time in seconds between two scans of the network interface list. On
2793 systems where we are notified about interface status changes
2794 asynchronously (such as newer versions of Linux), we need to scan the
2795 list only in order to avoid confusion by lost notification messages,
2796 so the default time is set to a large value.
2797
2798 <tag><label id="device-iface">interface <m/pattern/ [, <m/.../]</tag>
2799 By default, the Device protocol handles all interfaces without any
2800 configuration. Interface definitions allow to specify optional
2801 parameters for specific interfaces. See <ref id="proto-iface"
2802 name="interface"> common option for detailed description. Currently only
2803 one interface option is available:
2804
2805 <tag><label id="device-preferred">preferred <m/ip/</tag>
2806 If a network interface has more than one IP address, BIRD chooses one of
2807 them as a preferred one. Preferred IP address is used as source address
2808 for packets or announced next hop by routing protocols. Precisely, BIRD
2809 chooses one preferred IPv4 address, one preferred IPv6 address and one
2810 preferred link-local IPv6 address. By default, BIRD chooses the first
2811 found IP address as the preferred one.
2812
2813 This option allows to specify which IP address should be preferred. May
2814 be used multiple times for different address classes (IPv4, IPv6, IPv6
2815 link-local). In all cases, an address marked by operating system as
2816 secondary cannot be chosen as the primary one.
2817 </descrip>
2818
2819 <p>As the Device protocol doesn't generate any routes, it cannot have
2820 any attributes. Example configuration looks like this:
2821
2822 <p><code>
2823 protocol device {
2824 scan time 10; # Scan the interfaces often
2825 interface "eth0" {
2826 preferred 192.168.1.1;
2827 preferred 2001:db8:1:10::1;
2828 };
2829 }
2830 </code>
2831
2832
2833 <sect>Direct
2834 <label id="direct">
2835
2836 <p>The Direct protocol is a simple generator of device routes for all the
2837 directly connected networks according to the list of interfaces provided by the
2838 kernel via the Device protocol. The Direct protocol supports both IPv4 and IPv6
2839 channels; both can be configured simultaneously. It can also be configured with
2840 <ref id="ip-sadr-routes" name="IPv6 SADR"> channel instead of regular IPv6
2841 channel in order to be used together with SADR-enabled Babel protocol.
2842
2843 <p>The question is whether it is a good idea to have such device routes in BIRD
2844 routing table. OS kernel usually handles device routes for directly connected
2845 networks by itself so we don't need (and don't want) to export these routes to
2846 the kernel protocol. OSPF protocol creates device routes for its interfaces
2847 itself and BGP protocol is usually used for exporting aggregate routes. But the
2848 Direct protocol is necessary for distance-vector protocols like RIP or Babel to
2849 announce local networks.
2850
2851 <p>There are just few configuration options for the Direct protocol:
2852
2853 <p><descrip>
2854 <tag><label id="direct-iface">interface <m/pattern/ [, <m/.../]</tag>
2855 By default, the Direct protocol will generate device routes for all the
2856 interfaces available. If you want to restrict it to some subset of
2857 interfaces or addresses (e.g. if you're using multiple routing tables
2858 for policy routing and some of the policy domains don't contain all
2859 interfaces), just use this clause. See <ref id="proto-iface" name="interface">
2860 common option for detailed description. The Direct protocol uses
2861 extended interface clauses.
2862
2863 <tag><label id="direct-check-link">check link <m/switch/</tag>
2864 If enabled, a hardware link state (reported by OS) is taken into
2865 consideration. Routes for directly connected networks are generated only
2866 if link up is reported and they are withdrawn when link disappears
2867 (e.g., an ethernet cable is unplugged). Default value is no.
2868 </descrip>
2869
2870 <p>Direct device routes don't contain any specific attributes.
2871
2872 <p>Example config might look like this:
2873
2874 <p><code>
2875 protocol direct {
2876 ipv4;
2877 ipv6;
2878 interface "-arc*", "*"; # Exclude the ARCnets
2879 }
2880 </code>
2881
2882
2883 <sect>Kernel
2884 <label id="krt">
2885
2886 <p>The Kernel protocol is not a real routing protocol. Instead of communicating
2887 with other routers in the network, it performs synchronization of BIRD's routing
2888 tables with the OS kernel. Basically, it sends all routing table updates to the
2889 kernel and from time to time it scans the kernel tables to see whether some
2890 routes have disappeared (for example due to unnoticed up/down transition of an
2891 interface) or whether an `alien' route has been added by someone else (depending
2892 on the <cf/learn/ switch, such routes are either ignored or accepted to our
2893 table).
2894
2895 <p>Note that routes created by OS kernel itself, namely direct routes
2896 representing IP subnets of associated interfaces, are not imported even with
2897 <cf/learn/ enabled. You can use <ref id="direct" name="Direct protocol"> to
2898 generate these direct routes.
2899
2900 <p>If your OS supports only a single routing table, you can configure only one
2901 instance of the Kernel protocol. If it supports multiple tables (in order to
2902 allow policy routing; such an OS is for example Linux), you can run as many
2903 instances as you want, but each of them must be connected to a different BIRD
2904 routing table and to a different kernel table.
2905
2906 <p>Because the kernel protocol is partially integrated with the connected
2907 routing table, there are two limitations - it is not possible to connect more
2908 kernel protocols to the same routing table and changing route destination
2909 (gateway) in an export filter of a kernel protocol does not work. Both
2910 limitations can be overcome using another routing table and the pipe protocol.
2911
2912 <p>The Kernel protocol supports both IPv4 and IPv6 channels; only one channel
2913 can be configured in each protocol instance. On Linux, it also supports <ref
2914 id="ip-sadr-routes" name="IPv6 SADR"> and <ref id="mpls-routes" name="MPLS">
2915 channels.
2916
2917 <sect1>Configuration
2918 <label id="krt-config">
2919
2920 <p><descrip>
2921 <tag><label id="krt-persist">persist <m/switch/</tag>
2922 Tell BIRD to leave all its routes in the routing tables when it exits
2923 (instead of cleaning them up).
2924
2925 <tag><label id="krt-scan-time">scan time <m/number/</tag>
2926 Time in seconds between two consecutive scans of the kernel routing
2927 table.
2928
2929 <tag><label id="krt-learn">learn <m/switch/</tag>
2930 Enable learning of routes added to the kernel routing tables by other
2931 routing daemons or by the system administrator. This is possible only on
2932 systems which support identification of route authorship.
2933
2934 <tag><label id="krt-kernel-table">kernel table <m/number/</tag>
2935 Select which kernel table should this particular instance of the Kernel
2936 protocol work with. Available only on systems supporting multiple
2937 routing tables.
2938
2939 <tag><label id="krt-metric">metric <m/number/</tag> (Linux)
2940 Use specified value as a kernel metric (priority) for all routes sent to
2941 the kernel. When multiple routes for the same network are in the kernel
2942 routing table, the Linux kernel chooses one with lower metric. Also,
2943 routes with different metrics do not clash with each other, therefore
2944 using dedicated metric value is a reliable way to avoid overwriting
2945 routes from other sources (e.g. kernel device routes). Metric 0 has a
2946 special meaning of undefined metric, in which either OS default is used,
2947 or per-route metric can be set using <cf/krt_metric/ attribute. Default:
2948 32.
2949
2950 <tag><label id="krt-graceful-restart">graceful restart <m/switch/</tag>
2951 Participate in graceful restart recovery. If this option is enabled and
2952 a graceful restart recovery is active, the Kernel protocol will defer
2953 synchronization of routing tables until the end of the recovery. Note
2954 that import of kernel routes to BIRD is not affected.
2955
2956 <tag><label id="krt-merge-paths">merge paths <M>switch</M> [limit <M>number</M>]</tag>
2957 Usually, only best routes are exported to the kernel protocol. With path
2958 merging enabled, both best routes and equivalent non-best routes are
2959 merged during export to generate one ECMP (equal-cost multipath) route
2960 for each network. This is useful e.g. for BGP multipath. Note that best
2961 routes are still pivotal for route export (responsible for most
2962 properties of resulting ECMP routes), while exported non-best routes are
2963 responsible just for additional multipath next hops. This option also
2964 allows to specify a limit on maximal number of nexthops in one route. By
2965 default, multipath merging is disabled. If enabled, default value of the
2966 limit is 16.
2967 </descrip>
2968
2969 <sect1>Attributes
2970 <label id="krt-attr">
2971
2972 <p>The Kernel protocol defines several attributes. These attributes are
2973 translated to appropriate system (and OS-specific) route attributes. We support
2974 these attributes:
2975
2976 <descrip>
2977 <tag><label id="rta-krt-source">int krt_source</tag>
2978 The original source of the imported kernel route. The value is
2979 system-dependent. On Linux, it is a value of the protocol field of the
2980 route. See /etc/iproute2/rt_protos for common values. On BSD, it is
2981 based on STATIC and PROTOx flags. The attribute is read-only.
2982
2983 <tag><label id="rta-krt-metric">int krt_metric</tag> (Linux)
2984 The kernel metric of the route. When multiple same routes are in a
2985 kernel routing table, the Linux kernel chooses one with lower metric.
2986 Note that preferred way to set kernel metric is to use protocol option
2987 <cf/metric/, unless per-route metric values are needed.
2988
2989 <tag><label id="rta-krt-prefsrc">ip krt_prefsrc</tag> (Linux)
2990 The preferred source address. Used in source address selection for
2991 outgoing packets. Has to be one of the IP addresses of the router.
2992
2993 <tag><label id="rta-krt-realm">int krt_realm</tag> (Linux)
2994 The realm of the route. Can be used for traffic classification.
2995
2996 <tag><label id="rta-krt-scope">int krt_scope</tag> (Linux IPv4)
2997 The scope of the route. Valid values are 0-254, although Linux kernel
2998 may reject some values depending on route type and nexthop. It is
2999 supposed to represent `indirectness' of the route, where nexthops of
3000 routes are resolved through routes with a higher scope, but in current
3001 kernels anything below <it/link/ (253) is treated as <it/global/ (0).
3002 When not present, global scope is implied for all routes except device
3003 routes, where link scope is used by default.
3004 </descrip>
3005
3006 <p>In Linux, there is also a plenty of obscure route attributes mostly focused
3007 on tuning TCP performance of local connections. BIRD supports most of these
3008 attributes, see Linux or iproute2 documentation for their meaning. Attributes
3009 <cf/krt_lock_*/ and <cf/krt_feature_*/ have type bool, others have type int.
3010 Supported attributes are:
3011
3012 <cf/krt_mtu/, <cf/krt_lock_mtu/, <cf/krt_window/, <cf/krt_lock_window/,
3013 <cf/krt_rtt/, <cf/krt_lock_rtt/, <cf/krt_rttvar/, <cf/krt_lock_rttvar/,
3014 <cf/krt_sstresh/, <cf/krt_lock_sstresh/, <cf/krt_cwnd/, <cf/krt_lock_cwnd/,
3015 <cf/krt_advmss/, <cf/krt_lock_advmss/, <cf/krt_reordering/, <cf/krt_lock_reordering/,
3016 <cf/krt_hoplimit/, <cf/krt_lock_hoplimit/, <cf/krt_rto_min/, <cf/krt_lock_rto_min/,
3017 <cf/krt_initcwnd/, <cf/krt_initrwnd/, <cf/krt_quickack/,
3018 <cf/krt_feature_ecn/, <cf/krt_feature_allfrag/
3019
3020 <sect1>Example
3021 <label id="krt-exam">
3022
3023 <p>A simple configuration can look this way:
3024
3025 <p><code>
3026 protocol kernel {
3027 export all;
3028 }
3029 </code>
3030
3031 <p>Or for a system with two routing tables:
3032
3033 <p><code>
3034 protocol kernel { # Primary routing table
3035 learn; # Learn alien routes from the kernel
3036 persist; # Don't remove routes on bird shutdown
3037 scan time 10; # Scan kernel routing table every 10 seconds
3038 ipv4 {
3039 import all;
3040 export all;
3041 };
3042 }
3043
3044 protocol kernel { # Secondary routing table
3045 kernel table 100;
3046 ipv4 {
3047 table auxtable;
3048 export all;
3049 };
3050 }
3051 </code>
3052
3053
3054 <sect>OSPF
3055 <label id="ospf">
3056
3057 <sect1>Introduction
3058 <label id="ospf-intro">
3059
3060 <p>Open Shortest Path First (OSPF) is a quite complex interior gateway
3061 protocol. The current IPv4 version (OSPFv2) is defined in <rfc id="2328"> and
3062 the current IPv6 version (OSPFv3) is defined in <rfc id="5340"> It's a link
3063 state (a.k.a. shortest path first) protocol -- each router maintains a database
3064 describing the autonomous system's topology. Each participating router has an
3065 identical copy of the database and all routers run the same algorithm
3066 calculating a shortest path tree with themselves as a root. OSPF chooses the
3067 least cost path as the best path.
3068
3069 <p>In OSPF, the autonomous system can be split to several areas in order to
3070 reduce the amount of resources consumed for exchanging the routing information
3071 and to protect the other areas from incorrect routing data. Topology of the area
3072 is hidden to the rest of the autonomous system.
3073
3074 <p>Another very important feature of OSPF is that it can keep routing information
3075 from other protocols (like Static or BGP) in its link state database as external
3076 routes. Each external route can be tagged by the advertising router, making it
3077 possible to pass additional information between routers on the boundary of the
3078 autonomous system.
3079
3080 <p>OSPF quickly detects topological changes in the autonomous system (such as
3081 router interface failures) and calculates new loop-free routes after a short
3082 period of convergence. Only a minimal amount of routing traffic is involved.
3083
3084 <p>Each router participating in OSPF routing periodically sends Hello messages
3085 to all its interfaces. This allows neighbors to be discovered dynamically. Then
3086 the neighbors exchange theirs parts of the link state database and keep it
3087 identical by flooding updates. The flooding process is reliable and ensures that
3088 each router detects all changes.
3089
3090 <sect1>Configuration
3091 <label id="ospf-config">
3092
3093 <p>First, the desired OSPF version can be specified by using <cf/ospf v2/ or
3094 <cf/ospf v3/ as a protocol type. By default, OSPFv2 is used. In the main part of
3095 configuration, there can be multiple definitions of OSPF areas, each with a
3096 different id. These definitions includes many other switches and multiple
3097 definitions of interfaces. Definition of interface may contain many switches and
3098 constant definitions and list of neighbors on nonbroadcast networks.
3099
3100 <p>OSPFv2 needs one IPv4 channel. OSPFv3 needs either one IPv6 channel, or one
3101 IPv4 channel (<rfc id="5838">). Therefore, it is possible to use OSPFv3 for both
3102 IPv4 and Pv6 routing, but it is necessary to have two protocol instances anyway.
3103 If no channel is configured, appropriate channel is defined with default
3104 parameters.
3105
3106 <code>
3107 protocol ospf [v2|v3] &lt;name&gt; {
3108 rfc1583compat &lt;switch&gt;;
3109 rfc5838 &lt;switch&gt;;
3110 instance id &lt;num&gt;;
3111 stub router &lt;switch&gt;;
3112 tick &lt;num&gt;;
3113 ecmp &lt;switch&gt; [limit &lt;num&gt;];
3114 merge external &lt;switch&gt;;
3115 area &lt;id&gt; {
3116 stub;
3117 nssa;
3118 summary &lt;switch&gt;;
3119 default nssa &lt;switch&gt;;
3120 default cost &lt;num&gt;;
3121 default cost2 &lt;num&gt;;
3122 translator &lt;switch&gt;;
3123 translator stability &lt;num&gt;;
3124
3125 networks {
3126 &lt;prefix&gt;;
3127 &lt;prefix&gt; hidden;
3128 }
3129 external {
3130 &lt;prefix&gt;;
3131 &lt;prefix&gt; hidden;
3132 &lt;prefix&gt; tag &lt;num&gt;;
3133 }
3134 stubnet &lt;prefix&gt;;
3135 stubnet &lt;prefix&gt; {
3136 hidden &lt;switch&gt;;
3137 summary &lt;switch&gt;;
3138 cost &lt;num&gt;;
3139 }
3140 interface &lt;interface pattern&gt; [instance &lt;num&gt;] {
3141 cost &lt;num&gt;;
3142 stub &lt;switch&gt;;
3143 hello &lt;num&gt;;
3144 poll &lt;num&gt;;
3145 retransmit &lt;num&gt;;
3146 priority &lt;num&gt;;
3147 wait &lt;num&gt;;
3148 dead count &lt;num&gt;;
3149 dead &lt;num&gt;;
3150 secondary &lt;switch&gt;;
3151 rx buffer [normal|large|&lt;num&gt;];
3152 tx length &lt;num&gt;;
3153 type [broadcast|bcast|pointopoint|ptp|
3154 nonbroadcast|nbma|pointomultipoint|ptmp];
3155 link lsa suppression &lt;switch&gt;;
3156 strict nonbroadcast &lt;switch&gt;;
3157 real broadcast &lt;switch&gt;;
3158 ptp netmask &lt;switch&gt;;
3159 check link &lt;switch&gt;;
3160 bfd &lt;switch&gt;;
3161 ecmp weight &lt;num&gt;;
3162 ttl security [&lt;switch&gt;; | tx only]
3163 tx class|dscp &lt;num&gt;;
3164 tx priority &lt;num&gt;;
3165 authentication none|simple|cryptographic;
3166 password "&lt;text&gt;";
3167 password "&lt;text&gt;" {
3168 id &lt;num&gt;;
3169 generate from "&lt;date&gt;";
3170 generate to "&lt;date&gt;";
3171 accept from "&lt;date&gt;";
3172 accept to "&lt;date&gt;";
3173 from "&lt;date&gt;";
3174 to "&lt;date&gt;";
3175 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
3176 };
3177 neighbors {
3178 &lt;ip&gt;;
3179 &lt;ip&gt; eligible;
3180 };
3181 };
3182 virtual link &lt;id&gt; [instance &lt;num&gt;] {
3183 hello &lt;num&gt;;
3184 retransmit &lt;num&gt;;
3185 wait &lt;num&gt;;
3186 dead count &lt;num&gt;;
3187 dead &lt;num&gt;;
3188 authentication none|simple|cryptographic;
3189 password "&lt;text&gt;";
3190 password "&lt;text&gt;" {
3191 id &lt;num&gt;;
3192 generate from "&lt;date&gt;";
3193 generate to "&lt;date&gt;";
3194 accept from "&lt;date&gt;";
3195 accept to "&lt;date&gt;";
3196 from "&lt;date&gt;";
3197 to "&lt;date&gt;";
3198 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
3199 };
3200 };
3201 };
3202 }
3203 </code>
3204
3205 <descrip>
3206 <tag><label id="ospf-rfc1583compat">rfc1583compat <M>switch</M></tag>
3207 This option controls compatibility of routing table calculation with
3208 <rfc id="1583">. Default value is no.
3209
3210 <tag><label id="ospf-rfc5838">rfc5838 <m/switch/</tag>
3211 Basic OSPFv3 is limited to IPv6 unicast routing. The <rfc id="5838">
3212 extension defines support for more address families (IPv4, IPv6, both
3213 unicast and multicast). The extension is enabled by default, but can be
3214 disabled if necessary, as it restricts the range of available instance
3215 IDs. Default value is yes.
3216
3217 <tag><label id="ospf-instance-id">instance id <m/num/</tag>
3218 When multiple OSPF protocol instances are active on the same links, they
3219 should use different instance IDs to distinguish their packets. Although
3220 it could be done on per-interface basis, it is often preferred to set
3221 one instance ID to whole OSPF domain/topology (e.g., when multiple
3222 instances are used to represent separate logical topologies on the same
3223 physical network). This option specifies the instance ID for all
3224 interfaces of the OSPF instance, but can be overridden by
3225 <cf/interface/ option. Default value is 0 unless OSPFv3-AF extended
3226 address families are used, see <rfc id="5838"> for that case.
3227
3228 <tag><label id="ospf-stub-router">stub router <M>switch</M></tag>
3229 This option configures the router to be a stub router, i.e., a router
3230 that participates in the OSPF topology but does not allow transit
3231 traffic. In OSPFv2, this is implemented by advertising maximum metric
3232 for outgoing links. In OSPFv3, the stub router behavior is announced by
3233 clearing the R-bit in the router LSA. See <rfc id="6987"> for details.
3234 Default value is no.
3235
3236 <tag><label id="ospf-tick">tick <M>num</M></tag>
3237 The routing table calculation and clean-up of areas' databases is not
3238 performed when a single link state change arrives. To lower the CPU
3239 utilization, it's processed later at periodical intervals of <m/num/
3240 seconds. The default value is 1.
3241
3242 <tag><label id="ospf-ecmp">ecmp <M>switch</M> [limit <M>number</M>]</tag>
3243 This option specifies whether OSPF is allowed to generate ECMP
3244 (equal-cost multipath) routes. Such routes are used when there are
3245 several directions to the destination, each with the same (computed)
3246 cost. This option also allows to specify a limit on maximum number of
3247 nexthops in one route. By default, ECMP is enabled if supported by
3248 Kernel. Default value of the limit is 16.
3249
3250 <tag><label id="ospf-merge-external">merge external <M>switch</M></tag>
3251 This option specifies whether OSPF should merge external routes from
3252 different routers/LSAs for the same destination. When enabled together
3253 with <cf/ecmp/, equal-cost external routes will be combined to multipath
3254 routes in the same way as regular routes. When disabled, external routes
3255 from different LSAs are treated as separate even if they represents the
3256 same destination. Default value is no.
3257
3258 <tag><label id="ospf-area">area <M>id</M></tag>
3259 This defines an OSPF area with given area ID (an integer or an IPv4
3260 address, similarly to a router ID). The most important area is the
3261 backbone (ID 0) to which every other area must be connected.
3262
3263 <tag><label id="ospf-stub">stub</tag>
3264 This option configures the area to be a stub area. External routes are
3265 not flooded into stub areas. Also summary LSAs can be limited in stub
3266 areas (see option <cf/summary/). By default, the area is not a stub
3267 area.
3268
3269 <tag><label id="ospf-nssa">nssa</tag>
3270 This option configures the area to be a NSSA (Not-So-Stubby Area). NSSA
3271 is a variant of a stub area which allows a limited way of external route
3272 propagation. Global external routes are not propagated into a NSSA, but
3273 an external route can be imported into NSSA as a (area-wide) NSSA-LSA
3274 (and possibly translated and/or aggregated on area boundary). By
3275 default, the area is not NSSA.
3276
3277 <tag><label id="ospf-summary">summary <M>switch</M></tag>
3278 This option controls propagation of summary LSAs into stub or NSSA
3279 areas. If enabled, summary LSAs are propagated as usual, otherwise just
3280 the default summary route (0.0.0.0/0) is propagated (this is sometimes
3281 called totally stubby area). If a stub area has more area boundary
3282 routers, propagating summary LSAs could lead to more efficient routing
3283 at the cost of larger link state database. Default value is no.
3284
3285 <tag><label id="ospf-default-nssa">default nssa <M>switch</M></tag>
3286 When <cf/summary/ option is enabled, default summary route is no longer
3287 propagated to the NSSA. In that case, this option allows to originate
3288 default route as NSSA-LSA to the NSSA. Default value is no.
3289
3290 <tag><label id="ospf-default-cost">default cost <M>num</M></tag>
3291 This option controls the cost of a default route propagated to stub and
3292 NSSA areas. Default value is 1000.
3293
3294 <tag><label id="ospf-default-cost2">default cost2 <M>num</M></tag>
3295 When a default route is originated as NSSA-LSA, its cost can use either
3296 type 1 or type 2 metric. This option allows to specify the cost of a
3297 default route in type 2 metric. By default, type 1 metric (option
3298 <cf/default cost/) is used.
3299
3300 <tag><label id="ospf-translator">translator <M>switch</M></tag>
3301 This option controls translation of NSSA-LSAs into external LSAs. By
3302 default, one translator per NSSA is automatically elected from area
3303 boundary routers. If enabled, this area boundary router would
3304 unconditionally translate all NSSA-LSAs regardless of translator
3305 election. Default value is no.
3306
3307 <tag><label id="ospf-translator-stability">translator stability <M>num</M></tag>
3308 This option controls the translator stability interval (in seconds).
3309 When the new translator is elected, the old one keeps translating until
3310 the interval is over. Default value is 40.
3311
3312 <tag><label id="ospf-networks">networks { <m/set/ }</tag>
3313 Definition of area IP ranges. This is used in summary LSA origination.
3314 Hidden networks are not propagated into other areas.
3315
3316 <tag><label id="ospf-external">external { <m/set/ }</tag>
3317 Definition of external area IP ranges for NSSAs. This is used for
3318 NSSA-LSA translation. Hidden networks are not translated into external
3319 LSAs. Networks can have configured route tag.
3320
3321 <tag><label id="ospf-stubnet">stubnet <m/prefix/ { <m/options/ }</tag>
3322 Stub networks are networks that are not transit networks between OSPF
3323 routers. They are also propagated through an OSPF area as a part of a
3324 link state database. By default, BIRD generates a stub network record
3325 for each primary network address on each OSPF interface that does not
3326 have any OSPF neighbors, and also for each non-primary network address
3327 on each OSPF interface. This option allows to alter a set of stub
3328 networks propagated by this router.
3329
3330 Each instance of this option adds a stub network with given network
3331 prefix to the set of propagated stub network, unless option <cf/hidden/
3332 is used. It also suppresses default stub networks for given network
3333 prefix. When option <cf/summary/ is used, also default stub networks
3334 that are subnetworks of given stub network are suppressed. This might be
3335 used, for example, to aggregate generated stub networks.
3336
3337 <tag><label id="ospf-iface">interface <M>pattern</M> [instance <m/num/]</tag>
3338 Defines that the specified interfaces belong to the area being defined.
3339 See <ref id="proto-iface" name="interface"> common option for detailed
3340 description. In OSPFv2, extended interface clauses are used, because
3341 each network prefix is handled as a separate virtual interface.
3342
3343 You can specify alternative instance ID for the interface definition,
3344 therefore it is possible to have several instances of that interface
3345 with different options or even in different areas. For OSPFv2, instance
3346 ID support is an extension (<rfc id="6549">) and is supposed to be set
3347 per-protocol. For OSPFv3, it is an integral feature.
3348
3349 <tag><label id="ospf-virtual-link">virtual link <M>id</M> [instance <m/num/]</tag>
3350 Virtual link to router with the router id. Virtual link acts as a
3351 point-to-point interface belonging to backbone. The actual area is used
3352 as a transport area. This item cannot be in the backbone. Like with
3353 <cf/interface/ option, you could also use several virtual links to one
3354 destination with different instance IDs.
3355
3356 <tag><label id="ospf-cost">cost <M>num</M></tag>
3357 Specifies output cost (metric) of an interface. Default value is 10.
3358
3359 <tag><label id="ospf-stub-iface">stub <M>switch</M></tag>
3360 If set to interface it does not listen to any packet and does not send
3361 any hello. Default value is no.
3362
3363 <tag><label id="ospf-hello">hello <M>num</M></tag>
3364 Specifies interval in seconds between sending of Hello messages. Beware,
3365 all routers on the same network need to have the same hello interval.
3366 Default value is 10.
3367
3368 <tag><label id="ospf-poll">poll <M>num</M></tag>
3369 Specifies interval in seconds between sending of Hello messages for some
3370 neighbors on NBMA network. Default value is 20.
3371
3372 <tag><label id="ospf-retransmit">retransmit <M>num</M></tag>
3373 Specifies interval in seconds between retransmissions of unacknowledged
3374 updates. Default value is 5.
3375
3376 <tag><label id="ospf-priority">priority <M>num</M></tag>
3377 On every multiple access network (e.g., the Ethernet) Designated Router
3378 and Backup Designated router are elected. These routers have some special
3379 functions in the flooding process. Higher priority increases preferences
3380 in this election. Routers with priority 0 are not eligible. Default
3381 value is 1.
3382
3383 <tag><label id="ospf-wait">wait <M>num</M></tag>
3384 After start, router waits for the specified number of seconds between
3385 starting election and building adjacency. Default value is 4*<m/hello/.
3386
3387 <tag><label id="ospf-dead-count">dead count <M>num</M></tag>
3388 When the router does not receive any messages from a neighbor in
3389 <m/dead count/*<m/hello/ seconds, it will consider the neighbor down.
3390
3391 <tag><label id="ospf-dead">dead <M>num</M></tag>
3392 When the router does not receive any messages from a neighbor in
3393 <m/dead/ seconds, it will consider the neighbor down. If both directives
3394 <cf/dead count/ and <cf/dead/ are used, <cf/dead/ has precedence.
3395
3396 <tag><label id="ospf-secondary">secondary <M>switch</M></tag>
3397 On BSD systems, older versions of BIRD supported OSPFv2 only for the
3398 primary IP address of an interface, other IP ranges on the interface
3399 were handled as stub networks. Since v1.4.1, regular operation on
3400 secondary IP addresses is supported, but disabled by default for
3401 compatibility. This option allows to enable it. The option is a
3402 transitional measure, will be removed in the next major release as the
3403 behavior will be changed. On Linux systems, the option is irrelevant, as
3404 operation on non-primary addresses is already the regular behavior.
3405
3406 <tag><label id="ospf-rx-buffer">rx buffer <M>num</M></tag>
3407 This option allows to specify the size of buffers used for packet
3408 processing. The buffer size should be bigger than maximal size of any
3409 packets. By default, buffers are dynamically resized as needed, but a
3410 fixed value could be specified. Value <cf/large/ means maximal allowed
3411 packet size - 65535.
3412
3413 <tag><label id="ospf-tx-length">tx length <M>num</M></tag>
3414 Transmitted OSPF messages that contain large amount of information are
3415 segmented to separate OSPF packets to avoid IP fragmentation. This
3416 option specifies the soft ceiling for the length of generated OSPF
3417 packets. Default value is the MTU of the network interface. Note that
3418 larger OSPF packets may still be generated if underlying OSPF messages
3419 cannot be splitted (e.g. when one large LSA is propagated).
3420
3421 <tag><label id="ospf-type-bcast">type broadcast|bcast</tag>
3422 BIRD detects a type of a connected network automatically, but sometimes
3423 it's convenient to force use of a different type manually. On broadcast
3424 networks (like ethernet), flooding and Hello messages are sent using
3425 multicasts (a single packet for all the neighbors). A designated router
3426 is elected and it is responsible for synchronizing the link-state
3427 databases and originating network LSAs. This network type cannot be used
3428 on physically NBMA networks and on unnumbered networks (networks without
3429 proper IP prefix).
3430
3431 <tag><label id="ospf-type-ptp">type pointopoint|ptp</tag>
3432 Point-to-point networks connect just 2 routers together. No election is
3433 performed and no network LSA is originated, which makes it simpler and
3434 faster to establish. This network type is useful not only for physically
3435 PtP ifaces (like PPP or tunnels), but also for broadcast networks used
3436 as PtP links. This network type cannot be used on physically NBMA
3437 networks.
3438
3439 <tag><label id="ospf-type-nbma">type nonbroadcast|nbma</tag>
3440 On NBMA networks, the packets are sent to each neighbor separately
3441 because of lack of multicast capabilities. Like on broadcast networks,
3442 a designated router is elected, which plays a central role in propagation
3443 of LSAs. This network type cannot be used on unnumbered networks.
3444
3445 <tag><label id="ospf-type-ptmp">type pointomultipoint|ptmp</tag>
3446 This is another network type designed to handle NBMA networks. In this
3447 case the NBMA network is treated as a collection of PtP links. This is
3448 useful if not every pair of routers on the NBMA network has direct
3449 communication, or if the NBMA network is used as an (possibly
3450 unnumbered) PtP link.
3451
3452 <tag><label id="ospf-link-lsa-suppression">link lsa suppression <m/switch/</tag>
3453 In OSPFv3, link LSAs are generated for each link, announcing link-local
3454 IPv6 address of the router to its local neighbors. These are useless on
3455 PtP or PtMP networks and this option allows to suppress the link LSA
3456 origination for such interfaces. The option is ignored on other than PtP
3457 or PtMP interfaces. Default value is no.
3458
3459 <tag><label id="ospf-strict-nonbroadcast">strict nonbroadcast <m/switch/</tag>
3460 If set, don't send hello to any undefined neighbor. This switch is
3461 ignored on other than NBMA or PtMP interfaces. Default value is no.
3462
3463 <tag><label id="ospf-real-broadcast">real broadcast <m/switch/</tag>
3464 In <cf/type broadcast/ or <cf/type ptp/ network configuration, OSPF
3465 packets are sent as IP multicast packets. This option changes the
3466 behavior to using old-fashioned IP broadcast packets. This may be useful
3467 as a workaround if IP multicast for some reason does not work or does
3468 not work reliably. This is a non-standard option and probably is not
3469 interoperable with other OSPF implementations. Default value is no.
3470
3471 <tag><label id="ospf-ptp-netmask">ptp netmask <m/switch/</tag>
3472 In <cf/type ptp/ network configurations, OSPFv2 implementations should
3473 ignore received netmask field in hello packets and should send hello
3474 packets with zero netmask field on unnumbered PtP links. But some OSPFv2
3475 implementations perform netmask checking even for PtP links. This option
3476 specifies whether real netmask will be used in hello packets on <cf/type
3477 ptp/ interfaces. You should ignore this option unless you meet some
3478 compatibility problems related to this issue. Default value is no for
3479 unnumbered PtP links, yes otherwise.
3480
3481 <tag><label id="ospf-check-link">check link <M>switch</M></tag>
3482 If set, a hardware link state (reported by OS) is taken into consideration.
3483 When a link disappears (e.g. an ethernet cable is unplugged), neighbors
3484 are immediately considered unreachable and only the address of the iface
3485 (instead of whole network prefix) is propagated. It is possible that
3486 some hardware drivers or platforms do not implement this feature.
3487 Default value is yes.
3488
3489 <tag><label id="ospf-bfd">bfd <M>switch</M></tag>
3490 OSPF could use BFD protocol as an advisory mechanism for neighbor
3491 liveness and failure detection. If enabled, BIRD setups a BFD session
3492 for each OSPF neighbor and tracks its liveness by it. This has an
3493 advantage of an order of magnitude lower detection times in case of
3494 failure. Note that BFD protocol also has to be configured, see
3495 <ref id="bfd" name="BFD"> section for details. Default value is no.
3496
3497 <tag><label id="ospf-ttl-security">ttl security [<m/switch/ | tx only]</tag>
3498 TTL security is a feature that protects routing protocols from remote
3499 spoofed packets by using TTL 255 instead of TTL 1 for protocol packets
3500 destined to neighbors. Because TTL is decremented when packets are
3501 forwarded, it is non-trivial to spoof packets with TTL 255 from remote
3502 locations. Note that this option would interfere with OSPF virtual
3503 links.
3504
3505 If this option is enabled, the router will send OSPF packets with TTL
3506 255 and drop received packets with TTL less than 255. If this option si
3507 set to <cf/tx only/, TTL 255 is used for sent packets, but is not
3508 checked for received packets. Default value is no.
3509
3510 <tag><label id="ospf-tx-class">tx class|dscp|priority <m/num/</tag>
3511 These options specify the ToS/DiffServ/Traffic class/Priority of the
3512 outgoing OSPF packets. See <ref id="proto-tx-class" name="tx class"> common
3513 option for detailed description.
3514
3515 <tag><label id="ospf-ecmp-weight">ecmp weight <M>num</M></tag>
3516 When ECMP (multipath) routes are allowed, this value specifies a
3517 relative weight used for nexthops going through the iface. Allowed
3518 values are 1-256. Default value is 1.
3519
3520 <tag><label id="ospf-auth-none">authentication none</tag>
3521 No passwords are sent in OSPF packets. This is the default value.
3522
3523 <tag><label id="ospf-auth-simple">authentication simple</tag>
3524 Every packet carries 8 bytes of password. Received packets lacking this
3525 password are ignored. This authentication mechanism is very weak.
3526 This option is not available in OSPFv3.
3527
3528 <tag><label id="ospf-auth-cryptographic">authentication cryptographic</tag>
3529 An authentication code is appended to every packet. The specific
3530 cryptographic algorithm is selected by option <cf/algorithm/ for each
3531 key. The default cryptographic algorithm for OSPFv2 keys is Keyed-MD5
3532 and for OSPFv3 keys is HMAC-SHA-256. Passwords are not sent open via
3533 network, so this mechanism is quite secure. Packets can still be read by
3534 an attacker.
3535
3536 <tag><label id="ospf-pass">password "<M>text</M>"</tag>
3537 Specifies a password used for authentication. See
3538 <ref id="proto-pass" name="password"> common option for detailed
3539 description.
3540
3541 <tag><label id="ospf-neighbors">neighbors { <m/set/ } </tag>
3542 A set of neighbors to which Hello messages on NBMA or PtMP networks are
3543 to be sent. For NBMA networks, some of them could be marked as eligible.
3544 In OSPFv3, link-local addresses should be used, using global ones is
3545 possible, but it is nonstandard and might be problematic. And definitely,
3546 link-local and global addresses should not be mixed.
3547 </descrip>
3548
3549 <sect1>Attributes
3550 <label id="ospf-attr">
3551
3552 <p>OSPF defines four route attributes. Each internal route has a <cf/metric/.
3553
3554 <p>Metric is ranging from 1 to infinity (65535). External routes use
3555 <cf/metric type 1/ or <cf/metric type 2/. A <cf/metric of type 1/ is comparable
3556 with internal <cf/metric/, a <cf/metric of type 2/ is always longer than any
3557 <cf/metric of type 1/ or any <cf/internal metric/. <cf/Internal metric/ or
3558 <cf/metric of type 1/ is stored in attribute <cf/ospf_metric1/, <cf/metric type
3559 2/ is stored in attribute <cf/ospf_metric2/.
3560
3561 When both metrics are specified then <cf/metric of type 2/ is used. This is
3562 relevant e.g. when a type 2 external route is propagated from one OSPF domain to
3563 another and <cf/ospf_metric1/ is an internal distance to the original ASBR,
3564 while <cf/ospf_metric2/ stores the type 2 metric. Note that in such cases if
3565 <cf/ospf_metric1/ is non-zero then <cf/ospf_metric2/ is increased by one to
3566 ensure monotonicity of metric, as internal distance is reset to zero when an
3567 external route is announced.
3568
3569 <p>Each external route can also carry attribute <cf/ospf_tag/ which is a 32-bit
3570 integer which is used when exporting routes to other protocols; otherwise, it
3571 doesn't affect routing inside the OSPF domain at all. The fourth attribute
3572 <cf/ospf_router_id/ is a router ID of the router advertising that route /
3573 network. This attribute is read-only. Default is <cf/ospf_metric2 = 10000/ and
3574 <cf/ospf_tag = 0/.
3575
3576 <sect1>Example
3577 <label id="ospf-exam">
3578
3579 <p><code>
3580 protocol ospf MyOSPF {
3581 ipv4 {
3582 export filter {
3583 if source = RTS_BGP then {
3584 ospf_metric1 = 100;
3585 accept;
3586 }
3587 reject;
3588 };
3589 };
3590 area 0.0.0.0 {
3591 interface "eth*" {
3592 cost 11;
3593 hello 15;
3594 priority 100;
3595 retransmit 7;
3596 authentication simple;
3597 password "aaa";
3598 };
3599 interface "ppp*" {
3600 cost 100;
3601 authentication cryptographic;
3602 password "abc" {
3603 id 1;
3604 generate to "22-04-2003 11:00:06";
3605 accept from "17-01-2001 12:01:05";
3606 algorithm hmac sha384;
3607 };
3608 password "def" {
3609 id 2;
3610 generate to "22-07-2005 17:03:21";
3611 accept from "22-02-2001 11:34:06";
3612 algorithm hmac sha512;
3613 };
3614 };
3615 interface "arc0" {
3616 cost 10;
3617 stub yes;
3618 };
3619 interface "arc1";
3620 };
3621 area 120 {
3622 stub yes;
3623 networks {
3624 172.16.1.0/24;
3625 172.16.2.0/24 hidden;
3626 }
3627 interface "-arc0" , "arc*" {
3628 type nonbroadcast;
3629 authentication none;
3630 strict nonbroadcast yes;
3631 wait 120;
3632 poll 40;
3633 dead count 8;
3634 neighbors {
3635 192.168.120.1 eligible;
3636 192.168.120.2;
3637 192.168.120.10;
3638 };
3639 };
3640 };
3641 }
3642 </code>
3643
3644
3645 <sect>Pipe
3646 <label id="pipe">
3647
3648 <sect1>Introduction
3649 <label id="pipe-intro">
3650
3651 <p>The Pipe protocol serves as a link between two routing tables, allowing
3652 routes to be passed from a table declared as primary (i.e., the one the pipe is
3653 connected to using the <cf/table/ configuration keyword) to the secondary one
3654 (declared using <cf/peer table/) and vice versa, depending on what's allowed by
3655 the filters. Export filters control export of routes from the primary table to
3656 the secondary one, import filters control the opposite direction. Both tables
3657 must be of the same nettype.
3658
3659 <p>The Pipe protocol may work in the transparent mode mode or in the opaque
3660 mode. In the transparent mode, the Pipe protocol retransmits all routes from
3661 one table to the other table, retaining their original source and attributes.
3662 If import and export filters are set to accept, then both tables would have
3663 the same content. The transparent mode is the default mode.
3664
3665 <p>In the opaque mode, the Pipe protocol retransmits optimal route from one
3666 table to the other table in a similar way like other protocols send and receive
3667 routes. Retransmitted route will have the source set to the Pipe protocol, which
3668 may limit access to protocol specific route attributes. This mode is mainly for
3669 compatibility, it is not suggested for new configs. The mode can be changed by
3670 <tt/mode/ option.
3671
3672 <p>The primary use of multiple routing tables and the Pipe protocol is for
3673 policy routing, where handling of a single packet doesn't depend only on its
3674 destination address, but also on its source address, source interface, protocol
3675 type and other similar parameters. In many systems (Linux being a good example),
3676 the kernel allows to enforce routing policies by defining routing rules which
3677 choose one of several routing tables to be used for a packet according to its
3678 parameters. Setting of these rules is outside the scope of BIRD's work (on
3679 Linux, you can use the <tt/ip/ command), but you can create several routing
3680 tables in BIRD, connect them to the kernel ones, use filters to control which
3681 routes appear in which tables and also you can employ the Pipe protocol for
3682 exporting a selected subset of one table to another one.
3683
3684 <sect1>Configuration
3685 <label id="pipe-config">
3686
3687 <p>Essentially, the Pipe protocol is just a channel connected to a table on both
3688 sides. Therefore, the configuration block for <cf/protocol pipe/ shall directly
3689 include standard channel config options; see the example below.
3690
3691 <p><descrip>
3692 <tag><label id="pipe-peer-table">peer table <m/table/</tag>
3693 Defines secondary routing table to connect to. The primary one is
3694 selected by the <cf/table/ keyword.
3695 </descrip>
3696
3697 <sect1>Attributes
3698 <label id="pipe-attr">
3699
3700 <p>The Pipe protocol doesn't define any route attributes.
3701
3702 <sect1>Example
3703 <label id="pipe-exam">
3704
3705 <p>Let's consider a router which serves as a boundary router of two different
3706 autonomous systems, each of them connected to a subset of interfaces of the
3707 router, having its own exterior connectivity and wishing to use the other AS as
3708 a backup connectivity in case of outage of its own exterior line.
3709
3710 <p>Probably the simplest solution to this situation is to use two routing tables
3711 (we'll call them <cf/as1/ and <cf/as2/) and set up kernel routing rules, so that
3712 packets having arrived from interfaces belonging to the first AS will be routed
3713 according to <cf/as1/ and similarly for the second AS. Thus we have split our
3714 router to two logical routers, each one acting on its own routing table, having
3715 its own routing protocols on its own interfaces. In order to use the other AS's
3716 routes for backup purposes, we can pass the routes between the tables through a
3717 Pipe protocol while decreasing their preferences and correcting their BGP paths
3718 to reflect the AS boundary crossing.
3719
3720 <code>
3721 ipv4 table as1; # Define the tables
3722 ipv4 table as2;
3723
3724 protocol kernel kern1 { # Synchronize them with the kernel
3725 ipv4 { table as1; export all; };
3726 kernel table 1;
3727 }
3728
3729 protocol kernel kern2 {
3730 ipv4 { table as2; export all; };
3731 kernel table 2;
3732 }
3733
3734 protocol bgp bgp1 { # The outside connections
3735 ipv4 { table as1; import all; export all; };
3736 local as 1;
3737 neighbor 192.168.0.1 as 1001;
3738 }
3739
3740 protocol bgp bgp2 {
3741 ipv4 { table as2; import all; export all; };
3742 local as 2;
3743 neighbor 10.0.0.1 as 1002;
3744 }
3745
3746 protocol pipe { # The Pipe
3747 table as1;
3748 peer table as2;
3749 export filter {
3750 if net ~ [ 1.0.0.0/8+] then { # Only AS1 networks
3751 if preference>10 then preference = preference-10;
3752 if source=RTS_BGP then bgp_path.prepend(1);
3753 accept;
3754 }
3755 reject;
3756 };
3757 import filter {
3758 if net ~ [ 2.0.0.0/8+] then { # Only AS2 networks
3759 if preference>10 then preference = preference-10;
3760 if source=RTS_BGP then bgp_path.prepend(2);
3761 accept;
3762 }
3763 reject;
3764 };
3765 }
3766 </code>
3767
3768
3769 <sect>RAdv
3770 <label id="radv">
3771
3772 <sect1>Introduction
3773 <label id="radv-intro">
3774
3775 <p>The RAdv protocol is an implementation of Router Advertisements, which are
3776 used in the IPv6 stateless autoconfiguration. IPv6 routers send (in irregular
3777 time intervals or as an answer to a request) advertisement packets to connected
3778 networks. These packets contain basic information about a local network (e.g. a
3779 list of network prefixes), which allows network hosts to autoconfigure network
3780 addresses and choose a default route. BIRD implements router behavior as defined
3781 in <rfc id="4861">, router preferences and specific routes (<rfc id="4191">),
3782 and DNS extensions (<rfc id="6106">).
3783
3784 <p>The RAdv protocols supports just IPv6 channel.
3785
3786 <sect1>Configuration
3787 <label id="radv-config">
3788
3789 <p>There are several classes of definitions in RAdv configuration -- interface
3790 definitions, prefix definitions and DNS definitions:
3791
3792 <descrip>
3793 <tag><label id="radv-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
3794 Interface definitions specify a set of interfaces on which the
3795 protocol is activated and contain interface specific options.
3796 See <ref id="proto-iface" name="interface"> common options for
3797 detailed description.
3798
3799 <tag><label id="radv-prefix">prefix <m/prefix/ { <m/options/ }</tag>
3800 Prefix definitions allow to modify a list of advertised prefixes. By
3801 default, the advertised prefixes are the same as the network prefixes
3802 assigned to the interface. For each network prefix, the matching prefix
3803 definition is found and its options are used. If no matching prefix
3804 definition is found, the prefix is used with default options.
3805
3806 Prefix definitions can be either global or interface-specific. The
3807 second ones are part of interface options. The prefix definition
3808 matching is done in the first-match style, when interface-specific
3809 definitions are processed before global definitions. As expected, the
3810 prefix definition is matching if the network prefix is a subnet of the
3811 prefix in prefix definition.
3812
3813 <tag><label id="radv-rdnss">rdnss { <m/options/ }</tag>
3814 RDNSS definitions allow to specify a list of advertised recursive DNS
3815 servers together with their options. As options are seldom necessary,
3816 there is also a short variant <cf>rdnss <m/address/</cf> that just
3817 specifies one DNS server. Multiple definitions are cumulative. RDNSS
3818 definitions may also be interface-specific when used inside interface
3819 options. By default, interface uses both global and interface-specific
3820 options, but that can be changed by <cf/rdnss local/ option.
3821
3822 <tag><label id="radv-dnssl">dnssl { <m/options/ }</tag>
3823 DNSSL definitions allow to specify a list of advertised DNS search
3824 domains together with their options. Like <cf/rdnss/ above, multiple
3825 definitions are cumulative, they can be used also as interface-specific
3826 options and there is a short variant <cf>dnssl <m/domain/</cf> that just
3827 specifies one DNS search domain.
3828
3829 <tag><label id="radv-trigger">trigger <m/prefix/</tag>
3830 RAdv protocol could be configured to change its behavior based on
3831 availability of routes. When this option is used, the protocol waits in
3832 suppressed state until a <it/trigger route/ (for the specified network)
3833 is exported to the protocol, the protocol also returnsd to suppressed
3834 state if the <it/trigger route/ disappears. Note that route export
3835 depends on specified export filter, as usual. This option could be used,
3836 e.g., for handling failover in multihoming scenarios.
3837
3838 During suppressed state, router advertisements are generated, but with
3839 some fields zeroed. Exact behavior depends on which fields are zeroed,
3840 this can be configured by <cf/sensitive/ option for appropriate
3841 fields. By default, just <cf/default lifetime/ (also called <cf/router
3842 lifetime/) is zeroed, which means hosts cannot use the router as a
3843 default router. <cf/preferred lifetime/ and <cf/valid lifetime/ could
3844 also be configured as <cf/sensitive/ for a prefix, which would cause
3845 autoconfigured IPs to be deprecated or even removed.
3846
3847 <tag><label id="radv-propagate-routes">propagate routes <m/switch/</tag>
3848 This option controls propagation of more specific routes, as defined in
3849 <rfc id="4191">. If enabled, all routes exported to the RAdv protocol,
3850 with the exception of the trigger prefix, are added to advertisments as
3851 additional options. The lifetime and preference of advertised routes can
3852 be set individually by <cf/ra_lifetime/ and <cf/ra_preference/ route
3853 attributes, or per interface by <cf/route lifetime/ and
3854 <cf/route preference/ options. Default: disabled.
3855
3856 Note that the RFC discourages from sending more than 17 routes and
3857 recommends the routes to be configured manually.
3858 </descrip>
3859
3860 <p>Interface specific options:
3861
3862 <descrip>
3863 <tag><label id="radv-iface-max-ra-interval">max ra interval <m/expr/</tag>
3864 Unsolicited router advertisements are sent in irregular time intervals.
3865 This option specifies the maximum length of these intervals, in seconds.
3866 Valid values are 4-1800. Default: 600
3867
3868 <tag><label id="radv-iface-min-ra-interval">min ra interval <m/expr/</tag>
3869 This option specifies the minimum length of that intervals, in seconds.
3870 Must be at least 3 and at most 3/4 * <cf/max ra interval/. Default:
3871 about 1/3 * <cf/max ra interval/.
3872
3873 <tag><label id="radv-iface-min-delay">min delay <m/expr/</tag>
3874 The minimum delay between two consecutive router advertisements, in
3875 seconds. Default: 3
3876
3877 <tag><label id="radv-iface-managed">managed <m/switch/</tag>
3878 This option specifies whether hosts should use DHCPv6 for IP address
3879 configuration. Default: no
3880
3881 <tag><label id="radv-iface-other-config">other config <m/switch/</tag>
3882 This option specifies whether hosts should use DHCPv6 to receive other
3883 configuration information. Default: no
3884
3885 <tag><label id="radv-iface-link-mtu">link mtu <m/expr/</tag>
3886 This option specifies which value of MTU should be used by hosts. 0
3887 means unspecified. Default: 0
3888
3889 <tag><label id="radv-iface-reachable-time">reachable time <m/expr/</tag>
3890 This option specifies the time (in milliseconds) how long hosts should
3891 assume a neighbor is reachable (from the last confirmation). Maximum is
3892 3600000, 0 means unspecified. Default 0.
3893
3894 <tag><label id="radv-iface-retrans-timer">retrans timer <m/expr/</tag>
3895 This option specifies the time (in milliseconds) how long hosts should
3896 wait before retransmitting Neighbor Solicitation messages. 0 means
3897 unspecified. Default 0.
3898
3899 <tag><label id="radv-iface-current-hop-limit">current hop limit <m/expr/</tag>
3900 This option specifies which value of Hop Limit should be used by
3901 hosts. Valid values are 0-255, 0 means unspecified. Default: 64
3902
3903 <tag><label id="radv-iface-default-lifetime">default lifetime <m/expr/ [sensitive <m/switch/]</tag>
3904 This option specifies the time (in seconds) how long (since the receipt
3905 of RA) hosts may use the router as a default router. 0 means do not use
3906 as a default router. For <cf/sensitive/ option, see <ref id="radv-trigger" name="trigger">.
3907 Default: 3 * <cf/max ra interval/, <cf/sensitive/ yes.
3908
3909 <tag><label id="radv-iface-default-preference">default preference low|medium|high</tag>
3910 This option specifies the Default Router Preference value to advertise
3911 to hosts. Default: medium.
3912
3913 <tag><label id="radv-iface-route-lifetime">route lifetime <m/expr/ [sensitive <m/switch/]</tag>
3914 This option specifies the default value of advertised lifetime for
3915 specific routes; i.e., the time (in seconds) for how long (since the
3916 receipt of RA) hosts should consider these routes valid. A special value
3917 0xffffffff represents infinity. The lifetime can be overriden on a per
3918 route basis by the <ref id="rta-ra-lifetime" name="ra_lifetime"> route
3919 attribute. Default: 3 * <cf/max ra interval/, <cf/sensitive/ no.
3920
3921 For the <cf/sensitive/ option, see <ref id="radv-trigger" name="trigger">.
3922 If <cf/sensitive/ is enabled, even the routes with the <cf/ra_lifetime/
3923 attribute become sensitive to the trigger.
3924
3925 <tag><label id="radv-iface-route-preference">route preference low|medium|high</tag>
3926 This option specifies the default value of advertised route preference
3927 for specific routes. The value can be overriden on a per route basis by
3928 the <ref id="rta-ra-preference" name="ra_preference"> route attribute.
3929 Default: medium.
3930
3931 <tag><label id="radv-prefix-linger-time">prefix linger time <m/expr/</tag>
3932 When a prefix or a route disappears, it is advertised for some time with
3933 zero lifetime, to inform clients it is no longer valid. This option
3934 specifies the time (in seconds) for how long prefixes are advertised
3935 that way. Default: 3 * <cf/max ra interval/.
3936
3937 <tag><label id="radv-route-linger-time">route linger time <m/expr/</tag>
3938 When a prefix or a route disappears, it is advertised for some time with
3939 zero lifetime, to inform clients it is no longer valid. This option
3940 specifies the time (in seconds) for how long routes are advertised
3941 that way. Default: 3 * <cf/max ra interval/.
3942
3943 <tag><label id="radv-iface-rdnss-local">rdnss local <m/switch/</tag>
3944 Use only local (interface-specific) RDNSS definitions for this
3945 interface. Otherwise, both global and local definitions are used. Could
3946 also be used to disable RDNSS for given interface if no local definitons
3947 are specified. Default: no.
3948
3949 <tag><label id="radv-iface-dnssl-local">dnssl local <m/switch/</tag>
3950 Use only local DNSSL definitions for this interface. See <cf/rdnss local/
3951 option above. Default: no.
3952 </descrip>
3953
3954 <p>Prefix specific options
3955
3956 <descrip>
3957 <tag><label id="radv-prefix-skip">skip <m/switch/</tag>
3958 This option allows to specify that given prefix should not be
3959 advertised. This is useful for making exceptions from a default policy
3960 of advertising all prefixes. Note that for withdrawing an already
3961 advertised prefix it is more useful to advertise it with zero valid
3962 lifetime. Default: no
3963
3964 <tag><label id="radv-prefix-onlink">onlink <m/switch/</tag>
3965 This option specifies whether hosts may use the advertised prefix for
3966 onlink determination. Default: yes
3967
3968 <tag><label id="radv-prefix-autonomous">autonomous <m/switch/</tag>
3969 This option specifies whether hosts may use the advertised prefix for
3970 stateless autoconfiguration. Default: yes
3971
3972 <tag><label id="radv-prefix-valid-lifetime">valid lifetime <m/expr/ [sensitive <m/switch/]</tag>
3973 This option specifies the time (in seconds) how long (after the
3974 receipt of RA) the prefix information is valid, i.e., autoconfigured
3975 IP addresses can be assigned and hosts with that IP addresses are
3976 considered directly reachable. 0 means the prefix is no longer
3977 valid. For <cf/sensitive/ option, see <ref id="radv-trigger" name="trigger">.
3978 Default: 86400 (1 day), <cf/sensitive/ no.
3979
3980 <tag><label id="radv-prefix-preferred-lifetime">preferred lifetime <m/expr/ [sensitive <m/switch/]</tag>
3981 This option specifies the time (in seconds) how long (after the
3982 receipt of RA) IP addresses generated from the prefix using stateless
3983 autoconfiguration remain preferred. For <cf/sensitive/ option,
3984 see <ref id="radv-trigger" name="trigger">. Default: 14400 (4 hours),
3985 <cf/sensitive/ no.
3986 </descrip>
3987
3988 <p>RDNSS specific options:
3989
3990 <descrip>
3991 <tag><label id="radv-rdnss-ns">ns <m/address/</tag>
3992 This option specifies one recursive DNS server. Can be used multiple
3993 times for multiple servers. It is mandatory to have at least one
3994 <cf/ns/ option in <cf/rdnss/ definition.
3995
3996 <tag><label id="radv-rdnss-lifetime">lifetime [mult] <m/expr/</tag>
3997 This option specifies the time how long the RDNSS information may be
3998 used by clients after the receipt of RA. It is expressed either in
3999 seconds or (when <cf/mult/ is used) in multiples of <cf/max ra
4000 interval/. Note that RDNSS information is also invalidated when
4001 <cf/default lifetime/ expires. 0 means these addresses are no longer
4002 valid DNS servers. Default: 3 * <cf/max ra interval/.
4003 </descrip>
4004
4005 <p>DNSSL specific options:
4006
4007 <descrip>
4008 <tag><label id="radv-dnssl-domain">domain <m/address/</tag>
4009 This option specifies one DNS search domain. Can be used multiple times
4010 for multiple domains. It is mandatory to have at least one <cf/domain/
4011 option in <cf/dnssl/ definition.
4012
4013 <tag><label id="radv-dnssl-lifetime">lifetime [mult] <m/expr/</tag>
4014 This option specifies the time how long the DNSSL information may be
4015 used by clients after the receipt of RA. Details are the same as for
4016 RDNSS <cf/lifetime/ option above. Default: 3 * <cf/max ra interval/.
4017 </descrip>
4018
4019 <sect1>Attributes
4020 <label id="radv-attr">
4021
4022 <p>RAdv defines two route attributes:
4023
4024 <descrip>
4025 <tag><label id="rta-ra-preference">enum ra_preference</tag>
4026 The preference of the route. The value can be <it/RA_PREF_LOW/,
4027 <it/RA_PREF_MEDIUM/ or <it/RA_PREF_HIGH/. If the attribute is not set,
4028 the <ref id="radv-iface-route-preference" name="route preference">
4029 option is used.
4030
4031 <tag><label id="rta-ra-lifetime">int ra_lifetime</tag>
4032 The advertised lifetime of the route, in seconds. The special value of
4033 0xffffffff represents infinity. If the attribute is not set, the
4034 <ref id="radv-iface-route-lifetime" name="route lifetime">
4035 option is used.
4036 </descrip>
4037
4038 <sect1>Example
4039 <label id="radv-exam">
4040
4041 <p><code>
4042 ipv6 table radv_routes; # Manually configured routes go here
4043
4044 protocol static {
4045 ipv6 { table radv_routes; };
4046
4047 route 2001:0DB8:4000::/48 unreachable;
4048 route 2001:0DB8:4010::/48 unreachable;
4049
4050 route 2001:0DB8:4020::/48 unreachable {
4051 ra_preference = RA_PREF_HIGH;
4052 ra_lifetime = 3600;
4053 };
4054 }
4055
4056 protocol radv {
4057 propagate routes yes; # Propagate the routes from the radv_routes table
4058 ipv6 { table radv_routes; export all; };
4059
4060 interface "eth2" {
4061 max ra interval 5; # Fast failover with more routers
4062 managed yes; # Using DHCPv6 on eth2
4063 prefix ::/0 {
4064 autonomous off; # So do not autoconfigure any IP
4065 };
4066 };
4067
4068 interface "eth*"; # No need for any other options
4069
4070 prefix 2001:0DB8:1234::/48 {
4071 preferred lifetime 0; # Deprecated address range
4072 };
4073
4074 prefix 2001:0DB8:2000::/48 {
4075 autonomous off; # Do not autoconfigure
4076 };
4077
4078 rdnss 2001:0DB8:1234::10; # Short form of RDNSS
4079
4080 rdnss {
4081 lifetime mult 10;
4082 ns 2001:0DB8:1234::11;
4083 ns 2001:0DB8:1234::12;
4084 };
4085
4086 dnssl {
4087 lifetime 3600;
4088 domain "abc.com";
4089 domain "xyz.com";
4090 };
4091 }
4092 </code>
4093
4094
4095 <sect>RIP
4096 <label id="rip">
4097
4098 <sect1>Introduction
4099 <label id="rip-intro">
4100
4101 <p>The RIP protocol (also sometimes called Rest In Pieces) is a simple protocol,
4102 where each router broadcasts (to all its neighbors) distances to all networks it
4103 can reach. When a router hears distance to another network, it increments it and
4104 broadcasts it back. Broadcasts are done in regular intervals. Therefore, if some
4105 network goes unreachable, routers keep telling each other that its distance is
4106 the original distance plus 1 (actually, plus interface metric, which is usually
4107 one). After some time, the distance reaches infinity (that's 15 in RIP) and all
4108 routers know that network is unreachable. RIP tries to minimize situations where
4109 counting to infinity is necessary, because it is slow. Due to infinity being 16,
4110 you can't use RIP on networks where maximal distance is higher than 15
4111 hosts.
4112
4113 <p>BIRD supports RIPv1 (<rfc id="1058">), RIPv2 (<rfc id="2453">), RIPng (<rfc
4114 id="2080">), and RIP cryptographic authentication (<rfc id="4822">).
4115
4116 <p>RIP is a very simple protocol, and it has a lot of shortcomings. Slow
4117 convergence, big network load and inability to handle larger networks makes it
4118 pretty much obsolete. It is still usable on very small networks.
4119
4120 <sect1>Configuration
4121 <label id="rip-config">
4122
4123 <p>RIP configuration consists mainly of common protocol options and interface
4124 definitions, most RIP options are interface specific. RIPng (RIP for IPv6)
4125 protocol instance can be configured by using <cf/rip ng/ instead of just
4126 <cf/rip/ as a protocol type.
4127
4128 <p>RIP needs one IPv4 channel. RIPng needs one IPv6 channel. If no channel is
4129 configured, appropriate channel is defined with default parameters.
4130
4131 <code>
4132 protocol rip [ng] [&lt;name&gt;] {
4133 infinity &lt;number&gt;;
4134 ecmp &lt;switch&gt; [limit &lt;number&gt;];
4135 interface &lt;interface pattern&gt; {
4136 metric &lt;number&gt;;
4137 mode multicast|broadcast;
4138 passive &lt;switch&gt;;
4139 address &lt;ip&gt;;
4140 port &lt;number&gt;;
4141 version 1|2;
4142 split horizon &lt;switch&gt;;
4143 poison reverse &lt;switch&gt;;
4144 check zero &lt;switch&gt;;
4145 update time &lt;number&gt;;
4146 timeout time &lt;number&gt;;
4147 garbage time &lt;number&gt;;
4148 ecmp weight &lt;number&gt;;
4149 ttl security &lt;switch&gt;; | tx only;
4150 tx class|dscp &lt;number&gt;;
4151 tx priority &lt;number&gt;;
4152 rx buffer &lt;number&gt;;
4153 tx length &lt;number&gt;;
4154 check link &lt;switch&gt;;
4155 authentication none|plaintext|cryptographic;
4156 password "&lt;text&gt;";
4157 password "&lt;text&gt;" {
4158 id &lt;num&gt;;
4159 generate from "&lt;date&gt;";
4160 generate to "&lt;date&gt;";
4161 accept from "&lt;date&gt;";
4162 accept to "&lt;date&gt;";
4163 from "&lt;date&gt;";
4164 to "&lt;date&gt;";
4165 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
4166 };
4167 };
4168 }
4169 </code>
4170
4171 <descrip>
4172 <tag><label id="rip-infinity">infinity <M>number</M></tag>
4173 Selects the distance of infinity. Bigger values will make
4174 protocol convergence even slower. The default value is 16.
4175
4176 <tag><label id="rip-ecmp">ecmp <M>switch</M> [limit <M>number</M>]</tag>
4177 This option specifies whether RIP is allowed to generate ECMP
4178 (equal-cost multipath) routes. Such routes are used when there are
4179 several directions to the destination, each with the same (computed)
4180 cost. This option also allows to specify a limit on maximum number of
4181 nexthops in one route. By default, ECMP is enabled if supported by
4182 Kernel. Default value of the limit is 16.
4183
4184 <tag><label id="rip-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
4185 Interface definitions specify a set of interfaces on which the
4186 protocol is activated and contain interface specific options.
4187 See <ref id="proto-iface" name="interface"> common options for
4188 detailed description.
4189 </descrip>
4190
4191 <p>Interface specific options:
4192
4193 <descrip>
4194 <tag><label id="rip-iface-metric">metric <m/num/</tag>
4195 This option specifies the metric of the interface. When a route is
4196 received from the interface, its metric is increased by this value
4197 before further processing. Valid values are 1-255, but values higher
4198 than infinity has no further meaning. Default: 1.
4199
4200 <tag><label id="rip-iface-mode">mode multicast|broadcast</tag>
4201 This option selects the mode for RIP to use on the interface. The
4202 default is multicast mode for RIPv2 and broadcast mode for RIPv1.
4203 RIPng always uses the multicast mode.
4204
4205 <tag><label id="rip-iface-passive">passive <m/switch/</tag>
4206 Passive interfaces receive routing updates but do not transmit any
4207 messages. Default: no.
4208
4209 <tag><label id="rip-iface-address">address <m/ip/</tag>
4210 This option specifies a destination address used for multicast or
4211 broadcast messages, the default is the official RIP (224.0.0.9) or RIPng
4212 (ff02::9) multicast address, or an appropriate broadcast address in the
4213 broadcast mode.
4214
4215 <tag><label id="rip-iface-port">port <m/number/</tag>
4216 This option selects an UDP port to operate on, the default is the
4217 official RIP (520) or RIPng (521) port.
4218
4219 <tag><label id="rip-iface-version">version 1|2</tag>
4220 This option selects the version of RIP used on the interface. For RIPv1,
4221 automatic subnet aggregation is not implemented, only classful network
4222 routes and host routes are propagated. Note that BIRD allows RIPv1 to be
4223 configured with features that are defined for RIPv2 only, like
4224 authentication or using multicast sockets. The default is RIPv2 for IPv4
4225 RIP, the option is not supported for RIPng, as no further versions are
4226 defined.
4227
4228 <tag><label id="rip-iface-version-only">version only <m/switch/</tag>
4229 Regardless of RIP version configured for the interface, BIRD accepts
4230 incoming packets of any RIP version. This option restrict accepted
4231 packets to the configured version. Default: no.
4232
4233 <tag><label id="rip-iface-split-horizon">split horizon <m/switch/</tag>
4234 Split horizon is a scheme for preventing routing loops. When split
4235 horizon is active, routes are not regularly propagated back to the
4236 interface from which they were received. They are either not propagated
4237 back at all (plain split horizon) or propagated back with an infinity
4238 metric (split horizon with poisoned reverse). Therefore, other routers
4239 on the interface will not consider the router as a part of an
4240 independent path to the destination of the route. Default: yes.
4241
4242 <tag><label id="rip-iface-poison-reverse">poison reverse <m/switch/</tag>
4243 When split horizon is active, this option specifies whether the poisoned
4244 reverse variant (propagating routes back with an infinity metric) is
4245 used. The poisoned reverse has some advantages in faster convergence,
4246 but uses more network traffic. Default: yes.
4247
4248 <tag><label id="rip-iface-check-zero">check zero <m/switch/</tag>
4249 Received RIPv1 packets with non-zero values in reserved fields should
4250 be discarded. This option specifies whether the check is performed or
4251 such packets are just processed as usual. Default: yes.
4252
4253 <tag><label id="rip-iface-update-time">update time <m/number/</tag>
4254 Specifies the number of seconds between periodic updates. A lower number
4255 will mean faster convergence but bigger network load. Default: 30.
4256
4257 <tag><label id="rip-iface-timeout-time">timeout time <m/number/</tag>
4258 Specifies the time interval (in seconds) between the last received route
4259 announcement and the route expiration. After that, the network is
4260 considered unreachable, but still is propagated with infinity distance.
4261 Default: 180.
4262
4263 <tag><label id="rip-iface-garbage-time">garbage time <m/number/</tag>
4264 Specifies the time interval (in seconds) between the route expiration
4265 and the removal of the unreachable network entry. The garbage interval,
4266 when a route with infinity metric is propagated, is used for both
4267 internal (after expiration) and external (after withdrawal) routes.
4268 Default: 120.
4269
4270 <tag><label id="rip-iface-ecmp-weight">ecmp weight <m/number/</tag>
4271 When ECMP (multipath) routes are allowed, this value specifies a
4272 relative weight used for nexthops going through the iface. Valid
4273 values are 1-256. Default value is 1.
4274
4275 <tag><label id="rip-iface-auth">authentication none|plaintext|cryptographic</tag>
4276 Selects authentication method to be used. <cf/none/ means that packets
4277 are not authenticated at all, <cf/plaintext/ means that a plaintext
4278 password is embedded into each packet, and <cf/cryptographic/ means that
4279 packets are authenticated using some cryptographic hash function
4280 selected by option <cf/algorithm/ for each key. The default
4281 cryptographic algorithm for RIP keys is Keyed-MD5. If you set
4282 authentication to not-none, it is a good idea to add <cf>password</cf>
4283 section. Default: none.
4284
4285 <tag><label id="rip-iface-pass">password "<m/text/"</tag>
4286 Specifies a password used for authentication. See <ref id="proto-pass"
4287 name="password"> common option for detailed description.
4288
4289 <tag><label id="rip-iface-ttl-security">ttl security [<m/switch/ | tx only]</tag>
4290 TTL security is a feature that protects routing protocols from remote
4291 spoofed packets by using TTL 255 instead of TTL 1 for protocol packets
4292 destined to neighbors. Because TTL is decremented when packets are
4293 forwarded, it is non-trivial to spoof packets with TTL 255 from remote
4294 locations.
4295
4296 If this option is enabled, the router will send RIP packets with TTL 255
4297 and drop received packets with TTL less than 255. If this option si set
4298 to <cf/tx only/, TTL 255 is used for sent packets, but is not checked
4299 for received packets. Such setting does not offer protection, but offers
4300 compatibility with neighbors regardless of whether they use ttl
4301 security.
4302
4303 For RIPng, TTL security is a standard behavior (required by <rfc
4304 id="2080">) and therefore default value is yes. For IPv4 RIP, default
4305 value is no.
4306
4307 <tag><label id="rip-iface-tx-class">tx class|dscp|priority <m/number/</tag>
4308 These options specify the ToS/DiffServ/Traffic class/Priority of the
4309 outgoing RIP packets. See <ref id="proto-tx-class" name="tx class"> common
4310 option for detailed description.
4311
4312 <tag><label id="rip-iface-rx-buffer">rx buffer <m/number/</tag>
4313 This option specifies the size of buffers used for packet processing.
4314 The buffer size should be bigger than maximal size of received packets.
4315 The default value is 532 for IPv4 RIP and interface MTU value for RIPng.
4316
4317 <tag><label id="rip-iface-tx-length">tx length <m/number/</tag>
4318 This option specifies the maximum length of generated RIP packets. To
4319 avoid IP fragmentation, it should not exceed the interface MTU value.
4320 The default value is 532 for IPv4 RIP and interface MTU value for RIPng.
4321
4322 <tag><label id="rip-iface-check-link">check link <m/switch/</tag>
4323 If set, the hardware link state (as reported by OS) is taken into
4324 consideration. When the link disappears (e.g. an ethernet cable is
4325 unplugged), neighbors are immediately considered unreachable and all
4326 routes received from them are withdrawn. It is possible that some
4327 hardware drivers or platforms do not implement this feature.
4328 Default: yes.
4329 </descrip>
4330
4331 <sect1>Attributes
4332 <label id="rip-attr">
4333
4334 <p>RIP defines two route attributes:
4335
4336 <descrip>
4337 <tag><label id="rta-rip-metric">int rip_metric</tag>
4338 RIP metric of the route (ranging from 0 to <cf/infinity/). When routes
4339 from different RIP instances are available and all of them have the same
4340 preference, BIRD prefers the route with lowest <cf/rip_metric/. When a
4341 non-RIP route is exported to RIP, the default metric is 1.
4342
4343 <tag><label id="rta-rip-tag">int rip_tag</tag>
4344 RIP route tag: a 16-bit number which can be used to carry additional
4345 information with the route (for example, an originating AS number in
4346 case of external routes). When a non-RIP route is exported to RIP, the
4347 default tag is 0.
4348 </descrip>
4349
4350 <sect1>Example
4351 <label id="rip-exam">
4352
4353 <p><code>
4354 protocol rip {
4355 ipv4 {
4356 import all;
4357 export all;
4358 };
4359 interface "eth*" {
4360 metric 2;
4361 port 1520;
4362 mode multicast;
4363 update time 12;
4364 timeout time 60;
4365 authentication cryptographic;
4366 password "secret" { algorithm hmac sha256; };
4367 };
4368 }
4369 </code>
4370
4371
4372 <sect>RPKI
4373 <label id="rpki">
4374
4375 <sect1>Introduction
4376
4377 <p>The Resource Public Key Infrastructure (RPKI) is mechanism for origin
4378 validation of BGP routes (RFC 6480). BIRD supports only so-called RPKI-based
4379 origin validation. There is implemented RPKI to Router (RPKI-RTR) protocol (RFC
4380 6810). It uses some of the RPKI data to allow a router to verify that the
4381 autonomous system announcing an IP address prefix is in fact authorized to do
4382 so. This is not crypto checked so can be violated. But it should prevent the
4383 vast majority of accidental hijackings on the Internet today, e.g. the famous
4384 Pakastani accidental announcement of YouTube's address space.
4385
4386 <p>The RPKI-RTR protocol receives and maintains a set of ROAs from a cache
4387 server (also called validator). You can validate routes (RFC 6483) using
4388 function <cf/roa_check()/ in filter and set it as import filter at the BGP
4389 protocol. BIRD should re-validate all of affected routes after RPKI update by
4390 RFC 6811, but we don't support it yet! You can use a BIRD's client command
4391 <cf>reload in <m/bgp_protocol_name/</cf> for manual call of revalidation of all
4392 routes.
4393
4394 <sect1>Supported transports
4395 <p>
4396 <itemize>
4397 <item>Unprotected transport over TCP uses a port 323. The cache server
4398 and BIRD router should be on the same trusted and controlled network
4399 for security reasons.
4400 <item>SSHv2 encrypted transport connection uses the normal SSH port
4401 22.
4402 </itemize>
4403
4404 <sect1>Configuration
4405
4406 <p>We currently support just one cache server per protocol. However you can
4407 define more RPKI protocols generally.
4408
4409 <code>
4410 protocol rpki [&lt;name&gt;] {
4411 roa4 { table &lt;tab&gt;; };
4412 roa6 { table &lt;tab&gt;; };
4413 remote &lt;ip&gt; | "&lt;domain&gt;" [port &lt;num&gt;];
4414 port &lt;num&gt;;
4415 refresh [keep] &lt;num&gt;;
4416 retry [keep] &lt;num&gt;;
4417 expire [keep] &lt;num&gt;;
4418 transport tcp;
4419 transport ssh {
4420 bird private key "&lt;/path/to/id_rsa&gt;";
4421 remote public key "&lt;/path/to/known_host&gt;";
4422 user "&lt;name&gt;";
4423 };
4424 }
4425 </code>
4426
4427 <p>Alse note that you have to specify the ROA channel. If you want to import
4428 only IPv4 prefixes you have to specify only roa4 channel. Similarly with IPv6
4429 prefixes only. If you want to fetch both IPv4 and even IPv6 ROAs you have to
4430 specify both channels.
4431
4432 <sect2>RPKI protocol options
4433 <p>
4434 <descrip>
4435 <tag>remote <m/ip/ | "<m/hostname/" [port <m/num/]</tag> Specifies
4436 a destination address of the cache server. Can be specified by an IP
4437 address or by full domain name string. Only one cache can be specified
4438 per protocol. This option is required.
4439
4440 <tag>port <m/num/</tag> Specifies the port number. The default port
4441 number is 323 for transport without any encryption and 22 for transport
4442 with SSH encryption.
4443
4444 <tag>refresh [keep] <m/num/</tag> Time period in seconds. Tells how
4445 long to wait before next attempting to poll the cache using a Serial
4446 Query or a Reset Query packet. Must be lower than 86400 seconds (one
4447 day). Too low value can caused a false positive detection of
4448 network connection problems. A keyword <cf/keep/ suppresses updating
4449 this value by a cache server.
4450 Default: 3600 seconds
4451
4452 <tag>retry [keep] <m/num/</tag> Time period in seconds between a failed
4453 Serial/Reset Query and a next attempt. Maximum allowed value is 7200
4454 seconds (two hours). Too low value can caused a false positive
4455 detection of network connection problems. A keyword <cf/keep/
4456 suppresses updating this value by a cache server.
4457 Default: 600 seconds
4458
4459 <tag>expire [keep] <m/num/</tag> Time period in seconds. Received
4460 records are deleted if the client was unable to successfully refresh
4461 data for this time period. Must be in range from 600 seconds (ten
4462 minutes) to 172800 seconds (two days). A keyword <cf/keep/
4463 suppresses updating this value by a cache server.
4464 Default: 7200 seconds
4465
4466 <tag>transport tcp</tag> Unprotected transport over TCP. It's a default
4467 transport. Should be used only on secure private networks.
4468 Default: tcp
4469
4470 <tag>transport ssh { <m/SSH transport options.../ }</tag> It enables a
4471 SSHv2 transport encryption. Cannot be combined with a TCP transport.
4472 Default: off
4473 </descrip>
4474
4475 <sect3>SSH transport options
4476 <p>
4477 <descrip>
4478 <tag>bird private key "<m>/path/to/id_rsa</m>"</tag>
4479 A path to the BIRD's private SSH key for authentication.
4480 It can be a <cf><m>id_rsa</m></cf> file.
4481
4482 <tag>remote public key "<m>/path/to/known_host</m>"</tag>
4483 A path to the cache's public SSH key for verification identity
4484 of the cache server. It could be a path to <cf><m>known_host</m></cf> file.
4485
4486 <tag>user "<m/name/"</tag>
4487 A SSH user name for authentication. This option is a required.
4488 </descrip>
4489
4490 <sect1>Examples
4491 <sect2>BGP origin validation
4492 <p>Policy: Don't import <cf/ROA_INVALID/ routes.
4493 <code>
4494 roa4 table r4;
4495 roa6 table r6;
4496
4497 protocol rpki {
4498 debug all;
4499
4500 roa4 { table r4; };
4501 roa6 { table r6; };
4502
4503 # Please, do not use rpki-validator.realmv6.org in production
4504 remote "rpki-validator.realmv6.org" port 8282;
4505
4506 retry keep 5;
4507 refresh keep 30;
4508 expire 600;
4509 }
4510
4511 filter peer_in_v4 {
4512 if (roa_check(r4, net, bgp_path.last) = ROA_INVALID) then
4513 {
4514 print "Ignore invalid ROA ", net, " for ASN ", bgp_path.last;
4515 reject;
4516 }
4517 accept;
4518 }
4519
4520 protocol bgp {
4521 debug all;
4522 local as 65000;
4523 neighbor 192.168.2.1 as 65001;
4524 ipv4 {
4525 import filter peer_in_v4;
4526 export none;
4527 };
4528 }
4529 </code>
4530
4531 <sect2>SSHv2 transport encryption
4532 <p>
4533 <code>
4534 roa4 table r4;
4535 roa6 table r6;
4536
4537 protocol rpki {
4538 debug all;
4539
4540 roa4 { table r4; };
4541 roa6 { table r6; };
4542
4543 remote 127.0.0.1 port 2345;
4544 transport ssh {
4545 bird private key "/home/birdgeek/.ssh/id_rsa";
4546 remote public key "/home/birdgeek/.ssh/known_hosts";
4547 user "birdgeek";
4548 };
4549
4550 # Default interval values
4551 }
4552 </code>
4553
4554
4555 <sect>Static
4556 <label id="static">
4557
4558 <p>The Static protocol doesn't communicate with other routers in the network,
4559 but instead it allows you to define routes manually. This is often used for
4560 specifying how to forward packets to parts of the network which don't use
4561 dynamic routing at all and also for defining sink routes (i.e., those telling to
4562 return packets as undeliverable if they are in your IP block, you don't have any
4563 specific destination for them and you don't want to send them out through the
4564 default route to prevent routing loops).
4565
4566 <p>There are three classes of definitions in Static protocol configuration --
4567 global options, static route definitions, and per-route options. Usually, the
4568 definition of the protocol contains mainly a list of static routes.
4569 Static routes have no specific attributes.
4570
4571 <p>Global options:
4572
4573 <descrip>
4574 <tag><label id="static-check-link">check link <m/switch/</tag>
4575 If set, hardware link states of network interfaces are taken into
4576 consideration. When link disappears (e.g. ethernet cable is unplugged),
4577 static routes directing to that interface are removed. It is possible
4578 that some hardware drivers or platforms do not implement this feature.
4579 Default: off.
4580
4581 <tag><label id="static-igp-table">igp table <m/name/</tag>
4582 Specifies a table that is used for route table lookups of recursive
4583 routes. Default: the same table as the protocol is connected to.
4584 </descrip>
4585
4586 <p>Route definitions (each may also contain a block of per-route options):
4587
4588 <sect1>Regular routes; MPLS switching rules
4589
4590 <p>There exist several types of routes; keep in mind that <m/prefix/ syntax is
4591 <ref id="type-prefix" name="dependent on network type">.
4592
4593 <descrip>
4594 <tag>route <m/prefix/ via <m/ip/|<m/"interface"/ [mpls <m/num/[/<m/num/[/<m/num/[...]]]]</tag>
4595 Next hop routes may bear one or more <ref id="route-next-hop" name="next hops">.
4596 Every next hop is preceded by <cf/via/ and configured as shown.
4597
4598 <tag>route <m/prefix/ recursive <m/ip/ [mpls <m/num/[/<m/num/[/<m/num/[...]]]]</tag>
4599 Recursive nexthop resolves the given IP in the configured IGP table and
4600 uses that route's next hop. The MPLS stacks are concatenated; on top is
4601 the IGP's nexthop stack and on bottom is this route's stack.
4602
4603 <tag>route <m/prefix/ blackhole|unreachable|prohibit</tag>
4604 Special routes specifying to silently drop the packet, return it as
4605 unreachable or return it as administratively prohibited. First two
4606 targets are also known as <cf/drop/ and <cf/reject/.
4607 </descrip>
4608
4609 <p>When the particular destination is not available (the interface is down or
4610 the next hop of the route is not a neighbor at the moment), Static just
4611 uninstalls the route from the table it is connected to and adds it again as soon
4612 as the destination becomes adjacent again.
4613
4614 <sect1>Route Origin Authorization
4615
4616 <p>The ROA config is just <cf>route <m/prefix/ max <m/int/ as <m/int/</cf> with no nexthop.
4617
4618 <sect1>Flowspec
4619 <label id="flowspec-network-type">
4620
4621 <p>The flow specification are rules for routers and firewalls for filtering
4622 purpose. It is described by <rfc id="5575">. There are 3 types of arguments:
4623 <m/inet4/ or <m/inet6/ prefixes, bitmasks matching expressions and numbers
4624 matching expressions.
4625
4626 Bitmasks matching is written using <m/value/<cf>/</cf><m/mask/ or
4627 <cf/!/<m/value/<cf>/</cf><m/mask/ pairs. It means that <cf/(/<m/data/ <cf/&/
4628 <m/mask/<cf/)/ is or is not equal to <m/value/.
4629
4630 Numbers matching is a matching sequence of numbers and ranges separeted by a
4631 commas (<cf/,/) (e.g. <cf/10,20,30/). Ranges can be written using double dots
4632 <cf/../ notation (e.g. <cf/80..90,120..124/). An alternative notation are
4633 sequence of one or more pairs of relational operators and values separated by
4634 logical operators <cf/&&/ or <cf/||/. Allowed relational operators are <cf/=/,
4635 <cf/!=/, <cf/</, <cf/<=/, <cf/>/, <cf/>=/, <cf/true/ and <cf/false/.
4636
4637 <sect2>IPv4 Flowspec
4638
4639 <p><descrip>
4640 <tag><label id="flow-dst">dst <m/inet4/</tag>
4641 Set a matching destination prefix (e.g. <cf>dst 192.168.0.0/16</cf>).
4642 Only this option is mandatory in IPv4 Flowspec.
4643
4644 <tag><label id="flow-src">src <m/inet4/</tag>
4645 Set a matching source prefix (e.g. <cf>src 10.0.0.0/8</cf>).
4646
4647 <tag><label id="flow-proto">proto <m/numbers-match/</tag>
4648 Set a matching IP protocol numbers (e.g. <cf/proto 6/).
4649
4650 <tag><label id="flow-port">port <m/numbers-match/</tag>
4651 Set a matching source or destination TCP/UDP port numbers (e.g.
4652 <cf>port 1..1023,1194,3306</cf>).
4653
4654 <tag><label id="flow-dport">dport <m/numbers-match/</tag>
4655 Set a mating destination port numbers (e.g. <cf>dport 49151</cf>).
4656
4657 <tag><label id="flow-sport">sport <m/numbers-match/</tag>
4658 Set a matching source port numbers (e.g. <cf>sport = 0</cf>).
4659
4660 <tag><label id="flow-icmp-type">icmp type <m/numbers-match/</tag>
4661 Set a matching type field number of an ICMP packet (e.g. <cf>icmp type
4662 3</cf>)
4663
4664 <tag><label id="flow-icmp-code">icmp code <m/numbers-match/</tag>
4665 Set a matching code field number of an ICMP packet (e.g. <cf>icmp code
4666 1</cf>)
4667
4668 <tag><label id="flow-tcp-flags">tcp flags <m/bitmask-match/</tag>
4669 Set a matching bitmask for TCP header flags (aka control bits) (e.g.
4670 <cf>tcp flags 0x03/0x0f;</cf>). The maximum length of mask is 12 bits
4671 (0xfff).
4672
4673 <tag><label id="flow-length">length <m/numbers-match/</tag>
4674 Set a matching packet length (e.g. <cf>length > 1500;</cf>)
4675
4676 <tag><label id="flow-dscp">dscp <m/numbers-match/</tag>
4677 Set a matching DiffServ Code Point number (e.g. <cf>length > 1500;</cf>).
4678
4679 <tag><label id="flow-fragment">fragment <m/fragmentation-type/</tag>
4680 Set a matching type of packet fragmentation. Allowed fragmentation
4681 types are <cf/dont_fragment/, <cf/is_fragment/, <cf/first_fragment/,
4682 <cf/last_fragment/ (e.g. <cf>fragment is_fragment &&
4683 !dont_fragment</cf>).
4684 </descrip>
4685
4686 <p><code>
4687 protocol static {
4688 flow4;
4689
4690 route flow4 {
4691 dst 10.0.0.0/8;
4692 port > 24 && < 30 || 40..50,60..70,80 && >= 90;
4693 tcp flags 0x03/0x0f;
4694 length > 1024;
4695 dscp = 63;
4696 fragment dont_fragment, is_fragment || !first_fragment;
4697 };
4698 }
4699 </code>
4700
4701 <sect2>Differences for IPv6 Flowspec
4702
4703 <p>Flowspec IPv6 are same as Flowspec IPv4 with a few exceptions.
4704 <itemize>
4705 <item>Prefixes <m/inet6/ can be specified not only with prefix length,
4706 but with prefix <cf/offset/ <m/num/ too (e.g.
4707 <cf>::1234:5678:9800:0000/101 offset 64</cf>). Offset means to don't
4708 care of <m/num/ first bits.
4709 <item>IPv6 Flowspec hasn't mandatory any flowspec component.
4710 <item>In IPv6 packets, there is a matching the last next header value
4711 for a matching IP protocol number (e.g. <cf>next header 6</cf>).
4712 <item>It is not possible to set <cf>dont_fragment</cf> as a type of
4713 packet fragmentation.
4714 </itemize>
4715
4716 <p><descrip>
4717 <tag><label id="flow6-dst">dst <m/inet6/ [offset <m/num/]</tag>
4718 Set a matching destination IPv6 prefix (e.g. <cf>dst
4719 ::1c77:3769:27ad:a11a/128 offset 64</cf>).
4720
4721 <tag><label id="flow6-src">src <m/inet6/ [offset <m/num/]</tag>
4722 Set a matching source IPv6 prefix (e.g. <cf>src fe80::/64</cf>).
4723
4724 <tag><label id="flow6-next-header">next header <m/numbers-match/</tag>
4725 Set a matching IP protocol numbers (e.g. <cf>next header != 6</cf>).
4726
4727 <tag><label id="flow6-label">label <m/bitmask-match/</tag>
4728 Set a 20-bit bitmask for matching Flow Label field in IPv6 packets
4729 (e.g. <cf>label 0x8e5/0x8e5</cf>).
4730 </descrip>
4731
4732 <p><code>
4733 protocol static {
4734 flow6 { table myflow6; };
4735
4736 route flow6 {
4737 dst fec0:1122:3344:5566:7788:99aa:bbcc:ddee/128;
4738 src 0000:0000:0000:0001:1234:5678:9800:0000/101 offset 63;
4739 next header = 23;
4740 sport > 24 && < 30 || = 40 || 50,60,70..80;
4741 dport = 50;
4742 tcp flags 0x03/0x0f, !0/0xff || 0x33/0x33;
4743 fragment !is_fragment || !first_fragment;
4744 label 0xaaaa/0xaaaa && 0x33/0x33;
4745 };
4746 }
4747 </code>
4748
4749 <sect1>Per-route options
4750 <p>
4751 <descrip>
4752 <tag><label id="static-route-bfd">bfd <m/switch/</tag>
4753 The Static protocol could use BFD protocol for next hop liveness
4754 detection. If enabled, a BFD session to the route next hop is created
4755 and the static route is BFD-controlled -- the static route is announced
4756 only if the next hop liveness is confirmed by BFD. If the BFD session
4757 fails, the static route is removed. Note that this is a bit different
4758 compared to other protocols, which may use BFD as an advisory mechanism
4759 for fast failure detection but ignores it if a BFD session is not even
4760 established.
4761
4762 This option can be used for static routes with a direct next hop, or
4763 also for for individual next hops in a static multipath route (see
4764 above). Note that BFD protocol also has to be configured, see
4765 <ref id="bfd" name="BFD"> section for details. Default value is no.
4766
4767 <tag><label id="static-route-filter"><m/filter expression/</tag>
4768 This is a special option that allows filter expressions to be configured
4769 on per-route basis. Can be used multiple times. These expressions are
4770 evaluated when the route is originated, similarly to the import filter
4771 of the static protocol. This is especially useful for configuring route
4772 attributes, e.g., <cf/ospf_metric1 = 100;/ for a route that will be
4773 exported to the OSPF protocol.
4774 </descrip>
4775
4776 <sect1>Example static config
4777
4778 <p><code>
4779 protocol static {
4780 ipv4 { table testable; }; # Connect to a non-default routing table
4781 check link; # Advertise routes only if link is up
4782 route 0.0.0.0/0 via 198.51.100.130; # Default route
4783 route 10.0.0.0/8 # Multipath route
4784 via 198.51.100.10 weight 2
4785 via 198.51.100.20 bfd # BFD-controlled next hop
4786 via 192.0.2.1;
4787 route 203.0.113.0/24 unreachable; # Sink route
4788 route 10.2.0.0/24 via "arc0"; # Secondary network
4789 route 192.168.10.0/24 via 198.51.100.100 {
4790 ospf_metric1 = 20; # Set extended attribute
4791 }
4792 route 192.168.10.0/24 via 198.51.100.100 {
4793 ospf_metric2 = 100; # Set extended attribute
4794 ospf_tag = 2; # Set extended attribute
4795 bfd; # BFD-controlled route
4796 }
4797 }
4798 </code>
4799
4800
4801 <chapt>Conclusions
4802 <label id="conclusion">
4803
4804 <sect>Future work
4805 <label id="future-work">
4806
4807 <p>Although BIRD supports all the commonly used routing protocols, there are
4808 still some features which would surely deserve to be implemented in future
4809 versions of BIRD:
4810
4811 <itemize>
4812 <item>Opaque LSA's
4813 <item>Route aggregation and flap dampening
4814 <item>Multicast routing protocols
4815 <item>Ports to other systems
4816 </itemize>
4817
4818
4819 <sect>Getting more help
4820 <label id="help">
4821
4822 <p>If you use BIRD, you're welcome to join the bird-users mailing list
4823 (<HTMLURL URL="mailto:bird-users@network.cz" name="bird-users@network.cz">)
4824 where you can share your experiences with the other users and consult
4825 your problems with the authors. To subscribe to the list, visit
4826 <HTMLURL URL="http://bird.network.cz/?m_list" name="http://bird.network.cz/?m_list">.
4827 The home page of BIRD can be found at <HTMLURL URL="http://bird.network.cz/" name="http://bird.network.cz/">.
4828
4829 <p>BIRD is a relatively young system and it probably contains some bugs. You can
4830 report any problems to the bird-users list and the authors will be glad to solve
4831 them, but before you do so, please make sure you have read the available
4832 documentation and that you are running the latest version (available at
4833 <HTMLURL URL="ftp://bird.network.cz/pub/bird" name="bird.network.cz:/pub/bird">).
4834 (Of course, a patch which fixes the bug is always welcome as an attachment.)
4835
4836 <p>If you want to understand what is going inside, Internet standards are a good
4837 and interesting reading. You can get them from
4838 <HTMLURL URL="ftp://ftp.rfc-editor.org/" name="ftp.rfc-editor.org"> (or a
4839 nicely sorted version from <HTMLURL URL="ftp://atrey.karlin.mff.cuni.cz/pub/rfc"
4840 name="atrey.karlin.mff.cuni.cz:/pub/rfc">).
4841
4842 <p><it/Good luck!/
4843
4844 </book>
4845
4846 <!--
4847 LocalWords: GPL IPv GateD BGPv RIPv OSPFv Linux sgml html dvi sgmltools Pavel
4848 LocalWords: linuxdoc dtd descrip config conf syslog stderr auth ospf bgp Mbps
4849 LocalWords: router's eval expr num birdc ctl UNIX if's enums bool int ip GCC
4850 LocalWords: len ipaddress pxlen netmask enum bgppath bgpmask clist gw md eth
4851 LocalWords: RTS printn quitbird iBGP AS'es eBGP RFC multiprotocol IGP Machek
4852 LocalWords: EGP misconfigurations keepalive pref aggr aggregator BIRD's RTC
4853 LocalWords: OS'es AS's multicast nolisten misconfigured UID blackhole MRTD MTU
4854 LocalWords: uninstalls ethernets IP binutils ANYCAST anycast dest RTD ICMP rfc
4855 LocalWords: compat multicasts nonbroadcast pointopoint loopback sym stats
4856 LocalWords: Perl SIGHUP dd mm yy HH MM SS EXT IA UNICAST multihop Discriminator txt
4857 LocalWords: proto wildcard Ondrej Filip
4858 -->