]> git.ipfire.org Git - thirdparty/bird.git/blob - doc/bird.sgml
c549e3c3db2ef43845bf5617f365924cfe675b80
[thirdparty/bird.git] / doc / bird.sgml
1 <!doctype birddoc system>
2
3 <!--
4 BIRD documentation
5
6 This documentation can have 4 forms: sgml (this is master copy), html, ASCII
7 text and dvi/postscript (generated from sgml using sgmltools). You should always
8 edit master copy.
9
10 This is a slightly modified linuxdoc dtd. Anything in <descrip> tags is
11 considered definition of configuration primitives, <cf> is fragment of
12 configuration within normal text, <m> is "meta" information within fragment of
13 configuration - something in config which is not keyword.
14
15 (set-fill-column 80)
16
17 Copyright 1999,2000 Pavel Machek <pavel@ucw.cz>, distribute under GPL version 2 or later.
18
19 -->
20
21 <book>
22
23 <title>BIRD User's Guide
24 <author>
25 Ondrej Filip <it/&lt;feela@network.cz&gt;/,
26 Pavel Machek <it/&lt;pavel@ucw.cz&gt;/,
27 Martin Mares <it/&lt;mj@ucw.cz&gt;/,
28 Ondrej Zajicek <it/&lt;santiago@crfreenet.org&gt;/
29 </author>
30
31 <abstract>
32 This document contains user documentation for the BIRD Internet Routing Daemon project.
33 </abstract>
34
35 <!-- Table of contents -->
36 <toc>
37
38 <!-- Begin the document -->
39
40
41 <chapt>Introduction
42 <label id="intro">
43
44 <sect>What is BIRD
45 <label id="what-is-bird">
46
47 <p>The name `BIRD' is actually an acronym standing for `BIRD Internet Routing
48 Daemon'. Let's take a closer look at the meaning of the name:
49
50 <p><em/BIRD/: Well, we think we have already explained that. It's an acronym
51 standing for `BIRD Internet Routing Daemon', you remember, don't you? :-)
52
53 <p><em/Internet Routing/: It's a program (well, a daemon, as you are going to
54 discover in a moment) which works as a dynamic router in an Internet type
55 network (that is, in a network running either the IPv4 or the IPv6 protocol).
56 Routers are devices which forward packets between interconnected networks in
57 order to allow hosts not connected directly to the same local area network to
58 communicate with each other. They also communicate with the other routers in the
59 Internet to discover the topology of the network which allows them to find
60 optimal (in terms of some metric) rules for forwarding of packets (which are
61 called routing tables) and to adapt themselves to the changing conditions such
62 as outages of network links, building of new connections and so on. Most of
63 these routers are costly dedicated devices running obscure firmware which is
64 hard to configure and not open to any changes (on the other hand, their special
65 hardware design allows them to keep up with lots of high-speed network
66 interfaces, better than general-purpose computer does). Fortunately, most
67 operating systems of the UNIX family allow an ordinary computer to act as a
68 router and forward packets belonging to the other hosts, but only according to a
69 statically configured table.
70
71 <p>A <em/Routing Daemon/ is in UNIX terminology a non-interactive program
72 running on background which does the dynamic part of Internet routing, that is
73 it communicates with the other routers, calculates routing tables and sends them
74 to the OS kernel which does the actual packet forwarding. There already exist
75 other such routing daemons: routed (RIP only), GateD (non-free),
76 <HTMLURL URL="http://www.zebra.org" name="Zebra"> and
77 <HTMLURL URL="http://sourceforge.net/projects/mrt" name="MRTD">,
78 but their capabilities are limited and they are relatively hard to configure
79 and maintain.
80
81 <p>BIRD is an Internet Routing Daemon designed to avoid all of these shortcomings,
82 to support all the routing technology used in the today's Internet or planned to
83 be used in near future and to have a clean extensible architecture allowing new
84 routing protocols to be incorporated easily. Among other features, BIRD
85 supports:
86
87 <itemize>
88 <item>both IPv4 and IPv6 protocols
89 <item>multiple routing tables
90 <item>the Border Gateway Protocol (BGPv4)
91 <item>the Routing Information Protocol (RIPv2)
92 <item>the Open Shortest Path First protocol (OSPFv2, OSPFv3)
93 <item>the Router Advertisements for IPv6 hosts
94 <item>a virtual protocol for exchange of routes between different
95 routing tables on a single host
96 <item>a command-line interface allowing on-line control and inspection
97 of status of the daemon
98 <item>soft reconfiguration (no need to use complex online commands to
99 change the configuration, just edit the configuration file and
100 notify BIRD to re-read it and it will smoothly switch itself to
101 the new configuration, not disturbing routing protocols unless
102 they are affected by the configuration changes)
103 <item>a powerful language for route filtering
104 </itemize>
105
106 <p>BIRD has been developed at the Faculty of Math and Physics, Charles
107 University, Prague, Czech Republic as a student project. It can be freely
108 distributed under the terms of the GNU General Public License.
109
110 <p>BIRD has been designed to work on all UNIX-like systems. It has been
111 developed and tested under Linux 2.0 to 2.6, and then ported to FreeBSD, NetBSD
112 and OpenBSD, porting to other systems (even non-UNIX ones) should be relatively
113 easy due to its highly modular architecture.
114
115 <p>BIRD supports either IPv4 or IPv6 protocol, but have to be compiled separately
116 for each one. Therefore, a dualstack router would run two instances of BIRD (one
117 for IPv4 and one for IPv6), with completely separate setups (configuration
118 files, tools ...).
119
120
121 <sect>Installing BIRD
122 <label id="install">
123
124 <p>On a recent UNIX system with GNU development tools (GCC, binutils, m4, make)
125 and Perl, installing BIRD should be as easy as:
126
127 <code>
128 ./configure
129 make
130 make install
131 vi /usr/local/etc/bird.conf
132 bird
133 </code>
134
135 <p>You can use <tt>./configure --help</tt> to get a list of configure
136 options. The most important ones are: <tt/--enable-ipv6/ which enables building
137 of an IPv6 version of BIRD, <tt/--with-protocols=/ to produce a slightly smaller
138 BIRD executable by configuring out routing protocols you don't use, and
139 <tt/--prefix=/ to install BIRD to a place different from <file>/usr/local</file>.
140
141
142 <sect>Running BIRD
143 <label id="argv">
144
145 <p>You can pass several command-line options to bird:
146
147 <descrip>
148 <tag><label id="argv-config">-c <m/config name/</tag>
149 use given configuration file instead of <it/prefix/<file>/etc/bird.conf</file>.
150
151 <tag><label id="argv-debug">-d</tag>
152 enable debug messages and run bird in foreground.
153
154 <tag><label id="argv-log-file">-D <m/filename of debug log/</tag>
155 log debugging information to given file instead of stderr.
156
157 <tag><label id="argv-foreground">-f</tag>
158 run bird in foreground.
159
160 <tag><label id="argv-group">-g <m/group/</tag>
161 use that group ID, see the next section for details.
162
163 <tag><label id="argv-help">-h, --help</tag>
164 display command-line options to bird.
165
166 <tag><label id="argv-local">-l</tag>
167 look for a configuration file and a communication socket in the current
168 working directory instead of in default system locations. However, paths
169 specified by options <cf/-c/, <cf/-s/ have higher priority.
170
171 <tag><label id="argv-parse">-p</tag>
172 just parse the config file and exit. Return value is zero if the config
173 file is valid, nonzero if there are some errors.
174
175 <tag><label id="argv-pid">-P <m/name of PID file/</tag>
176 create a PID file with given filename.
177
178 <tag><label id="argv-recovery">-R</tag>
179 apply graceful restart recovery after start.
180
181 <tag><label id="argv-socket">-s <m/name of communication socket/</tag>
182 use given filename for a socket for communications with the client,
183 default is <it/prefix/<file>/var/run/bird.ctl</file>.
184
185 <tag><label id="argv-user">-u <m/user/</tag>
186 drop privileges and use that user ID, see the next section for details.
187
188 <tag><label id="argv-version">--version</tag>
189 display bird version.
190 </descrip>
191
192 <p>BIRD writes messages about its work to log files or syslog (according to config).
193
194
195 <sect>Privileges
196 <label id="privileges">
197
198 <p>BIRD, as a routing daemon, uses several privileged operations (like setting
199 routing table and using raw sockets). Traditionally, BIRD is executed and runs
200 with root privileges, which may be prone to security problems. The recommended
201 way is to use a privilege restriction (options <cf/-u/, <cf/-g/). In that case
202 BIRD is executed with root privileges, but it changes its user and group ID to
203 an unprivileged ones, while using Linux capabilities to retain just required
204 privileges (capabilities CAP_NET_*). Note that the control socket is created
205 before the privileges are dropped, but the config file is read after that. The
206 privilege restriction is not implemented in BSD port of BIRD.
207
208 <p>An unprivileged user (as an argument to <cf/-u/ options) may be the user
209 <cf/nobody/, but it is suggested to use a new dedicated user account (like
210 <cf/bird/). The similar considerations apply for the group option, but there is
211 one more condition -- the users in the same group can use <file/birdc/ to
212 control BIRD.
213
214 <p>Finally, there is a possibility to use external tools to run BIRD in an
215 environment with restricted privileges. This may need some configuration, but it
216 is generally easy -- BIRD needs just the standard library, privileges to read
217 the config file and create the control socket and the CAP_NET_* capabilities.
218
219
220 <chapt>About routing tables
221 <label id="routing-tables">
222
223 <p>BIRD has one or more routing tables which may or may not be synchronized with
224 OS kernel and which may or may not be synchronized with each other (see the Pipe
225 protocol). Each routing table contains a list of known routes. Each route
226 consists of:
227
228 <itemize>
229 <item>network prefix this route is for (network address and prefix
230 length -- the number of bits forming the network part of the
231 address; also known as a netmask)
232 <item>preference of this route
233 <item>IP address of router which told us about this route
234 <item>IP address of router we should forward the packets to using this
235 route
236 <item>other attributes common to all routes
237 <item>dynamic attributes defined by protocols which may or may not be
238 present (typically protocol metrics)
239 </itemize>
240
241 Routing table maintains multiple entries for a network, but at most one entry
242 for one network and one protocol. The entry with the highest preference is used
243 for routing (we will call such an entry the <it/selected route/). If there are
244 more entries with the same preference and they are from the same protocol, the
245 protocol decides (typically according to metrics). If they aren't, an internal
246 ordering is used to break the tie. You can get the list of route attributes in
247 the Route attributes section.
248
249 <p>Each protocol is connected to a routing table through two filters which can
250 accept, reject and modify the routes. An <it/export/ filter checks routes passed
251 from the routing table to the protocol, an <it/import/ filter checks routes in
252 the opposite direction. When the routing table gets a route from a protocol, it
253 recalculates the selected route and broadcasts it to all protocols connected to
254 the table. The protocols typically send the update to other routers in the
255 network. Note that although most protocols are interested in receiving just
256 selected routes, some protocols (e.g. the <cf/Pipe/ protocol) receive and
257 process all entries in routing tables (accepted by filters).
258
259 <p><label id="dsc-table-sorted">Usually, a routing table just chooses a selected route
260 from a list of entries for one network. But if the <cf/sorted/ option is
261 activated, these lists of entries are kept completely sorted (according to
262 preference or some protocol-dependent metric). This is needed for some features
263 of some protocols (e.g. <cf/secondary/ option of BGP protocol, which allows to
264 accept not just a selected route, but the first route (in the sorted list) that
265 is accepted by filters), but it is incompatible with some other features (e.g.
266 <cf/deterministic med/ option of BGP protocol, which activates a way of choosing
267 selected route that cannot be described using comparison and ordering). Minor
268 advantage is that routes are shown sorted in <cf/show route/, minor disadvantage
269 is that it is slightly more computationally expensive.
270
271
272 <sect>Graceful restart
273 <label id="graceful-restart">
274
275 <p>When BIRD is started after restart or crash, it repopulates routing tables in
276 an uncoordinated manner, like after clean start. This may be impractical in some
277 cases, because if the forwarding plane (i.e. kernel routing tables) remains
278 intact, then its synchronization with BIRD would temporarily disrupt packet
279 forwarding until protocols converge. Graceful restart is a mechanism that could
280 help with this issue. Generally, it works by starting protocols and letting them
281 repopulate routing tables while deferring route propagation until protocols
282 acknowledge their convergence. Note that graceful restart behavior have to be
283 configured for all relevant protocols and requires protocol-specific support
284 (currently implemented for Kernel and BGP protocols), it is activated for
285 particular boot by option <cf/-R/.
286
287
288 <chapt>Configuration
289 <label id="config">
290
291 <sect>Introduction
292 <label id="config-intro">
293
294 <p>BIRD is configured using a text configuration file. Upon startup, BIRD reads
295 <it/prefix/<file>/etc/bird.conf</file> (unless the <tt/-c/ command line option
296 is given). Configuration may be changed at user's request: if you modify the
297 config file and then signal BIRD with <tt/SIGHUP/, it will adjust to the new
298 config. Then there's the client which allows you to talk with BIRD in an
299 extensive way.
300
301 <p>In the config, everything on a line after <cf/#/ or inside <cf>/* */</cf> is
302 a comment, whitespace characters are treated as a single space. If there's a
303 variable number of options, they are grouped using the <cf/{ }/ brackets. Each
304 option is terminated by a <cf/;/. Configuration is case sensitive. There are two
305 ways how to name symbols (like protocol names, filter names, constants etc.). You
306 can either use a simple string starting with a letter followed by any
307 combination of letters and numbers (e.g. "R123", "myfilter", "bgp5") or you can
308 enclose the name into apostrophes (<cf/'/) and than you can use any combination
309 of numbers, letters. hyphens, dots and colons (e.g. "'1:strange-name'",
310 "'-NAME-'", "'cool::name'").
311
312 <p>Here is an example of a simple config file. It enables synchronization of
313 routing tables with OS kernel, scans for new network interfaces every 10 seconds
314 and runs RIP on all network interfaces found.
315
316 <code>
317 protocol kernel {
318 persist; # Don't remove routes on BIRD shutdown
319 scan time 20; # Scan kernel routing table every 20 seconds
320 export all; # Default is export none
321 }
322
323 protocol device {
324 scan time 10; # Scan interfaces every 10 seconds
325 }
326
327 protocol rip {
328 export all;
329 import all;
330 interface "*";
331 }
332 </code>
333
334
335 <sect>Global options
336 <label id="global-opts">
337
338 <p><descrip>
339 <tag><label id="opt-include">include "<m/filename/"</tag>
340 This statement causes inclusion of a new file. <m/Filename/ could also
341 be a wildcard, in that case matching files are included in alphabetic
342 order. The maximal depth is 8. Note that this statement could be used
343 anywhere in the config file, not just as a top-level option.
344
345 <tag><label id="opt-log">log "<m/filename/"|syslog [name <m/name/]|stderr all|{ <m/list of classes/ }</tag>
346 Set logging of messages having the given class (either <cf/all/ or
347 <cf/{ error|trace [, <m/.../] }/ etc.) into selected destination (a file specified
348 as a filename string, syslog with optional name argument, or the stderr
349 output). Classes are:
350 <cf/info/, <cf/warning/, <cf/error/ and <cf/fatal/ for messages about local problems,
351 <cf/debug/ for debugging messages,
352 <cf/trace/ when you want to know what happens in the network,
353 <cf/remote/ for messages about misbehavior of remote machines,
354 <cf/auth/ about authentication failures,
355 <cf/bug/ for internal BIRD bugs.
356 You may specify more than one <cf/log/ line to establish logging to
357 multiple destinations. Default: log everything to the system log.
358
359 <tag><label id="opt-debug-protocols">debug protocols all|off|{ states|routes|filters|interfaces|events|packets [, <m/.../] }</tag>
360 Set global defaults of protocol debugging options. See <cf/debug/ in the
361 following section. Default: off.
362
363 <tag><label id="opt-debug-commands">debug commands <m/number/</tag>
364 Control logging of client connections (0 for no logging, 1 for logging
365 of connects and disconnects, 2 and higher for logging of all client
366 commands). Default: 0.
367
368 <tag><label id="opt-debug-latency">debug latency <m/switch/</tag>
369 Activate tracking of elapsed time for internal events. Recent events
370 could be examined using <cf/dump events/ command. Default: off.
371
372 <tag><label id="opt-debug-latency-limit">debug latency limit <m/time/</tag>
373 If <cf/debug latency/ is enabled, this option allows to specify a limit
374 for elapsed time. Events exceeding the limit are logged. Default: 1 s.
375
376 <tag><label id="opt-watchdog-warn">watchdog warning <m/time/</tag>
377 Set time limit for I/O loop cycle. If one iteration took more time to
378 complete, a warning is logged. Default: 5 s.
379
380 <tag><label id="opt-watchdog-timeout">watchdog timeout <m/time/</tag>
381 Set time limit for I/O loop cycle. If the limit is breached, BIRD is
382 killed by abort signal. The timeout has effective granularity of
383 seconds, zero means disabled. Default: disabled (0).
384
385 <tag><label id="opt-mrtdump">mrtdump "<m/filename/"</tag>
386 Set MRTdump file name. This option must be specified to allow MRTdump
387 feature. Default: no dump file.
388
389 <tag><label id="opt-mrtdump-protocols">mrtdump protocols all|off|{ states|messages [, <m/.../] }</tag>
390 Set global defaults of MRTdump options. See <cf/mrtdump/ in the
391 following section. Default: off.
392
393 <tag><label id="opt-filter">filter <m/name local variables/{ <m/commands/ }</tag>
394 Define a filter. You can learn more about filters in the following
395 chapter.
396
397 <tag><label id="opt-function">function <m/name/ (<m/parameters/) <m/local variables/ { <m/commands/ }</tag>
398 Define a function. You can learn more about functions in the following chapter.
399
400 <tag><label id="opt-protocol">protocol rip|ospf|bgp|<m/.../ [<m/name/ [from <m/name2/]] { <m>protocol options</m> }</tag>
401 Define a protocol instance called <cf><m/name/</cf> (or with a name like
402 "rip5" generated automatically if you don't specify any
403 <cf><m/name/</cf>). You can learn more about configuring protocols in
404 their own chapters. When <cf>from <m/name2/</cf> expression is used,
405 initial protocol options are taken from protocol or template
406 <cf><m/name2/</cf> You can run more than one instance of most protocols
407 (like RIP or BGP). By default, no instances are configured.
408
409 <tag><label id="opt-template">template rip|bgp|<m/.../ [<m/name/ [from <m/name2/]] { <m>protocol options</m> }</tag>
410 Define a protocol template instance called <m/name/ (or with a name like
411 "bgp1" generated automatically if you don't specify any <m/name/).
412 Protocol templates can be used to group common options when many
413 similarly configured protocol instances are to be defined. Protocol
414 instances (and other templates) can use templates by using <cf/from/
415 expression and the name of the template. At the moment templates (and
416 <cf/from/ expression) are not implemented for OSPF protocol.
417
418 <tag><label id="opt-define">define <m/constant/ = <m/expression/</tag>
419 Define a constant. You can use it later in every place you could use a
420 value of the same type. Besides, there are some predefined numeric
421 constants based on /etc/iproute2/rt_* files. A list of defined constants
422 can be seen (together with other symbols) using 'show symbols' command.
423
424 <tag><label id="opt-router-id">router id <m/IPv4 address/</tag>
425 Set BIRD's router ID. It's a world-wide unique identification of your
426 router, usually one of router's IPv4 addresses. Default: in IPv4
427 version, the lowest IP address of a non-loopback interface. In IPv6
428 version, this option is mandatory.
429
430 <tag><label id="opt-router-id-from">router id from [-] [ "<m/mask/" ] [ <m/prefix/ ] [, <m/.../]</tag>
431 Set BIRD's router ID based on an IP address of an interface specified by
432 an interface pattern. The option is applicable for IPv4 version only.
433 See <ref id="proto-iface" name="interface"> section for detailed
434 description of interface patterns with extended clauses.
435
436 <tag><label id="opt-listen-bgp">listen bgp [address <m/address/] [port <m/port/] [dual]</tag>
437 This option allows to specify address and port where BGP protocol should
438 listen. It is global option as listening socket is common to all BGP
439 instances. Default is to listen on all addresses (0.0.0.0) and port 179.
440 In IPv6 mode, option <cf/dual/ can be used to specify that BGP socket
441 should accept both IPv4 and IPv6 connections (but even in that case,
442 BIRD would accept IPv6 routes only). Such behavior was default in older
443 versions of BIRD.
444
445 <tag><label id="opt-graceful-restart">graceful restart wait <m/number/</tag>
446 During graceful restart recovery, BIRD waits for convergence of routing
447 protocols. This option allows to specify a timeout for the recovery to
448 prevent waiting indefinitely if some protocols cannot converge. Default:
449 240 seconds.
450
451 <tag><label id="opt-timeformat">timeformat route|protocol|base|log "<m/format1/" [<m/limit/ "<m/format2/"]</tag>
452 This option allows to specify a format of date/time used by BIRD. The
453 first argument specifies for which purpose such format is used.
454 <cf/route/ is a format used in 'show route' command output,
455 <cf/protocol/ is used in 'show protocols' command output, <cf/base/ is
456 used for other commands and <cf/log/ is used in a log file.
457
458 "<m/format1/" is a format string using <it/strftime(3)/ notation (see
459 <it/man strftime/ for details). <m/limit> and "<m/format2/" allow to
460 specify the second format string for times in past deeper than <m/limit/
461 seconds. There are few shorthands: <cf/iso long/ is a ISO 8601 date/time
462 format (YYYY-MM-DD hh:mm:ss) that can be also specified using <cf/"%F %T"/.
463 <cf/iso short/ is a variant of ISO 8601 that uses just the time format
464 (hh:mm:ss) for near times (up to 20 hours in the past) and the date
465 format (YYYY-MM-DD) for far times. This is a shorthand for
466 <cf/"%T" 72000 "%F"/.
467
468 By default, BIRD uses the <cf/iso short/ format for <cf/route/ and
469 <cf/protocol/ times, and the <cf/iso long/ format for <cf/base/ and
470 <cf/log/ times.
471
472 In pre-1.4.0 versions, BIRD used an short, ad-hoc format for <cf/route/
473 and <cf/protocol/ times, and a <cf/iso long/ similar format (DD-MM-YYYY
474 hh:mm:ss) for <cf/base/ and <cf/log/. These timeformats could be set by
475 <cf/old short/ and <cf/old long/ compatibility shorthands.
476
477 <tag><label id="opt-table">table <m/name/ [sorted]</tag>
478 Create a new routing table. The default routing table is created
479 implicitly, other routing tables have to be added by this command.
480 Option <cf/sorted/ can be used to enable sorting of routes, see
481 <ref id="dsc-table-sorted" name="sorted table"> description for details.
482
483 <tag><label id="opt-roa-table">roa table <m/name/ [ { <m/roa table options .../ } ]</tag>
484 Create a new ROA (Route Origin Authorization) table. ROA tables can be
485 used to validate route origination of BGP routes. A ROA table contains
486 ROA entries, each consist of a network prefix, a max prefix length and
487 an AS number. A ROA entry specifies prefixes which could be originated
488 by that AS number. ROA tables could be filled with data from RPKI (<rfc
489 id="6480">) or from public databases like Whois. ROA tables are
490 examined by <cf/roa_check()/ operator in filters.
491
492 Currently, there is just one option, <cf>roa <m/prefix/ max <m/num/ as
493 <m/num/</cf>, which can be used to populate the ROA table with static
494 ROA entries. The option may be used multiple times. Other entries can be
495 added dynamically by <cf/add roa/ command.
496
497 <tag><label id="opt-eval">eval <m/expr/</tag>
498 Evaluates given filter expression. It is used by us for testing of filters.
499 </descrip>
500
501
502 <sect>Protocol options
503 <label id="protocol-opts">
504
505 <p>For each protocol instance, you can configure a bunch of options. Some of
506 them (those described in this section) are generic, some are specific to the
507 protocol (see sections talking about the protocols).
508
509 <p>Several options use a <m/switch/ argument. It can be either <cf/on/,
510 <cf/yes/ or a numeric expression with a non-zero value for the option to be
511 enabled or <cf/off/, <cf/no/ or a numeric expression evaluating to zero to
512 disable it. An empty <m/switch/ is equivalent to <cf/on/ ("silence means
513 agreement").
514
515 <descrip>
516 <tag><label id="proto-preference">preference <m/expr/</tag>
517 Sets the preference of routes generated by this protocol. Default:
518 protocol dependent.
519
520 <tag><label id="proto-disabled">disabled <m/switch/</tag>
521 Disables the protocol. You can change the disable/enable status from the
522 command line interface without needing to touch the configuration.
523 Disabled protocols are not activated. Default: protocol is enabled.
524
525 <tag><label id="proto-debug">debug all|off|{ states|routes|filters|interfaces|events|packets [, <m/.../] }</tag>
526 Set protocol debugging options. If asked, each protocol is capable of
527 writing trace messages about its work to the log (with category
528 <cf/trace/). You can either request printing of <cf/all/ trace messages
529 or only of the types selected: <cf/states/ for protocol state changes
530 (protocol going up, down, starting, stopping etc.), <cf/routes/ for
531 routes exchanged with the routing table, <cf/filters/ for details on
532 route filtering, <cf/interfaces/ for interface change events sent to the
533 protocol, <cf/events/ for events internal to the protocol and <cf/packets/
534 for packets sent and received by the protocol. Default: off.
535
536 <tag><label id="proto-mrtdump">mrtdump all|off|{ states|messages [, <m/.../] }</tag>
537 Set protocol MRTdump flags. MRTdump is a standard binary format for
538 logging information from routing protocols and daemons. These flags
539 control what kind of information is logged from the protocol to the
540 MRTdump file (which must be specified by global <cf/mrtdump/ option, see
541 the previous section). Although these flags are similar to flags of
542 <cf/debug/ option, their meaning is different and protocol-specific. For
543 BGP protocol, <cf/states/ logs BGP state changes and <cf/messages/ logs
544 received BGP messages. Other protocols does not support MRTdump yet.
545
546 <tag><label id="proto-router-id">router id <m/IPv4 address/</tag>
547 This option can be used to override global router id for a given
548 protocol. Default: uses global router id.
549
550 <tag><label id="proto-import">import all | none | filter <m/name/ | filter { <m/filter commands/ } | where <m/filter expression/</tag>
551 Specify a filter to be used for filtering routes coming from the
552 protocol to the routing table. <cf/all/ is shorthand for <cf/where true/
553 and <cf/none/ is shorthand for <cf/where false/. Default: <cf/all/.
554
555 <tag><label id="proto-export">export <m/filter/</tag>
556 This is similar to the <cf>import</cf> keyword, except that it works in
557 the direction from the routing table to the protocol. Default: <cf/none/.
558
559 <tag><label id="proto-import-keep-filtered">import keep filtered <m/switch/</tag>
560 Usually, if an import filter rejects a route, the route is forgotten.
561 When this option is active, these routes are kept in the routing table,
562 but they are hidden and not propagated to other protocols. But it is
563 possible to show them using <cf/show route filtered/. Note that this
564 option does not work for the pipe protocol. Default: off.
565
566 <tag><label id="proto-import-limit">import limit [<m/number/ | off ] [action warn | block | restart | disable]</tag>
567 Specify an import route limit (a maximum number of routes imported from
568 the protocol) and optionally the action to be taken when the limit is
569 hit. Warn action just prints warning log message. Block action discards
570 new routes coming from the protocol. Restart and disable actions shut
571 the protocol down like appropriate commands. Disable is the default
572 action if an action is not explicitly specified. Note that limits are
573 reset during protocol reconfigure, reload or restart. Default: <cf/off/.
574
575 <tag><label id="proto-receive-limit">receive limit [<m/number/ | off ] [action warn | block | restart | disable]</tag>
576 Specify an receive route limit (a maximum number of routes received from
577 the protocol and remembered). It works almost identically to <cf>import
578 limit</cf> option, the only difference is that if <cf/import keep
579 filtered/ option is active, filtered routes are counted towards the
580 limit and blocked routes are forgotten, as the main purpose of the
581 receive limit is to protect routing tables from overflow. Import limit,
582 on the contrary, counts accepted routes only and routes blocked by the
583 limit are handled like filtered routes. Default: <cf/off/.
584
585 <tag><label id="proto-export-limit">export limit [ <m/number/ | off ] [action warn | block | restart | disable]</tag>
586 Specify an export route limit, works similarly to the <cf>import
587 limit</cf> option, but for the routes exported to the protocol. This
588 option is experimental, there are some problems in details of its
589 behavior -- the number of exported routes can temporarily exceed the
590 limit without triggering it during protocol reload, exported routes
591 counter ignores route blocking and block action also blocks route
592 updates of already accepted routes -- and these details will probably
593 change in the future. Default: <cf/off/.
594
595 <tag><label id="proto-description">description "<m/text/"</tag>
596 This is an optional description of the protocol. It is displayed as a
597 part of the output of 'show route all' command.
598
599 <tag><label id="proto-table">table <m/name/</tag>
600 Connect this protocol to a non-default routing table.
601
602 <tag><label id="proto-vrf">vrf "<m/text/"</tag>
603 Associate the protocol with specific VRF. The protocol will be
604 restricted to interfaces assigned to the VRF and will use sockets bound
605 to the VRF. Appropriate VRF interface must exist on OS level. For kernel
606 protocol, an appropriate table still must be explicitly selected by
607 <cf/table/ option. Note that the VRF support in BIRD and Linux kernel
608 (4.11) is still in development and is currently problematic outside of
609 multihop BGP.
610 </descrip>
611
612 <p>There are several options that give sense only with certain protocols:
613
614 <descrip>
615 <tag><label id="proto-iface">interface [-] [ "<m/mask/" ] [ <m/prefix/ ] [, <m/.../] [ { <m/option/; [<m/.../] } ]</tag>
616 Specifies a set of interfaces on which the protocol is activated with
617 given interface-specific options. A set of interfaces specified by one
618 interface option is described using an interface pattern. The interface
619 pattern consists of a sequence of clauses (separated by commas), each
620 clause is a mask specified as a shell-like pattern. Interfaces are
621 matched by their name.
622
623 An interface matches the pattern if it matches any of its clauses. If
624 the clause begins with <cf/-/, matching interfaces are excluded. Patterns
625 are processed left-to-right, thus <cf/interface "eth0", -"eth*", "*";/
626 means eth0 and all non-ethernets.
627
628 Some protocols (namely OSPFv2 and Direct) support extended clauses that
629 may contain a mask, a prefix, or both of them. An interface matches such
630 clause if its name matches the mask (if specified) and its address
631 matches the prefix (if specified). Extended clauses are used when the
632 protocol handles multiple addresses on an interface independently.
633
634 An interface option can be used more times with different interface-specific
635 options, in that case for given interface the first matching interface
636 option is used.
637
638 This option is allowed in Babel, BFD, Direct, OSPF, RAdv and RIP
639 protocols, but in OSPF protocol it is used in the <cf/area/ subsection.
640
641 Default: none.
642
643 Examples:
644
645 <cf>interface "*" { type broadcast; };</cf> - start the protocol on all
646 interfaces with <cf>type broadcast</cf> option.
647
648 <cf>interface "eth1", "eth4", "eth5" { type ptp; };</cf> - start the
649 protocol on enumerated interfaces with <cf>type ptp</cf> option.
650
651 <cf>interface -192.168.1.0/24, 192.168.0.0/16;</cf> - start the protocol
652 on all interfaces that have address from 192.168.0.0/16, but not from
653 192.168.1.0/24.
654
655 <cf>interface -192.168.1.0/24, 192.168.0.0/16;</cf> - start the protocol
656 on all interfaces that have address from 192.168.0.0/16, but not from
657 192.168.1.0/24.
658
659 <cf>interface "eth*" 192.168.1.0/24;</cf> - start the protocol on all
660 ethernet interfaces that have address from 192.168.1.0/24.
661
662 <tag><label id="proto-tx-class">tx class|dscp <m/num/</tag>
663 This option specifies the value of ToS/DS/Class field in IP headers of
664 the outgoing protocol packets. This may affect how the protocol packets
665 are processed by the network relative to the other network traffic. With
666 <cf/class/ keyword, the value (0-255) is used for the whole ToS/Class
667 octet (but two bits reserved for ECN are ignored). With <cf/dscp/
668 keyword, the value (0-63) is used just for the DS field in the octet.
669 Default value is 0xc0 (DSCP 0x30 - CS6).
670
671 <tag><label id="proto-tx-priority">tx priority <m/num/</tag>
672 This option specifies the local packet priority. This may affect how the
673 protocol packets are processed in the local TX queues. This option is
674 Linux specific. Default value is 7 (highest priority, privileged traffic).
675
676 <tag><label id="proto-pass">password "<m/password/" [ { <m>password options</m> } ]</tag>
677 Specifies a password that can be used by the protocol as a shared secret
678 key. Password option can be used more times to specify more passwords.
679 If more passwords are specified, it is a protocol-dependent decision
680 which one is really used. Specifying passwords does not mean that
681 authentication is enabled, authentication can be enabled by separate,
682 protocol-dependent <cf/authentication/ option.
683
684 This option is allowed in BFD, OSPF and RIP protocols. BGP has also
685 <cf/password/ option, but it is slightly different and described
686 separately.
687 Default: none.
688 </descrip>
689
690 <p>Password option can contain section with some (not necessary all) password sub-options:
691
692 <descrip>
693 <tag><label id="proto-pass-id">id <M>num</M></tag>
694 ID of the password, (1-255). If it is not used, BIRD will choose ID based
695 on an order of the password item in the interface. For example, second
696 password item in one interface will have default ID 2. ID is used by
697 some routing protocols to identify which password was used to
698 authenticate protocol packets.
699
700 <tag><label id="proto-pass-gen-from">generate from "<m/time/"</tag>
701 The start time of the usage of the password for packet signing.
702 The format of <cf><m/time/</cf> is <tt>dd-mm-yyyy HH:MM:SS</tt>.
703
704 <tag><label id="proto-pass-gen-to">generate to "<m/time/"</tag>
705 The last time of the usage of the password for packet signing.
706
707 <tag><label id="proto-pass-accept-from">accept from "<m/time/"</tag>
708 The start time of the usage of the password for packet verification.
709
710 <tag><label id="proto-pass-accept-to">accept to "<m/time/"</tag>
711 The last time of the usage of the password for packet verification.
712
713 <tag><label id="proto-pass-from">from "<m/time/"</tag>
714 Shorthand for setting both <cf/generate from/ and <cf/accept from/.
715
716 <tag><label id="proto-pass-to">to "<m/time/"</tag>
717 Shorthand for setting both <cf/generate to/ and <cf/accept to/.
718
719 <tag><label id="proto-pass-algorithm">algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 )</tag>
720 The message authentication algorithm for the password when cryptographic
721 authentication is enabled. The default value depends on the protocol.
722 For RIP and OSPFv2 it is Keyed-MD5 (for compatibility), for OSPFv3
723 protocol it is HMAC-SHA-256.
724
725 </descrip>
726
727 <chapt>Remote control
728 <label id="remote-control">
729
730 <p>You can use the command-line client <file>birdc</file> to talk with a running
731 BIRD. Communication is done using a <file/bird.ctl/ UNIX domain socket (unless
732 changed with the <tt/-s/ option given to both the server and the client). The
733 commands can perform simple actions such as enabling/disabling of protocols,
734 telling BIRD to show various information, telling it to show routing table
735 filtered by filter, or asking BIRD to reconfigure. Press <tt/?/ at any time to
736 get online help. Option <tt/-r/ can be used to enable a restricted mode of BIRD
737 client, which allows just read-only commands (<cf/show .../). Option <tt/-v/ can
738 be passed to the client, to make it dump numeric return codes along with the
739 messages. You do not necessarily need to use <file/birdc/ to talk to BIRD, your
740 own applications could do that, too -- the format of communication between BIRD
741 and <file/birdc/ is stable (see the programmer's documentation).
742
743 <p>There is also lightweight variant of BIRD client called <file/birdcl/, which
744 does not support command line editing and history and has minimal dependencies.
745 This is useful for running BIRD in resource constrained environments, where
746 Readline library (required for regular BIRD client) is not available.
747
748 <p>Many commands have the <m/name/ of the protocol instance as an argument.
749 This argument can be omitted if there exists only a single instance.
750
751 <p>Here is a brief list of supported functions:
752
753 <descrip>
754 <tag><label id="cli-show-status">show status</tag>
755 Show router status, that is BIRD version, uptime and time from last
756 reconfiguration.
757
758 <tag><label id="cli-show-interfaces">show interfaces [summary]</tag>
759 Show the list of interfaces. For each interface, print its type, state,
760 MTU and addresses assigned.
761
762 <tag><label id="cli-show-protocols">show protocols [all]</tag>
763 Show list of protocol instances along with tables they are connected to
764 and protocol status, possibly giving verbose information, if <cf/all/ is
765 specified.
766
767 <tag><label id="cli-show-ospf-iface">show ospf interface [<m/name/] ["<m/interface/"]</tag>
768 Show detailed information about OSPF interfaces.
769
770 <tag><label id="cli-show-ospf-neighbors">show ospf neighbors [<m/name/] ["<m/interface/"]</tag>
771 Show a list of OSPF neighbors and a state of adjacency to them.
772
773 <tag><label id="cli-show-ospf-state">show ospf state [all] [<m/name/]</tag>
774 Show detailed information about OSPF areas based on a content of the
775 link-state database. It shows network topology, stub networks,
776 aggregated networks and routers from other areas and external routes.
777 The command shows information about reachable network nodes, use option
778 <cf/all/ to show information about all network nodes in the link-state
779 database.
780
781 <tag><label id="cli-show-ospf-topology">show ospf topology [all] [<m/name/]</tag>
782 Show a topology of OSPF areas based on a content of the link-state
783 database. It is just a stripped-down version of 'show ospf state'.
784
785 <tag><label id="cli-show-ospf-lsadb">show ospf lsadb [global | area <m/id/ | link] [type <m/num/] [lsid <m/id/] [self | router <m/id/] [<m/name/] </tag>
786 Show contents of an OSPF LSA database. Options could be used to filter
787 entries.
788
789 <tag><label id="cli-show-rip-interfaces">show rip interfaces [<m/name/] ["<m/interface/"]</tag>
790 Show detailed information about RIP interfaces.
791
792 <tag><label id="cli-show-rip-neighbors">show rip neighbors [<m/name/] ["<m/interface/"]</tag>
793 Show a list of RIP neighbors and associated state.
794
795 <tag><label id="cli-show-static">show static [<m/name/]</tag>
796 Show detailed information about static routes.
797
798 <tag><label id="cli-show-bfd-sessions">show bfd sessions [<m/name/]</tag>
799 Show information about BFD sessions.
800
801 <tag><label id="cli-show-symbols">show symbols [table|filter|function|protocol|template|roa|<m/symbol/]</tag>
802 Show the list of symbols defined in the configuration (names of
803 protocols, routing tables etc.).
804
805 <tag><label id="cli-show-route">show route [[for] <m/prefix/|<m/IP/] [table <m/t/] [filter <m/f/|where <m/c/] [(export|preexport|noexport) <m/p/] [protocol <m/p/] [<m/options/]</tag>
806 Show contents of a routing table (by default of the main one or the
807 table attached to a respective protocol), that is routes, their metrics
808 and (in case the <cf/all/ switch is given) all their attributes.
809
810 <p>You can specify a <m/prefix/ if you want to print routes for a
811 specific network. If you use <cf>for <m/prefix or IP/</cf>, you'll get
812 the entry which will be used for forwarding of packets to the given
813 destination. By default, all routes for each network are printed with
814 the selected one at the top, unless <cf/primary/ is given in which case
815 only the selected route is shown.
816
817 <p>You can also ask for printing only routes processed and accepted by
818 a given filter (<cf>filter <m/name/</cf> or <cf>filter { <m/filter/ }
819 </cf> or matching a given condition (<cf>where <m/condition/</cf>).
820
821 The <cf/export/, <cf/preexport/ and <cf/noexport/ switches ask for
822 printing of routes that are exported to the specified protocol.
823 With <cf/preexport/, the export filter of the protocol is skipped.
824 With <cf/noexport/, routes rejected by the export filter are printed
825 instead. Note that routes not exported to the protocol for other reasons
826 (e.g. secondary routes or routes imported from that protocol) are not
827 printed even with <cf/noexport/.
828
829 <p>You can also select just routes added by a specific protocol.
830 <cf>protocol <m/p/</cf>.
831
832 <p>If BIRD is configured to keep filtered routes (see <cf/import keep
833 filtered/ option), you can show them instead of routes by using
834 <cf/filtered/ switch.
835
836 <p>The <cf/stats/ switch requests showing of route statistics (the
837 number of networks, number of routes before and after filtering). If
838 you use <cf/count/ instead, only the statistics will be printed.
839
840 <tag><label id="cli-show-roa">show roa [<m/prefix/ | in <m/prefix/ | for <m/prefix/] [as <m/num/] [table <m/t/]</tag>
841 Show contents of a ROA table (by default of the first one). You can
842 specify a <m/prefix/ to print ROA entries for a specific network. If you
843 use <cf>for <m/prefix/</cf>, you'll get all entries relevant for route
844 validation of the network prefix; i.e., ROA entries whose prefixes cover
845 the network prefix. Or you can use <cf>in <m/prefix/</cf> to get ROA
846 entries covered by the network prefix. You could also use <cf/as/ option
847 to show just entries for given AS.
848
849 <tag><label id="cli-add-roa">add roa <m/prefix/ max <m/num/ as <m/num/ [table <m/t/]</tag>
850 Add a new ROA entry to a ROA table. Such entry is called <it/dynamic/
851 compared to <it/static/ entries specified in the config file. These
852 dynamic entries survive reconfiguration.
853
854 <tag><label id="cli-delete-roa">delete roa <m/prefix/ max <m/num/ as <m/num/ [table <m/t/]</tag>
855 Delete the specified ROA entry from a ROA table. Only dynamic ROA
856 entries (i.e., the ones added by <cf/add roa/ command) can be deleted.
857
858 <tag><label id="cli-flush-roa">flush roa [table <m/t/]</tag>
859 Remove all dynamic ROA entries from a ROA table.
860
861 <tag><label id="cli-configure">configure [soft] ["<m/config file/"] [timeout [<m/num/]]</tag>
862 Reload configuration from a given file. BIRD will smoothly switch itself
863 to the new configuration, protocols are reconfigured if possible,
864 restarted otherwise. Changes in filters usually lead to restart of
865 affected protocols.
866
867 If <cf/soft/ option is used, changes in filters does not cause BIRD to
868 restart affected protocols, therefore already accepted routes (according
869 to old filters) would be still propagated, but new routes would be
870 processed according to the new filters.
871
872 If <cf/timeout/ option is used, config timer is activated. The new
873 configuration could be either confirmed using <cf/configure confirm/
874 command, or it will be reverted to the old one when the config timer
875 expires. This is useful for cases when reconfiguration breaks current
876 routing and a router becomes inaccessible for an administrator. The
877 config timeout expiration is equivalent to <cf/configure undo/
878 command. The timeout duration could be specified, default is 300 s.
879
880 <tag><label id="cli-configure-confirm">configure confirm</tag>
881 Deactivate the config undo timer and therefore confirm the current
882 configuration.
883
884 <tag><label id="cli-configure-undo">configure undo</tag>
885 Undo the last configuration change and smoothly switch back to the
886 previous (stored) configuration. If the last configuration change was
887 soft, the undo change is also soft. There is only one level of undo, but
888 in some specific cases when several reconfiguration requests are given
889 immediately in a row and the intermediate ones are skipped then the undo
890 also skips them back.
891
892 <tag><label id="cli-configure-check">configure check ["<m/config file/"]</tag>
893 Read and parse given config file, but do not use it. useful for checking
894 syntactic and some semantic validity of an config file.
895
896 <tag><label id="cli-enable-disable-restart">enable|disable|restart <m/name/|"<m/pattern/"|all</tag>
897 Enable, disable or restart a given protocol instance, instances matching
898 the <cf><m/pattern/</cf> or <cf/all/ instances.
899
900 <tag><label id="cli-reload">reload [in|out] <m/name/|"<m/pattern/"|all</tag>
901 Reload a given protocol instance, that means re-import routes from the
902 protocol instance and re-export preferred routes to the instance. If
903 <cf/in/ or <cf/out/ options are used, the command is restricted to one
904 direction (re-import or re-export).
905
906 This command is useful if appropriate filters have changed but the
907 protocol instance was not restarted (or reloaded), therefore it still
908 propagates the old set of routes. For example when <cf/configure soft/
909 command was used to change filters.
910
911 Re-export always succeeds, but re-import is protocol-dependent and might
912 fail (for example, if BGP neighbor does not support route-refresh
913 extension). In that case, re-export is also skipped. Note that for the
914 pipe protocol, both directions are always reloaded together (<cf/in/ or
915 <cf/out/ options are ignored in that case).
916
917 <tag><label id="cli-down">down</tag>
918 Shut BIRD down.
919
920 <tag><label id="cli-debug">debug <m/protocol/|<m/pattern/|all all|off|{ states|routes|filters|events|packets [, <m/.../] }</tag>
921 Control protocol debugging.
922
923 <tag><label id="cli-dump">dump resources|sockets|interfaces|neighbors|attributes|routes|protocols</tag>
924 Dump contents of internal data structures to the debugging output.
925
926 <tag><label id="cli-echo">echo all|off|{ <m/list of log classes/ } [ <m/buffer-size/ ]</tag>
927 Control echoing of log messages to the command-line output.
928 See <ref id="opt-log" name="log option"> for a list of log classes.
929
930 <tag><label id="cli-eval">eval <m/expr/</tag>
931 Evaluate given expression.
932 </descrip>
933
934
935 <chapt>Filters
936 <label id="filters">
937
938 <sect>Introduction
939 <label id="filters-intro">
940
941 <p>BIRD contains a simple programming language. (No, it can't yet read mail :-).
942 There are two objects in this language: filters and functions. Filters are
943 interpreted by BIRD core when a route is being passed between protocols and
944 routing tables. The filter language contains control structures such as if's and
945 switches, but it allows no loops. An example of a filter using many features can
946 be found in <file>filter/test.conf</file>.
947
948 <p>Filter gets the route, looks at its attributes and modifies some of them if
949 it wishes. At the end, it decides whether to pass the changed route through
950 (using <cf/accept/) or whether to <cf/reject/ it. A simple filter looks like
951 this:
952
953 <code>
954 filter not_too_far
955 int var;
956 {
957 if defined( rip_metric ) then
958 var = rip_metric;
959 else {
960 var = 1;
961 rip_metric = 1;
962 }
963 if rip_metric &gt; 10 then
964 reject "RIP metric is too big";
965 else
966 accept "ok";
967 }
968 </code>
969
970 <p>As you can see, a filter has a header, a list of local variables, and a body.
971 The header consists of the <cf/filter/ keyword followed by a (unique) name of
972 filter. The list of local variables consists of <cf><M>type name</M>;</cf>
973 pairs where each pair defines one local variable. The body consists of <cf>
974 { <M>statements</M> }</cf>. Each <m/statement/ is terminated by a <cf/;/. You
975 can group several statements to a single compound statement by using braces
976 (<cf>{ <M>statements</M> }</cf>) which is useful if you want to make a bigger
977 block of code conditional.
978
979 <p>BIRD supports functions, so that you don't have to repeat the same blocks of
980 code over and over. Functions can have zero or more parameters and they can have
981 local variables. Recursion is not allowed. Function definitions look like this:
982
983 <code>
984 function name ()
985 int local_variable;
986 {
987 local_variable = 5;
988 }
989
990 function with_parameters (int parameter)
991 {
992 print parameter;
993 }
994 </code>
995
996 <p>Unlike in C, variables are declared after the <cf/function/ line, but before
997 the first <cf/{/. You can't declare variables in nested blocks. Functions are
998 called like in C: <cf>name(); with_parameters(5);</cf>. Function may return
999 values using the <cf>return <m/[expr]/</cf> command. Returning a value exits
1000 from current function (this is similar to C).
1001
1002 <p>Filters are declared in a way similar to functions except they can't have
1003 explicit parameters. They get a route table entry as an implicit parameter, it
1004 is also passed automatically to any functions called. The filter must terminate
1005 with either <cf/accept/ or <cf/reject/ statement. If there's a runtime error in
1006 filter, the route is rejected.
1007
1008 <p>A nice trick to debug filters is to use <cf>show route filter <m/name/</cf>
1009 from the command line client. An example session might look like:
1010
1011 <code>
1012 pavel@bug:~/bird$ ./birdc -s bird.ctl
1013 BIRD 0.0.0 ready.
1014 bird> show route
1015 10.0.0.0/8 dev eth0 [direct1 23:21] (240)
1016 195.113.30.2/32 dev tunl1 [direct1 23:21] (240)
1017 127.0.0.0/8 dev lo [direct1 23:21] (240)
1018 bird> show route ?
1019 show route [<prefix>] [table <t>] [filter <f>] [all] [primary]...
1020 bird> show route filter { if 127.0.0.5 &tilde; net then accept; }
1021 127.0.0.0/8 dev lo [direct1 23:21] (240)
1022 bird>
1023 </code>
1024
1025
1026 <sect>Data types
1027 <label id="data-types">
1028
1029 <p>Each variable and each value has certain type. Booleans, integers and enums
1030 are incompatible with each other (that is to prevent you from shooting in the
1031 foot).
1032
1033 <descrip>
1034 <tag><label id="type-bool">bool</tag>
1035 This is a boolean type, it can have only two values, <cf/true/ and
1036 <cf/false/. Boolean is the only type you can use in <cf/if/ statements.
1037
1038 <tag><label id="type-int">int</tag>
1039 This is a general integer type. It is an unsigned 32bit type; i.e., you
1040 can expect it to store values from 0 to 4294967295. Overflows are not
1041 checked. You can use <cf/0x1234/ syntax to write hexadecimal values.
1042
1043 <tag><label id="type-pair">pair</tag>
1044 This is a pair of two short integers. Each component can have values
1045 from 0 to 65535. Literals of this type are written as <cf/(1234,5678)/.
1046 The same syntax can also be used to construct a pair from two arbitrary
1047 integer expressions (for example <cf/(1+2,a)/).
1048
1049 <tag><label id="type-quad">quad</tag>
1050 This is a dotted quad of numbers used to represent router IDs (and
1051 others). Each component can have a value from 0 to 255. Literals of
1052 this type are written like IPv4 addresses.
1053
1054 <tag><label id="type-string">string</tag>
1055 This is a string of characters. There are no ways to modify strings in
1056 filters. You can pass them between functions, assign them to variables
1057 of type <cf/string/, print such variables, use standard string
1058 comparison operations (e.g. <cf/=, !=, &lt;, &gt;, &lt;=, &gt;=/), but
1059 you can't concatenate two strings. String literals are written as
1060 <cf/"This is a string constant"/. Additionally matching (<cf/&tilde;,
1061 !&tilde;/) operators could be used to match a string value against
1062 a shell pattern (represented also as a string).
1063
1064 <tag><label id="type-ip">ip</tag>
1065 This type can hold a single IP address. Depending on the compile-time
1066 configuration of BIRD you are using, it is either an IPv4 or IPv6
1067 address. IP addresses are written in the standard notation
1068 (<cf/10.20.30.40/ or <cf/fec0:3:4::1/). You can apply special operator
1069 <cf>.mask(<M>num</M>)</cf> on values of type ip. It masks out all but
1070 first <cf><M>num</M></cf> bits from the IP address. So
1071 <cf/1.2.3.4.mask(8) = 1.0.0.0/ is true.
1072
1073 <tag><label id="type-prefix">prefix</tag>
1074 This type can hold a network prefix consisting of IP address and prefix
1075 length. Prefix literals are written as <cf><m/ipaddress//<m/pxlen/</cf>,
1076 or <cf><m>ipaddress</m>/<m>netmask</m></cf>. There are two special
1077 operators on prefixes: <cf/.ip/ which extracts the IP address from the
1078 pair, and <cf/.len/, which separates prefix length from the pair.
1079 So <cf>1.2.0.0/16.len = 16</cf> is true.
1080
1081 <tag><label id="type-ec">ec</tag>
1082 This is a specialized type used to represent BGP extended community
1083 values. It is essentially a 64bit value, literals of this type are
1084 usually written as <cf>(<m/kind/, <m/key/, <m/value/)</cf>, where
1085 <cf/kind/ is a kind of extended community (e.g. <cf/rt/ / <cf/ro/ for a
1086 route target / route origin communities), the format and possible values
1087 of <cf/key/ and <cf/value/ are usually integers, but it depends on the
1088 used kind. Similarly to pairs, ECs can be constructed using expressions
1089 for <cf/key/ and <cf/value/ parts, (e.g. <cf/(ro, myas, 3*10)/, where
1090 <cf/myas/ is an integer variable).
1091
1092 <tag><label id="type-lc">lc</tag>
1093 This is a specialized type used to represent BGP large community
1094 values. It is essentially a triplet of 32bit values, where the first
1095 value is reserved for the AS number of the issuer, while meaning of
1096 remaining parts is defined by the issuer. Literals of this type are
1097 written as <cf/(123, 456, 789)/, with any integer values. Similarly to
1098 pairs, LCs can be constructed using expressions for its parts, (e.g.
1099 <cf/(myas, 10+20, 3*10)/, where <cf/myas/ is an integer variable).
1100
1101 <tag><label id="type-set">int|pair|quad|ip|prefix|ec|lc|enum set</tag>
1102 Filters recognize four types of sets. Sets are similar to strings: you
1103 can pass them around but you can't modify them. Literals of type <cf>int
1104 set</cf> look like <cf> [ 1, 2, 5..7 ]</cf>. As you can see, both simple
1105 values and ranges are permitted in sets.
1106
1107 For pair sets, expressions like <cf/(123,*)/ can be used to denote
1108 ranges (in that case <cf/(123,0)..(123,65535)/). You can also use
1109 <cf/(123,5..100)/ for range <cf/(123,5)..(123,100)/. You can also use
1110 <cf/*/ and <cf/a..b/ expressions in the first part of a pair, note that
1111 such expressions are translated to a set of intervals, which may be
1112 memory intensive. E.g. <cf/(*,4..20)/ is translated to <cf/(0,4..20),
1113 (1,4..20), (2,4..20), ... (65535, 4..20)/.
1114
1115 EC sets use similar expressions like pair sets, e.g. <cf/(rt, 123,
1116 10..20)/ or <cf/(ro, 123, *)/. Expressions requiring the translation
1117 (like <cf/(rt, *, 3)/) are not allowed (as they usually have 4B range
1118 for ASNs).
1119
1120 Also LC sets use similar expressions like pair sets. You can use ranges
1121 and wildcards, but if one field uses that, more specific (later) fields
1122 must be wildcards. E.g., <cf/(10, 20..30, *)/ or <cf/(10, 20, 30..40)/
1123 is valid, while <cf/(10, *, 20..30)/ or <cf/(10, 20..30, 40)/ is not
1124 valid.
1125
1126 You can also use expressions for int, pair, EC and LC set values.
1127 However, it must be possible to evaluate these expressions before daemon
1128 boots. So you can use only constants inside them. E.g.
1129
1130 <code>
1131 define one=1;
1132 define myas=64500;
1133 int set odds;
1134 pair set ps;
1135 ec set es;
1136
1137 odds = [ one, 2+1, 6-one, 2*2*2-1, 9, 11 ];
1138 ps = [ (1,one+one), (3,4)..(4,8), (5,*), (6,3..6), (7..9,*) ];
1139 es = [ (rt, myas, 3*10), (rt, myas+one, 0..16*16*16-1), (ro, myas+2, *) ];
1140 </code>
1141
1142 Sets of prefixes are special: their literals does not allow ranges, but
1143 allows prefix patterns that are written
1144 as <cf><M>ipaddress</M>/<M>pxlen</M>{<M>low</M>,<M>high</M>}</cf>.
1145 Prefix <cf><m>ip1</m>/<m>len1</m></cf> matches prefix
1146 pattern <cf><m>ip2</m>/<m>len2</m>{<m>l</m>,<m>h</m>}</cf> if the
1147 first <cf>min(len1, len2)</cf> bits of <cf/ip1/ and <cf/ip2/ are
1148 identical and <cf>len1 &lt;= ip1 &lt;= len2</cf>. A valid prefix pattern
1149 has to satisfy <cf>low &lt;= high</cf>, but <cf/pxlen/ is not
1150 constrained by <cf/low/ or <cf/high/. Obviously, a prefix matches a
1151 prefix set literal if it matches any prefix pattern in the prefix set
1152 literal.
1153
1154 There are also two shorthands for prefix patterns: <cf><m/address//<m/len/+</cf>
1155 is a shorthand for <cf><m/address//<m/len/{<m/len/,<m/maxlen/}</cf>
1156 (where <cf><m/maxlen/</cf> is 32 for IPv4 and 128 for IPv6), that means
1157 network prefix <cf><m/address//<m/len/</cf> and all its subnets.
1158 <cf><m/address//<m/len/-</cf> is a shorthand for
1159 <cf><m/address//<m/len/{0,<m/len/}</cf>, that means network prefix
1160 <cf><m/address//<m/len/</cf> and all its supernets (network prefixes
1161 that contain it).
1162
1163 For example, <cf>[ 1.0.0.0/8, 2.0.0.0/8+, 3.0.0.0/8-, 4.0.0.0/8{16,24}
1164 ]</cf> matches prefix <cf>1.0.0.0/8</cf>, all subprefixes of
1165 <cf>2.0.0.0/8</cf>, all superprefixes of <cf>3.0.0.0/8</cf> and prefixes
1166 <cf/4.X.X.X/ whose prefix length is 16 to 24. <cf>[ 0.0.0.0/0{20,24} ]</cf>
1167 matches all prefixes (regardless of IP address) whose prefix length is
1168 20 to 24, <cf>[ 1.2.3.4/32- ]</cf> matches any prefix that contains IP
1169 address <cf>1.2.3.4</cf>. <cf>1.2.0.0/16 &tilde; [ 1.0.0.0/8{15,17} ]</cf>
1170 is true, but <cf>1.0.0.0/16 &tilde; [ 1.0.0.0/8- ]</cf> is false.
1171
1172 Cisco-style patterns like <cf>10.0.0.0/8 ge 16 le 24</cf> can be expressed
1173 in BIRD as <cf>10.0.0.0/8{16,24}</cf>, <cf>192.168.0.0/16 le 24</cf> as
1174 <cf>192.168.0.0/16{16,24}</cf> and <cf>192.168.0.0/16 ge 24</cf> as
1175 <cf>192.168.0.0/16{24,32}</cf>.
1176
1177 <tag><label id="type-enum">enum</tag>
1178 Enumeration types are fixed sets of possibilities. You can't define your
1179 own variables of such type, but some route attributes are of enumeration
1180 type. Enumeration types are incompatible with each other.
1181
1182 <tag><label id="type-bgppath">bgppath</tag>
1183 BGP path is a list of autonomous system numbers. You can't write
1184 literals of this type. There are several special operators on bgppaths:
1185
1186 <cf><m/P/.first</cf> returns the first ASN (the neighbor ASN) in path <m/P/.
1187
1188 <cf><m/P/.last</cf> returns the last ASN (the source ASN) in path <m/P/.
1189
1190 <cf><m/P/.last_nonaggregated</cf> returns the last ASN in the non-aggregated part of the path <m/P/.
1191
1192 Both <cf/first/ and <cf/last/ return zero if there is no appropriate
1193 ASN, for example if the path contains an AS set element as the first (or
1194 the last) part. If the path ends with an AS set, <cf/last_nonaggregated/
1195 may be used to get last ASN before any AS set.
1196
1197 <cf><m/P/.len</cf> returns the length of path <m/P/.
1198
1199 <cf><m/P/.empty</cf> resets path <m/P/ to empty path.
1200
1201 <cf>prepend(<m/P/,<m/A/)</cf> prepends ASN <m/A/ to path <m/P/ and
1202 returns the result.
1203
1204 <cf>delete(<m/P/,<m/A/)</cf> deletes all instances of ASN <m/A/ from
1205 from path <m/P/ and returns the result. <m/A/ may also be an integer
1206 set, in that case the operator deletes all ASNs from path <m/P/ that are
1207 also members of set <m/A/.
1208
1209 <cf>filter(<m/P/,<m/A/)</cf> deletes all ASNs from path <m/P/ that are
1210 not members of integer set <m/A/. I.e., <cf/filter/ do the same as
1211 <cf/delete/ with inverted set <m/A/.
1212
1213 Statement <cf><m/P/ = prepend(<m/P/, <m/A/);</cf> can be shortened to
1214 <cf><m/P/.prepend(<m/A/);</cf> if <m/P/ is appropriate route attribute
1215 (for example <cf/bgp_path/). Similarly for <cf/delete/ and <cf/filter/.
1216
1217 <tag><label id="type-bgpmask">bgpmask</tag>
1218 BGP masks are patterns used for BGP path matching (using <cf>path
1219 &tilde; [= 2 3 5 * =]</cf> syntax). The masks resemble wildcard patterns
1220 as used by UNIX shells. Autonomous system numbers match themselves,
1221 <cf/*/ matches any (even empty) sequence of arbitrary AS numbers and
1222 <cf/?/ matches one arbitrary AS number. For example, if <cf>bgp_path</cf>
1223 is 4 3 2 1, then: <tt>bgp_path &tilde; [= * 4 3 * =]</tt> is true,
1224 but <tt>bgp_path &tilde; [= * 4 5 * =]</tt> is false. BGP mask
1225 expressions can also contain integer expressions enclosed in parenthesis
1226 and integer variables, for example <tt>[= * 4 (1+2) a =]</tt>. You can
1227 also use ranges, for example <tt>[= * 3..5 2 100..200 * =]</tt>.
1228 There is also old (deprecated) syntax that uses / .. / instead of [= .. =]
1229 and ? instead of *.
1230
1231 <tag><label id="type-clist">clist</tag>
1232 Clist is similar to a set, except that unlike other sets, it can be
1233 modified. The type is used for community list (a set of pairs) and for
1234 cluster list (a set of quads). There exist no literals of this type.
1235 There are three special operators on clists:
1236
1237 <cf><m/C/.len</cf> returns the length of clist <m/C/.
1238
1239 <cf><m/C/.empty</cf> resets clist <m/C/ to empty clist.
1240
1241 <cf>add(<m/C/,<m/P/)</cf> adds pair (or quad) <m/P/ to clist <m/C/ and
1242 returns the result. If item <m/P/ is already in clist <m/C/, it does
1243 nothing. <m/P/ may also be a clist, in that case all its members are
1244 added; i.e., it works as clist union.
1245
1246 <cf>delete(<m/C/,<m/P/)</cf> deletes pair (or quad) <m/P/ from clist
1247 <m/C/ and returns the result. If clist <m/C/ does not contain item
1248 <m/P/, it does nothing. <m/P/ may also be a pair (or quad) set, in that
1249 case the operator deletes all items from clist <m/C/ that are also
1250 members of set <m/P/. Moreover, <m/P/ may also be a clist, which works
1251 analogously; i.e., it works as clist difference.
1252
1253 <cf>filter(<m/C/,<m/P/)</cf> deletes all items from clist <m/C/ that are
1254 not members of pair (or quad) set <m/P/. I.e., <cf/filter/ do the same
1255 as <cf/delete/ with inverted set <m/P/. <m/P/ may also be a clist, which
1256 works analogously; i.e., it works as clist intersection.
1257
1258 Statement <cf><m/C/ = add(<m/C/, <m/P/);</cf> can be shortened to
1259 <cf><m/C/.add(<m/P/);</cf> if <m/C/ is appropriate route attribute (for
1260 example <cf/bgp_community/). Similarly for <cf/delete/ and <cf/filter/.
1261
1262 <tag><label id="type-eclist">eclist</tag>
1263 Eclist is a data type used for BGP extended community lists. Eclists
1264 are very similar to clists, but they are sets of ECs instead of pairs.
1265 The same operations (like <cf/add/, <cf/delete/ or <cf/&tilde;/ and
1266 <cf/!&tilde;/ membership operators) can be used to modify or test
1267 eclists, with ECs instead of pairs as arguments.
1268
1269 <tag/lclist/
1270 Lclist is a data type used for BGP large community lists. Like eclists,
1271 lclists are very similar to clists, but they are sets of LCs instead of
1272 pairs. The same operations (like <cf/add/, <cf/delete/ or <cf/&tilde;/
1273 and <cf/!&tilde;/ membership operators) can be used to modify or test
1274 lclists, with LCs instead of pairs as arguments.
1275 </descrip>
1276
1277
1278 <sect>Operators
1279 <label id="operators">
1280
1281 <p>The filter language supports common integer operators <cf>(+,-,*,/)</cf>,
1282 parentheses <cf/(a*(b+c))/, comparison <cf/(a=b, a!=b, a&lt;b, a&gt;=b)/.
1283 Logical operations include unary not (<cf/!/), and (<cf/&amp;&amp;/) and or
1284 (<cf/&verbar;&verbar;/). Special operators include (<cf/&tilde;/,
1285 <cf/!&tilde;/) for "is (not) element of a set" operation - it can be used on
1286 element and set of elements of the same type (returning true if element is
1287 contained in the given set), or on two strings (returning true if first string
1288 matches a shell-like pattern stored in second string) or on IP and prefix
1289 (returning true if IP is within the range defined by that prefix), or on prefix
1290 and prefix (returning true if first prefix is more specific than second one) or
1291 on bgppath and bgpmask (returning true if the path matches the mask) or on
1292 number and bgppath (returning true if the number is in the path) or on bgppath
1293 and int (number) set (returning true if any ASN from the path is in the set) or
1294 on pair/quad and clist (returning true if the pair/quad is element of the
1295 clist) or on clist and pair/quad set (returning true if there is an element of
1296 the clist that is also a member of the pair/quad set).
1297
1298 <p>There is one operator related to ROA infrastructure - <cf/roa_check()/. It
1299 examines a ROA table and does <rfc id="6483"> route origin validation for a
1300 given network prefix. The basic usage is <cf>roa_check(<m/table/)</cf>, which
1301 checks current route (which should be from BGP to have AS_PATH argument) in the
1302 specified ROA table and returns ROA_UNKNOWN if there is no relevant ROA,
1303 ROA_VALID if there is a matching ROA, or ROA_INVALID if there are some relevant
1304 ROAs but none of them match. There is also an extended variant
1305 <cf>roa_check(<m/table/, <m/prefix/, <m/asn/)</cf>, which allows to specify a
1306 prefix and an ASN as arguments.
1307
1308
1309 <sect>Control structures
1310 <label id="control-structures">
1311
1312 <p>Filters support two control structures: conditions and case switches.
1313
1314 <p>Syntax of a condition is: <cf>if <M>boolean expression</M> then <m/command1/;
1315 else <m/command2/;</cf> and you can use <cf>{ <m/command_1/; <m/command_2/;
1316 <M>...</M> }</cf> instead of either command. The <cf>else</cf> clause may be
1317 omitted. If the <cf><m>boolean expression</m></cf> is true, <m/command1/ is
1318 executed, otherwise <m/command2/ is executed.
1319
1320 <p>The <cf>case</cf> is similar to case from Pascal. Syntax is <cf>case
1321 <m/expr/ { else: | <m/num_or_prefix [ .. num_or_prefix]/: <m/statement/ ; [
1322 ... ] }</cf>. The expression after <cf>case</cf> can be of any type which can be
1323 on the left side of the &tilde; operator and anything that could be a member of
1324 a set is allowed before <cf/:/. Multiple commands are allowed without <cf/{}/
1325 grouping. If <cf><m/expr/</cf> matches one of the <cf/:/ clauses, statements
1326 between it and next <cf/:/ statement are executed. If <cf><m/expr/</cf> matches
1327 neither of the <cf/:/ clauses, the statements after <cf/else:/ are executed.
1328
1329 <p>Here is example that uses <cf/if/ and <cf/case/ structures:
1330
1331 <code>
1332 case arg1 {
1333 2: print "two"; print "I can do more commands without {}";
1334 3 .. 5: print "three to five";
1335 else: print "something else";
1336 }
1337
1338 if 1234 = i then printn "."; else {
1339 print "not 1234";
1340 print "You need {} around multiple commands";
1341 }
1342 </code>
1343
1344
1345 <sect>Route attributes
1346 <label id="route-attributes">
1347
1348 <p>A filter is implicitly passed a route, and it can access its attributes just
1349 like it accesses variables. Attempts to access undefined attribute result in a
1350 runtime error; you can check if an attribute is defined by using the
1351 <cf>defined( <m>attribute</m> )</cf> operator. One notable exception to this
1352 rule are attributes of clist type, where undefined value is regarded as empty
1353 clist for most purposes.
1354
1355 <descrip>
1356 <tag><label id="rta-net"><m/prefix/ net</tag>
1357 Network the route is talking about. Read-only. (See the chapter about
1358 routing tables.)
1359
1360 <tag><label id="rta-scope"><m/enum/ scope</tag>
1361 The scope of the route. Possible values: <cf/SCOPE_HOST/ for routes
1362 local to this host, <cf/SCOPE_LINK/ for those specific for a physical
1363 link, <cf/SCOPE_SITE/ and <cf/SCOPE_ORGANIZATION/ for private routes and
1364 <cf/SCOPE_UNIVERSE/ for globally visible routes. This attribute is not
1365 interpreted by BIRD and can be used to mark routes in filters. The
1366 default value for new routes is <cf/SCOPE_UNIVERSE/.
1367
1368 <tag><label id="rta-preference"><m/int/ preference</tag>
1369 Preference of the route. Valid values are 0-65535. (See the chapter
1370 about routing tables.)
1371
1372 <tag><label id="rta-from"><m/ip/ from</tag>
1373 The router which the route has originated from.
1374
1375 <tag><label id="rta-gw"><m/ip/ gw</tag>
1376 Next hop packets routed using this route should be forwarded to.
1377
1378 <tag><label id="rta-proto"><m/string/ proto</tag>
1379 The name of the protocol which the route has been imported from.
1380 Read-only.
1381
1382 <tag><label id="rta-source"><m/enum/ source</tag>
1383 what protocol has told me about this route. Possible values:
1384 <cf/RTS_DUMMY/, <cf/RTS_STATIC/, <cf/RTS_INHERIT/, <cf/RTS_DEVICE/,
1385 <cf/RTS_STATIC_DEVICE/, <cf/RTS_REDIRECT/, <cf/RTS_RIP/, <cf/RTS_OSPF/,
1386 <cf/RTS_OSPF_IA/, <cf/RTS_OSPF_EXT1/, <cf/RTS_OSPF_EXT2/, <cf/RTS_BGP/,
1387 <cf/RTS_PIPE/, <cf/RTS_BABEL/.
1388
1389 <tag><label id="rta-cast"><m/enum/ cast</tag>
1390 Route type (Currently <cf/RTC_UNICAST/ for normal routes,
1391 <cf/RTC_BROADCAST/, <cf/RTC_MULTICAST/, <cf/RTC_ANYCAST/ will be used in
1392 the future for broadcast, multicast and anycast routes). Read-only.
1393
1394 <tag><label id="rta-dest"><m/enum/ dest</tag>
1395 Type of destination the packets should be sent to
1396 (<cf/RTD_ROUTER/ for forwarding to a neighboring router,
1397 <cf/RTD_DEVICE/ for routing to a directly-connected network,
1398 <cf/RTD_MULTIPATH/ for multipath destinations,
1399 <cf/RTD_BLACKHOLE/ for packets to be silently discarded,
1400 <cf/RTD_UNREACHABLE/, <cf/RTD_PROHIBIT/ for packets that should be
1401 returned with ICMP host unreachable / ICMP administratively prohibited
1402 messages). Can be changed, but only to <cf/RTD_BLACKHOLE/,
1403 <cf/RTD_UNREACHABLE/ or <cf/RTD_PROHIBIT/.
1404
1405 <tag><label id="rta-ifname"><m/string/ ifname</tag>
1406 Name of the outgoing interface. Sink routes (like blackhole, unreachable
1407 or prohibit) and multipath routes have no interface associated with
1408 them, so <cf/ifname/ returns an empty string for such routes. Read-only.
1409
1410 <tag><label id="rta-ifindex"><m/int/ ifindex</tag>
1411 Index of the outgoing interface. System wide index of the interface. May
1412 be used for interface matching, however indexes might change on interface
1413 creation/removal. Zero is returned for routes with undefined outgoing
1414 interfaces. Read-only.
1415
1416 <tag><label id="rta-igp-metric"><m/int/ igp_metric</tag>
1417 The optional attribute that can be used to specify a distance to the
1418 network for routes that do not have a native protocol metric attribute
1419 (like <cf/ospf_metric1/ for OSPF routes). It is used mainly by BGP to
1420 compare internal distances to boundary routers (see below). It is also
1421 used when the route is exported to OSPF as a default value for OSPF type
1422 1 metric.
1423 </descrip>
1424
1425 <p>There also exist some protocol-specific attributes which are described in the
1426 corresponding protocol sections.
1427
1428
1429 <sect>Other statements
1430 <label id="other-statements">
1431
1432 <p>The following statements are available:
1433
1434 <descrip>
1435 <tag><label id="assignment"><m/variable/ = <m/expr/</tag>
1436 Set variable to a given value.
1437
1438 <tag><label id="filter-accept-reject">accept|reject [ <m/expr/ ]</tag>
1439 Accept or reject the route, possibly printing <cf><m>expr</m></cf>.
1440
1441 <tag><label id="return">return <m/expr/</tag>
1442 Return <cf><m>expr</m></cf> from the current function, the function ends
1443 at this point.
1444
1445 <tag><label id="print">print|printn <m/expr/ [<m/, expr.../]</tag>
1446 Prints given expressions; useful mainly while debugging filters. The
1447 <cf/printn/ variant does not terminate the line.
1448
1449 <tag><label id="quitbird">quitbird</tag>
1450 Terminates BIRD. Useful when debugging the filter interpreter.
1451 </descrip>
1452
1453
1454 <chapt>Protocols
1455 <label id="protocols">
1456
1457 <sect>Babel
1458 <label id="babel">
1459
1460 <sect1>Introduction
1461 <label id="babel-intro">
1462
1463 <p>The Babel protocol
1464 (<rfc id="6126">) is a loop-avoiding distance-vector routing protocol that is
1465 robust and efficient both in ordinary wired networks and in wireless mesh
1466 networks. Babel is conceptually very simple in its operation and "just works"
1467 in its default configuration, though some configuration is possible and in some
1468 cases desirable.
1469
1470 <p>While the Babel protocol is dual stack (i.e., can carry both IPv4 and IPv6
1471 routes over the same IPv6 transport), BIRD presently implements only the IPv6
1472 subset of the protocol. No Babel extensions are implemented, but the BIRD
1473 implementation can coexist with implementations using the extensions (and will
1474 just ignore extension messages).
1475
1476 <p>The Babel protocol implementation in BIRD is currently in alpha stage.
1477
1478 <sect1>Configuration
1479 <label id="babel-config">
1480
1481 <p>Babel supports no global configuration options apart from those common to all
1482 other protocols, but supports the following per-interface configuration options:
1483
1484 <code>
1485 protocol babel [<name>] {
1486 interface <interface pattern> {
1487 type <wired|wireless>;
1488 rxcost <number>;
1489 hello interval <number>;
1490 update interval <number>;
1491 port <number>;
1492 tx class|dscp <number>;
1493 tx priority <number>;
1494 rx buffer <number>;
1495 tx length <number>;
1496 check link <switch>;
1497 };
1498 }
1499 </code>
1500
1501 <descrip>
1502 <tag><label id="babel-type">type wired|wireless </tag>
1503 This option specifies the interface type: Wired or wireless. Wired
1504 interfaces are considered more reliable, and so the default hello
1505 interval is higher, and a neighbour is considered unreachable after only
1506 a small number of "hello" packets are lost. On wireless interfaces,
1507 hello packets are sent more often, and the ETX link quality estimation
1508 technique is used to compute the metrics of routes discovered over this
1509 interface. This technique will gradually degrade the metric of routes
1510 when packets are lost rather than the more binary up/down mechanism of
1511 wired type links. Default: <cf/wired/.
1512
1513 <tag><label id="babel-rxcost">rxcost <m/num/</tag>
1514 This specifies the RX cost of the interface. The route metrics will be
1515 computed from this value with a mechanism determined by the interface
1516 <cf/type/. Default: 96 for wired interfaces, 256 for wireless.
1517
1518 <tag><label id="babel-hello">hello interval <m/num/</tag>
1519 Interval at which periodic "hello" messages are sent on this interface,
1520 in seconds. Default: 4 seconds.
1521
1522 <tag><label id="babel-update">update interval <m/num/</tag>
1523 Interval at which periodic (full) updates are sent. Default: 4 times the
1524 hello interval.
1525
1526 <tag><label id="babel-port">port <m/number/</tag>
1527 This option selects an UDP port to operate on. The default is to operate
1528 on port 6696 as specified in the Babel RFC.
1529
1530 <tag><label id="babel-tx-class">tx class|dscp|priority <m/number/</tag>
1531 These options specify the ToS/DiffServ/Traffic class/Priority of the
1532 outgoing Babel packets. See <ref id="proto-tx-class" name="tx class"> common
1533 option for detailed description.
1534
1535 <tag><label id="babel-rx-buffer">rx buffer <m/number/</tag>
1536 This option specifies the size of buffers used for packet processing.
1537 The buffer size should be bigger than maximal size of received packets.
1538 The default value is the interface MTU, and the value will be clamped to a
1539 minimum of 512 bytes + IP packet overhead.
1540
1541 <tag><label id="babel-tx-length">tx length <m/number/</tag>
1542 This option specifies the maximum length of generated Babel packets. To
1543 avoid IP fragmentation, it should not exceed the interface MTU value.
1544 The default value is the interface MTU value, and the value will be
1545 clamped to a minimum of 512 bytes + IP packet overhead.
1546
1547 <tag><label id="babel-check-link">check link <m/switch/</tag>
1548 If set, the hardware link state (as reported by OS) is taken into
1549 consideration. When the link disappears (e.g. an ethernet cable is
1550 unplugged), neighbors are immediately considered unreachable and all
1551 routes received from them are withdrawn. It is possible that some
1552 hardware drivers or platforms do not implement this feature. Default:
1553 yes.
1554 </descrip>
1555
1556 <sect1>Attributes
1557 <label id="babel-attr">
1558
1559 <p>Babel defines just one attribute: the internal babel metric of the route. It
1560 is exposed as the <cf/babel_metric/ attribute and has range from 1 to infinity
1561 (65535).
1562
1563 <sect1>Example
1564 <label id="babel-exam">
1565
1566 <p><code>
1567 protocol babel {
1568 interface "eth*" {
1569 type wired;
1570 };
1571 interface "wlan0", "wlan1" {
1572 type wireless;
1573 hello interval 1;
1574 rxcost 512;
1575 };
1576 interface "tap0";
1577
1578 # This matches the default of babeld: redistribute all addresses
1579 # configured on local interfaces, plus re-distribute all routes received
1580 # from other babel peers.
1581
1582 export where (source = RTS_DEVICE) || (source = RTS_BABEL);
1583 }
1584 </code>
1585
1586
1587 <sect>BFD
1588 <label id="bfd">
1589
1590 <sect1>Introduction
1591 <label id="bfd-intro">
1592
1593 <p>Bidirectional Forwarding Detection (BFD) is not a routing protocol itself, it
1594 is an independent tool providing liveness and failure detection. Routing
1595 protocols like OSPF and BGP use integrated periodic "hello" messages to monitor
1596 liveness of neighbors, but detection times of these mechanisms are high (e.g. 40
1597 seconds by default in OSPF, could be set down to several seconds). BFD offers
1598 universal, fast and low-overhead mechanism for failure detection, which could be
1599 attached to any routing protocol in an advisory role.
1600
1601 <p>BFD consists of mostly independent BFD sessions. Each session monitors an
1602 unicast bidirectional path between two BFD-enabled routers. This is done by
1603 periodically sending control packets in both directions. BFD does not handle
1604 neighbor discovery, BFD sessions are created on demand by request of other
1605 protocols (like OSPF or BGP), which supply appropriate information like IP
1606 addresses and associated interfaces. When a session changes its state, these
1607 protocols are notified and act accordingly (e.g. break an OSPF adjacency when
1608 the BFD session went down).
1609
1610 <p>BIRD implements basic BFD behavior as defined in <rfc id="5880"> (some
1611 advanced features like the echo mode or authentication are not implemented), IP
1612 transport for BFD as defined in <rfc id="5881"> and <rfc id="5883"> and
1613 interaction with client protocols as defined in <rfc id="5882">.
1614
1615 <p>Note that BFD implementation in BIRD is currently a new feature in
1616 development, expect some rough edges and possible UI and configuration changes
1617 in the future. Also note that we currently support at most one protocol instance.
1618
1619 <p>BFD packets are sent with a dynamic source port number. Linux systems use by
1620 default a bit different dynamic port range than the IANA approved one
1621 (49152-65535). If you experience problems with compatibility, please adjust
1622 <cf>/proc/sys/net/ipv4/ip_local_port_range</cf>
1623
1624 <sect1>Configuration
1625 <label id="bfd-config">
1626
1627 <p>BFD configuration consists mainly of multiple definitions of interfaces.
1628 Most BFD config options are session specific. When a new session is requested
1629 and dynamically created, it is configured from one of these definitions. For
1630 sessions to directly connected neighbors, <cf/interface/ definitions are chosen
1631 based on the interface associated with the session, while <cf/multihop/
1632 definition is used for multihop sessions. If no definition is relevant, the
1633 session is just created with the default configuration. Therefore, an empty BFD
1634 configuration is often sufficient.
1635
1636 <p>Note that to use BFD for other protocols like OSPF or BGP, these protocols
1637 also have to be configured to request BFD sessions, usually by <cf/bfd/ option.
1638
1639 <p>Some of BFD session options require <m/time/ value, which has to be specified
1640 with the appropriate unit: <m/num/ <cf/s/|<cf/ms/|<cf/us/. Although microseconds
1641 are allowed as units, practical minimum values are usually in order of tens of
1642 milliseconds.
1643
1644 <code>
1645 protocol bfd [&lt;name&gt;] {
1646 interface &lt;interface pattern&gt; {
1647 interval &lt;time&gt;;
1648 min rx interval &lt;time&gt;;
1649 min tx interval &lt;time&gt;;
1650 idle tx interval &lt;time&gt;;
1651 multiplier &lt;num&gt;;
1652 passive &lt;switch&gt;;
1653 authentication none;
1654 authentication simple;
1655 authentication [meticulous] keyed md5|sha1;
1656 password "&lt;text&gt;";
1657 password "&lt;text&gt;" {
1658 id &lt;num&gt;;
1659 generate from "&lt;date&gt;";
1660 generate to "&lt;date&gt;";
1661 accept from "&lt;date&gt;";
1662 accept to "&lt;date&gt;";
1663 from "&lt;date&gt;";
1664 to "&lt;date&gt;";
1665 };
1666 };
1667 multihop {
1668 interval &lt;time&gt;;
1669 min rx interval &lt;time&gt;;
1670 min tx interval &lt;time&gt;;
1671 idle tx interval &lt;time&gt;;
1672 multiplier &lt;num&gt;;
1673 passive &lt;switch&gt;;
1674 };
1675 neighbor &lt;ip&gt; [dev "&lt;interface&gt;"] [local &lt;ip&gt;] [multihop &lt;switch&gt;];
1676 }
1677 </code>
1678
1679 <descrip>
1680 <tag><label id="bfd-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
1681 Interface definitions allow to specify options for sessions associated
1682 with such interfaces and also may contain interface specific options.
1683 See <ref id="proto-iface" name="interface"> common option for a detailed
1684 description of interface patterns. Note that contrary to the behavior of
1685 <cf/interface/ definitions of other protocols, BFD protocol would accept
1686 sessions (in default configuration) even on interfaces not covered by
1687 such definitions.
1688
1689 <tag><label id="bfd-multihop">multihop { <m/options/ }</tag>
1690 Multihop definitions allow to specify options for multihop BFD sessions,
1691 in the same manner as <cf/interface/ definitions are used for directly
1692 connected sessions. Currently only one such definition (for all multihop
1693 sessions) could be used.
1694
1695 <tag><label id="bfd-neighbor">neighbor <m/ip/ [dev "<m/interface/"] [local <m/ip/] [multihop <m/switch/]</tag>
1696 BFD sessions are usually created on demand as requested by other
1697 protocols (like OSPF or BGP). This option allows to explicitly add
1698 a BFD session to the specified neighbor regardless of such requests.
1699
1700 The session is identified by the IP address of the neighbor, with
1701 optional specification of used interface and local IP. By default
1702 the neighbor must be directly connected, unless the session is
1703 configured as multihop. Note that local IP must be specified for
1704 multihop sessions.
1705 </descrip>
1706
1707 <p>Session specific options (part of <cf/interface/ and <cf/multihop/ definitions):
1708
1709 <descrip>
1710 <tag><label id="bfd-interval">interval <m/time/</tag>
1711 BFD ensures availability of the forwarding path associated with the
1712 session by periodically sending BFD control packets in both
1713 directions. The rate of such packets is controlled by two options,
1714 <cf/min rx interval/ and <cf/min tx interval/ (see below). This option
1715 is just a shorthand to set both of these options together.
1716
1717 <tag><label id="bfd-min-rx-interval">min rx interval <m/time/</tag>
1718 This option specifies the minimum RX interval, which is announced to the
1719 neighbor and used there to limit the neighbor's rate of generated BFD
1720 control packets. Default: 10 ms.
1721
1722 <tag><label id="bfd-min-tx-interval">min tx interval <m/time/</tag>
1723 This option specifies the desired TX interval, which controls the rate
1724 of generated BFD control packets (together with <cf/min rx interval/
1725 announced by the neighbor). Note that this value is used only if the BFD
1726 session is up, otherwise the value of <cf/idle tx interval/ is used
1727 instead. Default: 100 ms.
1728
1729 <tag><label id="bfd-idle-tx-interval">idle tx interval <m/time/</tag>
1730 In order to limit unnecessary traffic in cases where a neighbor is not
1731 available or not running BFD, the rate of generated BFD control packets
1732 is lower when the BFD session is not up. This option specifies the
1733 desired TX interval in such cases instead of <cf/min tx interval/.
1734 Default: 1 s.
1735
1736 <tag><label id="bfd-multiplier">multiplier <m/num/</tag>
1737 Failure detection time for BFD sessions is based on established rate of
1738 BFD control packets (<cf>min rx/tx interval</cf>) multiplied by this
1739 multiplier, which is essentially (ignoring jitter) a number of missed
1740 packets after which the session is declared down. Note that rates and
1741 multipliers could be different in each direction of a BFD session.
1742 Default: 5.
1743
1744 <tag><label id="bfd-passive">passive <m/switch/</tag>
1745 Generally, both BFD session endpoints try to establish the session by
1746 sending control packets to the other side. This option allows to enable
1747 passive mode, which means that the router does not send BFD packets
1748 until it has received one from the other side. Default: disabled.
1749
1750 <tag>authentication none</tag>
1751 No passwords are sent in BFD packets. This is the default value.
1752
1753 <tag>authentication simple</tag>
1754 Every packet carries 16 bytes of password. Received packets lacking this
1755 password are ignored. This authentication mechanism is very weak.
1756
1757 <tag>authentication [meticulous] keyed md5|sha1</tag>
1758 An authentication code is appended to each packet. The cryptographic
1759 algorithm is keyed MD5 or keyed SHA-1. Note that the algorithm is common
1760 for all keys (on one interface), in contrast to OSPF or RIP, where it
1761 is a per-key option. Passwords (keys) are not sent open via network.
1762
1763 The <cf/meticulous/ variant means that cryptographic sequence numbers
1764 are increased for each sent packet, while in the basic variant they are
1765 increased about once per second. Generally, the <cf/meticulous/ variant
1766 offers better resistance to replay attacks but may require more
1767 computation.
1768
1769 <tag>password "<M>text</M>"</tag>
1770 Specifies a password used for authentication. See <ref id="dsc-pass"
1771 name="password"> common option for detailed description. Note that
1772 password option <cf/algorithm/ is not available in BFD protocol. The
1773 algorithm is selected by <cf/authentication/ option for all passwords.
1774
1775 </descrip>
1776
1777 <sect1>Example
1778 <label id="bfd-exam">
1779
1780 <p><code>
1781 protocol bfd {
1782 interface "eth*" {
1783 min rx interval 20 ms;
1784 min tx interval 50 ms;
1785 idle tx interval 300 ms;
1786 };
1787 interface "gre*" {
1788 interval 200 ms;
1789 multiplier 10;
1790 passive;
1791 };
1792 multihop {
1793 interval 200 ms;
1794 multiplier 10;
1795 };
1796
1797 neighbor 192.168.1.10;
1798 neighbor 192.168.2.2 dev "eth2";
1799 neighbor 192.168.10.1 local 192.168.1.1 multihop;
1800 }
1801 </code>
1802
1803
1804 <sect>BGP
1805 <label id="bgp">
1806
1807 <p>The Border Gateway Protocol is the routing protocol used for backbone level
1808 routing in the today's Internet. Contrary to other protocols, its convergence
1809 does not rely on all routers following the same rules for route selection,
1810 making it possible to implement any routing policy at any router in the network,
1811 the only restriction being that if a router advertises a route, it must accept
1812 and forward packets according to it.
1813
1814 <p>BGP works in terms of autonomous systems (often abbreviated as AS). Each AS
1815 is a part of the network with common management and common routing policy. It is
1816 identified by a unique 16-bit number (ASN). Routers within each AS usually
1817 exchange AS-internal routing information with each other using an interior
1818 gateway protocol (IGP, such as OSPF or RIP). Boundary routers at the border of
1819 the AS communicate global (inter-AS) network reachability information with their
1820 neighbors in the neighboring AS'es via exterior BGP (eBGP) and redistribute
1821 received information to other routers in the AS via interior BGP (iBGP).
1822
1823 <p>Each BGP router sends to its neighbors updates of the parts of its routing
1824 table it wishes to export along with complete path information (a list of AS'es
1825 the packet will travel through if it uses the particular route) in order to
1826 avoid routing loops.
1827
1828 <p>BIRD supports all requirements of the BGP4 standard as defined in
1829 <rfc id="4271"> It also supports the community attributes (<rfc id="1997">),
1830 capability negotiation (<rfc id="5492">), MD5 password authentication (<rfc
1831 id="2385">), extended communities (<rfc id="4360">), route reflectors (<rfc
1832 id="4456">), graceful restart (<rfc id="4724">), multiprotocol extensions
1833 (<rfc id="4760">), 4B AS numbers (<rfc id="4893">), and 4B AS numbers in
1834 extended communities (<rfc id="5668">).
1835
1836
1837 For IPv6, it uses the standard multiprotocol extensions defined in
1838 <rfc id="4760"> and applied to IPv6 according to <rfc id="2545">.
1839
1840 <sect1>Route selection rules
1841 <label id="bgp-route-select-rules">
1842
1843 <p>BGP doesn't have any simple metric, so the rules for selection of an optimal
1844 route among multiple BGP routes with the same preference are a bit more complex
1845 and they are implemented according to the following algorithm. It starts the
1846 first rule, if there are more "best" routes, then it uses the second rule to
1847 choose among them and so on.
1848
1849 <itemize>
1850 <item>Prefer route with the highest Local Preference attribute.
1851 <item>Prefer route with the shortest AS path.
1852 <item>Prefer IGP origin over EGP and EGP origin over incomplete.
1853 <item>Prefer the lowest value of the Multiple Exit Discriminator.
1854 <item>Prefer routes received via eBGP over ones received via iBGP.
1855 <item>Prefer routes with lower internal distance to a boundary router.
1856 <item>Prefer the route with the lowest value of router ID of the
1857 advertising router.
1858 </itemize>
1859
1860 <sect1>IGP routing table
1861 <label id="bgp-igp-routing-table">
1862
1863 <p>BGP is mainly concerned with global network reachability and with routes to
1864 other autonomous systems. When such routes are redistributed to routers in the
1865 AS via BGP, they contain IP addresses of a boundary routers (in route attribute
1866 NEXT_HOP). BGP depends on existing IGP routing table with AS-internal routes to
1867 determine immediate next hops for routes and to know their internal distances to
1868 boundary routers for the purpose of BGP route selection. In BIRD, there is
1869 usually one routing table used for both IGP routes and BGP routes.
1870
1871 <sect1>Configuration
1872 <label id="bgp-config">
1873
1874 <p>Each instance of the BGP corresponds to one neighboring router. This allows
1875 to set routing policy and all the other parameters differently for each neighbor
1876 using the following configuration parameters:
1877
1878 <descrip>
1879 <tag><label id="bgp-local">local [<m/ip/] as <m/number/</tag>
1880 Define which AS we are part of. (Note that contrary to other IP routers,
1881 BIRD is able to act as a router located in multiple AS'es simultaneously,
1882 but in such cases you need to tweak the BGP paths manually in the filters
1883 to get consistent behavior.) Optional <cf/ip/ argument specifies a source
1884 address, equivalent to the <cf/source address/ option (see below). This
1885 parameter is mandatory.
1886
1887 <tag><label id="bgp-neighbor">neighbor [<m/ip/] [port <m/number/] [as <m/number/]</tag>
1888 Define neighboring router this instance will be talking to and what AS
1889 it is located in. In case the neighbor is in the same AS as we are, we
1890 automatically switch to iBGP. Optionally, the remote port may also be
1891 specified. The parameter may be used multiple times with different
1892 sub-options (e.g., both <cf/neighbor 10.0.0.1 as 65000;/ and
1893 <cf/neighbor 10.0.0.1; neighbor as 65000;/ are valid). This parameter is
1894 mandatory.
1895
1896 <tag><label id="bgp-iface">interface <m/string/</tag>
1897 Define interface we should use for link-local BGP IPv6 sessions.
1898 Interface can also be specified as a part of <cf/neighbor address/
1899 (e.g., <cf/neighbor fe80::1234%eth0 as 65000;/). The option may also be
1900 used for non link-local sessions when it is necessary to explicitly
1901 specify an interface, but only for direct (not multihop) sessions.
1902
1903 <tag><label id="bgp-direct">direct</tag>
1904 Specify that the neighbor is directly connected. The IP address of the
1905 neighbor must be from a directly reachable IP range (i.e. associated
1906 with one of your router's interfaces), otherwise the BGP session
1907 wouldn't start but it would wait for such interface to appear. The
1908 alternative is the <cf/multihop/ option. Default: enabled for eBGP.
1909
1910 <tag><label id="bgp-multihop">multihop [<m/number/]</tag>
1911 Configure multihop BGP session to a neighbor that isn't directly
1912 connected. Accurately, this option should be used if the configured
1913 neighbor IP address does not match with any local network subnets. Such
1914 IP address have to be reachable through system routing table. The
1915 alternative is the <cf/direct/ option. For multihop BGP it is
1916 recommended to explicitly configure the source address to have it
1917 stable. Optional <cf/number/ argument can be used to specify the number
1918 of hops (used for TTL). Note that the number of networks (edges) in a
1919 path is counted; i.e., if two BGP speakers are separated by one router,
1920 the number of hops is 2. Default: enabled for iBGP.
1921
1922 <tag><label id="bgp-source-address">source address <m/ip/</tag>
1923 Define local address we should use for next hop calculation and as a
1924 source address for the BGP session. Default: the address of the local
1925 end of the interface our neighbor is connected to.
1926
1927 <tag><label id="bgp-next-hop-self">next hop self</tag>
1928 Avoid calculation of the Next Hop attribute and always advertise our own
1929 source address as a next hop. This needs to be used only occasionally to
1930 circumvent misconfigurations of other routers. Default: disabled.
1931
1932 <tag><label id="bgp-next-hop-keep">next hop keep</tag>
1933 Forward the received Next Hop attribute even in situations where the
1934 local address should be used instead, like when the route is sent to an
1935 interface with a different subnet. Default: disabled.
1936
1937 <tag><label id="bgp-missing-lladdr">missing lladdr self|drop|ignore</tag>
1938 Next Hop attribute in BGP-IPv6 sometimes contains just the global IPv6
1939 address, but sometimes it has to contain both global and link-local IPv6
1940 addresses. This option specifies what to do if BIRD have to send both
1941 addresses but does not know link-local address. This situation might
1942 happen when routes from other protocols are exported to BGP, or when
1943 improper updates are received from BGP peers. <cf/self/ means that BIRD
1944 advertises its own local address instead. <cf/drop/ means that BIRD
1945 skips that prefixes and logs error. <cf/ignore/ means that BIRD ignores
1946 the problem and sends just the global address (and therefore forms
1947 improper BGP update). Default: <cf/self/, unless BIRD is configured as a
1948 route server (option <cf/rs client/), in that case default is <cf/ignore/,
1949 because route servers usually do not forward packets themselves.
1950
1951 <tag><label id="bgp-gateway">gateway direct|recursive</tag>
1952 For received routes, their <cf/gw/ (immediate next hop) attribute is
1953 computed from received <cf/bgp_next_hop/ attribute. This option
1954 specifies how it is computed. Direct mode means that the IP address from
1955 <cf/bgp_next_hop/ is used if it is directly reachable, otherwise the
1956 neighbor IP address is used. Recursive mode means that the gateway is
1957 computed by an IGP routing table lookup for the IP address from
1958 <cf/bgp_next_hop/. Note that there is just one level of indirection in
1959 recursive mode - the route obtained by the lookup must not be recursive
1960 itself, to prevent mutually recursive routes.
1961
1962 Recursive mode is the behavior specified by the BGP
1963 standard. Direct mode is simpler, does not require any routes in a
1964 routing table, and was used in older versions of BIRD, but does not
1965 handle well nontrivial iBGP setups and multihop. Recursive mode is
1966 incompatible with <ref id="dsc-table-sorted" name="sorted tables">. Default:
1967 <cf/direct/ for direct sessions, <cf/recursive/ for multihop sessions.
1968
1969 <tag><label id="bgp-igp-table">igp table <m/name/</tag>
1970 Specifies a table that is used as an IGP routing table. Default: the
1971 same as the table BGP is connected to.
1972
1973 <tag><label id="bgp-check-link">check link <M>switch</M></tag>
1974 BGP could use hardware link state into consideration. If enabled,
1975 BIRD tracks the link state of the associated interface and when link
1976 disappears (e.g. an ethernet cable is unplugged), the BGP session is
1977 immediately shut down. Note that this option cannot be used with
1978 multihop BGP. Default: disabled.
1979
1980 <tag><label id="bgp-bfd">bfd <M>switch</M></tag>
1981 BGP could use BFD protocol as an advisory mechanism for neighbor
1982 liveness and failure detection. If enabled, BIRD setups a BFD session
1983 for the BGP neighbor and tracks its liveness by it. This has an
1984 advantage of an order of magnitude lower detection times in case of
1985 failure. Note that BFD protocol also has to be configured, see
1986 <ref id="bfd" name="BFD"> section for details. Default: disabled.
1987
1988 <tag><label id="bgp-ttl-security">ttl security <m/switch/</tag>
1989 Use GTSM (<rfc id="5082"> - the generalized TTL security mechanism). GTSM
1990 protects against spoofed packets by ignoring received packets with a
1991 smaller than expected TTL. To work properly, GTSM have to be enabled on
1992 both sides of a BGP session. If both <cf/ttl security/ and
1993 <cf/multihop/ options are enabled, <cf/multihop/ option should specify
1994 proper hop value to compute expected TTL. Kernel support required:
1995 Linux: 2.6.34+ (IPv4), 2.6.35+ (IPv6), BSD: since long ago, IPv4 only.
1996 Note that full (ICMP protection, for example) <rfc id="5082"> support is
1997 provided by Linux only. Default: disabled.
1998
1999 <tag><label id="bgp-pass">password <m/string/</tag>
2000 Use this password for MD5 authentication of BGP sessions (<rfc id="2385">). When
2001 used on BSD systems, see also <cf/setkey/ option below. Default: no
2002 authentication.
2003
2004 <tag><label id="bgp-setkey">setkey <m/switch/</tag>
2005 On BSD systems, keys for TCP MD5 authentication are stored in the global
2006 SA/SP database, which can be accessed by external utilities (e.g.
2007 setkey(8)). BIRD configures security associations in the SA/SP database
2008 automatically based on <cf/password/ options (see above), this option
2009 allows to disable automatic updates by BIRD when manual configuration by
2010 external utilities is preferred. Note that automatic SA/SP database
2011 updates are currently implemented only for FreeBSD. Passwords have to be
2012 set manually by an external utility on NetBSD and OpenBSD. Default:
2013 enabled (ignored on non-FreeBSD).
2014
2015 <tag><label id="bgp-passive">passive <m/switch/</tag>
2016 Standard BGP behavior is both initiating outgoing connections and
2017 accepting incoming connections. In passive mode, outgoing connections
2018 are not initiated. Default: off.
2019
2020 <tag><label id="bgp-rr-client">rr client</tag>
2021 Be a route reflector and treat the neighbor as a route reflection
2022 client. Default: disabled.
2023
2024 <tag><label id="bgp-rr-cluster-id">rr cluster id <m/IPv4 address/</tag>
2025 Route reflectors use cluster id to avoid route reflection loops. When
2026 there is one route reflector in a cluster it usually uses its router id
2027 as a cluster id, but when there are more route reflectors in a cluster,
2028 these need to be configured (using this option) to use a common cluster
2029 id. Clients in a cluster need not know their cluster id and this option
2030 is not allowed for them. Default: the same as router id.
2031
2032 <tag><label id="bgp-rs-client">rs client</tag>
2033 Be a route server and treat the neighbor as a route server client.
2034 A route server is used as a replacement for full mesh EBGP routing in
2035 Internet exchange points in a similar way to route reflectors used in
2036 IBGP routing. BIRD does not implement obsoleted <rfc id="1863">, but
2037 uses ad-hoc implementation, which behaves like plain EBGP but reduces
2038 modifications to advertised route attributes to be transparent (for
2039 example does not prepend its AS number to AS PATH attribute and
2040 keeps MED attribute). Default: disabled.
2041
2042 <tag><label id="bgp-secondary">secondary <m/switch/</tag>
2043 Usually, if an export filter rejects a selected route, no other route is
2044 propagated for that network. This option allows to try the next route in
2045 order until one that is accepted is found or all routes for that network
2046 are rejected. This can be used for route servers that need to propagate
2047 different tables to each client but do not want to have these tables
2048 explicitly (to conserve memory). This option requires that the connected
2049 routing table is <ref id="dsc-table-sorted" name="sorted">. Default: off.
2050
2051 <tag><label id="bgp-add-paths">add paths <m/switch/|rx|tx</tag>
2052 Standard BGP can propagate only one path (route) per destination network
2053 (usually the selected one). This option controls the add-path protocol
2054 extension, which allows to advertise any number of paths to a
2055 destination. Note that to be active, add-path has to be enabled on both
2056 sides of the BGP session, but it could be enabled separately for RX and
2057 TX direction. When active, all available routes accepted by the export
2058 filter are advertised to the neighbor. Default: off.
2059
2060 <tag><label id="bgp-allow-local-pref">allow bgp_local_pref <m/switch/</tag>
2061 A standard BGP implementation do not send the Local Preference attribute
2062 to eBGP neighbors and ignore this attribute if received from eBGP
2063 neighbors, as per <rfc id="4271">. When this option is enabled on an
2064 eBGP session, this attribute will be sent to and accepted from the peer,
2065 which is useful for example if you have a setup like in <rfc id="7938">.
2066 The option does not affect iBGP sessions. Default: off.
2067
2068 <tag><label id="bgp-allow-local-as">allow local as [<m/number/]</tag>
2069 BGP prevents routing loops by rejecting received routes with the local
2070 AS number in the AS path. This option allows to loose or disable the
2071 check. Optional <cf/number/ argument can be used to specify the maximum
2072 number of local ASNs in the AS path that is allowed for received
2073 routes. When the option is used without the argument, the check is
2074 completely disabled and you should ensure loop-free behavior by some
2075 other means. Default: 0 (no local AS number allowed).
2076
2077 <tag><label id="bgp-enable-route-refresh">enable route refresh <m/switch/</tag>
2078 After the initial route exchange, BGP protocol uses incremental updates
2079 to keep BGP speakers synchronized. Sometimes (e.g., if BGP speaker
2080 changes its import filter, or if there is suspicion of inconsistency) it
2081 is necessary to do a new complete route exchange. BGP protocol extension
2082 Route Refresh (<rfc id="2918">) allows BGP speaker to request
2083 re-advertisement of all routes from its neighbor. BGP protocol
2084 extension Enhanced Route Refresh (<rfc id="7313">) specifies explicit
2085 begin and end for such exchanges, therefore the receiver can remove
2086 stale routes that were not advertised during the exchange. This option
2087 specifies whether BIRD advertises these capabilities and supports
2088 related procedures. Note that even when disabled, BIRD can send route
2089 refresh requests. Default: on.
2090
2091 <tag><label id="bgp-graceful-restart">graceful restart <m/switch/|aware</tag>
2092 When a BGP speaker restarts or crashes, neighbors will discard all
2093 received paths from the speaker, which disrupts packet forwarding even
2094 when the forwarding plane of the speaker remains intact. <rfc
2095 id="4724"> specifies an optional graceful restart mechanism to
2096 alleviate this issue. This option controls the mechanism. It has three
2097 states: Disabled, when no support is provided. Aware, when the graceful
2098 restart support is announced and the support for restarting neighbors
2099 is provided, but no local graceful restart is allowed (i.e.
2100 receiving-only role). Enabled, when the full graceful restart
2101 support is provided (i.e. both restarting and receiving role). Note
2102 that proper support for local graceful restart requires also
2103 configuration of other protocols. Default: aware.
2104
2105 <tag><label id="bgp-graceful-restart-time">graceful restart time <m/number/</tag>
2106 The restart time is announced in the BGP graceful restart capability
2107 and specifies how long the neighbor would wait for the BGP session to
2108 re-establish after a restart before deleting stale routes. Default:
2109 120 seconds.
2110
2111 <tag><label id="bgp-interpret-communities">interpret communities <m/switch/</tag>
2112 <rfc id="1997"> demands that BGP speaker should process well-known
2113 communities like no-export (65535, 65281) or no-advertise (65535,
2114 65282). For example, received route carrying a no-adverise community
2115 should not be advertised to any of its neighbors. If this option is
2116 enabled (which is by default), BIRD has such behavior automatically (it
2117 is evaluated when a route is exported to the BGP protocol just before
2118 the export filter). Otherwise, this integrated processing of
2119 well-known communities is disabled. In that case, similar behavior can
2120 be implemented in the export filter. Default: on.
2121
2122 <tag><label id="bgp-enable-as4">enable as4 <m/switch/</tag>
2123 BGP protocol was designed to use 2B AS numbers and was extended later to
2124 allow 4B AS number. BIRD supports 4B AS extension, but by disabling this
2125 option it can be persuaded not to advertise it and to maintain old-style
2126 sessions with its neighbors. This might be useful for circumventing bugs
2127 in neighbor's implementation of 4B AS extension. Even when disabled
2128 (off), BIRD behaves internally as AS4-aware BGP router. Default: on.
2129
2130 <tag><label id="bgp-enable-extended-messages">enable extended messages <m/switch/</tag>
2131 The BGP protocol uses maximum message length of 4096 bytes. This option
2132 provides an extension to allow extended messages with length up
2133 to 65535 bytes. Default: off.
2134
2135 <tag><label id="bgp-capabilities">capabilities <m/switch/</tag>
2136 Use capability advertisement to advertise optional capabilities. This is
2137 standard behavior for newer BGP implementations, but there might be some
2138 older BGP implementations that reject such connection attempts. When
2139 disabled (off), features that request it (4B AS support) are also
2140 disabled. Default: on, with automatic fallback to off when received
2141 capability-related error.
2142
2143 <tag><label id="bgp-advertise-ipv4">advertise ipv4 <m/switch/</tag>
2144 Advertise IPv4 multiprotocol capability. This is not a correct behavior
2145 according to the strict interpretation of <rfc id="4760">, but it is
2146 widespread and required by some BGP implementations (Cisco and Quagga).
2147 This option is relevant to IPv4 mode with enabled capability
2148 advertisement only. Default: on.
2149
2150 <tag><label id="bgp-route-limit">route limit <m/number/</tag>
2151 The maximal number of routes that may be imported from the protocol. If
2152 the route limit is exceeded, the connection is closed with an error.
2153 Limit is currently implemented as <cf>import limit <m/number/ action
2154 restart</cf>. This option is obsolete and it is replaced by
2155 <ref id="proto-import-limit" name="import limit option">. Default: no limit.
2156
2157 <tag><label id="bgp-disable-after-error">disable after error <m/switch/</tag>
2158 When an error is encountered (either locally or by the other side),
2159 disable the instance automatically and wait for an administrator to fix
2160 the problem manually. Default: off.
2161
2162 <tag><label id="bgp-hold-time">hold time <m/number/</tag>
2163 Time in seconds to wait for a Keepalive message from the other side
2164 before considering the connection stale. Default: depends on agreement
2165 with the neighboring router, we prefer 240 seconds if the other side is
2166 willing to accept it.
2167
2168 <tag><label id="bgp-startup-hold-time">startup hold time <m/number/</tag>
2169 Value of the hold timer used before the routers have a chance to exchange
2170 open messages and agree on the real value. Default: 240 seconds.
2171
2172 <tag><label id="bgp-keepalive-time">keepalive time <m/number/</tag>
2173 Delay in seconds between sending of two consecutive Keepalive messages.
2174 Default: One third of the hold time.
2175
2176 <tag><label id="bgp-connect-delay-time">connect delay time <m/number/</tag>
2177 Delay in seconds between protocol startup and the first attempt to
2178 connect. Default: 5 seconds.
2179
2180 <tag><label id="bgp-connect-retry-time">connect retry time <m/number/</tag>
2181 Time in seconds to wait before retrying a failed attempt to connect.
2182 Default: 120 seconds.
2183
2184 <tag><label id="bgp-error-wait-time">error wait time <m/number/,<m/number/</tag>
2185 Minimum and maximum delay in seconds between a protocol failure (either
2186 local or reported by the peer) and automatic restart. Doesn't apply
2187 when <cf/disable after error/ is configured. If consecutive errors
2188 happen, the delay is increased exponentially until it reaches the
2189 maximum. Default: 60, 300.
2190
2191 <tag><label id="bgp-error-forget-time">error forget time <m/number/</tag>
2192 Maximum time in seconds between two protocol failures to treat them as a
2193 error sequence which makes <cf/error wait time/ increase exponentially.
2194 Default: 300 seconds.
2195
2196 <tag><label id="bgp-path-metric">path metric <m/switch/</tag>
2197 Enable comparison of path lengths when deciding which BGP route is the
2198 best one. Default: on.
2199
2200 <tag><label id="bgp-med-metric">med metric <m/switch/</tag>
2201 Enable comparison of MED attributes (during best route selection) even
2202 between routes received from different ASes. This may be useful if all
2203 MED attributes contain some consistent metric, perhaps enforced in
2204 import filters of AS boundary routers. If this option is disabled, MED
2205 attributes are compared only if routes are received from the same AS
2206 (which is the standard behavior). Default: off.
2207
2208 <tag><label id="bgp-deterministic-med">deterministic med <m/switch/</tag>
2209 BGP route selection algorithm is often viewed as a comparison between
2210 individual routes (e.g. if a new route appears and is better than the
2211 current best one, it is chosen as the new best one). But the proper
2212 route selection, as specified by <rfc id="4271">, cannot be fully
2213 implemented in that way. The problem is mainly in handling the MED
2214 attribute. BIRD, by default, uses an simplification based on individual
2215 route comparison, which in some cases may lead to temporally dependent
2216 behavior (i.e. the selection is dependent on the order in which routes
2217 appeared). This option enables a different (and slower) algorithm
2218 implementing proper <rfc id="4271"> route selection, which is
2219 deterministic. Alternative way how to get deterministic behavior is to
2220 use <cf/med metric/ option. This option is incompatible with <ref
2221 id="dsc-table-sorted" name="sorted tables">. Default: off.
2222
2223 <tag><label id="bgp-igp-metric">igp metric <m/switch/</tag>
2224 Enable comparison of internal distances to boundary routers during best
2225 route selection. Default: on.
2226
2227 <tag><label id="bgp-prefer-older">prefer older <m/switch/</tag>
2228 Standard route selection algorithm breaks ties by comparing router IDs.
2229 This changes the behavior to prefer older routes (when both are external
2230 and from different peer). For details, see <rfc id="5004">. Default: off.
2231
2232 <tag><label id="bgp-default-med">default bgp_med <m/number/</tag>
2233 Value of the Multiple Exit Discriminator to be used during route
2234 selection when the MED attribute is missing. Default: 0.
2235
2236 <tag><label id="bgp-default-local-pref">default bgp_local_pref <m/number/</tag>
2237 A default value for the Local Preference attribute. It is used when
2238 a new Local Preference attribute is attached to a route by the BGP
2239 protocol itself (for example, if a route is received through eBGP and
2240 therefore does not have such attribute). Default: 100 (0 in pre-1.2.0
2241 versions of BIRD).
2242 </descrip>
2243
2244 <sect1>Attributes
2245 <label id="bgp-attr">
2246
2247 <p>BGP defines several route attributes. Some of them (those marked with
2248 `<tt/I/' in the table below) are available on internal BGP connections only,
2249 some of them (marked with `<tt/O/') are optional.
2250
2251 <descrip>
2252 <tag><label id="rta-bgp-path">bgppath bgp_path/</tag>
2253 Sequence of AS numbers describing the AS path the packet will travel
2254 through when forwarded according to the particular route. In case of
2255 internal BGP it doesn't contain the number of the local AS.
2256
2257 <tag><label id="rta-bgp-local-pref">int bgp_local_pref/ [I]</tag>
2258 Local preference value used for selection among multiple BGP routes (see
2259 the selection rules above). It's used as an additional metric which is
2260 propagated through the whole local AS.
2261
2262 <tag><label id="rta-bgp-med">int bgp_med/ [O]</tag>
2263 The Multiple Exit Discriminator of the route is an optional attribute
2264 which is used on external (inter-AS) links to convey to an adjacent AS
2265 the optimal entry point into the local AS. The received attribute is
2266 also propagated over internal BGP links. The attribute value is zeroed
2267 when a route is exported to an external BGP instance to ensure that the
2268 attribute received from a neighboring AS is not propagated to other
2269 neighboring ASes. A new value might be set in the export filter of an
2270 external BGP instance. See <rfc id="4451"> for further discussion of
2271 BGP MED attribute.
2272
2273 <tag><label id="rta-bgp-origin">enum bgp_origin/</tag>
2274 Origin of the route: either <cf/ORIGIN_IGP/ if the route has originated
2275 in an interior routing protocol or <cf/ORIGIN_EGP/ if it's been imported
2276 from the <tt>EGP</tt> protocol (nowadays it seems to be obsolete) or
2277 <cf/ORIGIN_INCOMPLETE/ if the origin is unknown.
2278
2279 <tag><label id="rta-bgp-next-hop">ip bgp_next_hop/</tag>
2280 Next hop to be used for forwarding of packets to this destination. On
2281 internal BGP connections, it's an address of the originating router if
2282 it's inside the local AS or a boundary router the packet will leave the
2283 AS through if it's an exterior route, so each BGP speaker within the AS
2284 has a chance to use the shortest interior path possible to this point.
2285
2286 <tag><label id="rta-bgp-atomic-aggr">void bgp_atomic_aggr/ [O]</tag>
2287 This is an optional attribute which carries no value, but the sole
2288 presence of which indicates that the route has been aggregated from
2289 multiple routes by some router on the path from the originator.
2290
2291 <!-- we don't handle aggregators right since they are of a very obscure type
2292 <tag>bgp_aggregator</tag>
2293 -->
2294 <tag><label id="rta-bgp-community">clist bgp_community/ [O]</tag>
2295 List of community values associated with the route. Each such value is a
2296 pair (represented as a <cf/pair/ data type inside the filters) of 16-bit
2297 integers, the first of them containing the number of the AS which
2298 defines the community and the second one being a per-AS identifier.
2299 There are lots of uses of the community mechanism, but generally they
2300 are used to carry policy information like "don't export to USA peers".
2301 As each AS can define its own routing policy, it also has a complete
2302 freedom about which community attributes it defines and what will their
2303 semantics be.
2304
2305 <tag><label id="rta-bgp-ext-community">eclist bgp_ext_community/ [O]</tag>
2306 List of extended community values associated with the route. Extended
2307 communities have similar usage as plain communities, but they have an
2308 extended range (to allow 4B ASNs) and a nontrivial structure with a type
2309 field. Individual community values are represented using an <cf/ec/ data
2310 type inside the filters.
2311
2312 <tag><label id="rta-bgp-large-community">lclist <cf/bgp_large_community/ [O]</tag>
2313 List of large community values associated with the route. Large BGP
2314 communities is another variant of communities, but contrary to extended
2315 communities they behave very much the same way as regular communities,
2316 just larger -- they are uniform untyped triplets of 32bit numbers.
2317 Individual community values are represented using an <cf/lc/ data type
2318 inside the filters.
2319
2320 <tag><label id="rta-bgp-originator-id">quad bgp_originator_id/ [I, O]</tag>
2321 This attribute is created by the route reflector when reflecting the
2322 route and contains the router ID of the originator of the route in the
2323 local AS.
2324
2325 <tag><label id="rta-bgp-cluster-list">clist bgp_cluster_list/ [I, O]</tag>
2326 This attribute contains a list of cluster IDs of route reflectors. Each
2327 route reflector prepends its cluster ID when reflecting the route.
2328 </descrip>
2329
2330 <sect1>Example
2331 <label id="bgp-exam">
2332
2333 <p><code>
2334 protocol bgp {
2335 local as 65000; # Use a private AS number
2336 neighbor 198.51.100.130 as 64496; # Our neighbor ...
2337 multihop; # ... which is connected indirectly
2338 export filter { # We use non-trivial export rules
2339 if source = RTS_STATIC then { # Export only static routes
2340 # Assign our community
2341 bgp_community.add((65000,64501));
2342 # Artificially increase path length
2343 # by advertising local AS number twice
2344 if bgp_path ~ [= 65000 =] then
2345 bgp_path.prepend(65000);
2346 accept;
2347 }
2348 reject;
2349 };
2350 import all;
2351 source address 198.51.100.14; # Use a non-standard source address
2352 }
2353 </code>
2354
2355
2356 <sect>Device
2357 <label id="device">
2358
2359 <p>The Device protocol is not a real routing protocol. It doesn't generate any
2360 routes and it only serves as a module for getting information about network
2361 interfaces from the kernel.
2362
2363 <p>Except for very unusual circumstances, you probably should include this
2364 protocol in the configuration since almost all other protocols require network
2365 interfaces to be defined for them to work with.
2366
2367 <sect1>Configuration
2368 <label id="device-config">
2369
2370 <p><descrip>
2371
2372 <tag><label id="device-scan-time">scan time <m/number/</tag>
2373 Time in seconds between two scans of the network interface list. On
2374 systems where we are notified about interface status changes
2375 asynchronously (such as newer versions of Linux), we need to scan the
2376 list only in order to avoid confusion by lost notification messages,
2377 so the default time is set to a large value.
2378
2379 <tag><label id="device-primary">primary [ "<m/mask/" ] <m/prefix/</tag>
2380 If a network interface has more than one network address, BIRD has to
2381 choose one of them as a primary one. By default, BIRD chooses the
2382 lexicographically smallest address as the primary one.
2383
2384 This option allows to specify which network address should be chosen as
2385 a primary one. Network addresses that match <m/prefix/ are preferred to
2386 non-matching addresses. If more <cf/primary/ options are used, the first
2387 one has the highest preference. If "<m/mask/" is specified, then such
2388 <cf/primary/ option is relevant only to matching network interfaces.
2389
2390 In all cases, an address marked by operating system as secondary cannot
2391 be chosen as the primary one.
2392 </descrip>
2393
2394 <p>As the Device protocol doesn't generate any routes, it cannot have
2395 any attributes. Example configuration looks like this:
2396
2397 <p><code>
2398 protocol device {
2399 scan time 10; # Scan the interfaces often
2400 primary "eth0" 192.168.1.1;
2401 primary 192.168.0.0/16;
2402 }
2403 </code>
2404
2405
2406 <sect>Direct
2407 <label id="direct">
2408
2409 <p>The Direct protocol is a simple generator of device routes for all the
2410 directly connected networks according to the list of interfaces provided by the
2411 kernel via the Device protocol.
2412
2413 <p>The question is whether it is a good idea to have such device routes in BIRD
2414 routing table. OS kernel usually handles device routes for directly connected
2415 networks by itself so we don't need (and don't want) to export these routes to
2416 the kernel protocol. OSPF protocol creates device routes for its interfaces
2417 itself and BGP protocol is usually used for exporting aggregate routes. Although
2418 there are some use cases that use the direct protocol (like abusing eBGP as an
2419 IGP routing protocol), in most cases it is not needed to have these device
2420 routes in BIRD routing table and to use the direct protocol.
2421
2422 <p>There is one notable case when you definitely want to use the direct protocol
2423 -- running BIRD on BSD systems. Having high priority device routes for directly
2424 connected networks from the direct protocol protects kernel device routes from
2425 being overwritten or removed by IGP routes during some transient network
2426 conditions, because a lower priority IGP route for the same network is not
2427 exported to the kernel routing table. This is an issue on BSD systems only, as
2428 on Linux systems BIRD cannot change non-BIRD route in the kernel routing table.
2429
2430 <p>There are just few configuration options for the Direct protocol:
2431
2432 <p><descrip>
2433 <tag><label id="direct-iface">interface <m/pattern/ [, <m/.../]</tag>
2434 By default, the Direct protocol will generate device routes for all the
2435 interfaces available. If you want to restrict it to some subset of
2436 interfaces or addresses (e.g. if you're using multiple routing tables
2437 for policy routing and some of the policy domains don't contain all
2438 interfaces), just use this clause. See <ref id="proto-iface" name="interface">
2439 common option for detailed description. The Direct protocol uses
2440 extended interface clauses.
2441
2442 <tag><label id="direct-check-link">check link <m/switch/</tag>
2443 If enabled, a hardware link state (reported by OS) is taken into
2444 consideration. Routes for directly connected networks are generated only
2445 if link up is reported and they are withdrawn when link disappears
2446 (e.g., an ethernet cable is unplugged). Default value is no.
2447 </descrip>
2448
2449 <p>Direct device routes don't contain any specific attributes.
2450
2451 <p>Example config might look like this:
2452
2453 <p><code>
2454 protocol direct {
2455 interface "-arc*", "*"; # Exclude the ARCnets
2456 }
2457 </code>
2458
2459
2460 <sect>Kernel
2461 <label id="krt">
2462
2463 <p>The Kernel protocol is not a real routing protocol. Instead of communicating
2464 with other routers in the network, it performs synchronization of BIRD's routing
2465 tables with the OS kernel. Basically, it sends all routing table updates to the
2466 kernel and from time to time it scans the kernel tables to see whether some
2467 routes have disappeared (for example due to unnoticed up/down transition of an
2468 interface) or whether an `alien' route has been added by someone else (depending
2469 on the <cf/learn/ switch, such routes are either ignored or accepted to our
2470 table).
2471
2472 <p>Unfortunately, there is one thing that makes the routing table synchronization
2473 a bit more complicated. In the kernel routing table there are also device routes
2474 for directly connected networks. These routes are usually managed by OS itself
2475 (as a part of IP address configuration) and we don't want to touch that. They
2476 are completely ignored during the scan of the kernel tables and also the export
2477 of device routes from BIRD tables to kernel routing tables is restricted to
2478 prevent accidental interference. This restriction can be disabled using
2479 <cf/device routes/ switch.
2480
2481 <p>If your OS supports only a single routing table, you can configure only one
2482 instance of the Kernel protocol. If it supports multiple tables (in order to
2483 allow policy routing; such an OS is for example Linux), you can run as many
2484 instances as you want, but each of them must be connected to a different BIRD
2485 routing table and to a different kernel table.
2486
2487 <p>Because the kernel protocol is partially integrated with the connected
2488 routing table, there are two limitations - it is not possible to connect more
2489 kernel protocols to the same routing table and changing route destination
2490 (gateway) in an export filter of a kernel protocol does not work. Both
2491 limitations can be overcome using another routing table and the pipe protocol.
2492
2493 <sect1>Configuration
2494 <label id="krt-config">
2495
2496 <p><descrip>
2497 <tag><label id="krt-persist">persist <m/switch/</tag>
2498 Tell BIRD to leave all its routes in the routing tables when it exits
2499 (instead of cleaning them up).
2500
2501 <tag><label id="krt-scan-time">scan time <m/number/</tag>
2502 Time in seconds between two consecutive scans of the kernel routing
2503 table.
2504
2505 <tag><label id="krt-learn">learn <m/switch/</tag>
2506 Enable learning of routes added to the kernel routing tables by other
2507 routing daemons or by the system administrator. This is possible only on
2508 systems which support identification of route authorship.
2509
2510 <tag><label id="krt-device-routes">device routes <m/switch/</tag>
2511 Enable export of device routes to the kernel routing table. By default,
2512 such routes are rejected (with the exception of explicitly configured
2513 device routes from the static protocol) regardless of the export filter
2514 to protect device routes in kernel routing table (managed by OS itself)
2515 from accidental overwriting or erasing.
2516
2517 <tag><label id="krt-kernel-table">kernel table <m/number/</tag>
2518 Select which kernel table should this particular instance of the Kernel
2519 protocol work with. Available only on systems supporting multiple
2520 routing tables.
2521
2522 <tag><label id="krt-metric">metric <m/number/</tag> (Linux)
2523 Use specified value as a kernel metric (priority) for all routes sent to
2524 the kernel. When multiple routes for the same network are in the kernel
2525 routing table, the Linux kernel chooses one with lower metric. Also,
2526 routes with different metrics do not clash with each other, therefore
2527 using dedicated metric value is a reliable way to avoid overwriting
2528 routes from other sources (e.g. kernel device routes). Metric 0 has a
2529 special meaning of undefined metric, in which either OS default is used,
2530 or per-route metric can be set using <cf/krt_metric/ attribute. Default:
2531 0 (undefined).
2532
2533 <tag><label id="krt-graceful-restart">graceful restart <m/switch/</tag>
2534 Participate in graceful restart recovery. If this option is enabled and
2535 a graceful restart recovery is active, the Kernel protocol will defer
2536 synchronization of routing tables until the end of the recovery. Note
2537 that import of kernel routes to BIRD is not affected.
2538
2539 <tag><label id="krt-merge-paths">merge paths <M>switch</M> [limit <M>number</M>]</tag>
2540 Usually, only best routes are exported to the kernel protocol. With path
2541 merging enabled, both best routes and equivalent non-best routes are
2542 merged during export to generate one ECMP (equal-cost multipath) route
2543 for each network. This is useful e.g. for BGP multipath. Note that best
2544 routes are still pivotal for route export (responsible for most
2545 properties of resulting ECMP routes), while exported non-best routes are
2546 responsible just for additional multipath next hops. This option also
2547 allows to specify a limit on maximal number of nexthops in one route. By
2548 default, multipath merging is disabled. If enabled, default value of the
2549 limit is 16.
2550 </descrip>
2551
2552 <sect1>Attributes
2553 <label id="krt-attr">
2554
2555 <p>The Kernel protocol defines several attributes. These attributes are
2556 translated to appropriate system (and OS-specific) route attributes. We support
2557 these attributes:
2558
2559 <descrip>
2560 <tag><label id="rta-krt-source">int krt_source/</tag>
2561 The original source of the imported kernel route. The value is
2562 system-dependent. On Linux, it is a value of the protocol field of the
2563 route. See /etc/iproute2/rt_protos for common values. On BSD, it is
2564 based on STATIC and PROTOx flags. The attribute is read-only.
2565
2566 <tag><label id="rta-krt-metric">int krt_metric/</tag> (Linux)
2567 The kernel metric of the route. When multiple same routes are in a
2568 kernel routing table, the Linux kernel chooses one with lower metric.
2569 Note that preferred way to set kernel metric is to use protocol option
2570 <cf/metric/, unless per-route metric values are needed.
2571
2572 <tag><label id="rta-krt-prefsrc">ip krt_prefsrc/</tag> (Linux)
2573 The preferred source address. Used in source address selection for
2574 outgoing packets. Has to be one of the IP addresses of the router.
2575
2576 <tag><label id="rta-krt-realm">int krt_realm/</tag> (Linux)
2577 The realm of the route. Can be used for traffic classification.
2578
2579 <tag><label id="rta-krt-scope">int krt_scope/</tag> (Linux IPv4)
2580 The scope of the route. Valid values are 0-254, although Linux kernel
2581 may reject some values depending on route type and nexthop. It is
2582 supposed to represent `indirectness' of the route, where nexthops of
2583 routes are resolved through routes with a higher scope, but in current
2584 kernels anything below <it/link/ (253) is treated as <it/global/ (0).
2585 When not present, global scope is implied for all routes except device
2586 routes, where link scope is used by default.
2587 </descrip>
2588
2589 <p>In Linux, there is also a plenty of obscure route attributes mostly focused
2590 on tuning TCP performance of local connections. BIRD supports most of these
2591 attributes, see Linux or iproute2 documentation for their meaning. Attributes
2592 <cf/krt_lock_*/ and <cf/krt_feature_*/ have type bool, others have type int.
2593 Supported attributes are:
2594
2595 <cf/krt_mtu/, <cf/krt_lock_mtu/, <cf/krt_window/, <cf/krt_lock_window/,
2596 <cf/krt_rtt/, <cf/krt_lock_rtt/, <cf/krt_rttvar/, <cf/krt_lock_rttvar/,
2597 <cf/krt_sstresh/, <cf/krt_lock_sstresh/, <cf/krt_cwnd/, <cf/krt_lock_cwnd/,
2598 <cf/krt_advmss/, <cf/krt_lock_advmss/, <cf/krt_reordering/, <cf/krt_lock_reordering/,
2599 <cf/krt_hoplimit/, <cf/krt_lock_hoplimit/, <cf/krt_rto_min/, <cf/krt_lock_rto_min/,
2600 <cf/krt_initcwnd/, <cf/krt_initrwnd/, <cf/krt_quickack/,
2601 <cf/krt_feature_ecn/, <cf/krt_feature_allfrag/
2602
2603 <sect1>Example
2604 <label id="krt-exam">
2605
2606 <p>A simple configuration can look this way:
2607
2608 <p><code>
2609 protocol kernel {
2610 export all;
2611 }
2612 </code>
2613
2614 <p>Or for a system with two routing tables:
2615
2616 <p><code>
2617 protocol kernel { # Primary routing table
2618 learn; # Learn alien routes from the kernel
2619 persist; # Don't remove routes on bird shutdown
2620 scan time 10; # Scan kernel routing table every 10 seconds
2621 import all;
2622 export all;
2623 }
2624
2625 protocol kernel { # Secondary routing table
2626 table auxtable;
2627 kernel table 100;
2628 export all;
2629 }
2630 </code>
2631
2632
2633 <sect>OSPF
2634 <label id="ospf">
2635
2636 <sect1>Introduction
2637 <label id="ospf-intro">
2638
2639 <p>Open Shortest Path First (OSPF) is a quite complex interior gateway
2640 protocol. The current IPv4 version (OSPFv2) is defined in <rfc id="2328"> and
2641 the current IPv6 version (OSPFv3) is defined in <rfc id="5340"> It's a link
2642 state (a.k.a. shortest path first) protocol -- each router maintains a database
2643 describing the autonomous system's topology. Each participating router has an
2644 identical copy of the database and all routers run the same algorithm
2645 calculating a shortest path tree with themselves as a root. OSPF chooses the
2646 least cost path as the best path.
2647
2648 <p>In OSPF, the autonomous system can be split to several areas in order to
2649 reduce the amount of resources consumed for exchanging the routing information
2650 and to protect the other areas from incorrect routing data. Topology of the area
2651 is hidden to the rest of the autonomous system.
2652
2653 <p>Another very important feature of OSPF is that it can keep routing information
2654 from other protocols (like Static or BGP) in its link state database as external
2655 routes. Each external route can be tagged by the advertising router, making it
2656 possible to pass additional information between routers on the boundary of the
2657 autonomous system.
2658
2659 <p>OSPF quickly detects topological changes in the autonomous system (such as
2660 router interface failures) and calculates new loop-free routes after a short
2661 period of convergence. Only a minimal amount of routing traffic is involved.
2662
2663 <p>Each router participating in OSPF routing periodically sends Hello messages
2664 to all its interfaces. This allows neighbors to be discovered dynamically. Then
2665 the neighbors exchange theirs parts of the link state database and keep it
2666 identical by flooding updates. The flooding process is reliable and ensures that
2667 each router detects all changes.
2668
2669 <sect1>Configuration
2670 <label id="ospf-config">
2671
2672 <p>In the main part of configuration, there can be multiple definitions of OSPF
2673 areas, each with a different id. These definitions includes many other switches
2674 and multiple definitions of interfaces. Definition of interface may contain many
2675 switches and constant definitions and list of neighbors on nonbroadcast
2676 networks.
2677
2678 <code>
2679 protocol ospf &lt;name&gt; {
2680 rfc1583compat &lt;switch&gt;;
2681 instance id &lt;num&gt;;
2682 stub router &lt;switch&gt;;
2683 tick &lt;num&gt;;
2684 ecmp &lt;switch&gt; [limit &lt;num&gt;];
2685 merge external &lt;switch&gt;;
2686 area &lt;id&gt; {
2687 stub;
2688 nssa;
2689 summary &lt;switch&gt;;
2690 default nssa &lt;switch&gt;;
2691 default cost &lt;num&gt;;
2692 default cost2 &lt;num&gt;;
2693 translator &lt;switch&gt;;
2694 translator stability &lt;num&gt;;
2695
2696 networks {
2697 &lt;prefix&gt;;
2698 &lt;prefix&gt; hidden;
2699 }
2700 external {
2701 &lt;prefix&gt;;
2702 &lt;prefix&gt; hidden;
2703 &lt;prefix&gt; tag &lt;num&gt;;
2704 }
2705 stubnet &lt;prefix&gt;;
2706 stubnet &lt;prefix&gt; {
2707 hidden &lt;switch&gt;;
2708 summary &lt;switch&gt;;
2709 cost &lt;num&gt;;
2710 }
2711 interface &lt;interface pattern&gt; [instance &lt;num&gt;] {
2712 cost &lt;num&gt;;
2713 stub &lt;switch&gt;;
2714 hello &lt;num&gt;;
2715 poll &lt;num&gt;;
2716 retransmit &lt;num&gt;;
2717 priority &lt;num&gt;;
2718 wait &lt;num&gt;;
2719 dead count &lt;num&gt;;
2720 dead &lt;num&gt;;
2721 secondary &lt;switch&gt;;
2722 rx buffer [normal|large|&lt;num&gt;];
2723 tx length &lt;num&gt;;
2724 type [broadcast|bcast|pointopoint|ptp|
2725 nonbroadcast|nbma|pointomultipoint|ptmp];
2726 link lsa suppression &lt;switch&gt;;
2727 strict nonbroadcast &lt;switch&gt;;
2728 real broadcast &lt;switch&gt;;
2729 ptp netmask &lt;switch&gt;;
2730 check link &lt;switch&gt;;
2731 bfd &lt;switch&gt;;
2732 ecmp weight &lt;num&gt;;
2733 ttl security [&lt;switch&gt;; | tx only]
2734 tx class|dscp &lt;num&gt;;
2735 tx priority &lt;num&gt;;
2736 authentication none|simple|cryptographic;
2737 password "&lt;text&gt;";
2738 password "&lt;text&gt;" {
2739 id &lt;num&gt;;
2740 generate from "&lt;date&gt;";
2741 generate to "&lt;date&gt;";
2742 accept from "&lt;date&gt;";
2743 accept to "&lt;date&gt;";
2744 from "&lt;date&gt;";
2745 to "&lt;date&gt;";
2746 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
2747 };
2748 neighbors {
2749 &lt;ip&gt;;
2750 &lt;ip&gt; eligible;
2751 };
2752 };
2753 virtual link &lt;id&gt; [instance &lt;num&gt;] {
2754 hello &lt;num&gt;;
2755 retransmit &lt;num&gt;;
2756 wait &lt;num&gt;;
2757 dead count &lt;num&gt;;
2758 dead &lt;num&gt;;
2759 authentication none|simple|cryptographic;
2760 password "&lt;text&gt;";
2761 password "&lt;text&gt;" {
2762 id &lt;num&gt;;
2763 generate from "&lt;date&gt;";
2764 generate to "&lt;date&gt;";
2765 accept from "&lt;date&gt;";
2766 accept to "&lt;date&gt;";
2767 from "&lt;date&gt;";
2768 to "&lt;date&gt;";
2769 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
2770 };
2771 };
2772 };
2773 }
2774 </code>
2775
2776 <descrip>
2777 <tag><label id="ospf-rfc1583compat">rfc1583compat <M>switch</M></tag>
2778 This option controls compatibility of routing table calculation with
2779 <rfc id="1583">. Default value is no.
2780
2781 <tag><label id="ospf-instance-id">instance id <m/num/</tag>
2782 When multiple OSPF protocol instances are active on the same links, they
2783 should use different instance IDs to distinguish their packets. Although
2784 it could be done on per-interface basis, it is often preferred to set
2785 one instance ID to whole OSPF domain/topology (e.g., when multiple
2786 instances are used to represent separate logical topologies on the same
2787 physical network). This option specifies the default instance ID for all
2788 interfaces of the OSPF instance. Note that this option, if used, must
2789 precede interface definitions. Default value is 0.
2790
2791 <tag><label id="ospf-stub-router">stub router <M>switch</M></tag>
2792 This option configures the router to be a stub router, i.e., a router
2793 that participates in the OSPF topology but does not allow transit
2794 traffic. In OSPFv2, this is implemented by advertising maximum metric
2795 for outgoing links. In OSPFv3, the stub router behavior is announced by
2796 clearing the R-bit in the router LSA. See <rfc id="6987"> for details.
2797 Default value is no.
2798
2799 <tag><label id="ospf-tick">tick <M>num</M></tag>
2800 The routing table calculation and clean-up of areas' databases is not
2801 performed when a single link state change arrives. To lower the CPU
2802 utilization, it's processed later at periodical intervals of <m/num/
2803 seconds. The default value is 1.
2804
2805 <tag><label id="ospf-ecmp">ecmp <M>switch</M> [limit <M>number</M>]</tag>
2806 This option specifies whether OSPF is allowed to generate ECMP
2807 (equal-cost multipath) routes. Such routes are used when there are
2808 several directions to the destination, each with the same (computed)
2809 cost. This option also allows to specify a limit on maximum number of
2810 nexthops in one route. By default, ECMP is disabled. If enabled,
2811 default value of the limit is 16.
2812
2813 <tag><label id="ospf-merge-external">merge external <M>switch</M></tag>
2814 This option specifies whether OSPF should merge external routes from
2815 different routers/LSAs for the same destination. When enabled together
2816 with <cf/ecmp/, equal-cost external routes will be combined to multipath
2817 routes in the same way as regular routes. When disabled, external routes
2818 from different LSAs are treated as separate even if they represents the
2819 same destination. Default value is no.
2820
2821 <tag><label id="ospf-area">area <M>id</M></tag>
2822 This defines an OSPF area with given area ID (an integer or an IPv4
2823 address, similarly to a router ID). The most important area is the
2824 backbone (ID 0) to which every other area must be connected.
2825
2826 <tag><label id="ospf-stub">stub</tag>
2827 This option configures the area to be a stub area. External routes are
2828 not flooded into stub areas. Also summary LSAs can be limited in stub
2829 areas (see option <cf/summary/). By default, the area is not a stub
2830 area.
2831
2832 <tag><label id="ospf-nssa">nssa</tag>
2833 This option configures the area to be a NSSA (Not-So-Stubby Area). NSSA
2834 is a variant of a stub area which allows a limited way of external route
2835 propagation. Global external routes are not propagated into a NSSA, but
2836 an external route can be imported into NSSA as a (area-wide) NSSA-LSA
2837 (and possibly translated and/or aggregated on area boundary). By
2838 default, the area is not NSSA.
2839
2840 <tag><label id="ospf-summary">summary <M>switch</M></tag>
2841 This option controls propagation of summary LSAs into stub or NSSA
2842 areas. If enabled, summary LSAs are propagated as usual, otherwise just
2843 the default summary route (0.0.0.0/0) is propagated (this is sometimes
2844 called totally stubby area). If a stub area has more area boundary
2845 routers, propagating summary LSAs could lead to more efficient routing
2846 at the cost of larger link state database. Default value is no.
2847
2848 <tag><label id="ospf-default-nssa">default nssa <M>switch</M></tag>
2849 When <cf/summary/ option is enabled, default summary route is no longer
2850 propagated to the NSSA. In that case, this option allows to originate
2851 default route as NSSA-LSA to the NSSA. Default value is no.
2852
2853 <tag><label id="ospf-default-cost">default cost <M>num</M></tag>
2854 This option controls the cost of a default route propagated to stub and
2855 NSSA areas. Default value is 1000.
2856
2857 <tag><label id="ospf-default-cost2">default cost2 <M>num</M></tag>
2858 When a default route is originated as NSSA-LSA, its cost can use either
2859 type 1 or type 2 metric. This option allows to specify the cost of a
2860 default route in type 2 metric. By default, type 1 metric (option
2861 <cf/default cost/) is used.
2862
2863 <tag><label id="ospf-translator">translator <M>switch</M></tag>
2864 This option controls translation of NSSA-LSAs into external LSAs. By
2865 default, one translator per NSSA is automatically elected from area
2866 boundary routers. If enabled, this area boundary router would
2867 unconditionally translate all NSSA-LSAs regardless of translator
2868 election. Default value is no.
2869
2870 <tag><label id="ospf-translator-stability">translator stability <M>num</M></tag>
2871 This option controls the translator stability interval (in seconds).
2872 When the new translator is elected, the old one keeps translating until
2873 the interval is over. Default value is 40.
2874
2875 <tag><label id="ospf-networks">networks { <m/set/ }</tag>
2876 Definition of area IP ranges. This is used in summary LSA origination.
2877 Hidden networks are not propagated into other areas.
2878
2879 <tag><label id="ospf-external">external { <m/set/ }</tag>
2880 Definition of external area IP ranges for NSSAs. This is used for
2881 NSSA-LSA translation. Hidden networks are not translated into external
2882 LSAs. Networks can have configured route tag.
2883
2884 <tag><label id="ospf-stubnet">stubnet <m/prefix/ { <m/options/ }</tag>
2885 Stub networks are networks that are not transit networks between OSPF
2886 routers. They are also propagated through an OSPF area as a part of a
2887 link state database. By default, BIRD generates a stub network record
2888 for each primary network address on each OSPF interface that does not
2889 have any OSPF neighbors, and also for each non-primary network address
2890 on each OSPF interface. This option allows to alter a set of stub
2891 networks propagated by this router.
2892
2893 Each instance of this option adds a stub network with given network
2894 prefix to the set of propagated stub network, unless option <cf/hidden/
2895 is used. It also suppresses default stub networks for given network
2896 prefix. When option <cf/summary/ is used, also default stub networks
2897 that are subnetworks of given stub network are suppressed. This might be
2898 used, for example, to aggregate generated stub networks.
2899
2900 <tag><label id="ospf-iface">interface <M>pattern</M> [instance <m/num/]</tag>
2901 Defines that the specified interfaces belong to the area being defined.
2902 See <ref id="proto-iface" name="interface"> common option for detailed
2903 description. In OSPFv2, extended interface clauses are used, because
2904 each network prefix is handled as a separate virtual interface.
2905
2906 You can specify alternative instance ID for the interface definition,
2907 therefore it is possible to have several instances of that interface
2908 with different options or even in different areas. For OSPFv2, instance
2909 ID support is an extension (<rfc id="6549">) and is supposed to be set
2910 per-protocol. For OSPFv3, it is an integral feature.
2911
2912 <tag><label id="ospf-virtual-link">virtual link <M>id</M> [instance <m/num/]</tag>
2913 Virtual link to router with the router id. Virtual link acts as a
2914 point-to-point interface belonging to backbone. The actual area is used
2915 as a transport area. This item cannot be in the backbone. Like with
2916 <cf/interface/ option, you could also use several virtual links to one
2917 destination with different instance IDs.
2918
2919 <tag><label id="ospf-cost">cost <M>num</M></tag>
2920 Specifies output cost (metric) of an interface. Default value is 10.
2921
2922 <tag><label id="ospf-stub-iface">stub <M>switch</M></tag>
2923 If set to interface it does not listen to any packet and does not send
2924 any hello. Default value is no.
2925
2926 <tag><label id="ospf-hello">hello <M>num</M></tag>
2927 Specifies interval in seconds between sending of Hello messages. Beware,
2928 all routers on the same network need to have the same hello interval.
2929 Default value is 10.
2930
2931 <tag><label id="ospf-poll">poll <M>num</M></tag>
2932 Specifies interval in seconds between sending of Hello messages for some
2933 neighbors on NBMA network. Default value is 20.
2934
2935 <tag><label id="ospf-retransmit">retransmit <M>num</M></tag>
2936 Specifies interval in seconds between retransmissions of unacknowledged
2937 updates. Default value is 5.
2938
2939 <tag><label id="ospf-priority">priority <M>num</M></tag>
2940 On every multiple access network (e.g., the Ethernet) Designated Router
2941 and Backup Designated router are elected. These routers have some special
2942 functions in the flooding process. Higher priority increases preferences
2943 in this election. Routers with priority 0 are not eligible. Default
2944 value is 1.
2945
2946 <tag><label id="ospf-wait">wait <M>num</M></tag>
2947 After start, router waits for the specified number of seconds between
2948 starting election and building adjacency. Default value is 4*<m/hello/.
2949
2950 <tag><label id="ospf-dead-count">dead count <M>num</M></tag>
2951 When the router does not receive any messages from a neighbor in
2952 <m/dead count/*<m/hello/ seconds, it will consider the neighbor down.
2953
2954 <tag><label id="ospf-dead">dead <M>num</M></tag>
2955 When the router does not receive any messages from a neighbor in
2956 <m/dead/ seconds, it will consider the neighbor down. If both directives
2957 <cf/dead count/ and <cf/dead/ are used, <cf/dead/ has precedence.
2958
2959 <tag><label id="ospf-secondary">secondary <M>switch</M></tag>
2960 On BSD systems, older versions of BIRD supported OSPFv2 only for the
2961 primary IP address of an interface, other IP ranges on the interface
2962 were handled as stub networks. Since v1.4.1, regular operation on
2963 secondary IP addresses is supported, but disabled by default for
2964 compatibility. This option allows to enable it. The option is a
2965 transitional measure, will be removed in the next major release as the
2966 behavior will be changed. On Linux systems, the option is irrelevant, as
2967 operation on non-primary addresses is already the regular behavior.
2968
2969 <tag><label id="ospf-rx-buffer">rx buffer <M>num</M></tag>
2970 This option allows to specify the size of buffers used for packet
2971 processing. The buffer size should be bigger than maximal size of any
2972 packets. By default, buffers are dynamically resized as needed, but a
2973 fixed value could be specified. Value <cf/large/ means maximal allowed
2974 packet size - 65535.
2975
2976 <tag><label id="ospf-tx-length">tx length <M>num</M></tag>
2977 Transmitted OSPF messages that contain large amount of information are
2978 segmented to separate OSPF packets to avoid IP fragmentation. This
2979 option specifies the soft ceiling for the length of generated OSPF
2980 packets. Default value is the MTU of the network interface. Note that
2981 larger OSPF packets may still be generated if underlying OSPF messages
2982 cannot be splitted (e.g. when one large LSA is propagated).
2983
2984 <tag><label id="ospf-type-bcast">type broadcast|bcast</tag>
2985 BIRD detects a type of a connected network automatically, but sometimes
2986 it's convenient to force use of a different type manually. On broadcast
2987 networks (like ethernet), flooding and Hello messages are sent using
2988 multicasts (a single packet for all the neighbors). A designated router
2989 is elected and it is responsible for synchronizing the link-state
2990 databases and originating network LSAs. This network type cannot be used
2991 on physically NBMA networks and on unnumbered networks (networks without
2992 proper IP prefix).
2993
2994 <tag><label id="ospf-type-ptp">type pointopoint|ptp</tag>
2995 Point-to-point networks connect just 2 routers together. No election is
2996 performed and no network LSA is originated, which makes it simpler and
2997 faster to establish. This network type is useful not only for physically
2998 PtP ifaces (like PPP or tunnels), but also for broadcast networks used
2999 as PtP links. This network type cannot be used on physically NBMA
3000 networks.
3001
3002 <tag><label id="ospf-type-nbma">type nonbroadcast|nbma</tag>
3003 On NBMA networks, the packets are sent to each neighbor separately
3004 because of lack of multicast capabilities. Like on broadcast networks,
3005 a designated router is elected, which plays a central role in propagation
3006 of LSAs. This network type cannot be used on unnumbered networks.
3007
3008 <tag><label id="ospf-type-ptmp">type pointomultipoint|ptmp</tag>
3009 This is another network type designed to handle NBMA networks. In this
3010 case the NBMA network is treated as a collection of PtP links. This is
3011 useful if not every pair of routers on the NBMA network has direct
3012 communication, or if the NBMA network is used as an (possibly
3013 unnumbered) PtP link.
3014
3015 <tag><label id="ospf-link-lsa-suppression">link lsa suppression <m/switch/</tag>
3016 In OSPFv3, link LSAs are generated for each link, announcing link-local
3017 IPv6 address of the router to its local neighbors. These are useless on
3018 PtP or PtMP networks and this option allows to suppress the link LSA
3019 origination for such interfaces. The option is ignored on other than PtP
3020 or PtMP interfaces. Default value is no.
3021
3022 <tag><label id="ospf-strict-nonbroadcast">strict nonbroadcast <m/switch/</tag>
3023 If set, don't send hello to any undefined neighbor. This switch is
3024 ignored on other than NBMA or PtMP interfaces. Default value is no.
3025
3026 <tag><label id="ospf-real-broadcast">real broadcast <m/switch/</tag>
3027 In <cf/type broadcast/ or <cf/type ptp/ network configuration, OSPF
3028 packets are sent as IP multicast packets. This option changes the
3029 behavior to using old-fashioned IP broadcast packets. This may be useful
3030 as a workaround if IP multicast for some reason does not work or does
3031 not work reliably. This is a non-standard option and probably is not
3032 interoperable with other OSPF implementations. Default value is no.
3033
3034 <tag><label id="ospf-ptp-netmask">ptp netmask <m/switch/</tag>
3035 In <cf/type ptp/ network configurations, OSPFv2 implementations should
3036 ignore received netmask field in hello packets and should send hello
3037 packets with zero netmask field on unnumbered PtP links. But some OSPFv2
3038 implementations perform netmask checking even for PtP links. This option
3039 specifies whether real netmask will be used in hello packets on <cf/type
3040 ptp/ interfaces. You should ignore this option unless you meet some
3041 compatibility problems related to this issue. Default value is no for
3042 unnumbered PtP links, yes otherwise.
3043
3044 <tag><label id="ospf-check-link">check link <M>switch</M></tag>
3045 If set, a hardware link state (reported by OS) is taken into consideration.
3046 When a link disappears (e.g. an ethernet cable is unplugged), neighbors
3047 are immediately considered unreachable and only the address of the iface
3048 (instead of whole network prefix) is propagated. It is possible that
3049 some hardware drivers or platforms do not implement this feature.
3050 Default value is no.
3051
3052 <tag><label id="ospf-bfd">bfd <M>switch</M></tag>
3053 OSPF could use BFD protocol as an advisory mechanism for neighbor
3054 liveness and failure detection. If enabled, BIRD setups a BFD session
3055 for each OSPF neighbor and tracks its liveness by it. This has an
3056 advantage of an order of magnitude lower detection times in case of
3057 failure. Note that BFD protocol also has to be configured, see
3058 <ref id="bfd" name="BFD"> section for details. Default value is no.
3059
3060 <tag><label id="ospf-ttl-security">ttl security [<m/switch/ | tx only]</tag>
3061 TTL security is a feature that protects routing protocols from remote
3062 spoofed packets by using TTL 255 instead of TTL 1 for protocol packets
3063 destined to neighbors. Because TTL is decremented when packets are
3064 forwarded, it is non-trivial to spoof packets with TTL 255 from remote
3065 locations. Note that this option would interfere with OSPF virtual
3066 links.
3067
3068 If this option is enabled, the router will send OSPF packets with TTL
3069 255 and drop received packets with TTL less than 255. If this option si
3070 set to <cf/tx only/, TTL 255 is used for sent packets, but is not
3071 checked for received packets. Default value is no.
3072
3073 <tag><label id="ospf-tx-class">tx class|dscp|priority <m/num/</tag>
3074 These options specify the ToS/DiffServ/Traffic class/Priority of the
3075 outgoing OSPF packets. See <ref id="proto-tx-class" name="tx class"> common
3076 option for detailed description.
3077
3078 <tag><label id="ospf-ecmp-weight">ecmp weight <M>num</M></tag>
3079 When ECMP (multipath) routes are allowed, this value specifies a
3080 relative weight used for nexthops going through the iface. Allowed
3081 values are 1-256. Default value is 1.
3082
3083 <tag><label id="ospf-auth-none">authentication none</tag>
3084 No passwords are sent in OSPF packets. This is the default value.
3085
3086 <tag><label id="ospf-auth-simple">authentication simple</tag>
3087 Every packet carries 8 bytes of password. Received packets lacking this
3088 password are ignored. This authentication mechanism is very weak.
3089 This option is not available in OSPFv3.
3090
3091 <tag><label id="ospf-auth-cryptographic">authentication cryptographic</tag>
3092 An authentication code is appended to every packet. The specific
3093 cryptographic algorithm is selected by option <cf/algorithm/ for each
3094 key. The default cryptographic algorithm for OSPFv2 keys is Keyed-MD5
3095 and for OSPFv3 keys is HMAC-SHA-256. Passwords are not sent open via
3096 network, so this mechanism is quite secure. Packets can still be read by
3097 an attacker.
3098
3099 <tag><label id="ospf-pass">password "<M>text</M>"</tag>
3100 Specifies a password used for authentication. See
3101 <ref id="proto-pass" name="password"> common option for detailed
3102 description.
3103
3104 <tag><label id="ospf-neighbors">neighbors { <m/set/ } </tag>
3105 A set of neighbors to which Hello messages on NBMA or PtMP networks are
3106 to be sent. For NBMA networks, some of them could be marked as eligible.
3107 In OSPFv3, link-local addresses should be used, using global ones is
3108 possible, but it is nonstandard and might be problematic. And definitely,
3109 link-local and global addresses should not be mixed.
3110 </descrip>
3111
3112 <sect1>Attributes
3113 <label id="ospf-attr">
3114
3115 <p>OSPF defines four route attributes. Each internal route has a <cf/metric/.
3116
3117 <p>Metric is ranging from 1 to infinity (65535). External routes use
3118 <cf/metric type 1/ or <cf/metric type 2/. A <cf/metric of type 1/ is comparable
3119 with internal <cf/metric/, a <cf/metric of type 2/ is always longer than any
3120 <cf/metric of type 1/ or any <cf/internal metric/. <cf/Internal metric/ or
3121 <cf/metric of type 1/ is stored in attribute <cf/ospf_metric1/, <cf/metric type
3122 2/ is stored in attribute <cf/ospf_metric2/. If you specify both metrics only
3123 metric1 is used.
3124
3125 <p>Each external route can also carry attribute <cf/ospf_tag/ which is a 32-bit
3126 integer which is used when exporting routes to other protocols; otherwise, it
3127 doesn't affect routing inside the OSPF domain at all. The fourth attribute
3128 <cf/ospf_router_id/ is a router ID of the router advertising that route /
3129 network. This attribute is read-only. Default is <cf/ospf_metric2 = 10000/ and
3130 <cf/ospf_tag = 0/.
3131
3132 <sect1>Example
3133 <label id="ospf-exam">
3134
3135 <p><code>
3136 protocol ospf MyOSPF {
3137 rfc1583compat yes;
3138 tick 2;
3139 export filter {
3140 if source = RTS_BGP then {
3141 ospf_metric1 = 100;
3142 accept;
3143 }
3144 reject;
3145 };
3146 area 0.0.0.0 {
3147 interface "eth*" {
3148 cost 11;
3149 hello 15;
3150 priority 100;
3151 retransmit 7;
3152 authentication simple;
3153 password "aaa";
3154 };
3155 interface "ppp*" {
3156 cost 100;
3157 authentication cryptographic;
3158 password "abc" {
3159 id 1;
3160 generate to "22-04-2003 11:00:06";
3161 accept from "17-01-2001 12:01:05";
3162 algorithm hmac sha384;
3163 };
3164 password "def" {
3165 id 2;
3166 generate to "22-07-2005 17:03:21";
3167 accept from "22-02-2001 11:34:06";
3168 algorithm hmac sha512;
3169 };
3170 };
3171 interface "arc0" {
3172 cost 10;
3173 stub yes;
3174 };
3175 interface "arc1";
3176 };
3177 area 120 {
3178 stub yes;
3179 networks {
3180 172.16.1.0/24;
3181 172.16.2.0/24 hidden;
3182 }
3183 interface "-arc0" , "arc*" {
3184 type nonbroadcast;
3185 authentication none;
3186 strict nonbroadcast yes;
3187 wait 120;
3188 poll 40;
3189 dead count 8;
3190 neighbors {
3191 192.168.120.1 eligible;
3192 192.168.120.2;
3193 192.168.120.10;
3194 };
3195 };
3196 };
3197 }
3198 </code>
3199
3200
3201 <sect>Pipe
3202 <label id="pipe">
3203
3204 <sect1>Introduction
3205 <label id="pipe-intro">
3206
3207 <p>The Pipe protocol serves as a link between two routing tables, allowing
3208 routes to be passed from a table declared as primary (i.e., the one the pipe is
3209 connected to using the <cf/table/ configuration keyword) to the secondary one
3210 (declared using <cf/peer table/) and vice versa, depending on what's allowed by
3211 the filters. Export filters control export of routes from the primary table to
3212 the secondary one, import filters control the opposite direction.
3213
3214 <p>The Pipe protocol may work in the transparent mode mode or in the opaque
3215 mode. In the transparent mode, the Pipe protocol retransmits all routes from
3216 one table to the other table, retaining their original source and attributes.
3217 If import and export filters are set to accept, then both tables would have
3218 the same content. The transparent mode is the default mode.
3219
3220 <p>In the opaque mode, the Pipe protocol retransmits optimal route from one
3221 table to the other table in a similar way like other protocols send and receive
3222 routes. Retransmitted route will have the source set to the Pipe protocol, which
3223 may limit access to protocol specific route attributes. This mode is mainly for
3224 compatibility, it is not suggested for new configs. The mode can be changed by
3225 <tt/mode/ option.
3226
3227 <p>The primary use of multiple routing tables and the Pipe protocol is for
3228 policy routing, where handling of a single packet doesn't depend only on its
3229 destination address, but also on its source address, source interface, protocol
3230 type and other similar parameters. In many systems (Linux being a good example),
3231 the kernel allows to enforce routing policies by defining routing rules which
3232 choose one of several routing tables to be used for a packet according to its
3233 parameters. Setting of these rules is outside the scope of BIRD's work (on
3234 Linux, you can use the <tt/ip/ command), but you can create several routing
3235 tables in BIRD, connect them to the kernel ones, use filters to control which
3236 routes appear in which tables and also you can employ the Pipe protocol for
3237 exporting a selected subset of one table to another one.
3238
3239 <sect1>Configuration
3240 <label id="pipe-config">
3241
3242 <p><descrip>
3243 <tag><label id="pipe-peer-table">peer table <m/table/</tag>
3244 Defines secondary routing table to connect to. The primary one is
3245 selected by the <cf/table/ keyword.
3246
3247 <tag><label id="pipe-mode">mode opaque|transparent</tag>
3248 Specifies the mode for the pipe to work in. Default is transparent.
3249 </descrip>
3250
3251 <sect1>Attributes
3252 <label id="pipe-attr">
3253
3254 <p>The Pipe protocol doesn't define any route attributes.
3255
3256 <sect1>Example
3257 <label id="pipe-exam">
3258
3259 <p>Let's consider a router which serves as a boundary router of two different
3260 autonomous systems, each of them connected to a subset of interfaces of the
3261 router, having its own exterior connectivity and wishing to use the other AS as
3262 a backup connectivity in case of outage of its own exterior line.
3263
3264 <p>Probably the simplest solution to this situation is to use two routing tables
3265 (we'll call them <cf/as1/ and <cf/as2/) and set up kernel routing rules, so that
3266 packets having arrived from interfaces belonging to the first AS will be routed
3267 according to <cf/as1/ and similarly for the second AS. Thus we have split our
3268 router to two logical routers, each one acting on its own routing table, having
3269 its own routing protocols on its own interfaces. In order to use the other AS's
3270 routes for backup purposes, we can pass the routes between the tables through a
3271 Pipe protocol while decreasing their preferences and correcting their BGP paths
3272 to reflect the AS boundary crossing.
3273
3274 <code>
3275 table as1; # Define the tables
3276 table as2;
3277
3278 protocol kernel kern1 { # Synchronize them with the kernel
3279 table as1;
3280 kernel table 1;
3281 }
3282
3283 protocol kernel kern2 {
3284 table as2;
3285 kernel table 2;
3286 }
3287
3288 protocol bgp bgp1 { # The outside connections
3289 table as1;
3290 local as 1;
3291 neighbor 192.168.0.1 as 1001;
3292 export all;
3293 import all;
3294 }
3295
3296 protocol bgp bgp2 {
3297 table as2;
3298 local as 2;
3299 neighbor 10.0.0.1 as 1002;
3300 export all;
3301 import all;
3302 }
3303
3304 protocol pipe { # The Pipe
3305 table as1;
3306 peer table as2;
3307 export filter {
3308 if net ~ [ 1.0.0.0/8+] then { # Only AS1 networks
3309 if preference>10 then preference = preference-10;
3310 if source=RTS_BGP then bgp_path.prepend(1);
3311 accept;
3312 }
3313 reject;
3314 };
3315 import filter {
3316 if net ~ [ 2.0.0.0/8+] then { # Only AS2 networks
3317 if preference>10 then preference = preference-10;
3318 if source=RTS_BGP then bgp_path.prepend(2);
3319 accept;
3320 }
3321 reject;
3322 };
3323 }
3324 </code>
3325
3326
3327 <sect>RAdv
3328 <label id="radv">
3329
3330 <sect1>Introduction
3331 <label id="radv-intro">
3332
3333 <p>The RAdv protocol is an implementation of Router Advertisements, which are
3334 used in the IPv6 stateless autoconfiguration. IPv6 routers send (in irregular
3335 time intervals or as an answer to a request) advertisement packets to connected
3336 networks. These packets contain basic information about a local network (e.g. a
3337 list of network prefixes), which allows network hosts to autoconfigure network
3338 addresses and choose a default route. BIRD implements router behavior as defined
3339 in <rfc id="4861"> and also the DNS extensions from <rfc id="6106">.
3340
3341 <sect1>Configuration
3342 <label id="radv-config">
3343
3344 <p>There are several classes of definitions in RAdv configuration -- interface
3345 definitions, prefix definitions and DNS definitions:
3346
3347 <descrip>
3348 <tag><label id="radv-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
3349 Interface definitions specify a set of interfaces on which the
3350 protocol is activated and contain interface specific options.
3351 See <ref id="proto-iface" name="interface"> common options for
3352 detailed description.
3353
3354 <tag><label id="radv-prefix">prefix <m/prefix/ { <m/options/ }</tag>
3355 Prefix definitions allow to modify a list of advertised prefixes. By
3356 default, the advertised prefixes are the same as the network prefixes
3357 assigned to the interface. For each network prefix, the matching prefix
3358 definition is found and its options are used. If no matching prefix
3359 definition is found, the prefix is used with default options.
3360
3361 Prefix definitions can be either global or interface-specific. The
3362 second ones are part of interface options. The prefix definition
3363 matching is done in the first-match style, when interface-specific
3364 definitions are processed before global definitions. As expected, the
3365 prefix definition is matching if the network prefix is a subnet of the
3366 prefix in prefix definition.
3367
3368 <tag><label id="radv-rdnss">rdnss { <m/options/ }</tag>
3369 RDNSS definitions allow to specify a list of advertised recursive DNS
3370 servers together with their options. As options are seldom necessary,
3371 there is also a short variant <cf>rdnss <m/address/</cf> that just
3372 specifies one DNS server. Multiple definitions are cumulative. RDNSS
3373 definitions may also be interface-specific when used inside interface
3374 options. By default, interface uses both global and interface-specific
3375 options, but that can be changed by <cf/rdnss local/ option.
3376 dsc-iface
3377 <tag><label id="radv-dnssl">dnssl { <m/options/ }</tag>
3378 DNSSL definitions allow to specify a list of advertised DNS search
3379 domains together with their options. Like <cf/rdnss/ above, multiple
3380 definitions are cumulative, they can be used also as interface-specific
3381 options and there is a short variant <cf>dnssl <m/domain/</cf> that just
3382 specifies one DNS search domain.
3383
3384 <tag><label id="radv-trigger">trigger <m/prefix/</tag>
3385 RAdv protocol could be configured to change its behavior based on
3386 availability of routes. When this option is used, the protocol waits in
3387 suppressed state until a <it/trigger route/ (for the specified network)
3388 is exported to the protocol, the protocol also returnsd to suppressed
3389 state if the <it/trigger route/ disappears. Note that route export
3390 depends on specified export filter, as usual. This option could be used,
3391 e.g., for handling failover in multihoming scenarios.
3392
3393 During suppressed state, router advertisements are generated, but with
3394 some fields zeroed. Exact behavior depends on which fields are zeroed,
3395 this can be configured by <cf/sensitive/ option for appropriate
3396 fields. By default, just <cf/default lifetime/ (also called <cf/router
3397 lifetime/) is zeroed, which means hosts cannot use the router as a
3398 default router. <cf/preferred lifetime/ and <cf/valid lifetime/ could
3399 also be configured as <cf/sensitive/ for a prefix, which would cause
3400 autoconfigured IPs to be deprecated or even removed.
3401 </descrip>
3402
3403 <p>Interface specific options:
3404
3405 <descrip>
3406 <tag><label id="radv-iface-max-ra-interval">max ra interval <m/expr/</tag>
3407 Unsolicited router advertisements are sent in irregular time intervals.
3408 This option specifies the maximum length of these intervals, in seconds.
3409 Valid values are 4-1800. Default: 600
3410
3411 <tag><label id="radv-iface-min-ra-interval">min ra interval <m/expr/</tag>
3412 This option specifies the minimum length of that intervals, in seconds.
3413 Must be at least 3 and at most 3/4 * <cf/max ra interval/. Default:
3414 about 1/3 * <cf/max ra interval/.
3415
3416 <tag><label id="radv-iface-min-delay">min delay <m/expr/</tag>
3417 The minimum delay between two consecutive router advertisements, in
3418 seconds. Default: 3
3419
3420 <tag><label id="radv-iface-managed">managed <m/switch/</tag>
3421 This option specifies whether hosts should use DHCPv6 for IP address
3422 configuration. Default: no
3423
3424 <tag><label id="radv-iface-other-config">other config <m/switch/</tag>
3425 This option specifies whether hosts should use DHCPv6 to receive other
3426 configuration information. Default: no
3427
3428 <tag><label id="radv-iface-link-mtu">link mtu <m/expr/</tag>
3429 This option specifies which value of MTU should be used by hosts. 0
3430 means unspecified. Default: 0
3431
3432 <tag><label id="radv-iface-reachable-time">reachable time <m/expr/</tag>
3433 This option specifies the time (in milliseconds) how long hosts should
3434 assume a neighbor is reachable (from the last confirmation). Maximum is
3435 3600000, 0 means unspecified. Default 0.
3436
3437 <tag><label id="radv-iface-retrans-timer">retrans timer <m/expr/</tag>
3438 This option specifies the time (in milliseconds) how long hosts should
3439 wait before retransmitting Neighbor Solicitation messages. 0 means
3440 unspecified. Default 0.
3441
3442 <tag><label id="radv-iface-current-hop-limit">current hop limit <m/expr/</tag>
3443 This option specifies which value of Hop Limit should be used by
3444 hosts. Valid values are 0-255, 0 means unspecified. Default: 64
3445
3446 <tag><label id="radv-iface-default-lifetime">default lifetime <m/expr/ [sensitive <m/switch/]</tag>
3447 This option specifies the time (in seconds) how long (after the receipt
3448 of RA) hosts may use the router as a default router. 0 means do not use
3449 as a default router. For <cf/sensitive/ option, see <ref id="radv-trigger" name="trigger">.
3450 Default: 3 * <cf/max ra interval/, <cf/sensitive/ yes.
3451
3452 <tag><label id="radv-iface-linger-time">linger time <m/expr/</tag>
3453 When a prefix disappears, it is advertised for some time with 0
3454 lifetime, to inform clients the prefix is no longer usable. This option
3455 sets the time for how long it is advertised (in seconds). Maximum is
3456 3600, 0 means disabled. Default: 300.
3457
3458 <tag><label id="radv-iface-default-preference-low">default preference low|medium|high</tag>
3459 This option specifies the Default Router Preference value to advertise
3460 to hosts. Default: medium.
3461
3462 <tag><label id="radv-iface-rdnss-local">rdnss local <m/switch/</tag>
3463 Use only local (interface-specific) RDNSS definitions for this
3464 interface. Otherwise, both global and local definitions are used. Could
3465 also be used to disable RDNSS for given interface if no local definitons
3466 are specified. Default: no.
3467
3468 <tag><label id="radv-iface-dnssl-local">dnssl local <m/switch/</tag>
3469 Use only local DNSSL definitions for this interface. See <cf/rdnss local/
3470 option above. Default: no.
3471 </descrip>
3472
3473
3474 <p>Prefix specific options
3475
3476 <descrip>
3477 <tag><label id="radv-prefix-skip">skip <m/switch/</tag>
3478 This option allows to specify that given prefix should not be
3479 advertised. This is useful for making exceptions from a default policy
3480 of advertising all prefixes. Note that for withdrawing an already
3481 advertised prefix it is more useful to advertise it with zero valid
3482 lifetime. Default: no
3483
3484 <tag><label id="radv-prefix-onlink">onlink <m/switch/</tag>
3485 This option specifies whether hosts may use the advertised prefix for
3486 onlink determination. Default: yes
3487
3488 <tag><label id="radv-prefix-autonomous">autonomous <m/switch/</tag>
3489 This option specifies whether hosts may use the advertised prefix for
3490 stateless autoconfiguration. Default: yes
3491
3492 <tag><label id="radv-prefix-valid-lifetime">valid lifetime <m/expr/ [sensitive <m/switch/]</tag>
3493 This option specifies the time (in seconds) how long (after the
3494 receipt of RA) the prefix information is valid, i.e., autoconfigured
3495 IP addresses can be assigned and hosts with that IP addresses are
3496 considered directly reachable. 0 means the prefix is no longer
3497 valid. For <cf/sensitive/ option, see <ref id="radv-trigger" name="trigger">.
3498 Default: 86400 (1 day), <cf/sensitive/ no.
3499
3500 <tag><label id="radv-prefix-preferred-lifetime">preferred lifetime <m/expr/ [sensitive <m/switch/]</tag>
3501 This option specifies the time (in seconds) how long (after the
3502 receipt of RA) IP addresses generated from the prefix using stateless
3503 autoconfiguration remain preferred. For <cf/sensitive/ option,
3504 see <ref id="radv-trigger" name="trigger">. Default: 14400 (4 hours),
3505 <cf/sensitive/ no.
3506 </descrip>
3507
3508
3509 <p>RDNSS specific options:
3510
3511 <descrip>
3512 <tag><label id="radv-rdnss-ns">ns <m/address/</tag>
3513 This option specifies one recursive DNS server. Can be used multiple
3514 times for multiple servers. It is mandatory to have at least one
3515 <cf/ns/ option in <cf/rdnss/ definition.
3516
3517 <tag><label id="radv-rdnss-lifetime">lifetime [mult] <m/expr/</tag>
3518 This option specifies the time how long the RDNSS information may be
3519 used by clients after the receipt of RA. It is expressed either in
3520 seconds or (when <cf/mult/ is used) in multiples of <cf/max ra
3521 interval/. Note that RDNSS information is also invalidated when
3522 <cf/default lifetime/ expires. 0 means these addresses are no longer
3523 valid DNS servers. Default: 3 * <cf/max ra interval/.
3524 </descrip>
3525
3526
3527 <p>DNSSL specific options:
3528
3529 <descrip>
3530 <tag><label id="radv-dnssl-domain">domain <m/address/</tag>
3531 This option specifies one DNS search domain. Can be used multiple times
3532 for multiple domains. It is mandatory to have at least one <cf/domain/
3533 option in <cf/dnssl/ definition.
3534
3535 <tag><label id="radv-dnssl-lifetime">lifetime [mult] <m/expr/</tag>
3536 This option specifies the time how long the DNSSL information may be
3537 used by clients after the receipt of RA. Details are the same as for
3538 RDNSS <cf/lifetime/ option above. Default: 3 * <cf/max ra interval/.
3539 </descrip>
3540
3541
3542 <sect1>Example
3543 <label id="radv-exam">
3544
3545 <p><code>
3546 protocol radv {
3547 interface "eth2" {
3548 max ra interval 5; # Fast failover with more routers
3549 managed yes; # Using DHCPv6 on eth2
3550 prefix ::/0 {
3551 autonomous off; # So do not autoconfigure any IP
3552 };
3553 };
3554
3555 interface "eth*"; # No need for any other options
3556
3557 prefix 2001:0DB8:1234::/48 {
3558 preferred lifetime 0; # Deprecated address range
3559 };
3560
3561 prefix 2001:0DB8:2000::/48 {
3562 autonomous off; # Do not autoconfigure
3563 };
3564
3565 rdnss 2001:0DB8:1234::10; # Short form of RDNSS
3566
3567 rdnss {
3568 lifetime mult 10;
3569 ns 2001:0DB8:1234::11;
3570 ns 2001:0DB8:1234::12;
3571 };
3572
3573 dnssl {
3574 lifetime 3600;
3575 domain "abc.com";
3576 domain "xyz.com";
3577 };
3578 }
3579 </code>
3580
3581
3582 <sect>RIP
3583 <label id="rip">
3584
3585 <sect1>Introduction
3586 <label id="rip-intro">
3587
3588 <p>The RIP protocol (also sometimes called Rest In Pieces) is a simple protocol,
3589 where each router broadcasts (to all its neighbors) distances to all networks it
3590 can reach. When a router hears distance to another network, it increments it and
3591 broadcasts it back. Broadcasts are done in regular intervals. Therefore, if some
3592 network goes unreachable, routers keep telling each other that its distance is
3593 the original distance plus 1 (actually, plus interface metric, which is usually
3594 one). After some time, the distance reaches infinity (that's 15 in RIP) and all
3595 routers know that network is unreachable. RIP tries to minimize situations where
3596 counting to infinity is necessary, because it is slow. Due to infinity being 16,
3597 you can't use RIP on networks where maximal distance is higher than 15
3598 hosts.
3599
3600 <p>BIRD supports RIPv1 (<rfc id="1058">), RIPv2 (<rfc id="2453">), RIPng (<rfc
3601 id="2080">), and RIP cryptographic authentication (<rfc id="4822">).
3602
3603 <p>RIP is a very simple protocol, and it has a lot of shortcomings. Slow
3604 convergence, big network load and inability to handle larger networks makes it
3605 pretty much obsolete. It is still usable on very small networks.
3606
3607 <sect1>Configuration
3608 <label id="rip-config">
3609
3610 <p>RIP configuration consists mainly of common protocol options and interface
3611 definitions, most RIP options are interface specific.
3612
3613 <code>
3614 protocol rip [&lt;name&gt;] {
3615 infinity &lt;number&gt;;
3616 ecmp &lt;switch&gt; [limit &lt;number&gt;];
3617 interface &lt;interface pattern&gt; {
3618 metric &lt;number&gt;;
3619 mode multicast|broadcast;
3620 passive &lt;switch&gt;;
3621 address &lt;ip&gt;;
3622 port &lt;number&gt;;
3623 version 1|2;
3624 split horizon &lt;switch&gt;;
3625 poison reverse &lt;switch&gt;;
3626 check zero &lt;switch&gt;;
3627 update time &lt;number&gt;;
3628 timeout time &lt;number&gt;;
3629 garbage time &lt;number&gt;;
3630 ecmp weight &lt;number&gt;;
3631 ttl security &lt;switch&gt;; | tx only;
3632 tx class|dscp &lt;number&gt;;
3633 tx priority &lt;number&gt;;
3634 rx buffer &lt;number&gt;;
3635 tx length &lt;number&gt;;
3636 check link &lt;switch&gt;;
3637 authentication none|plaintext|cryptographic;
3638 password "&lt;text&gt;";
3639 password "&lt;text&gt;" {
3640 id &lt;num&gt;;
3641 generate from "&lt;date&gt;";
3642 generate to "&lt;date&gt;";
3643 accept from "&lt;date&gt;";
3644 accept to "&lt;date&gt;";
3645 from "&lt;date&gt;";
3646 to "&lt;date&gt;";
3647 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
3648 };
3649 };
3650 }
3651 </code>
3652
3653 <descrip>
3654 <tag><label id="rip-infinity">infinity <M>number</M></tag>
3655 Selects the distance of infinity. Bigger values will make
3656 protocol convergence even slower. The default value is 16.
3657
3658 <tag><label id="rip-ecmp">ecmp <M>switch</M> [limit <M>number</M>]</tag>
3659 This option specifies whether RIP is allowed to generate ECMP
3660 (equal-cost multipath) routes. Such routes are used when there are
3661 several directions to the destination, each with the same (computed)
3662 cost. This option also allows to specify a limit on maximum number of
3663 nexthops in one route. By default, ECMP is disabled. If enabled,
3664 default value of the limit is 16.
3665
3666 <tag><label id="rip-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
3667 Interface definitions specify a set of interfaces on which the
3668 protocol is activated and contain interface specific options.
3669 See <ref id="proto-iface" name="interface"> common options for
3670 detailed description.
3671 </descrip>
3672
3673 <p>Interface specific options:
3674
3675 <descrip>
3676 <tag><label id="rip-iface-metric">metric <m/num/</tag>
3677 This option specifies the metric of the interface. When a route is
3678 received from the interface, its metric is increased by this value
3679 before further processing. Valid values are 1-255, but values higher
3680 than infinity has no further meaning. Default: 1.
3681
3682 <tag><label id="rip-iface-mode">mode multicast|broadcast</tag>
3683 This option selects the mode for RIP to use on the interface. The
3684 default is multicast mode for RIPv2 and broadcast mode for RIPv1.
3685 RIPng always uses the multicast mode.
3686
3687 <tag><label id="rip-iface-passive">passive <m/switch/</tag>
3688 Passive interfaces receive routing updates but do not transmit any
3689 messages. Default: no.
3690
3691 <tag><label id="rip-iface-address">address <m/ip/</tag>
3692 This option specifies a destination address used for multicast or
3693 broadcast messages, the default is the official RIP (224.0.0.9) or RIPng
3694 (ff02::9) multicast address, or an appropriate broadcast address in the
3695 broadcast mode.
3696
3697 <tag><label id="rip-iface-port">port <m/number/</tag>
3698 This option selects an UDP port to operate on, the default is the
3699 official RIP (520) or RIPng (521) port.
3700
3701 <tag><label id="rip-iface-version">version 1|2</tag>
3702 This option selects the version of RIP used on the interface. For RIPv1,
3703 automatic subnet aggregation is not implemented, only classful network
3704 routes and host routes are propagated. Note that BIRD allows RIPv1 to be
3705 configured with features that are defined for RIPv2 only, like
3706 authentication or using multicast sockets. The default is RIPv2 for IPv4
3707 RIP, the option is not supported for RIPng, as no further versions are
3708 defined.
3709
3710 <tag><label id="rip-iface-version-only">version only <m/switch/</tag>
3711 Regardless of RIP version configured for the interface, BIRD accepts
3712 incoming packets of any RIP version. This option restrict accepted
3713 packets to the configured version. Default: no.
3714
3715 <tag><label id="rip-iface-split-horizon">split horizon <m/switch/</tag>
3716 Split horizon is a scheme for preventing routing loops. When split
3717 horizon is active, routes are not regularly propagated back to the
3718 interface from which they were received. They are either not propagated
3719 back at all (plain split horizon) or propagated back with an infinity
3720 metric (split horizon with poisoned reverse). Therefore, other routers
3721 on the interface will not consider the router as a part of an
3722 independent path to the destination of the route. Default: yes.
3723
3724 <tag><label id="rip-iface-poison-reverse">poison reverse <m/switch/</tag>
3725 When split horizon is active, this option specifies whether the poisoned
3726 reverse variant (propagating routes back with an infinity metric) is
3727 used. The poisoned reverse has some advantages in faster convergence,
3728 but uses more network traffic. Default: yes.
3729
3730 <tag><label id="rip-iface-check-zero">check zero <m/switch/</tag>
3731 Received RIPv1 packets with non-zero values in reserved fields should
3732 be discarded. This option specifies whether the check is performed or
3733 such packets are just processed as usual. Default: yes.
3734
3735 <tag><label id="rip-iface-update-time">update time <m/number/</tag>
3736 Specifies the number of seconds between periodic updates. A lower number
3737 will mean faster convergence but bigger network load. Default: 30.
3738
3739 <tag><label id="rip-iface-timeout-time">timeout time <m/number/</tag>
3740 Specifies the time interval (in seconds) between the last received route
3741 announcement and the route expiration. After that, the network is
3742 considered unreachable, but still is propagated with infinity distance.
3743 Default: 180.
3744
3745 <tag><label id="rip-iface-garbage-time">garbage time <m/number/</tag>
3746 Specifies the time interval (in seconds) between the route expiration
3747 and the removal of the unreachable network entry. The garbage interval,
3748 when a route with infinity metric is propagated, is used for both
3749 internal (after expiration) and external (after withdrawal) routes.
3750 Default: 120.
3751
3752 <tag><label id="rip-iface-ecmp-weight">ecmp weight <m/number/</tag>
3753 When ECMP (multipath) routes are allowed, this value specifies a
3754 relative weight used for nexthops going through the iface. Valid
3755 values are 1-256. Default value is 1.
3756
3757 <tag><label id="rip-iface-auth">authentication none|plaintext|cryptographic</tag>
3758 Selects authentication method to be used. <cf/none/ means that packets
3759 are not authenticated at all, <cf/plaintext/ means that a plaintext
3760 password is embedded into each packet, and <cf/cryptographic/ means that
3761 packets are authenticated using some cryptographic hash function
3762 selected by option <cf/algorithm/ for each key. The default
3763 cryptographic algorithm for RIP keys is Keyed-MD5. If you set
3764 authentication to not-none, it is a good idea to add <cf>password</cf>
3765 section. Default: none.
3766
3767 <tag><label id="rip-iface-pass">password "<m/text/"</tag>
3768 Specifies a password used for authentication. See <ref id="proto-pass"
3769 name="password"> common option for detailed description.
3770
3771 <tag><label id="rip-iface-ttl-security">ttl security [<m/switch/ | tx only]</tag>
3772 TTL security is a feature that protects routing protocols from remote
3773 spoofed packets by using TTL 255 instead of TTL 1 for protocol packets
3774 destined to neighbors. Because TTL is decremented when packets are
3775 forwarded, it is non-trivial to spoof packets with TTL 255 from remote
3776 locations.
3777
3778 If this option is enabled, the router will send RIP packets with TTL 255
3779 and drop received packets with TTL less than 255. If this option si set
3780 to <cf/tx only/, TTL 255 is used for sent packets, but is not checked
3781 for received packets. Such setting does not offer protection, but offers
3782 compatibility with neighbors regardless of whether they use ttl
3783 security.
3784
3785 For RIPng, TTL security is a standard behavior (required by <rfc
3786 id="2080">) and therefore default value is yes. For IPv4 RIP, default
3787 value is no.
3788
3789 <tag><label id="rip-iface-tx-class">tx class|dscp|priority <m/number/</tag>
3790 These options specify the ToS/DiffServ/Traffic class/Priority of the
3791 outgoing RIP packets. See <ref id="proto-tx-class" name="tx class"> common
3792 option for detailed description.
3793
3794 <tag><label id="rip-iface-rx-buffer">rx buffer <m/number/</tag>
3795 This option specifies the size of buffers used for packet processing.
3796 The buffer size should be bigger than maximal size of received packets.
3797 The default value is 532 for IPv4 RIP and interface MTU value for RIPng.
3798
3799 <tag><label id="rip-iface-tx-length">tx length <m/number/</tag>
3800 This option specifies the maximum length of generated RIP packets. To
3801 avoid IP fragmentation, it should not exceed the interface MTU value.
3802 The default value is 532 for IPv4 RIP and interface MTU value for RIPng.
3803
3804 <tag><label id="rip-iface-check-link">check link <m/switch/</tag>
3805 If set, the hardware link state (as reported by OS) is taken into
3806 consideration. When the link disappears (e.g. an ethernet cable is
3807 unplugged), neighbors are immediately considered unreachable and all
3808 routes received from them are withdrawn. It is possible that some
3809 hardware drivers or platforms do not implement this feature.
3810 Default: no.
3811 </descrip>
3812
3813 <sect1>Attributes
3814 <label id="rip-attr">
3815
3816 <p>RIP defines two route attributes:
3817
3818 <descrip>
3819 <tag><label id="rta-rip-metric">int rip_metric/</tag>
3820 RIP metric of the route (ranging from 0 to <cf/infinity/). When routes
3821 from different RIP instances are available and all of them have the same
3822 preference, BIRD prefers the route with lowest <cf/rip_metric/. When a
3823 non-RIP route is exported to RIP, the default metric is 1.
3824
3825 <tag><label id="rta-rip-tag">int rip_tag/</tag>
3826 RIP route tag: a 16-bit number which can be used to carry additional
3827 information with the route (for example, an originating AS number in
3828 case of external routes). When a non-RIP route is exported to RIP, the
3829 default tag is 0.
3830 </descrip>
3831
3832 <sect1>Example
3833 <label id="rip-exam">
3834
3835 <p><code>
3836 protocol rip {
3837 import all;
3838 export all;
3839 interface "eth*" {
3840 metric 2;
3841 port 1520;
3842 mode multicast;
3843 update time 12;
3844 timeout time 60;
3845 authentication cryptographic;
3846 password "secret" { algorithm hmac sha256; };
3847 };
3848 }
3849 </code>
3850
3851
3852 <sect>Static
3853 <label id="static">
3854
3855 <p>The Static protocol doesn't communicate with other routers in the network,
3856 but instead it allows you to define routes manually. This is often used for
3857 specifying how to forward packets to parts of the network which don't use
3858 dynamic routing at all and also for defining sink routes (i.e., those telling to
3859 return packets as undeliverable if they are in your IP block, you don't have any
3860 specific destination for them and you don't want to send them out through the
3861 default route to prevent routing loops).
3862
3863 <p>There are five types of static routes: `classical' routes telling to forward
3864 packets to a neighboring router, multipath routes specifying several (possibly
3865 weighted) neighboring routers, device routes specifying forwarding to hosts on a
3866 directly connected network, recursive routes computing their nexthops by doing
3867 route table lookups for a given IP, and special routes (sink, blackhole etc.)
3868 which specify a special action to be done instead of forwarding the packet.
3869
3870 <p>When the particular destination is not available (the interface is down or
3871 the next hop of the route is not a neighbor at the moment), Static just
3872 uninstalls the route from the table it is connected to and adds it again as soon
3873 as the destination becomes adjacent again.
3874
3875 <p>There are three classes of definitions in Static protocol configuration --
3876 global options, static route definitions, and per-route options. Usually, the
3877 definition of the protocol contains mainly a list of static routes.
3878
3879 <p>Global options:
3880
3881 <descrip>
3882 <tag><label id="static-check-link">check link <m/switch/</tag>
3883 If set, hardware link states of network interfaces are taken into
3884 consideration. When link disappears (e.g. ethernet cable is unplugged),
3885 static routes directing to that interface are removed. It is possible
3886 that some hardware drivers or platforms do not implement this feature.
3887 Default: off.
3888
3889 <tag><label id="static-igp-table">igp table <m/name/</tag>
3890 Specifies a table that is used for route table lookups of recursive
3891 routes. Default: the same table as the protocol is connected to.
3892 </descrip>
3893
3894 <p>Route definitions (each may also contain a block of per-route options):
3895
3896 <descrip>
3897 <tag><label id="static-route-via-ip">route <m/prefix/ via <m/ip/</tag>
3898 Static route through a neighboring router. For link-local next hops,
3899 interface can be specified as a part of the address (e.g.,
3900 <cf/via fe80::1234%eth0/).
3901
3902 <tag><label id="static-route-via-mpath">route <m/prefix/ multipath via <m/ip/ [weight <m/num/] [bfd <m/switch/] [via <m/.../]</tag>
3903 Static multipath route. Contains several nexthops (gateways), possibly
3904 with their weights.
3905
3906 <tag><label id="static-route-via-iface">route <m/prefix/ via <m/"interface"/</tag>
3907 Static device route through an interface to hosts on a directly
3908 connected network.
3909
3910 <tag><label id="static-route-recursive">route <m/prefix/ recursive <m/ip/</tag>
3911 Static recursive route, its nexthop depends on a route table lookup for
3912 given IP address.
3913
3914 <tag><label id="static-route-drop">route <m/prefix/ blackhole|unreachable|prohibit</tag>
3915 Special routes specifying to silently drop the packet, return it as
3916 unreachable or return it as administratively prohibited. First two
3917 targets are also known as <cf/drop/ and <cf/reject/.
3918 </descrip>
3919
3920 <p>Per-route options:
3921
3922 <descrip>
3923 <tag><label id="static-route-bfd">bfd <m/switch/</tag>
3924 The Static protocol could use BFD protocol for next hop liveness
3925 detection. If enabled, a BFD session to the route next hop is created
3926 and the static route is BFD-controlled -- the static route is announced
3927 only if the next hop liveness is confirmed by BFD. If the BFD session
3928 fails, the static route is removed. Note that this is a bit different
3929 compared to other protocols, which may use BFD as an advisory mechanism
3930 for fast failure detection but ignores it if a BFD session is not even
3931 established.
3932
3933 This option can be used for static routes with a direct next hop, or
3934 also for for individual next hops in a static multipath route (see
3935 above). Note that BFD protocol also has to be configured, see
3936 <ref id="bfd" name="BFD"> section for details. Default value is no.
3937
3938 <tag><label id="static-route-filter"><m/filter expression/</tag>
3939 This is a special option that allows filter expressions to be configured
3940 on per-route basis. Can be used multiple times. These expressions are
3941 evaluated when the route is originated, similarly to the import filter
3942 of the static protocol. This is especially useful for configuring route
3943 attributes, e.g., <cf/ospf_metric1 = 100;/ for a route that will be
3944 exported to the OSPF protocol.
3945 </descrip>
3946
3947 <p>Static routes have no specific attributes.
3948
3949 <p>Example static config might look like this:
3950
3951 <p><code>
3952 protocol static {
3953 table testable; # Connect to a non-default routing table
3954 check link; # Advertise routes only if link is up
3955 route 0.0.0.0/0 via 198.51.100.130; # Default route
3956 route 10.0.0.0/8 multipath # Multipath route
3957 via 198.51.100.10 weight 2
3958 via 198.51.100.20 bfd # BFD-controlled next hop
3959 via 192.0.2.1;
3960 route 203.0.113.0/24 unreachable; # Sink route
3961 route 10.2.0.0/24 via "arc0"; # Secondary network
3962 route 192.168.10.0/24 via 198.51.100.100 {
3963 ospf_metric1 = 20; # Set extended attribute
3964 }
3965 route 192.168.10.0/24 via 198.51.100.100 {
3966 ospf_metric2 = 100; # Set extended attribute
3967 ospf_tag = 2; # Set extended attribute
3968 bfd; # BFD-controlled route
3969 }
3970 }
3971 </code>
3972
3973
3974 <chapt>Conclusions
3975 <label id="conclusion">
3976
3977 <sect>Future work
3978 <label id="future-work">
3979
3980 <p>Although BIRD supports all the commonly used routing protocols, there are
3981 still some features which would surely deserve to be implemented in future
3982 versions of BIRD:
3983
3984 <itemize>
3985 <item>Opaque LSA's
3986 <item>Route aggregation and flap dampening
3987 <item>Multipath routes
3988 <item>Multicast routing protocols
3989 <item>Ports to other systems
3990 </itemize>
3991
3992
3993 <sect>Getting more help
3994 <label id="help">
3995
3996 <p>If you use BIRD, you're welcome to join the bird-users mailing list
3997 (<HTMLURL URL="mailto:bird-users@network.cz" name="bird-users@network.cz">)
3998 where you can share your experiences with the other users and consult
3999 your problems with the authors. To subscribe to the list, visit
4000 <HTMLURL URL="http://bird.network.cz/?m_list" name="http://bird.network.cz/?m_list">.
4001 The home page of BIRD can be found at <HTMLURL URL="http://bird.network.cz/" name="http://bird.network.cz/">.
4002
4003 <p>BIRD is a relatively young system and it probably contains some bugs. You can
4004 report any problems to the bird-users list and the authors will be glad to solve
4005 them, but before you do so, please make sure you have read the available
4006 documentation and that you are running the latest version (available at
4007 <HTMLURL URL="ftp://bird.network.cz/pub/bird" name="bird.network.cz:/pub/bird">).
4008 (Of course, a patch which fixes the bug is always welcome as an attachment.)
4009
4010 <p>If you want to understand what is going inside, Internet standards are a good
4011 and interesting reading. You can get them from
4012 <HTMLURL URL="ftp://ftp.rfc-editor.org/" name="ftp.rfc-editor.org"> (or a
4013 nicely sorted version from <HTMLURL URL="ftp://atrey.karlin.mff.cuni.cz/pub/rfc"
4014 name="atrey.karlin.mff.cuni.cz:/pub/rfc">).
4015
4016 <p><it/Good luck!/
4017
4018 </book>
4019
4020 <!--
4021 LocalWords: GPL IPv GateD BGPv RIPv OSPFv Linux sgml html dvi sgmltools Pavel
4022 LocalWords: linuxdoc dtd descrip config conf syslog stderr auth ospf bgp Mbps
4023 LocalWords: router's eval expr num birdc ctl UNIX if's enums bool int ip GCC
4024 LocalWords: len ipaddress pxlen netmask enum bgppath bgpmask clist gw md eth
4025 LocalWords: RTS printn quitbird iBGP AS'es eBGP RFC multiprotocol IGP Machek
4026 LocalWords: EGP misconfigurations keepalive pref aggr aggregator BIRD's RTC
4027 LocalWords: OS'es AS's multicast nolisten misconfigured UID blackhole MRTD MTU
4028 LocalWords: uninstalls ethernets IP binutils ANYCAST anycast dest RTD ICMP rfc
4029 LocalWords: compat multicasts nonbroadcast pointopoint loopback sym stats
4030 LocalWords: Perl SIGHUP dd mm yy HH MM SS EXT IA UNICAST multihop Discriminator txt
4031 LocalWords: proto wildcard Ondrej Filip
4032 -->