]> git.ipfire.org Git - thirdparty/bird.git/blob - doc/bird.sgml
Custom route attributes
[thirdparty/bird.git] / doc / bird.sgml
1 <!doctype birddoc system>
2
3 <!--
4 BIRD 2.0 documentation
5
6 This documentation can have 4 forms: sgml (this is master copy), html, ASCII
7 text and dvi/postscript (generated from sgml using sgmltools). You should always
8 edit master copy.
9
10 This is a slightly modified linuxdoc dtd. Anything in <descrip> tags is
11 considered definition of configuration primitives, <cf> is fragment of
12 configuration within normal text, <m> is "meta" information within fragment of
13 configuration - something in config which is not keyword.
14
15 (set-fill-column 80)
16
17 Copyright 1999,2000 Pavel Machek <pavel@ucw.cz>, distribute under GPL version 2 or later.
18
19 -->
20
21 <book>
22
23 <title>BIRD 2.0 User's Guide
24 <author>
25 Ondrej Filip <it/&lt;feela@network.cz&gt;/,
26 Pavel Machek <it/&lt;pavel@ucw.cz&gt;/,
27 Martin Mares <it/&lt;mj@ucw.cz&gt;/,
28 Maria Jan Matejka <it/&lt;mq@jmq.cz&gt;/,
29 Ondrej Zajicek <it/&lt;santiago@crfreenet.org&gt;/
30 </author>
31
32 <abstract>
33 This document contains user documentation for the BIRD Internet Routing Daemon project.
34 </abstract>
35
36 <!-- Table of contents -->
37 <toc>
38
39 <!-- Begin the document -->
40
41
42 <chapt>Introduction
43 <label id="intro">
44
45 <sect>What is BIRD
46 <label id="what-is-bird">
47
48 <p>The name `BIRD' is actually an acronym standing for `BIRD Internet Routing
49 Daemon'. Let's take a closer look at the meaning of the name:
50
51 <p><em/BIRD/: Well, we think we have already explained that. It's an acronym
52 standing for `BIRD Internet Routing Daemon', you remember, don't you? :-)
53
54 <p><em/Internet Routing/: It's a program (well, a daemon, as you are going to
55 discover in a moment) which works as a dynamic router in an Internet type
56 network (that is, in a network running either the IPv4 or the IPv6 protocol).
57 Routers are devices which forward packets between interconnected networks in
58 order to allow hosts not connected directly to the same local area network to
59 communicate with each other. They also communicate with the other routers in the
60 Internet to discover the topology of the network which allows them to find
61 optimal (in terms of some metric) rules for forwarding of packets (which are
62 called routing tables) and to adapt themselves to the changing conditions such
63 as outages of network links, building of new connections and so on. Most of
64 these routers are costly dedicated devices running obscure firmware which is
65 hard to configure and not open to any changes (on the other hand, their special
66 hardware design allows them to keep up with lots of high-speed network
67 interfaces, better than general-purpose computer does). Fortunately, most
68 operating systems of the UNIX family allow an ordinary computer to act as a
69 router and forward packets belonging to the other hosts, but only according to a
70 statically configured table.
71
72 <p>A <em/Routing Daemon/ is in UNIX terminology a non-interactive program
73 running on background which does the dynamic part of Internet routing, that is
74 it communicates with the other routers, calculates routing tables and sends them
75 to the OS kernel which does the actual packet forwarding. There already exist
76 other such routing daemons: routed (RIP only), GateD (non-free),
77 <HTMLURL URL="http://www.zebra.org" name="Zebra"> and
78 <HTMLURL URL="http://sourceforge.net/projects/mrt" name="MRTD">,
79 but their capabilities are limited and they are relatively hard to configure
80 and maintain.
81
82 <p>BIRD is an Internet Routing Daemon designed to avoid all of these shortcomings,
83 to support all the routing technology used in the today's Internet or planned to
84 be used in near future and to have a clean extensible architecture allowing new
85 routing protocols to be incorporated easily. Among other features, BIRD
86 supports:
87
88 <itemize>
89 <item>both IPv4 and IPv6 protocols
90 <item>multiple routing tables
91 <item>the Border Gateway Protocol (BGPv4)
92 <item>the Routing Information Protocol (RIPv2, RIPng)
93 <item>the Open Shortest Path First protocol (OSPFv2, OSPFv3)
94 <item>the Babel Routing Protocol
95 <item>the Router Advertisements for IPv6 hosts
96 <item>a virtual protocol for exchange of routes between different
97 routing tables on a single host
98 <item>a command-line interface allowing on-line control and inspection
99 of status of the daemon
100 <item>soft reconfiguration (no need to use complex online commands to
101 change the configuration, just edit the configuration file and
102 notify BIRD to re-read it and it will smoothly switch itself to
103 the new configuration, not disturbing routing protocols unless
104 they are affected by the configuration changes)
105 <item>a powerful language for route filtering
106 </itemize>
107
108 <p>BIRD has been developed at the Faculty of Math and Physics, Charles
109 University, Prague, Czech Republic as a student project. It can be freely
110 distributed under the terms of the GNU General Public License.
111
112 <p>BIRD has been designed to work on all UNIX-like systems. It has been
113 developed and tested under Linux 2.0 to 2.6, and then ported to FreeBSD, NetBSD
114 and OpenBSD, porting to other systems (even non-UNIX ones) should be relatively
115 easy due to its highly modular architecture.
116
117 <p>BIRD 1.x supported either IPv4 or IPv6 protocol, but had to be compiled separately
118 for each one. BIRD~2 supports both of them with a possibility of further extension.
119 BIRD~2 supports Linux at least 3.16, FreeBSD 10, NetBSD 7.0, and OpenBSD 5.8.
120 Anyway, it will probably work well also on older systems.
121
122 <sect>Installing BIRD
123 <label id="install">
124
125 <p>On a recent UNIX system with GNU development tools (GCC, binutils, m4, make)
126 and Perl, installing BIRD should be as easy as:
127
128 <code>
129 ./configure
130 make
131 make install
132 vi /usr/local/etc/bird.conf
133 bird
134 </code>
135
136 <p>You can use <tt>./configure --help</tt> to get a list of configure
137 options. The most important ones are: <tt/--with-protocols=/ to produce a slightly smaller
138 BIRD executable by configuring out routing protocols you don't use, and
139 <tt/--prefix=/ to install BIRD to a place different from <file>/usr/local</file>.
140
141
142 <sect>Running BIRD
143 <label id="argv">
144
145 <p>You can pass several command-line options to bird:
146
147 <descrip>
148 <tag><label id="argv-config">-c <m/config name/</tag>
149 use given configuration file instead of <it/prefix/<file>/etc/bird.conf</file>.
150
151 <tag><label id="argv-debug">-d</tag>
152 enable debug messages and run bird in foreground.
153
154 <tag><label id="argv-log-file">-D <m/filename of debug log/</tag>
155 log debugging information to given file instead of stderr.
156
157 <tag><label id="argv-foreground">-f</tag>
158 run bird in foreground.
159
160 <tag><label id="argv-group">-g <m/group/</tag>
161 use that group ID, see the next section for details.
162
163 <tag><label id="argv-help">-h, --help</tag>
164 display command-line options to bird.
165
166 <tag><label id="argv-local">-l</tag>
167 look for a configuration file and a communication socket in the current
168 working directory instead of in default system locations. However, paths
169 specified by options <cf/-c/, <cf/-s/ have higher priority.
170
171 <tag><label id="argv-parse">-p</tag>
172 just parse the config file and exit. Return value is zero if the config
173 file is valid, nonzero if there are some errors.
174
175 <tag><label id="argv-pid">-P <m/name of PID file/</tag>
176 create a PID file with given filename.
177
178 <tag><label id="argv-recovery">-R</tag>
179 apply graceful restart recovery after start.
180
181 <tag><label id="argv-socket">-s <m/name of communication socket/</tag>
182 use given filename for a socket for communications with the client,
183 default is <it/prefix/<file>/var/run/bird.ctl</file>.
184
185 <tag><label id="argv-user">-u <m/user/</tag>
186 drop privileges and use that user ID, see the next section for details.
187
188 <tag><label id="argv-version">--version</tag>
189 display bird version.
190 </descrip>
191
192 <p>BIRD writes messages about its work to log files or syslog (according to config).
193
194
195 <sect>Privileges
196 <label id="privileges">
197
198 <p>BIRD, as a routing daemon, uses several privileged operations (like setting
199 routing table and using raw sockets). Traditionally, BIRD is executed and runs
200 with root privileges, which may be prone to security problems. The recommended
201 way is to use a privilege restriction (options <cf/-u/, <cf/-g/). In that case
202 BIRD is executed with root privileges, but it changes its user and group ID to
203 an unprivileged ones, while using Linux capabilities to retain just required
204 privileges (capabilities CAP_NET_*). Note that the control socket is created
205 before the privileges are dropped, but the config file is read after that. The
206 privilege restriction is not implemented in BSD port of BIRD.
207
208 <p>An unprivileged user (as an argument to <cf/-u/ options) may be the user
209 <cf/nobody/, but it is suggested to use a new dedicated user account (like
210 <cf/bird/). The similar considerations apply for the group option, but there is
211 one more condition -- the users in the same group can use <file/birdc/ to
212 control BIRD.
213
214 <p>Finally, there is a possibility to use external tools to run BIRD in an
215 environment with restricted privileges. This may need some configuration, but it
216 is generally easy -- BIRD needs just the standard library, privileges to read
217 the config file and create the control socket and the CAP_NET_* capabilities.
218
219
220 <chapt>Architecture
221 <label id="architecture">
222
223 <sect>Routing tables
224 <label id="routing-tables">
225
226 <p>The heart of BIRD is a routing table. BIRD has several independent routing tables;
227 each of them contains routes of exactly one <m/nettype/ (see below). There are two
228 default tables -- <cf/master4/ for IPv4 routes and <cf/master6/ for IPv6 routes.
229 Other tables must be explicitly configured.
230
231 <p>
232 These routing tables are not kernel forwarding tables. No forwarding is done by
233 BIRD. If you want to forward packets using the routes in BIRD tables, you may
234 use the Kernel protocol (see below) to synchronize them with kernel FIBs.
235
236 <p>
237 Every nettype defines a (kind of) primary key on routes. Every route source can
238 supply one route for every possible primary key; new route announcement replaces
239 the old route from the same source, keeping other routes intact. BIRD always
240 chooses the best route for each primary key among the known routes and keeps the
241 others as suboptimal. When the best route is retracted, BIRD re-runs the best
242 route selection algorithm to find the current best route.
243
244 <p>
245 The global best route selection algorithm is (roughly) as follows:
246
247 <itemize>
248 <item>Preferences of the routes are compared.
249 <item>Source protocol instance preferences are compared.
250 <item>If source protocols are the same (e.g. BGP vs. BGP), the protocol's route selection algorithm is invoked.
251 <item>If source protocols are different (e.g. BGP vs. OSPF), result of the algorithm is undefined.
252 </itemize>
253
254 <p><label id="dsc-table-sorted">Usually, a routing table just chooses a selected
255 route from a list of entries for one network. But if the <cf/sorted/ option is
256 activated, these lists of entries are kept completely sorted (according to
257 preference or some protocol-dependent metric). This is needed for some features
258 of some protocols (e.g. <cf/secondary/ option of BGP protocol, which allows to
259 accept not just a selected route, but the first route (in the sorted list) that
260 is accepted by filters), but it is incompatible with some other features (e.g.
261 <cf/deterministic med/ option of BGP protocol, which activates a way of choosing
262 selected route that cannot be described using comparison and ordering). Minor
263 advantage is that routes are shown sorted in <cf/show route/, minor disadvantage
264 is that it is slightly more computationally expensive.
265
266 <sect>Routes and network types
267 <label id="routes">
268
269 <p>BIRD works with several types of routes. Some of them are typical IP routes,
270 others are better described as forwarding rules. We call them all routes,
271 regardless of this difference.
272
273 <p>Every route consists of several attributes (read more about them in the
274 <ref id="route-attributes" name="Route attributes"> section); the common for all
275 routes are:
276
277 <itemize>
278 <item>IP address of router which told us about this route
279 <item>Source protocol instance
280 <item>Route preference
281 <item>Optional attributes defined by protocols
282 </itemize>
283
284 <p>Other attributes depend on nettypes. Some of them are part of the primary key, these are marked (PK).
285
286 <sect1>IPv4 and IPv6 routes
287 <label id="ip-routes">
288
289 <p>The traditional routes. Configuration keywords are <cf/ipv4/ and <cf/ipv6/.
290
291 <itemize>
292 <item>(PK) Route destination (IP prefix together with its length)
293 <item>Route next hops (see below)
294 </itemize>
295
296 <sect1>IPv6 source-specific routes
297 <label id="ip-sadr-routes">
298
299 <p>The IPv6 routes containing both destination and source prefix. They are used
300 for source-specific routing (SSR), also called source-address dependent routing
301 (SADR), see <rfc id="8043">. Currently limited mostly to the Babel protocol.
302 Configuration keyword is <cf/ipv6 sadr/.
303
304 <itemize>
305 <item>(PK) Route destination (IP prefix together with its length)
306 <item>(PK) Route source (IP prefix together with its length)
307 <item>Route next hops (see below)
308 </itemize>
309
310 <sect1>VPN IPv4 and IPv6 routes
311 <label id="vpn-routes">
312
313 <p>Routes for IPv4 and IPv6 with VPN Route Distinguisher (<rfc id="4364">).
314 Configuration keywords are <cf/vpn4/ and <cf/vpn6/.
315
316 <itemize>
317 <item>(PK) Route destination (IP prefix together with its length)
318 <item>(PK) Route distinguisher (according to <rfc id="4364">)
319 <item>Route next hops
320 </itemize>
321
322 <sect1>Route Origin Authorization for IPv4 and IPv6
323 <label id="roa-routes">
324
325 <p>These entries can be used to validate route origination of BGP routes.
326 A ROA entry specifies prefixes which could be originated by an AS number.
327 Their keywords are <cf/roa4/ and <cf/roa6/.
328
329 <itemize>
330 <item>(PK) IP prefix together with its length
331 <item>(PK) Matching prefix maximal length
332 <item>(PK) AS number
333 </itemize>
334
335 <sect1>Flowspec for IPv4 and IPv6
336 <label id="flow-routes">
337
338 <p>Flowspec rules are a form of firewall and traffic flow control rules
339 distributed mostly via BGP. These rules may help the operators stop various
340 network attacks in the beginning before eating up the whole bandwidth.
341 Configuration keywords are <cf/flow4/ and <cf/flow6/.
342
343 <itemize>
344 <item>(PK) IP prefix together with its length
345 <item>(PK) Flow definition data
346 <item>Flow action (encoded internally as BGP communities according to <rfc id="5575">)
347 </itemize>
348
349 <sect1>MPLS switching rules
350 <label id="mpls-routes">
351
352 <p>This nettype is currently a stub before implementing more support of <rfc id="3031">.
353 BIRD currently does not support any label distribution protocol nor any label assignment method.
354 Only the Kernel, Pipe and Static protocols can use MPLS tables.
355 Configuration keyword is <cf/mpls/.
356
357 <itemize>
358 <item>(PK) MPLS label
359 <item>Route next hops
360 </itemize>
361
362 <sect1>Route next hops
363 <label id="route-next-hop">
364
365 <p>This is not a nettype. The route next hop is a complex attribute common for many
366 nettypes as you can see before. Every next hop has its assigned device
367 (either assumed from its IP address or set explicitly). It may have also
368 an IP address and an MPLS stack (one or both independently).
369 Maximal MPLS stack depth is set (in compile time) to 8 labels.
370
371 <p>Every route (when eligible to have a next hop) can have more than one next hop.
372 In that case, every next hop has also its weight.
373
374 <sect>Protocols and channels
375 <label id="protocols-concept">
376
377 <p>BIRD protocol is an abstract class of producers and consumers of the routes.
378 Each protocol may run in multiple instances and bind on one side to route
379 tables via channels, on the other side to specified listen sockets (BGP),
380 interfaces (Babel, OSPF, RIP), APIs (Kernel, Direct), or nothing (Static, Pipe).
381
382 <p>There are also two protocols that do not have any channels -- BFD and Device.
383 Both of them are kind of service for other protocols.
384
385 <p>Each protocol is connected to a routing table through a channel. Some protocols
386 support only one channel (OSPF, RIP), some protocols support more channels (BGP, Direct).
387 Each channel has two filters which can accept, reject and modify the routes.
388 An <it/export/ filter is applied to routes passed from the routing table to the protocol,
389 an <it/import/ filter is applied to routes in the opposite direction.
390
391 <sect>Graceful restart
392 <label id="graceful-restart">
393
394 <p>When BIRD is started after restart or crash, it repopulates routing tables in
395 an uncoordinated manner, like after clean start. This may be impractical in some
396 cases, because if the forwarding plane (i.e. kernel routing tables) remains
397 intact, then its synchronization with BIRD would temporarily disrupt packet
398 forwarding until protocols converge. Graceful restart is a mechanism that could
399 help with this issue. Generally, it works by starting protocols and letting them
400 repopulate routing tables while deferring route propagation until protocols
401 acknowledge their convergence. Note that graceful restart behavior have to be
402 configured for all relevant protocols and requires protocol-specific support
403 (currently implemented for Kernel and BGP protocols), it is activated for
404 particular boot by option <cf/-R/.
405
406
407 <chapt>Configuration
408 <label id="config">
409
410 <sect>Introduction
411 <label id="config-intro">
412
413 <p>BIRD is configured using a text configuration file. Upon startup, BIRD reads
414 <it/prefix/<file>/etc/bird.conf</file> (unless the <tt/-c/ command line option
415 is given). Configuration may be changed at user's request: if you modify the
416 config file and then signal BIRD with <tt/SIGHUP/, it will adjust to the new
417 config. Then there's the client which allows you to talk with BIRD in an
418 extensive way.
419
420 <p>In the config, everything on a line after <cf/#/ or inside <cf>/* */</cf> is
421 a comment, whitespace characters are treated as a single space. If there's a
422 variable number of options, they are grouped using the <cf/{ }/ brackets. Each
423 option is terminated by a <cf/;/. Configuration is case sensitive. There are two
424 ways how to name symbols (like protocol names, filter names, constants etc.).
425 You can either use a simple string starting with a letter followed by any
426 combination of letters and numbers (e.g. <cf/R123/, <cf/myfilter/, <cf/bgp5/) or
427 you can enclose the name into apostrophes (<cf/'/) and than you can use any
428 combination of numbers, letters. hyphens, dots and colons (e.g.
429 <cf/'1:strange-name'/, <cf/'-NAME-'/, <cf/'cool::name'/).
430
431 <p>Here is an example of a simple config file. It enables synchronization of
432 routing tables with OS kernel, learns network interfaces and runs RIP on all
433 network interfaces found.
434
435 <code>
436 protocol kernel {
437 ipv4 {
438 export all; # Default is export none
439 };
440 persist; # Don't remove routes on BIRD shutdown
441 }
442
443 protocol device {
444 }
445
446 protocol rip {
447 ipv4 {
448 import all;
449 export all;
450 };
451 interface "*";
452 }
453 </code>
454
455
456 <sect>Global options
457 <label id="global-opts">
458
459 <p><descrip>
460 <tag><label id="opt-include">include "<m/filename/";</tag>
461 This statement causes inclusion of a new file. The <m/filename/ could
462 also be a wildcard, in that case matching files are included in
463 alphabetic order. The maximal depth is 8. Note that this statement can
464 be used anywhere in the config file, even inside other options, but
465 always on the beginning of line. In the following example, the first
466 semicolon belongs to the <cf/include/, the second to <cf/ipv6 table/.
467 If the <file/tablename.conf/ contains exactly one token (the name of the
468 table), this construction is correct:
469 <code>
470 ipv6 table
471 include "tablename.conf";;
472 </code>
473
474 <tag><label id="opt-log">log "<m/filename/"|syslog [name <m/name/]|stderr all|{ <m/list of classes/ }</tag>
475 Set logging of messages having the given class (either <cf/all/ or
476 <cf/{ error|trace [, <m/.../] }/ etc.) into selected destination (a file specified
477 as a filename string, syslog with optional name argument, or the stderr
478 output). Classes are:
479 <cf/info/, <cf/warning/, <cf/error/ and <cf/fatal/ for messages about local problems,
480 <cf/debug/ for debugging messages,
481 <cf/trace/ when you want to know what happens in the network,
482 <cf/remote/ for messages about misbehavior of remote machines,
483 <cf/auth/ about authentication failures,
484 <cf/bug/ for internal BIRD bugs.
485 You may specify more than one <cf/log/ line to establish logging to
486 multiple destinations. Default: log everything to the system log.
487
488 <tag><label id="opt-debug-protocols">debug protocols all|off|{ states|routes|filters|interfaces|events|packets [, <m/.../] }</tag>
489 Set global defaults of protocol debugging options. See <cf/debug/ in the
490 following section. Default: off.
491
492 <tag><label id="opt-debug-commands">debug commands <m/number/</tag>
493 Control logging of client connections (0 for no logging, 1 for logging
494 of connects and disconnects, 2 and higher for logging of all client
495 commands). Default: 0.
496
497 <tag><label id="opt-debug-latency">debug latency <m/switch/</tag>
498 Activate tracking of elapsed time for internal events. Recent events
499 could be examined using <cf/dump events/ command. Default: off.
500
501 <tag><label id="opt-debug-latency-limit">debug latency limit <m/time/</tag>
502 If <cf/debug latency/ is enabled, this option allows to specify a limit
503 for elapsed time. Events exceeding the limit are logged. Default: 1 s.
504
505 <tag><label id="opt-watchdog-warn">watchdog warning <m/time/</tag>
506 Set time limit for I/O loop cycle. If one iteration took more time to
507 complete, a warning is logged. Default: 5 s.
508
509 <tag><label id="opt-watchdog-timeout">watchdog timeout <m/time/</tag>
510 Set time limit for I/O loop cycle. If the limit is breached, BIRD is
511 killed by abort signal. The timeout has effective granularity of
512 seconds, zero means disabled. Default: disabled (0).
513
514 <tag><label id="opt-mrtdump">mrtdump "<m/filename/"</tag>
515 Set MRTdump file name. This option must be specified to allow MRTdump
516 feature. Default: no dump file.
517
518 <tag><label id="opt-mrtdump-protocols">mrtdump protocols all|off|{ states|messages [, <m/.../] }</tag>
519 Set global defaults of MRTdump options. See <cf/mrtdump/ in the
520 following section. Default: off.
521
522 <tag><label id="opt-filter">filter <m/name local variables/{ <m/commands/ }</tag>
523 Define a filter. You can learn more about filters in the following
524 chapter.
525
526 <tag><label id="opt-function">function <m/name/ (<m/parameters/) <m/local variables/ { <m/commands/ }</tag>
527 Define a function. You can learn more about functions in the following chapter.
528
529 <tag><label id="opt-protocol">protocol rip|ospf|bgp|<m/.../ [<m/name/ [from <m/name2/]] { <m>protocol options</m> }</tag>
530 Define a protocol instance called <cf><m/name/</cf> (or with a name like
531 "rip5" generated automatically if you don't specify any
532 <cf><m/name/</cf>). You can learn more about configuring protocols in
533 their own chapters. When <cf>from <m/name2/</cf> expression is used,
534 initial protocol options are taken from protocol or template
535 <cf><m/name2/</cf> You can run more than one instance of most protocols
536 (like RIP or BGP). By default, no instances are configured.
537
538 <tag><label id="opt-template">template rip|ospf|bgp|<m/.../ [<m/name/ [from <m/name2/]] { <m>protocol options</m> }</tag>
539 Define a protocol template instance called <m/name/ (or with a name like
540 "bgp1" generated automatically if you don't specify any <m/name/).
541 Protocol templates can be used to group common options when many
542 similarly configured protocol instances are to be defined. Protocol
543 instances (and other templates) can use templates by using <cf/from/
544 expression and the name of the template. At the moment templates (and
545 <cf/from/ expression) are not implemented for OSPF protocol.
546
547 <tag><label id="opt-define">define <m/constant/ = <m/expression/</tag>
548 Define a constant. You can use it later in every place you could use a
549 value of the same type. Besides, there are some predefined numeric
550 constants based on /etc/iproute2/rt_* files. A list of defined constants
551 can be seen (together with other symbols) using 'show symbols' command.
552
553 <tag><label id="opt-attribute">attribute <m/type/ <m/name/</tag>
554 Define a custom route attribute. You can set and get it in filters like
555 any other route atribute. This feature is intended for marking routes
556 in import filters for export filtering purposes instead of locally
557 assigned BGP communities which have to be deleted in export filters.
558
559 <tag><label id="opt-router-id">router id <m/IPv4 address/</tag>
560 Set BIRD's router ID. It's a world-wide unique identification of your
561 router, usually one of router's IPv4 addresses. Default: the lowest
562 IPv4 address of a non-loopback interface.
563
564 <tag><label id="opt-router-id-from">router id from [-] [ "<m/mask/" ] [ <m/prefix/ ] [, <m/.../]</tag>
565 Set BIRD's router ID based on an IPv4 address of an interface specified by
566 an interface pattern.
567 See <ref id="proto-iface" name="interface"> section for detailed
568 description of interface patterns with extended clauses.
569
570 <tag><label id="opt-graceful-restart">graceful restart wait <m/number/</tag>
571 During graceful restart recovery, BIRD waits for convergence of routing
572 protocols. This option allows to specify a timeout for the recovery to
573 prevent waiting indefinitely if some protocols cannot converge. Default:
574 240 seconds.
575
576 <tag><label id="opt-timeformat">timeformat route|protocol|base|log "<m/format1/" [<m/limit/ "<m/format2/"]</tag>
577 This option allows to specify a format of date/time used by BIRD. The
578 first argument specifies for which purpose such format is used.
579 <cf/route/ is a format used in 'show route' command output,
580 <cf/protocol/ is used in 'show protocols' command output, <cf/base/ is
581 used for other commands and <cf/log/ is used in a log file.
582
583 "<m/format1/" is a format string using <it/strftime(3)/ notation (see
584 <it/man strftime/ for details). It is extended to support sub-second
585 time part with variable precision (up to microseconds) using "%f"
586 conversion code (e.g., "%T.%3f" is hh:mm:ss.sss time). <m/limit/ and
587 "<m/format2/" allow to specify the second format string for times in
588 past deeper than <m/limit/ seconds.
589
590 There are several shorthands: <cf/iso long/ is a ISO 8601 date/time
591 format (YYYY-MM-DD hh:mm:ss) that can be also specified using <cf/"%F
592 %T"/. Similarly, <cf/iso long ms/ and <cf/iso long us/ are ISO 8601
593 date/time formats with millisecond or microsecond precision.
594 <cf/iso short/ is a variant of ISO 8601 that uses just the time format
595 (hh:mm:ss) for near times (up to 20 hours in the past) and the date
596 format (YYYY-MM-DD) for far times. This is a shorthand for <cf/"%T"
597 72000 "%F"/. And there are also <cf/iso short ms/ and <cf/iso short us/
598 high-precision variants of that.
599
600 By default, BIRD uses the <cf/iso short ms/ format for <cf/route/ and
601 <cf/protocol/ times, and the <cf/iso long ms/ format for <cf/base/ and
602 <cf/log/ times.
603
604 <tag><label id="opt-table"><m/nettype/ table <m/name/ [sorted]</tag>
605 Create a new routing table. The default routing tables <cf/master4/ and
606 <cf/master6/ are created implicitly, other routing tables have to be
607 added by this command. Option <cf/sorted/ can be used to enable sorting
608 of routes, see <ref id="dsc-table-sorted" name="sorted table">
609 description for details.
610
611 <tag><label id="opt-eval">eval <m/expr/</tag>
612 Evaluates given filter expression. It is used by the developers for testing of filters.
613 </descrip>
614
615
616 <sect>Protocol options
617 <label id="protocol-opts">
618
619 <p>For each protocol instance, you can configure a bunch of options. Some of
620 them (those described in this section) are generic, some are specific to the
621 protocol (see sections talking about the protocols).
622
623 <p>Several options use a <m/switch/ argument. It can be either <cf/on/,
624 <cf/yes/ or a numeric expression with a non-zero value for the option to be
625 enabled or <cf/off/, <cf/no/ or a numeric expression evaluating to zero to
626 disable it. An empty <m/switch/ is equivalent to <cf/on/ ("silence means
627 agreement").
628
629 <descrip>
630 <tag><label id="proto-disabled">disabled <m/switch/</tag>
631 Disables the protocol. You can change the disable/enable status from the
632 command line interface without needing to touch the configuration.
633 Disabled protocols are not activated. Default: protocol is enabled.
634
635 <tag><label id="proto-debug">debug all|off|{ states|routes|filters|interfaces|events|packets [, <m/.../] }</tag>
636 Set protocol debugging options. If asked, each protocol is capable of
637 writing trace messages about its work to the log (with category
638 <cf/trace/). You can either request printing of <cf/all/ trace messages
639 or only of the types selected: <cf/states/ for protocol state changes
640 (protocol going up, down, starting, stopping etc.), <cf/routes/ for
641 routes exchanged with the routing table, <cf/filters/ for details on
642 route filtering, <cf/interfaces/ for interface change events sent to the
643 protocol, <cf/events/ for events internal to the protocol and <cf/packets/
644 for packets sent and received by the protocol. Default: off.
645
646 <tag><label id="proto-mrtdump">mrtdump all|off|{ states|messages [, <m/.../] }</tag>
647 Set protocol MRTdump flags. MRTdump is a standard binary format for
648 logging information from routing protocols and daemons. These flags
649 control what kind of information is logged from the protocol to the
650 MRTdump file (which must be specified by global <cf/mrtdump/ option, see
651 the previous section). Although these flags are similar to flags of
652 <cf/debug/ option, their meaning is different and protocol-specific. For
653 BGP protocol, <cf/states/ logs BGP state changes and <cf/messages/ logs
654 received BGP messages. Other protocols does not support MRTdump yet.
655
656 <tag><label id="proto-router-id">router id <m/IPv4 address/</tag>
657 This option can be used to override global router id for a given
658 protocol. Default: uses global router id.
659
660 <tag><label id="proto-description">description "<m/text/"</tag>
661 This is an optional description of the protocol. It is displayed as a
662 part of the output of 'show protocols all' command.
663
664 <tag><label id="proto-vrf">vrf "<m/text/"</tag>
665 Associate the protocol with specific VRF. The protocol will be
666 restricted to interfaces assigned to the VRF and will use sockets bound
667 to the VRF. Appropriate VRF interface must exist on OS level. For kernel
668 protocol, an appropriate table still must be explicitly selected by
669 <cf/table/ option. Note that for proper VRF support it is necessary to
670 use Linux kernel version at least 4.14, older versions have limited VRF
671 implementation.
672
673 <tag><label id="proto-channel"><m/channel name/ [{<m/channel config/}]</tag>
674 Every channel must be explicitly stated. See the protocol-specific
675 configuration for the list of supported channel names. See the
676 <ref id="channel-opts" name="channel configuration section"> for channel
677 definition.
678 </descrip>
679
680 <p>There are several options that give sense only with certain protocols:
681
682 <descrip>
683 <tag><label id="proto-iface">interface [-] [ "<m/mask/" ] [ <m/prefix/ ] [, <m/.../] [ { <m/option/; [<m/.../] } ]</tag>
684 Specifies a set of interfaces on which the protocol is activated with
685 given interface-specific options. A set of interfaces specified by one
686 interface option is described using an interface pattern. The interface
687 pattern consists of a sequence of clauses (separated by commas), each
688 clause is a mask specified as a shell-like pattern. Interfaces are
689 matched by their name.
690
691 An interface matches the pattern if it matches any of its clauses. If
692 the clause begins with <cf/-/, matching interfaces are excluded. Patterns
693 are processed left-to-right, thus <cf/interface "eth0", -"eth*", "*";/
694 means eth0 and all non-ethernets.
695
696 Some protocols (namely OSPFv2 and Direct) support extended clauses that
697 may contain a mask, a prefix, or both of them. An interface matches such
698 clause if its name matches the mask (if specified) and its address
699 matches the prefix (if specified). Extended clauses are used when the
700 protocol handles multiple addresses on an interface independently.
701
702 An interface option can be used more times with different interface-specific
703 options, in that case for given interface the first matching interface
704 option is used.
705
706 This option is allowed in Babel, BFD, Device, Direct, OSPF, RAdv and RIP
707 protocols. In OSPF protocol it is used in the <cf/area/ subsection.
708
709 Default: none.
710
711 Examples:
712
713 <cf>interface "*" { type broadcast; };</cf> - start the protocol on all
714 interfaces with <cf>type broadcast</cf> option.
715
716 <cf>interface "eth1", "eth4", "eth5" { type ptp; };</cf> - start the
717 protocol on enumerated interfaces with <cf>type ptp</cf> option.
718
719 <cf>interface -192.168.1.0/24, 192.168.0.0/16;</cf> - start the protocol
720 on all interfaces that have address from 192.168.0.0/16, but not from
721 192.168.1.0/24.
722
723 <cf>interface -192.168.1.0/24, 192.168.0.0/16;</cf> - start the protocol
724 on all interfaces that have address from 192.168.0.0/16, but not from
725 192.168.1.0/24.
726
727 <cf>interface "eth*" 192.168.1.0/24;</cf> - start the protocol on all
728 ethernet interfaces that have address from 192.168.1.0/24.
729
730 <tag><label id="proto-tx-class">tx class|dscp <m/num/</tag>
731 This option specifies the value of ToS/DS/Class field in IP headers of
732 the outgoing protocol packets. This may affect how the protocol packets
733 are processed by the network relative to the other network traffic. With
734 <cf/class/ keyword, the value (0-255) is used for the whole ToS/Class
735 octet (but two bits reserved for ECN are ignored). With <cf/dscp/
736 keyword, the value (0-63) is used just for the DS field in the octet.
737 Default value is 0xc0 (DSCP 0x30 - CS6).
738
739 <tag><label id="proto-tx-priority">tx priority <m/num/</tag>
740 This option specifies the local packet priority. This may affect how the
741 protocol packets are processed in the local TX queues. This option is
742 Linux specific. Default value is 7 (highest priority, privileged traffic).
743
744 <tag><label id="proto-pass">password "<m/password/" [ { <m>password options</m> } ]</tag>
745 Specifies a password that can be used by the protocol as a shared secret
746 key. Password option can be used more times to specify more passwords.
747 If more passwords are specified, it is a protocol-dependent decision
748 which one is really used. Specifying passwords does not mean that
749 authentication is enabled, authentication can be enabled by separate,
750 protocol-dependent <cf/authentication/ option.
751
752 This option is allowed in BFD, OSPF and RIP protocols. BGP has also
753 <cf/password/ option, but it is slightly different and described
754 separately.
755 Default: none.
756 </descrip>
757
758 <p>Password option can contain section with some (not necessary all) password sub-options:
759
760 <descrip>
761 <tag><label id="proto-pass-id">id <M>num</M></tag>
762 ID of the password, (1-255). If it is not used, BIRD will choose ID based
763 on an order of the password item in the interface. For example, second
764 password item in one interface will have default ID 2. ID is used by
765 some routing protocols to identify which password was used to
766 authenticate protocol packets.
767
768 <tag><label id="proto-pass-gen-from">generate from "<m/time/"</tag>
769 The start time of the usage of the password for packet signing.
770 The format of <cf><m/time/</cf> is <tt>dd-mm-yyyy HH:MM:SS</tt>.
771
772 <tag><label id="proto-pass-gen-to">generate to "<m/time/"</tag>
773 The last time of the usage of the password for packet signing.
774
775 <tag><label id="proto-pass-accept-from">accept from "<m/time/"</tag>
776 The start time of the usage of the password for packet verification.
777
778 <tag><label id="proto-pass-accept-to">accept to "<m/time/"</tag>
779 The last time of the usage of the password for packet verification.
780
781 <tag><label id="proto-pass-from">from "<m/time/"</tag>
782 Shorthand for setting both <cf/generate from/ and <cf/accept from/.
783
784 <tag><label id="proto-pass-to">to "<m/time/"</tag>
785 Shorthand for setting both <cf/generate to/ and <cf/accept to/.
786
787 <tag><label id="proto-pass-algorithm">algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 )</tag>
788 The message authentication algorithm for the password when cryptographic
789 authentication is enabled. The default value depends on the protocol.
790 For RIP and OSPFv2 it is Keyed-MD5 (for compatibility), for OSPFv3
791 protocol it is HMAC-SHA-256.
792
793 </descrip>
794
795
796 <sect>Channel options
797 <label id="channel-opts">
798
799 <p>Every channel belongs to a protocol and is configured inside its block. The
800 minimal channel config is empty, then it uses default values. The name of the
801 channel implies its nettype. Channel definitions can be inherited from protocol
802 templates. Multiple definitions of the same channel are forbidden, but channels
803 inherited from templates can be updated by new definitions.
804
805 <descrip>
806 <tag><label id="proto-table">table <m/name/</tag>
807 Specify a table to which the channel is connected. Default: the first
808 table of given nettype.
809
810 <tag><label id="proto-preference">preference <m/expr/</tag>
811 Sets the preference of routes generated by the protocol and imported
812 through this channel. Default: protocol dependent.
813
814 <tag><label id="proto-import">import all | none | filter <m/name/ | filter { <m/filter commands/ } | where <m/boolean filter expression/</tag>
815 Specify a filter to be used for filtering routes coming from the
816 protocol to the routing table. <cf/all/ is for keeping all routes,
817 <cf/none/ is for dropping all routes. Default: <cf/all/ (except for
818 EBGP).
819
820 <tag><label id="proto-export">export <m/filter/</tag>
821 This is similar to the <cf>import</cf> keyword, except that it works in
822 the direction from the routing table to the protocol. Default: <cf/none/
823 (except for EBGP).
824
825 <tag><label id="proto-import-keep-filtered">import keep filtered <m/switch/</tag>
826 Usually, if an import filter rejects a route, the route is forgotten.
827 When this option is active, these routes are kept in the routing table,
828 but they are hidden and not propagated to other protocols. But it is
829 possible to show them using <cf/show route filtered/. Note that this
830 option does not work for the pipe protocol. Default: off.
831
832 <tag><label id="proto-import-limit">import limit [<m/number/ | off ] [action warn | block | restart | disable]</tag>
833 Specify an import route limit (a maximum number of routes imported from
834 the protocol) and optionally the action to be taken when the limit is
835 hit. Warn action just prints warning log message. Block action discards
836 new routes coming from the protocol. Restart and disable actions shut
837 the protocol down like appropriate commands. Disable is the default
838 action if an action is not explicitly specified. Note that limits are
839 reset during protocol reconfigure, reload or restart. Default: <cf/off/.
840
841 <tag><label id="proto-receive-limit">receive limit [<m/number/ | off ] [action warn | block | restart | disable]</tag>
842 Specify an receive route limit (a maximum number of routes received from
843 the protocol and remembered). It works almost identically to <cf>import
844 limit</cf> option, the only difference is that if <cf/import keep
845 filtered/ option is active, filtered routes are counted towards the
846 limit and blocked routes are forgotten, as the main purpose of the
847 receive limit is to protect routing tables from overflow. Import limit,
848 on the contrary, counts accepted routes only and routes blocked by the
849 limit are handled like filtered routes. Default: <cf/off/.
850
851 <tag><label id="proto-export-limit">export limit [ <m/number/ | off ] [action warn | block | restart | disable]</tag>
852 Specify an export route limit, works similarly to the <cf>import
853 limit</cf> option, but for the routes exported to the protocol. This
854 option is experimental, there are some problems in details of its
855 behavior -- the number of exported routes can temporarily exceed the
856 limit without triggering it during protocol reload, exported routes
857 counter ignores route blocking and block action also blocks route
858 updates of already accepted routes -- and these details will probably
859 change in the future. Default: <cf/off/.
860 </descrip>
861
862 <p>This is a trivial example of RIP configured for IPv6 on all interfaces:
863 <code>
864 protocol rip ng {
865 ipv6;
866 interface "*";
867 }
868 </code>
869
870 <p>This is a non-trivial example.
871 <code>
872 protocol rip ng {
873 ipv6 {
874 table mytable6;
875 import filter { ... };
876 export filter { ... };
877 import limit 50;
878 };
879 interface "*";
880 }
881 </code>
882
883 <p>And this is even more complicated example using templates.
884 <code>
885 template bgp {
886 local 198.51.100.14 as 65000;
887
888 ipv4 {
889 table mytable4;
890 import filter { ... };
891 export none;
892 };
893 ipv6 {
894 table mytable6;
895 import filter { ... };
896 export none;
897 };
898 }
899
900 protocol bgp from {
901 neighbor 198.51.100.130 as 64496;
902
903 # IPv4 channel is inherited as-is, while IPv6
904 # channel is adjusted by export filter option
905 ipv6 {
906 export filter { ... };
907 };
908 }
909 </code>
910
911
912 <chapt>Remote control
913 <label id="remote-control">
914
915 <p>You can use the command-line client <file>birdc</file> to talk with a running
916 BIRD. Communication is done using a <file/bird.ctl/ UNIX domain socket (unless
917 changed with the <tt/-s/ option given to both the server and the client). The
918 commands can perform simple actions such as enabling/disabling of protocols,
919 telling BIRD to show various information, telling it to show routing table
920 filtered by filter, or asking BIRD to reconfigure. Press <tt/?/ at any time to
921 get online help. Option <tt/-r/ can be used to enable a restricted mode of BIRD
922 client, which allows just read-only commands (<cf/show .../). Option <tt/-v/ can
923 be passed to the client, to make it dump numeric return codes along with the
924 messages. You do not necessarily need to use <file/birdc/ to talk to BIRD, your
925 own applications could do that, too -- the format of communication between BIRD
926 and <file/birdc/ is stable (see the programmer's documentation).
927
928 <p>There is also lightweight variant of BIRD client called <file/birdcl/, which
929 does not support command line editing and history and has minimal dependencies.
930 This is useful for running BIRD in resource constrained environments, where
931 Readline library (required for regular BIRD client) is not available.
932
933 <p>Many commands have the <m/name/ of the protocol instance as an argument.
934 This argument can be omitted if there exists only a single instance.
935
936 <p>Here is a brief list of supported functions:
937
938 <descrip>
939 <tag><label id="cli-show-status">show status</tag>
940 Show router status, that is BIRD version, uptime and time from last
941 reconfiguration.
942
943 <tag><label id="cli-show-interfaces">show interfaces [summary]</tag>
944 Show the list of interfaces. For each interface, print its type, state,
945 MTU and addresses assigned.
946
947 <tag><label id="cli-show-protocols">show protocols [all]</tag>
948 Show list of protocol instances along with tables they are connected to
949 and protocol status, possibly giving verbose information, if <cf/all/ is
950 specified.
951
952 <!-- TODO: Move these protocol-specific remote control commands to the protocol sections -->
953 <tag><label id="cli-show-ospf-iface">show ospf interface [<m/name/] ["<m/interface/"]</tag>
954 Show detailed information about OSPF interfaces.
955
956 <tag><label id="cli-show-ospf-neighbors">show ospf neighbors [<m/name/] ["<m/interface/"]</tag>
957 Show a list of OSPF neighbors and a state of adjacency to them.
958
959 <tag><label id="cli-show-ospf-state">show ospf state [all] [<m/name/]</tag>
960 Show detailed information about OSPF areas based on a content of the
961 link-state database. It shows network topology, stub networks,
962 aggregated networks and routers from other areas and external routes.
963 The command shows information about reachable network nodes, use option
964 <cf/all/ to show information about all network nodes in the link-state
965 database.
966
967 <tag><label id="cli-show-ospf-topology">show ospf topology [all] [<m/name/]</tag>
968 Show a topology of OSPF areas based on a content of the link-state
969 database. It is just a stripped-down version of 'show ospf state'.
970
971 <tag><label id="cli-show-ospf-lsadb">show ospf lsadb [global | area <m/id/ | link] [type <m/num/] [lsid <m/id/] [self | router <m/id/] [<m/name/] </tag>
972 Show contents of an OSPF LSA database. Options could be used to filter
973 entries.
974
975 <tag><label id="cli-show-rip-interfaces">show rip interfaces [<m/name/] ["<m/interface/"]</tag>
976 Show detailed information about RIP interfaces.
977
978 <tag><label id="cli-show-rip-neighbors">show rip neighbors [<m/name/] ["<m/interface/"]</tag>
979 Show a list of RIP neighbors and associated state.
980
981 <tag><label id="cli-show-static">show static [<m/name/]</tag>
982 Show detailed information about static routes.
983
984 <tag><label id="cli-show-bfd-sessions">show bfd sessions [<m/name/]</tag>
985 Show information about BFD sessions.
986
987 <tag><label id="cli-show-symbols">show symbols [table|filter|function|protocol|template|roa|<m/symbol/]</tag>
988 Show the list of symbols defined in the configuration (names of
989 protocols, routing tables etc.).
990
991 <tag><label id="cli-show-route">show route [[for] <m/prefix/|<m/IP/] [table (<m/t/ | all)] [filter <m/f/|where <m/c/] [(export|preexport|noexport) <m/p/] [protocol <m/p/] [(stats|count)] [<m/options/]</tag>
992 Show contents of specified routing tables, that is routes, their metrics
993 and (in case the <cf/all/ switch is given) all their attributes.
994
995 <p>You can specify a <m/prefix/ if you want to print routes for a
996 specific network. If you use <cf>for <m/prefix or IP/</cf>, you'll get
997 the entry which will be used for forwarding of packets to the given
998 destination. By default, all routes for each network are printed with
999 the selected one at the top, unless <cf/primary/ is given in which case
1000 only the selected route is shown.
1001
1002 <p>The <cf/show route/ command can process one or multiple routing
1003 tables. The set of selected tables is determined on three levels: First,
1004 tables can be explicitly selected by <cf/table/ switch, which could be
1005 used multiple times, all tables are specified by <cf/table all/. Second,
1006 tables can be implicitly selected by channels or protocols that are
1007 arguments of several other switches (e.g., <cf/export/, <cf/protocol/).
1008 Last, the set of default tables is used: <cf/master4/, <cf/master6/ and
1009 each first table of any other network type.
1010
1011 <p>You can also ask for printing only routes processed and accepted by
1012 a given filter (<cf>filter <m/name/</cf> or <cf>filter { <m/filter/ }
1013 </cf> or matching a given condition (<cf>where <m/condition/</cf>).
1014
1015 The <cf/export/, <cf/preexport/ and <cf/noexport/ switches ask for
1016 printing of routes that are exported to the specified protocol or
1017 channel. With <cf/preexport/, the export filter of the channel is
1018 skipped. With <cf/noexport/, routes rejected by the export filter are
1019 printed instead. Note that routes not exported for other reasons
1020 (e.g. secondary routes or routes imported from that protocol) are not
1021 printed even with <cf/noexport/. These switches also imply that
1022 associated routing tables are selected instead of default ones.
1023
1024 <p>You can also select just routes added by a specific protocol.
1025 <cf>protocol <m/p/</cf>. This switch also implies that associated
1026 routing tables are selected instead of default ones.
1027
1028 <p>If BIRD is configured to keep filtered routes (see <cf/import keep
1029 filtered/ option), you can show them instead of routes by using
1030 <cf/filtered/ switch.
1031
1032 <p>The <cf/stats/ switch requests showing of route statistics (the
1033 number of networks, number of routes before and after filtering). If
1034 you use <cf/count/ instead, only the statistics will be printed.
1035
1036 <tag><label id="cli-mrt-dump">mrt dump table <m/name/|"<m/pattern/" to "<m/filename/" [filter <m/f/|where <m/c/]</tag>
1037 Dump content of a routing table to a specified file in MRT table dump
1038 format. See <ref id="mrt" name="MRT protocol"> for details.
1039
1040 <tag><label id="cli-configure">configure [soft] ["<m/config file/"] [timeout [<m/num/]]</tag>
1041 Reload configuration from a given file. BIRD will smoothly switch itself
1042 to the new configuration, protocols are reconfigured if possible,
1043 restarted otherwise. Changes in filters usually lead to restart of
1044 affected protocols.
1045
1046 If <cf/soft/ option is used, changes in filters does not cause BIRD to
1047 restart affected protocols, therefore already accepted routes (according
1048 to old filters) would be still propagated, but new routes would be
1049 processed according to the new filters.
1050
1051 If <cf/timeout/ option is used, config timer is activated. The new
1052 configuration could be either confirmed using <cf/configure confirm/
1053 command, or it will be reverted to the old one when the config timer
1054 expires. This is useful for cases when reconfiguration breaks current
1055 routing and a router becomes inaccessible for an administrator. The
1056 config timeout expiration is equivalent to <cf/configure undo/
1057 command. The timeout duration could be specified, default is 300 s.
1058
1059 <tag><label id="cli-configure-confirm">configure confirm</tag>
1060 Deactivate the config undo timer and therefore confirm the current
1061 configuration.
1062
1063 <tag><label id="cli-configure-undo">configure undo</tag>
1064 Undo the last configuration change and smoothly switch back to the
1065 previous (stored) configuration. If the last configuration change was
1066 soft, the undo change is also soft. There is only one level of undo, but
1067 in some specific cases when several reconfiguration requests are given
1068 immediately in a row and the intermediate ones are skipped then the undo
1069 also skips them back.
1070
1071 <tag><label id="cli-configure-check">configure check ["<m/config file/"]</tag>
1072 Read and parse given config file, but do not use it. useful for checking
1073 syntactic and some semantic validity of an config file.
1074
1075 <tag><label id="cli-enable-disable-restart">enable|disable|restart <m/name/|"<m/pattern/"|all</tag>
1076 Enable, disable or restart a given protocol instance, instances matching
1077 the <cf><m/pattern/</cf> or <cf/all/ instances.
1078
1079 <tag><label id="cli-reload">reload [in|out] <m/name/|"<m/pattern/"|all</tag>
1080 Reload a given protocol instance, that means re-import routes from the
1081 protocol instance and re-export preferred routes to the instance. If
1082 <cf/in/ or <cf/out/ options are used, the command is restricted to one
1083 direction (re-import or re-export).
1084
1085 This command is useful if appropriate filters have changed but the
1086 protocol instance was not restarted (or reloaded), therefore it still
1087 propagates the old set of routes. For example when <cf/configure soft/
1088 command was used to change filters.
1089
1090 Re-export always succeeds, but re-import is protocol-dependent and might
1091 fail (for example, if BGP neighbor does not support route-refresh
1092 extension). In that case, re-export is also skipped. Note that for the
1093 pipe protocol, both directions are always reloaded together (<cf/in/ or
1094 <cf/out/ options are ignored in that case).
1095
1096 <tag><label id="cli-down">down</tag>
1097 Shut BIRD down.
1098
1099 <tag><label id="cli-debug">debug <m/protocol/|<m/pattern/|all all|off|{ states|routes|filters|events|packets [, <m/.../] }</tag>
1100 Control protocol debugging.
1101
1102 <tag><label id="cli-dump">dump resources|sockets|interfaces|neighbors|attributes|routes|protocols</tag>
1103 Dump contents of internal data structures to the debugging output.
1104
1105 <tag><label id="cli-echo">echo all|off|{ <m/list of log classes/ } [ <m/buffer-size/ ]</tag>
1106 Control echoing of log messages to the command-line output.
1107 See <ref id="opt-log" name="log option"> for a list of log classes.
1108
1109 <tag><label id="cli-eval">eval <m/expr/</tag>
1110 Evaluate given expression.
1111 </descrip>
1112
1113
1114 <chapt>Filters
1115 <label id="filters">
1116
1117 <sect>Introduction
1118 <label id="filters-intro">
1119
1120 <p>BIRD contains a simple programming language. (No, it can't yet read mail :-).
1121 There are two objects in this language: filters and functions. Filters are
1122 interpreted by BIRD core when a route is being passed between protocols and
1123 routing tables. The filter language contains control structures such as if's and
1124 switches, but it allows no loops. An example of a filter using many features can
1125 be found in <file>filter/test.conf</file>.
1126
1127 <p>Filter gets the route, looks at its attributes and modifies some of them if
1128 it wishes. At the end, it decides whether to pass the changed route through
1129 (using <cf/accept/) or whether to <cf/reject/ it. A simple filter looks like
1130 this:
1131
1132 <code>
1133 filter not_too_far
1134 int var;
1135 {
1136 if defined( rip_metric ) then
1137 var = rip_metric;
1138 else {
1139 var = 1;
1140 rip_metric = 1;
1141 }
1142 if rip_metric &gt; 10 then
1143 reject "RIP metric is too big";
1144 else
1145 accept "ok";
1146 }
1147 </code>
1148
1149 <p>As you can see, a filter has a header, a list of local variables, and a body.
1150 The header consists of the <cf/filter/ keyword followed by a (unique) name of
1151 filter. The list of local variables consists of <cf><M>type name</M>;</cf>
1152 pairs where each pair defines one local variable. The body consists of <cf>
1153 { <M>statements</M> }</cf>. Each <m/statement/ is terminated by a <cf/;/. You
1154 can group several statements to a single compound statement by using braces
1155 (<cf>{ <M>statements</M> }</cf>) which is useful if you want to make a bigger
1156 block of code conditional.
1157
1158 <p>BIRD supports functions, so that you don't have to repeat the same blocks of
1159 code over and over. Functions can have zero or more parameters and they can have
1160 local variables. Recursion is not allowed. Function definitions look like this:
1161
1162 <code>
1163 function name ()
1164 int local_variable;
1165 {
1166 local_variable = 5;
1167 }
1168
1169 function with_parameters (int parameter)
1170 {
1171 print parameter;
1172 }
1173 </code>
1174
1175 <p>Unlike in C, variables are declared after the <cf/function/ line, but before
1176 the first <cf/{/. You can't declare variables in nested blocks. Functions are
1177 called like in C: <cf>name(); with_parameters(5);</cf>. Function may return
1178 values using the <cf>return <m/[expr]/</cf> command. Returning a value exits
1179 from current function (this is similar to C).
1180
1181 <p>Filters are declared in a way similar to functions except they can't have
1182 explicit parameters. They get a route table entry as an implicit parameter, it
1183 is also passed automatically to any functions called. The filter must terminate
1184 with either <cf/accept/ or <cf/reject/ statement. If there's a runtime error in
1185 filter, the route is rejected.
1186
1187 <p>A nice trick to debug filters is to use <cf>show route filter <m/name/</cf>
1188 from the command line client. An example session might look like:
1189
1190 <code>
1191 pavel@bug:~/bird$ ./birdc -s bird.ctl
1192 BIRD 0.0.0 ready.
1193 bird> show route
1194 10.0.0.0/8 dev eth0 [direct1 23:21] (240)
1195 195.113.30.2/32 dev tunl1 [direct1 23:21] (240)
1196 127.0.0.0/8 dev lo [direct1 23:21] (240)
1197 bird> show route ?
1198 show route [<prefix>] [table <t>] [filter <f>] [all] [primary]...
1199 bird> show route filter { if 127.0.0.5 &tilde; net then accept; }
1200 127.0.0.0/8 dev lo [direct1 23:21] (240)
1201 bird>
1202 </code>
1203
1204
1205 <sect>Data types
1206 <label id="data-types">
1207
1208 <p>Each variable and each value has certain type. Booleans, integers and enums
1209 are incompatible with each other (that is to prevent you from shooting in the
1210 foot).
1211
1212 <descrip>
1213 <tag><label id="type-bool">bool</tag>
1214 This is a boolean type, it can have only two values, <cf/true/ and
1215 <cf/false/. Boolean is the only type you can use in <cf/if/ statements.
1216
1217 <tag><label id="type-int">int</tag>
1218 This is a general integer type. It is an unsigned 32bit type; i.e., you
1219 can expect it to store values from 0 to 4294967295. Overflows are not
1220 checked. You can use <cf/0x1234/ syntax to write hexadecimal values.
1221
1222 <tag><label id="type-pair">pair</tag>
1223 This is a pair of two short integers. Each component can have values
1224 from 0 to 65535. Literals of this type are written as <cf/(1234,5678)/.
1225 The same syntax can also be used to construct a pair from two arbitrary
1226 integer expressions (for example <cf/(1+2,a)/).
1227
1228 <tag><label id="type-quad">quad</tag>
1229 This is a dotted quad of numbers used to represent router IDs (and
1230 others). Each component can have a value from 0 to 255. Literals of
1231 this type are written like IPv4 addresses.
1232
1233 <tag><label id="type-string">string</tag>
1234 This is a string of characters. There are no ways to modify strings in
1235 filters. You can pass them between functions, assign them to variables
1236 of type <cf/string/, print such variables, use standard string
1237 comparison operations (e.g. <cf/=, !=, &lt;, &gt;, &lt;=, &gt;=/), but
1238 you can't concatenate two strings. String literals are written as
1239 <cf/"This is a string constant"/. Additionally matching (<cf/&tilde;,
1240 !&tilde;/) operators could be used to match a string value against
1241 a shell pattern (represented also as a string).
1242
1243 <tag><label id="type-ip">ip</tag>
1244 This type can hold a single IP address. The IPv4 addresses are stored as
1245 IPv4-Mapped IPv6 addresses so one data type for both of them is used.
1246 Whether the address is IPv4 or not may be checked by <cf>.is_ip4</cf>
1247 which returns <cf/bool/. IP addresses are written in the standard
1248 notation (<cf/10.20.30.40/ or <cf/fec0:3:4::1/). You can apply special
1249 operator <cf>.mask(<M>num</M>)</cf> on values of type ip. It masks out
1250 all but first <cf><M>num</M></cf> bits from the IP address. So
1251 <cf/1.2.3.4.mask(8) = 1.0.0.0/ is true.
1252
1253 <tag><label id="type-prefix">prefix</tag>
1254 This type can hold a network prefix consisting of IP address, prefix
1255 length and several other values. This is the key in route tables.
1256
1257 Prefixes may be of several types, which can be determined by the special
1258 operator <cf/.type/. The type may be:
1259
1260 <cf/NET_IP4/ and <cf/NET_IP6/ prefixes hold an IP prefix. The literals
1261 are written as <cf><m/ipaddress//<m/pxlen/</cf>. There are two special
1262 operators on these: <cf/.ip/ which extracts the IP address from the
1263 pair, and <cf/.len/, which separates prefix length from the pair.
1264 So <cf>1.2.0.0/16.len = 16</cf> is true.
1265
1266 <cf/NET_IP6_SADR/ nettype holds both destination and source IPv6
1267 prefix. The literals are written as <cf><m/ipaddress//<m/pxlen/ from
1268 <m/ipaddress//<m/pxlen/</cf>, where the first part is the destination
1269 prefix and the second art is the source prefix. They support the same
1270 operators as IP prefixes, but just for the destination part.
1271
1272 <cf/NET_VPN4/ and <cf/NET_VPN6/ prefixes hold an IP prefix with VPN
1273 Route Distinguisher (<rfc id="4364">). They support the same special
1274 operators as IP prefixes, and also <cf/.rd/ which extracts the Route
1275 Distinguisher. Their literals are written
1276 as <cf><m/vpnrd/ <m/ipprefix/</cf>
1277
1278 <cf/NET_ROA4/ and <cf/NET_ROA6/ prefixes hold an IP prefix range
1279 together with an ASN. They support the same special operators as IP
1280 prefixes, and also <cf/.maxlen/ which extracts maximal prefix length,
1281 and <cf/.asn/ which extracts the ASN.
1282
1283 <cf/NET_FLOW4/ and <cf/NET_FLOW6/ hold an IP prefix together with a
1284 flowspec rule. Filters currently don't support flowspec parsing.
1285
1286 <cf/NET_MPLS/ holds a single MPLS label and its handling is currently
1287 not implemented.
1288
1289 <tag><label id="type-vpnrd">vpnrd</tag>
1290 This is a route distinguisher according to <rfc id="4364">. There are
1291 three kinds of RD's: <cf><m/asn/:<m/32bit int/</cf>, <cf><m/asn4/:<m/16bit int/</cf>
1292 and <cf><m/IPv4 address/:<m/32bit int/</cf>
1293
1294 <tag><label id="type-ec">ec</tag>
1295 This is a specialized type used to represent BGP extended community
1296 values. It is essentially a 64bit value, literals of this type are
1297 usually written as <cf>(<m/kind/, <m/key/, <m/value/)</cf>, where
1298 <cf/kind/ is a kind of extended community (e.g. <cf/rt/ / <cf/ro/ for a
1299 route target / route origin communities), the format and possible values
1300 of <cf/key/ and <cf/value/ are usually integers, but it depends on the
1301 used kind. Similarly to pairs, ECs can be constructed using expressions
1302 for <cf/key/ and <cf/value/ parts, (e.g. <cf/(ro, myas, 3*10)/, where
1303 <cf/myas/ is an integer variable).
1304
1305 <tag><label id="type-lc">lc</tag>
1306 This is a specialized type used to represent BGP large community
1307 values. It is essentially a triplet of 32bit values, where the first
1308 value is reserved for the AS number of the issuer, while meaning of
1309 remaining parts is defined by the issuer. Literals of this type are
1310 written as <cf/(123, 456, 789)/, with any integer values. Similarly to
1311 pairs, LCs can be constructed using expressions for its parts, (e.g.
1312 <cf/(myas, 10+20, 3*10)/, where <cf/myas/ is an integer variable).
1313
1314 <tag><label id="type-set">int|pair|quad|ip|prefix|ec|lc|enum set</tag>
1315 Filters recognize four types of sets. Sets are similar to strings: you
1316 can pass them around but you can't modify them. Literals of type <cf>int
1317 set</cf> look like <cf> [ 1, 2, 5..7 ]</cf>. As you can see, both simple
1318 values and ranges are permitted in sets.
1319
1320 For pair sets, expressions like <cf/(123,*)/ can be used to denote
1321 ranges (in that case <cf/(123,0)..(123,65535)/). You can also use
1322 <cf/(123,5..100)/ for range <cf/(123,5)..(123,100)/. You can also use
1323 <cf/*/ and <cf/a..b/ expressions in the first part of a pair, note that
1324 such expressions are translated to a set of intervals, which may be
1325 memory intensive. E.g. <cf/(*,4..20)/ is translated to <cf/(0,4..20),
1326 (1,4..20), (2,4..20), ... (65535, 4..20)/.
1327
1328 EC sets use similar expressions like pair sets, e.g. <cf/(rt, 123,
1329 10..20)/ or <cf/(ro, 123, *)/. Expressions requiring the translation
1330 (like <cf/(rt, *, 3)/) are not allowed (as they usually have 4B range
1331 for ASNs).
1332
1333 Also LC sets use similar expressions like pair sets. You can use ranges
1334 and wildcards, but if one field uses that, more specific (later) fields
1335 must be wildcards. E.g., <cf/(10, 20..30, *)/ or <cf/(10, 20, 30..40)/
1336 is valid, while <cf/(10, *, 20..30)/ or <cf/(10, 20..30, 40)/ is not
1337 valid.
1338
1339 You can also use expressions for int, pair, EC and LC set values.
1340 However, it must be possible to evaluate these expressions before daemon
1341 boots. So you can use only constants inside them. E.g.
1342
1343 <code>
1344 define one=1;
1345 define myas=64500;
1346 int set odds;
1347 pair set ps;
1348 ec set es;
1349
1350 odds = [ one, 2+1, 6-one, 2*2*2-1, 9, 11 ];
1351 ps = [ (1,one+one), (3,4)..(4,8), (5,*), (6,3..6), (7..9,*) ];
1352 es = [ (rt, myas, 3*10), (rt, myas+one, 0..16*16*16-1), (ro, myas+2, *) ];
1353 </code>
1354
1355 Sets of prefixes are special: their literals does not allow ranges, but
1356 allows prefix patterns that are written
1357 as <cf><M>ipaddress</M>/<M>pxlen</M>{<M>low</M>,<M>high</M>}</cf>.
1358 Prefix <cf><m>ip1</m>/<m>len1</m></cf> matches prefix
1359 pattern <cf><m>ip2</m>/<m>len2</m>{<m>l</m>,<m>h</m>}</cf> if the
1360 first <cf>min(len1, len2)</cf> bits of <cf/ip1/ and <cf/ip2/ are
1361 identical and <cf>len1 &lt;= ip1 &lt;= len2</cf>. A valid prefix pattern
1362 has to satisfy <cf>low &lt;= high</cf>, but <cf/pxlen/ is not
1363 constrained by <cf/low/ or <cf/high/. Obviously, a prefix matches a
1364 prefix set literal if it matches any prefix pattern in the prefix set
1365 literal.
1366
1367 There are also two shorthands for prefix patterns: <cf><m/address//<m/len/+</cf>
1368 is a shorthand for <cf><m/address//<m/len/{<m/len/,<m/maxlen/}</cf>
1369 (where <cf><m/maxlen/</cf> is 32 for IPv4 and 128 for IPv6), that means
1370 network prefix <cf><m/address//<m/len/</cf> and all its subnets.
1371 <cf><m/address//<m/len/-</cf> is a shorthand for
1372 <cf><m/address//<m/len/{0,<m/len/}</cf>, that means network prefix
1373 <cf><m/address//<m/len/</cf> and all its supernets (network prefixes
1374 that contain it).
1375
1376 For example, <cf>[ 1.0.0.0/8, 2.0.0.0/8+, 3.0.0.0/8-, 4.0.0.0/8{16,24}
1377 ]</cf> matches prefix <cf>1.0.0.0/8</cf>, all subprefixes of
1378 <cf>2.0.0.0/8</cf>, all superprefixes of <cf>3.0.0.0/8</cf> and prefixes
1379 <cf/4.X.X.X/ whose prefix length is 16 to 24. <cf>[ 0.0.0.0/0{20,24} ]</cf>
1380 matches all prefixes (regardless of IP address) whose prefix length is
1381 20 to 24, <cf>[ 1.2.3.4/32- ]</cf> matches any prefix that contains IP
1382 address <cf>1.2.3.4</cf>. <cf>1.2.0.0/16 &tilde; [ 1.0.0.0/8{15,17} ]</cf>
1383 is true, but <cf>1.0.0.0/16 &tilde; [ 1.0.0.0/8- ]</cf> is false.
1384
1385 Cisco-style patterns like <cf>10.0.0.0/8 ge 16 le 24</cf> can be expressed
1386 in BIRD as <cf>10.0.0.0/8{16,24}</cf>, <cf>192.168.0.0/16 le 24</cf> as
1387 <cf>192.168.0.0/16{16,24}</cf> and <cf>192.168.0.0/16 ge 24</cf> as
1388 <cf>192.168.0.0/16{24,32}</cf>.
1389
1390 It is possible to mix IPv4 and IPv6 prefixes/addresses in a prefix/ip set
1391 but its behavior may change between versions without any warning; don't do
1392 it unless you are more than sure what you are doing. (Really, don't do it.)
1393
1394 <tag><label id="type-enum">enum</tag>
1395 Enumeration types are fixed sets of possibilities. You can't define your
1396 own variables of such type, but some route attributes are of enumeration
1397 type. Enumeration types are incompatible with each other.
1398
1399 <tag><label id="type-bgppath">bgppath</tag>
1400 BGP path is a list of autonomous system numbers. You can't write
1401 literals of this type. There are several special operators on bgppaths:
1402
1403 <cf><m/P/.first</cf> returns the first ASN (the neighbor ASN) in path <m/P/.
1404
1405 <cf><m/P/.last</cf> returns the last ASN (the source ASN) in path <m/P/.
1406
1407 <cf><m/P/.last_nonaggregated</cf> returns the last ASN in the non-aggregated part of the path <m/P/.
1408
1409 Both <cf/first/ and <cf/last/ return zero if there is no appropriate
1410 ASN, for example if the path contains an AS set element as the first (or
1411 the last) part. If the path ends with an AS set, <cf/last_nonaggregated/
1412 may be used to get last ASN before any AS set.
1413
1414 <cf><m/P/.len</cf> returns the length of path <m/P/.
1415
1416 <cf><m/P/.empty</cf> makes the path <m/P/ empty.
1417
1418 <cf>prepend(<m/P/,<m/A/)</cf> prepends ASN <m/A/ to path <m/P/ and
1419 returns the result.
1420
1421 <cf>delete(<m/P/,<m/A/)</cf> deletes all instances of ASN <m/A/ from
1422 from path <m/P/ and returns the result. <m/A/ may also be an integer
1423 set, in that case the operator deletes all ASNs from path <m/P/ that are
1424 also members of set <m/A/.
1425
1426 <cf>filter(<m/P/,<m/A/)</cf> deletes all ASNs from path <m/P/ that are
1427 not members of integer set <m/A/. I.e., <cf/filter/ do the same as
1428 <cf/delete/ with inverted set <m/A/.
1429
1430 Statement <cf><m/P/ = prepend(<m/P/, <m/A/);</cf> can be shortened to
1431 <cf><m/P/.prepend(<m/A/);</cf> if <m/P/ is appropriate route attribute
1432 (for example <cf/bgp_path/). Similarly for <cf/delete/ and <cf/filter/.
1433
1434 <tag><label id="type-bgpmask">bgpmask</tag>
1435 BGP masks are patterns used for BGP path matching (using <cf>path
1436 &tilde; [= 2 3 5 * =]</cf> syntax). The masks resemble wildcard patterns
1437 as used by UNIX shells. Autonomous system numbers match themselves,
1438 <cf/*/ matches any (even empty) sequence of arbitrary AS numbers and
1439 <cf/?/ matches one arbitrary AS number. For example, if <cf>bgp_path</cf>
1440 is 4 3 2 1, then: <tt>bgp_path &tilde; [= * 4 3 * =]</tt> is true,
1441 but <tt>bgp_path &tilde; [= * 4 5 * =]</tt> is false. BGP mask
1442 expressions can also contain integer expressions enclosed in parenthesis
1443 and integer variables, for example <tt>[= * 4 (1+2) a =]</tt>. You can
1444 also use ranges, for example <tt>[= * 3..5 2 100..200 * =]</tt>.
1445
1446 <tag><label id="type-clist">clist</tag>
1447 Clist is similar to a set, except that unlike other sets, it can be
1448 modified. The type is used for community list (a set of pairs) and for
1449 cluster list (a set of quads). There exist no literals of this type.
1450 There are three special operators on clists:
1451
1452 <cf><m/C/.len</cf> returns the length of clist <m/C/.
1453
1454 <cf><m/C/.empty</cf> makes the list <m/C/ empty.
1455
1456 <cf>add(<m/C/,<m/P/)</cf> adds pair (or quad) <m/P/ to clist <m/C/ and
1457 returns the result. If item <m/P/ is already in clist <m/C/, it does
1458 nothing. <m/P/ may also be a clist, in that case all its members are
1459 added; i.e., it works as clist union.
1460
1461 <cf>delete(<m/C/,<m/P/)</cf> deletes pair (or quad) <m/P/ from clist
1462 <m/C/ and returns the result. If clist <m/C/ does not contain item
1463 <m/P/, it does nothing. <m/P/ may also be a pair (or quad) set, in that
1464 case the operator deletes all items from clist <m/C/ that are also
1465 members of set <m/P/. Moreover, <m/P/ may also be a clist, which works
1466 analogously; i.e., it works as clist difference.
1467
1468 <cf>filter(<m/C/,<m/P/)</cf> deletes all items from clist <m/C/ that are
1469 not members of pair (or quad) set <m/P/. I.e., <cf/filter/ do the same
1470 as <cf/delete/ with inverted set <m/P/. <m/P/ may also be a clist, which
1471 works analogously; i.e., it works as clist intersection.
1472
1473 Statement <cf><m/C/ = add(<m/C/, <m/P/);</cf> can be shortened to
1474 <cf><m/C/.add(<m/P/);</cf> if <m/C/ is appropriate route attribute (for
1475 example <cf/bgp_community/). Similarly for <cf/delete/ and <cf/filter/.
1476
1477 <tag><label id="type-eclist">eclist</tag>
1478 Eclist is a data type used for BGP extended community lists. Eclists
1479 are very similar to clists, but they are sets of ECs instead of pairs.
1480 The same operations (like <cf/add/, <cf/delete/ or <cf/&tilde;/ and
1481 <cf/!&tilde;/ membership operators) can be used to modify or test
1482 eclists, with ECs instead of pairs as arguments.
1483
1484 <tag><label id="type-lclist">lclist/</tag>
1485 Lclist is a data type used for BGP large community lists. Like eclists,
1486 lclists are very similar to clists, but they are sets of LCs instead of
1487 pairs. The same operations (like <cf/add/, <cf/delete/ or <cf/&tilde;/
1488 and <cf/!&tilde;/ membership operators) can be used to modify or test
1489 lclists, with LCs instead of pairs as arguments.
1490 </descrip>
1491
1492
1493 <sect>Operators
1494 <label id="operators">
1495
1496 <p>The filter language supports common integer operators <cf>(+,-,*,/)</cf>,
1497 parentheses <cf/(a*(b+c))/, comparison <cf/(a=b, a!=b, a&lt;b, a&gt;=b)/.
1498 Logical operations include unary not (<cf/!/), and (<cf/&amp;&amp;/), and or
1499 (<cf/&verbar;&verbar;/). Special operators include (<cf/&tilde;/,
1500 <cf/!&tilde;/) for "is (not) element of a set" operation - it can be used on
1501 element and set of elements of the same type (returning true if element is
1502 contained in the given set), or on two strings (returning true if first string
1503 matches a shell-like pattern stored in second string) or on IP and prefix
1504 (returning true if IP is within the range defined by that prefix), or on prefix
1505 and prefix (returning true if first prefix is more specific than second one) or
1506 on bgppath and bgpmask (returning true if the path matches the mask) or on
1507 number and bgppath (returning true if the number is in the path) or on bgppath
1508 and int (number) set (returning true if any ASN from the path is in the set) or
1509 on pair/quad and clist (returning true if the pair/quad is element of the
1510 clist) or on clist and pair/quad set (returning true if there is an element of
1511 the clist that is also a member of the pair/quad set).
1512
1513 <p>There is one operator related to ROA infrastructure - <cf/roa_check()/. It
1514 examines a ROA table and does <rfc id="6483"> route origin validation for a
1515 given network prefix. The basic usage is <cf>roa_check(<m/table/)</cf>, which
1516 checks current route (which should be from BGP to have AS_PATH argument) in the
1517 specified ROA table and returns ROA_UNKNOWN if there is no relevant ROA,
1518 ROA_VALID if there is a matching ROA, or ROA_INVALID if there are some relevant
1519 ROAs but none of them match. There is also an extended variant
1520 <cf>roa_check(<m/table/, <m/prefix/, <m/asn/)</cf>, which allows to specify a
1521 prefix and an ASN as arguments.
1522
1523
1524 <sect>Control structures
1525 <label id="control-structures">
1526
1527 <p>Filters support two control structures: conditions and case switches.
1528
1529 <p>Syntax of a condition is: <cf>if <M>boolean expression</M> then <m/commandT/;
1530 else <m/commandF/;</cf> and you can use <cf>{ <m/command1/; <m/command2/;
1531 <M>...</M> }</cf> instead of either command. The <cf>else</cf> clause may be
1532 omitted. If the <cf><m>boolean expression</m></cf> is true, <m/commandT/ is
1533 executed, otherwise <m/commandF/ is executed.
1534
1535 <p>The <cf>case</cf> is similar to case from Pascal. Syntax is <cf>case
1536 <m/expr/ { else: | <m/num_or_prefix [ .. num_or_prefix]/: <m/statement/ ; [
1537 ... ] }</cf>. The expression after <cf>case</cf> can be of any type which can be
1538 on the left side of the &tilde; operator and anything that could be a member of
1539 a set is allowed before <cf/:/. Multiple commands are allowed without <cf/{}/
1540 grouping. If <cf><m/expr/</cf> matches one of the <cf/:/ clauses, statements
1541 between it and next <cf/:/ statement are executed. If <cf><m/expr/</cf> matches
1542 neither of the <cf/:/ clauses, the statements after <cf/else:/ are executed.
1543
1544 <p>Here is example that uses <cf/if/ and <cf/case/ structures:
1545
1546 <code>
1547 case arg1 {
1548 2: print "two"; print "I can do more commands without {}";
1549 3 .. 5: print "three to five";
1550 else: print "something else";
1551 }
1552
1553 if 1234 = i then printn "."; else {
1554 print "not 1234";
1555 print "You need {} around multiple commands";
1556 }
1557 </code>
1558
1559
1560 <sect>Route attributes
1561 <label id="route-attributes">
1562
1563 <p>A filter is implicitly passed a route, and it can access its attributes just
1564 like it accesses variables. Attempts to access undefined attribute result in a
1565 runtime error; you can check if an attribute is defined by using the
1566 <cf>defined( <m>attribute</m> )</cf> operator. One notable exception to this
1567 rule are attributes of bgppath and *clist types, where undefined value is
1568 regarded as empty bgppath/*clist for most purposes.
1569
1570 <descrip>
1571 <tag><label id="rta-net"><m/prefix/ net</tag>
1572 The network prefix or anything else the route is talking about. The
1573 primary key of the routing table. Read-only. (See the <ref id="routes"
1574 name="chapter about routes">.)
1575
1576 <tag><label id="rta-scope"><m/enum/ scope</tag>
1577 The scope of the route. Possible values: <cf/SCOPE_HOST/ for routes
1578 local to this host, <cf/SCOPE_LINK/ for those specific for a physical
1579 link, <cf/SCOPE_SITE/ and <cf/SCOPE_ORGANIZATION/ for private routes and
1580 <cf/SCOPE_UNIVERSE/ for globally visible routes. This attribute is not
1581 interpreted by BIRD and can be used to mark routes in filters. The
1582 default value for new routes is <cf/SCOPE_UNIVERSE/.
1583
1584 <tag><label id="rta-preference"><m/int/ preference</tag>
1585 Preference of the route. Valid values are 0-65535. (See the chapter
1586 about routing tables.)
1587
1588 <tag><label id="rta-from"><m/ip/ from</tag>
1589 The router which the route has originated from.
1590
1591 <tag><label id="rta-gw"><m/ip/ gw</tag>
1592 Next hop packets routed using this route should be forwarded to.
1593
1594 <tag><label id="rta-proto"><m/string/ proto</tag>
1595 The name of the protocol which the route has been imported from.
1596 Read-only.
1597
1598 <tag><label id="rta-source"><m/enum/ source</tag>
1599 what protocol has told me about this route. Possible values:
1600 <cf/RTS_DUMMY/, <cf/RTS_STATIC/, <cf/RTS_INHERIT/, <cf/RTS_DEVICE/,
1601 <cf/RTS_STATIC_DEVICE/, <cf/RTS_REDIRECT/, <cf/RTS_RIP/, <cf/RTS_OSPF/,
1602 <cf/RTS_OSPF_IA/, <cf/RTS_OSPF_EXT1/, <cf/RTS_OSPF_EXT2/, <cf/RTS_BGP/,
1603 <cf/RTS_PIPE/, <cf/RTS_BABEL/.
1604
1605 <tag><label id="rta-dest"><m/enum/ dest</tag>
1606 Type of destination the packets should be sent to
1607 (<cf/RTD_ROUTER/ for forwarding to a neighboring router,
1608 <cf/RTD_DEVICE/ for routing to a directly-connected network,
1609 <cf/RTD_MULTIPATH/ for multipath destinations,
1610 <cf/RTD_BLACKHOLE/ for packets to be silently discarded,
1611 <cf/RTD_UNREACHABLE/, <cf/RTD_PROHIBIT/ for packets that should be
1612 returned with ICMP host unreachable / ICMP administratively prohibited
1613 messages). Can be changed, but only to <cf/RTD_BLACKHOLE/,
1614 <cf/RTD_UNREACHABLE/ or <cf/RTD_PROHIBIT/.
1615
1616 <tag><label id="rta-ifname"><m/string/ ifname</tag>
1617 Name of the outgoing interface. Sink routes (like blackhole, unreachable
1618 or prohibit) and multipath routes have no interface associated with
1619 them, so <cf/ifname/ returns an empty string for such routes. Setting it
1620 would also change route to a direct one (remove gateway).
1621
1622 <tag><label id="rta-ifindex"><m/int/ ifindex</tag>
1623 Index of the outgoing interface. System wide index of the interface. May
1624 be used for interface matching, however indexes might change on interface
1625 creation/removal. Zero is returned for routes with undefined outgoing
1626 interfaces. Read-only.
1627
1628 <tag><label id="rta-igp-metric"><m/int/ igp_metric</tag>
1629 The optional attribute that can be used to specify a distance to the
1630 network for routes that do not have a native protocol metric attribute
1631 (like <cf/ospf_metric1/ for OSPF routes). It is used mainly by BGP to
1632 compare internal distances to boundary routers (see below).
1633 </descrip>
1634
1635 <p>There also exist protocol-specific attributes which are described in the
1636 corresponding protocol sections.
1637
1638
1639 <sect>Other statements
1640 <label id="other-statements">
1641
1642 <p>The following statements are available:
1643
1644 <descrip>
1645 <tag><label id="assignment"><m/variable/ = <m/expr/</tag>
1646 Set variable to a given value.
1647
1648 <tag><label id="filter-accept-reject">accept|reject [ <m/expr/ ]</tag>
1649 Accept or reject the route, possibly printing <cf><m>expr</m></cf>.
1650
1651 <tag><label id="return">return <m/expr/</tag>
1652 Return <cf><m>expr</m></cf> from the current function, the function ends
1653 at this point.
1654
1655 <tag><label id="print">print|printn <m/expr/ [<m/, expr.../]</tag>
1656 Prints given expressions; useful mainly while debugging filters. The
1657 <cf/printn/ variant does not terminate the line.
1658
1659 <tag><label id="quitbird">quitbird</tag>
1660 Terminates BIRD. Useful when debugging the filter interpreter.
1661 </descrip>
1662
1663
1664 <chapt>Protocols
1665 <label id="protocols">
1666
1667 <sect>Babel
1668 <label id="babel">
1669
1670 <sect1>Introduction
1671 <label id="babel-intro">
1672
1673 <p>The Babel protocol
1674 (<rfc id="6126">) is a loop-avoiding distance-vector routing protocol that is
1675 robust and efficient both in ordinary wired networks and in wireless mesh
1676 networks. Babel is conceptually very simple in its operation and "just works"
1677 in its default configuration, though some configuration is possible and in some
1678 cases desirable.
1679
1680 <p>The Babel protocol is dual stack; i.e., it can carry both IPv4 and IPv6
1681 routes over the same IPv6 transport. For sending and receiving Babel packets,
1682 only a link-local IPv6 address is needed.
1683
1684 <p>BIRD implements an extension for IPv6 source-specific routing (SSR or SADR),
1685 but must be configured accordingly to use it. SADR-enabled Babel router can
1686 interoperate with non-SADR Babel router, but the later would ignore routes
1687 with specific (non-zero) source prefix.
1688
1689 <sect1>Configuration
1690 <label id="babel-config">
1691
1692 <p>The Babel protocol support both IPv4 and IPv6 channels; both can be
1693 configured simultaneously. It can also be configured with <ref
1694 id="ip-sadr-routes" name="IPv6 SADR"> channel instead of regular IPv6
1695 channel, in such case SADR support is enabled. Babel supports no global
1696 configuration options apart from those common to all other protocols, but
1697 supports the following per-interface configuration options:
1698
1699 <code>
1700 protocol babel [<name>] {
1701 ipv4 { <channel config> };
1702 ipv6 [sadr] { <channel config> };
1703 randomize router id <switch>;
1704 interface <interface pattern> {
1705 type <wired|wireless>;
1706 rxcost <number>;
1707 limit <number>;
1708 hello interval <time>;
1709 update interval <time>;
1710 port <number>;
1711 tx class|dscp <number>;
1712 tx priority <number>;
1713 rx buffer <number>;
1714 tx length <number>;
1715 check link <switch>;
1716 next hop ipv4 <address>;
1717 next hop ipv6 <address>;
1718 };
1719 }
1720 </code>
1721
1722 <descrip>
1723 <tag><label id="babel-channel">ipv4 | ipv6 [sadr] <m/channel config/</tag>
1724 The supported channels are IPv4, IPv6, and IPv6 SADR.
1725
1726 <tag><label id="babel-random-router-id">randomize router id <m/switch/</tag>
1727 If enabled, Bird will randomize the top 32 bits of its router ID whenever
1728 the protocol instance starts up. If a Babel node restarts, it loses its
1729 sequence number, which can cause its routes to be rejected by peers until
1730 the state is cleared out by other nodes in the network (which can take on
1731 the order of minutes). Enabling this option causes Bird to pick a random
1732 router ID every time it starts up, which avoids this problem at the cost
1733 of not having stable router IDs in the network. Default: no.
1734
1735 <tag><label id="babel-type">type wired|wireless </tag>
1736 This option specifies the interface type: Wired or wireless. On wired
1737 interfaces a neighbor is considered unreachable after a small number of
1738 Hello packets are lost, as described by <cf/limit/ option. On wireless
1739 interfaces the ETX link quality estimation technique is used to compute
1740 the metrics of routes discovered over this interface. This technique will
1741 gradually degrade the metric of routes when packets are lost rather than
1742 the more binary up/down mechanism of wired type links. Default:
1743 <cf/wired/.
1744
1745 <tag><label id="babel-rxcost">rxcost <m/num/</tag>
1746 This option specifies the nominal RX cost of the interface. The effective
1747 neighbor costs for route metrics will be computed from this value with a
1748 mechanism determined by the interface <cf/type/. Note that in contrast to
1749 other routing protocols like RIP or OSPF, the <cf/rxcost/ specifies the
1750 cost of RX instead of TX, so it affects primarily neighbors' route
1751 selection and not local route selection. Default: 96 for wired interfaces,
1752 256 for wireless.
1753
1754 <tag><label id="babel-limit">limit <m/num/</tag>
1755 BIRD keeps track of received Hello messages from each neighbor to
1756 establish neighbor reachability. For wired type interfaces, this option
1757 specifies how many of last 16 hellos have to be correctly received in
1758 order to neighbor is assumed to be up. The option is ignored on wireless
1759 type interfaces, where gradual cost degradation is used instead of sharp
1760 limit. Default: 12.
1761
1762 <tag><label id="babel-hello">hello interval <m/time/ s|ms</tag>
1763 Interval at which periodic Hello messages are sent on this interface,
1764 with time units. Default: 4 seconds.
1765
1766 <tag><label id="babel-update">update interval <m/time/ s|ms</tag>
1767 Interval at which periodic (full) updates are sent, with time
1768 units. Default: 4 times the hello interval.
1769
1770 <tag><label id="babel-port">port <m/number/</tag>
1771 This option selects an UDP port to operate on. The default is to operate
1772 on port 6696 as specified in the Babel RFC.
1773
1774 <tag><label id="babel-tx-class">tx class|dscp|priority <m/number/</tag>
1775 These options specify the ToS/DiffServ/Traffic class/Priority of the
1776 outgoing Babel packets. See <ref id="proto-tx-class" name="tx class"> common
1777 option for detailed description.
1778
1779 <tag><label id="babel-rx-buffer">rx buffer <m/number/</tag>
1780 This option specifies the size of buffers used for packet processing.
1781 The buffer size should be bigger than maximal size of received packets.
1782 The default value is the interface MTU, and the value will be clamped to a
1783 minimum of 512 bytes + IP packet overhead.
1784
1785 <tag><label id="babel-tx-length">tx length <m/number/</tag>
1786 This option specifies the maximum length of generated Babel packets. To
1787 avoid IP fragmentation, it should not exceed the interface MTU value.
1788 The default value is the interface MTU value, and the value will be
1789 clamped to a minimum of 512 bytes + IP packet overhead.
1790
1791 <tag><label id="babel-check-link">check link <m/switch/</tag>
1792 If set, the hardware link state (as reported by OS) is taken into
1793 consideration. When the link disappears (e.g. an ethernet cable is
1794 unplugged), neighbors are immediately considered unreachable and all
1795 routes received from them are withdrawn. It is possible that some
1796 hardware drivers or platforms do not implement this feature. Default:
1797 yes.
1798
1799 <tag><label id="babel-next-hop-ipv4">next hop ipv4 <m/address/</tag>
1800 Set the next hop address advertised for IPv4 routes advertised on this
1801 interface. Default: the preferred IPv4 address of the interface.
1802
1803 <tag><label id="babel-next-hop-ipv6">next hop ipv6 <m/address/</tag>
1804 Set the next hop address advertised for IPv6 routes advertised on this
1805 interface. If not set, the same link-local address that is used as the
1806 source for Babel packets will be used. In normal operation, it should not
1807 be necessary to set this option.
1808 </descrip>
1809
1810 <sect1>Attributes
1811 <label id="babel-attr">
1812
1813 <p>Babel defines just one attribute: the internal babel metric of the route. It
1814 is exposed as the <cf/babel_metric/ attribute and has range from 1 to infinity
1815 (65535).
1816
1817 <sect1>Example
1818 <label id="babel-exam">
1819
1820 <p><code>
1821 protocol babel {
1822 interface "eth*" {
1823 type wired;
1824 };
1825 interface "wlan0", "wlan1" {
1826 type wireless;
1827 hello interval 1;
1828 rxcost 512;
1829 };
1830 interface "tap0";
1831
1832 # This matches the default of babeld: redistribute all addresses
1833 # configured on local interfaces, plus re-distribute all routes received
1834 # from other babel peers.
1835
1836 ipv4 {
1837 export where (source = RTS_DEVICE) || (source = RTS_BABEL);
1838 };
1839 ipv6 {
1840 export where (source = RTS_DEVICE) || (source = RTS_BABEL);
1841 };
1842 }
1843 </code>
1844
1845 <sect1>Known issues
1846 <label id="babel-issues">
1847
1848 <p>When retracting a route, Babel generates an unreachable route for a little
1849 while (according to RFC). The interaction of this behavior with other protocols
1850 is not well tested and strange things may happen.
1851
1852
1853 <sect>BFD
1854 <label id="bfd">
1855
1856 <sect1>Introduction
1857 <label id="bfd-intro">
1858
1859 <p>Bidirectional Forwarding Detection (BFD) is not a routing protocol itself, it
1860 is an independent tool providing liveness and failure detection. Routing
1861 protocols like OSPF and BGP use integrated periodic "hello" messages to monitor
1862 liveness of neighbors, but detection times of these mechanisms are high (e.g. 40
1863 seconds by default in OSPF, could be set down to several seconds). BFD offers
1864 universal, fast and low-overhead mechanism for failure detection, which could be
1865 attached to any routing protocol in an advisory role.
1866
1867 <p>BFD consists of mostly independent BFD sessions. Each session monitors an
1868 unicast bidirectional path between two BFD-enabled routers. This is done by
1869 periodically sending control packets in both directions. BFD does not handle
1870 neighbor discovery, BFD sessions are created on demand by request of other
1871 protocols (like OSPF or BGP), which supply appropriate information like IP
1872 addresses and associated interfaces. When a session changes its state, these
1873 protocols are notified and act accordingly (e.g. break an OSPF adjacency when
1874 the BFD session went down).
1875
1876 <p>BIRD implements basic BFD behavior as defined in <rfc id="5880"> (some
1877 advanced features like the echo mode or authentication are not implemented), IP
1878 transport for BFD as defined in <rfc id="5881"> and <rfc id="5883"> and
1879 interaction with client protocols as defined in <rfc id="5882">.
1880 We currently support at most one protocol instance.
1881
1882 <p>BFD packets are sent with a dynamic source port number. Linux systems use by
1883 default a bit different dynamic port range than the IANA approved one
1884 (49152-65535). If you experience problems with compatibility, please adjust
1885 <cf>/proc/sys/net/ipv4/ip_local_port_range</cf>
1886
1887 <sect1>Configuration
1888 <label id="bfd-config">
1889
1890 <p>BFD configuration consists mainly of multiple definitions of interfaces.
1891 Most BFD config options are session specific. When a new session is requested
1892 and dynamically created, it is configured from one of these definitions. For
1893 sessions to directly connected neighbors, <cf/interface/ definitions are chosen
1894 based on the interface associated with the session, while <cf/multihop/
1895 definition is used for multihop sessions. If no definition is relevant, the
1896 session is just created with the default configuration. Therefore, an empty BFD
1897 configuration is often sufficient.
1898
1899 <p>Note that to use BFD for other protocols like OSPF or BGP, these protocols
1900 also have to be configured to request BFD sessions, usually by <cf/bfd/ option.
1901
1902 <p>Some of BFD session options require <m/time/ value, which has to be specified
1903 with the appropriate unit: <m/num/ <cf/s/|<cf/ms/|<cf/us/. Although microseconds
1904 are allowed as units, practical minimum values are usually in order of tens of
1905 milliseconds.
1906
1907 <code>
1908 protocol bfd [&lt;name&gt;] {
1909 interface &lt;interface pattern&gt; {
1910 interval &lt;time&gt;;
1911 min rx interval &lt;time&gt;;
1912 min tx interval &lt;time&gt;;
1913 idle tx interval &lt;time&gt;;
1914 multiplier &lt;num&gt;;
1915 passive &lt;switch&gt;;
1916 authentication none;
1917 authentication simple;
1918 authentication [meticulous] keyed md5|sha1;
1919 password "&lt;text&gt;";
1920 password "&lt;text&gt;" {
1921 id &lt;num&gt;;
1922 generate from "&lt;date&gt;";
1923 generate to "&lt;date&gt;";
1924 accept from "&lt;date&gt;";
1925 accept to "&lt;date&gt;";
1926 from "&lt;date&gt;";
1927 to "&lt;date&gt;";
1928 };
1929 };
1930 multihop {
1931 interval &lt;time&gt;;
1932 min rx interval &lt;time&gt;;
1933 min tx interval &lt;time&gt;;
1934 idle tx interval &lt;time&gt;;
1935 multiplier &lt;num&gt;;
1936 passive &lt;switch&gt;;
1937 };
1938 neighbor &lt;ip&gt; [dev "&lt;interface&gt;"] [local &lt;ip&gt;] [multihop &lt;switch&gt;];
1939 }
1940 </code>
1941
1942 <descrip>
1943 <tag><label id="bfd-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
1944 Interface definitions allow to specify options for sessions associated
1945 with such interfaces and also may contain interface specific options.
1946 See <ref id="proto-iface" name="interface"> common option for a detailed
1947 description of interface patterns. Note that contrary to the behavior of
1948 <cf/interface/ definitions of other protocols, BFD protocol would accept
1949 sessions (in default configuration) even on interfaces not covered by
1950 such definitions.
1951
1952 <tag><label id="bfd-multihop">multihop { <m/options/ }</tag>
1953 Multihop definitions allow to specify options for multihop BFD sessions,
1954 in the same manner as <cf/interface/ definitions are used for directly
1955 connected sessions. Currently only one such definition (for all multihop
1956 sessions) could be used.
1957
1958 <tag><label id="bfd-neighbor">neighbor <m/ip/ [dev "<m/interface/"] [local <m/ip/] [multihop <m/switch/]</tag>
1959 BFD sessions are usually created on demand as requested by other
1960 protocols (like OSPF or BGP). This option allows to explicitly add
1961 a BFD session to the specified neighbor regardless of such requests.
1962
1963 The session is identified by the IP address of the neighbor, with
1964 optional specification of used interface and local IP. By default
1965 the neighbor must be directly connected, unless the session is
1966 configured as multihop. Note that local IP must be specified for
1967 multihop sessions.
1968 </descrip>
1969
1970 <p>Session specific options (part of <cf/interface/ and <cf/multihop/ definitions):
1971
1972 <descrip>
1973 <tag><label id="bfd-interval">interval <m/time/</tag>
1974 BFD ensures availability of the forwarding path associated with the
1975 session by periodically sending BFD control packets in both
1976 directions. The rate of such packets is controlled by two options,
1977 <cf/min rx interval/ and <cf/min tx interval/ (see below). This option
1978 is just a shorthand to set both of these options together.
1979
1980 <tag><label id="bfd-min-rx-interval">min rx interval <m/time/</tag>
1981 This option specifies the minimum RX interval, which is announced to the
1982 neighbor and used there to limit the neighbor's rate of generated BFD
1983 control packets. Default: 10 ms.
1984
1985 <tag><label id="bfd-min-tx-interval">min tx interval <m/time/</tag>
1986 This option specifies the desired TX interval, which controls the rate
1987 of generated BFD control packets (together with <cf/min rx interval/
1988 announced by the neighbor). Note that this value is used only if the BFD
1989 session is up, otherwise the value of <cf/idle tx interval/ is used
1990 instead. Default: 100 ms.
1991
1992 <tag><label id="bfd-idle-tx-interval">idle tx interval <m/time/</tag>
1993 In order to limit unnecessary traffic in cases where a neighbor is not
1994 available or not running BFD, the rate of generated BFD control packets
1995 is lower when the BFD session is not up. This option specifies the
1996 desired TX interval in such cases instead of <cf/min tx interval/.
1997 Default: 1 s.
1998
1999 <tag><label id="bfd-multiplier">multiplier <m/num/</tag>
2000 Failure detection time for BFD sessions is based on established rate of
2001 BFD control packets (<cf>min rx/tx interval</cf>) multiplied by this
2002 multiplier, which is essentially (ignoring jitter) a number of missed
2003 packets after which the session is declared down. Note that rates and
2004 multipliers could be different in each direction of a BFD session.
2005 Default: 5.
2006
2007 <tag><label id="bfd-passive">passive <m/switch/</tag>
2008 Generally, both BFD session endpoints try to establish the session by
2009 sending control packets to the other side. This option allows to enable
2010 passive mode, which means that the router does not send BFD packets
2011 until it has received one from the other side. Default: disabled.
2012
2013 <tag>authentication none</tag>
2014 No passwords are sent in BFD packets. This is the default value.
2015
2016 <tag>authentication simple</tag>
2017 Every packet carries 16 bytes of password. Received packets lacking this
2018 password are ignored. This authentication mechanism is very weak.
2019
2020 <tag>authentication [meticulous] keyed md5|sha1</tag>
2021 An authentication code is appended to each packet. The cryptographic
2022 algorithm is keyed MD5 or keyed SHA-1. Note that the algorithm is common
2023 for all keys (on one interface), in contrast to OSPF or RIP, where it
2024 is a per-key option. Passwords (keys) are not sent open via network.
2025
2026 The <cf/meticulous/ variant means that cryptographic sequence numbers
2027 are increased for each sent packet, while in the basic variant they are
2028 increased about once per second. Generally, the <cf/meticulous/ variant
2029 offers better resistance to replay attacks but may require more
2030 computation.
2031
2032 <tag>password "<M>text</M>"</tag>
2033 Specifies a password used for authentication. See <ref id="proto-pass"
2034 name="password"> common option for detailed description. Note that
2035 password option <cf/algorithm/ is not available in BFD protocol. The
2036 algorithm is selected by <cf/authentication/ option for all passwords.
2037
2038 </descrip>
2039
2040 <sect1>Example
2041 <label id="bfd-exam">
2042
2043 <p><code>
2044 protocol bfd {
2045 interface "eth*" {
2046 min rx interval 20 ms;
2047 min tx interval 50 ms;
2048 idle tx interval 300 ms;
2049 };
2050 interface "gre*" {
2051 interval 200 ms;
2052 multiplier 10;
2053 passive;
2054 };
2055 multihop {
2056 interval 200 ms;
2057 multiplier 10;
2058 };
2059
2060 neighbor 192.168.1.10;
2061 neighbor 192.168.2.2 dev "eth2";
2062 neighbor 192.168.10.1 local 192.168.1.1 multihop;
2063 }
2064 </code>
2065
2066
2067 <sect>BGP
2068 <label id="bgp">
2069
2070 <p>The Border Gateway Protocol is the routing protocol used for backbone level
2071 routing in the today's Internet. Contrary to other protocols, its convergence
2072 does not rely on all routers following the same rules for route selection,
2073 making it possible to implement any routing policy at any router in the network,
2074 the only restriction being that if a router advertises a route, it must accept
2075 and forward packets according to it.
2076
2077 <p>BGP works in terms of autonomous systems (often abbreviated as AS). Each AS
2078 is a part of the network with common management and common routing policy. It is
2079 identified by a unique 16-bit number (ASN). Routers within each AS usually
2080 exchange AS-internal routing information with each other using an interior
2081 gateway protocol (IGP, such as OSPF or RIP). Boundary routers at the border of
2082 the AS communicate global (inter-AS) network reachability information with their
2083 neighbors in the neighboring AS'es via exterior BGP (eBGP) and redistribute
2084 received information to other routers in the AS via interior BGP (iBGP).
2085
2086 <p>Each BGP router sends to its neighbors updates of the parts of its routing
2087 table it wishes to export along with complete path information (a list of AS'es
2088 the packet will travel through if it uses the particular route) in order to
2089 avoid routing loops.
2090
2091 <sect1>Supported standards
2092 <label id="bgp-standards">
2093
2094 <p>
2095 <itemize>
2096 <item> <rfc id="4271"> - Border Gateway Protocol 4 (BGP)
2097 <item> <rfc id="1997"> - BGP Communities Attribute
2098 <item> <rfc id="2385"> - Protection of BGP Sessions via TCP MD5 Signature
2099 <item> <rfc id="2545"> - Use of BGP Multiprotocol Extensions for IPv6
2100 <item> <rfc id="2918"> - Route Refresh Capability
2101 <item> <rfc id="3107"> - Carrying Label Information in BGP
2102 <item> <rfc id="4360"> - BGP Extended Communities Attribute
2103 <item> <rfc id="4364"> - BGP/MPLS IPv4 Virtual Private Networks
2104 <item> <rfc id="4456"> - BGP Route Reflection
2105 <item> <rfc id="4486"> - Subcodes for BGP Cease Notification Message
2106 <item> <rfc id="4659"> - BGP/MPLS IPv6 Virtual Private Networks
2107 <item> <rfc id="4724"> - Graceful Restart Mechanism for BGP
2108 <item> <rfc id="4760"> - Multiprotocol extensions for BGP
2109 <item> <rfc id="4798"> - Connecting IPv6 Islands over IPv4 MPLS
2110 <item> <rfc id="5065"> - AS confederations for BGP
2111 <item> <rfc id="5082"> - Generalized TTL Security Mechanism
2112 <item> <rfc id="5492"> - Capabilities Advertisement with BGP
2113 <item> <rfc id="5549"> - Advertising IPv4 NLRI with an IPv6 Next Hop
2114 <item> <rfc id="5575"> - Dissemination of Flow Specification Rules
2115 <item> <rfc id="5668"> - 4-Octet AS Specific BGP Extended Community
2116 <item> <rfc id="6286"> - AS-Wide Unique BGP Identifier
2117 <item> <rfc id="6608"> - Subcodes for BGP Finite State Machine Error
2118 <item> <rfc id="6793"> - BGP Support for 4-Octet AS Numbers
2119 <item> <rfc id="7313"> - Enhanced Route Refresh Capability for BGP
2120 <item> <rfc id="7606"> - Revised Error Handling for BGP UPDATE Messages
2121 <item> <rfc id="7911"> - Advertisement of Multiple Paths in BGP
2122 <item> <rfc id="7947"> - Internet Exchange BGP Route Server
2123 <item> <rfc id="8092"> - BGP Large Communities Attribute
2124 <item> <rfc id="8203"> - BGP Administrative Shutdown Communication
2125 <item> <rfc id="8212"> - Default EBGP Route Propagation Behavior without Policies
2126 </itemize>
2127
2128 <sect1>Route selection rules
2129 <label id="bgp-route-select-rules">
2130
2131 <p>BGP doesn't have any simple metric, so the rules for selection of an optimal
2132 route among multiple BGP routes with the same preference are a bit more complex
2133 and they are implemented according to the following algorithm. It starts the
2134 first rule, if there are more "best" routes, then it uses the second rule to
2135 choose among them and so on.
2136
2137 <itemize>
2138 <item>Prefer route with the highest Local Preference attribute.
2139 <item>Prefer route with the shortest AS path.
2140 <item>Prefer IGP origin over EGP and EGP origin over incomplete.
2141 <item>Prefer the lowest value of the Multiple Exit Discriminator.
2142 <item>Prefer routes received via eBGP over ones received via iBGP.
2143 <item>Prefer routes with lower internal distance to a boundary router.
2144 <item>Prefer the route with the lowest value of router ID of the
2145 advertising router.
2146 </itemize>
2147
2148 <sect1>IGP routing table
2149 <label id="bgp-igp-routing-table">
2150
2151 <p>BGP is mainly concerned with global network reachability and with routes to
2152 other autonomous systems. When such routes are redistributed to routers in the
2153 AS via BGP, they contain IP addresses of a boundary routers (in route attribute
2154 NEXT_HOP). BGP depends on existing IGP routing table with AS-internal routes to
2155 determine immediate next hops for routes and to know their internal distances to
2156 boundary routers for the purpose of BGP route selection. In BIRD, there is
2157 usually one routing table used for both IGP routes and BGP routes.
2158
2159 <sect1>Protocol configuration
2160 <label id="bgp-proto-config">
2161
2162 <p>Each instance of the BGP corresponds to one neighboring router. This allows
2163 to set routing policy and all the other parameters differently for each neighbor
2164 using the following configuration parameters:
2165
2166 <descrip>
2167 <tag><label id="bgp-local">local [<m/ip/] [port <m/number/] [as <m/number/]</tag>
2168 Define which AS we are part of. (Note that contrary to other IP routers,
2169 BIRD is able to act as a router located in multiple AS'es simultaneously,
2170 but in such cases you need to tweak the BGP paths manually in the filters
2171 to get consistent behavior.) Optional <cf/ip/ argument specifies a source
2172 address, equivalent to the <cf/source address/ option (see below).
2173 Optional <cf/port/ argument specifies the local BGP port instead of
2174 standard port 179. The parameter may be used multiple times with
2175 different sub-options (e.g., both <cf/local 10.0.0.1 as 65000;/ and
2176 <cf/local 10.0.0.1; local as 65000;/ are valid). This parameter is
2177 mandatory.
2178
2179 <tag><label id="bgp-neighbor">neighbor [<m/ip/] [port <m/number/] [as <m/number/]</tag>
2180 Define neighboring router this instance will be talking to and what AS
2181 it is located in. In case the neighbor is in the same AS as we are, we
2182 automatically switch to iBGP. Optionally, the remote port may also be
2183 specified. Like <cf/local/ parameter, this parameter may also be used
2184 multiple times with different sub-options. This parameter is mandatory.
2185
2186 <tag><label id="bgp-iface">interface <m/string/</tag>
2187 Define interface we should use for link-local BGP IPv6 sessions.
2188 Interface can also be specified as a part of <cf/neighbor address/
2189 (e.g., <cf/neighbor fe80::1234%eth0 as 65000;/). The option may also be
2190 used for non link-local sessions when it is necessary to explicitly
2191 specify an interface, but only for direct (not multihop) sessions.
2192
2193 <tag><label id="bgp-direct">direct</tag>
2194 Specify that the neighbor is directly connected. The IP address of the
2195 neighbor must be from a directly reachable IP range (i.e. associated
2196 with one of your router's interfaces), otherwise the BGP session
2197 wouldn't start but it would wait for such interface to appear. The
2198 alternative is the <cf/multihop/ option. Default: enabled for eBGP.
2199
2200 <tag><label id="bgp-multihop">multihop [<m/number/]</tag>
2201 Configure multihop BGP session to a neighbor that isn't directly
2202 connected. Accurately, this option should be used if the configured
2203 neighbor IP address does not match with any local network subnets. Such
2204 IP address have to be reachable through system routing table. The
2205 alternative is the <cf/direct/ option. For multihop BGP it is
2206 recommended to explicitly configure the source address to have it
2207 stable. Optional <cf/number/ argument can be used to specify the number
2208 of hops (used for TTL). Note that the number of networks (edges) in a
2209 path is counted; i.e., if two BGP speakers are separated by one router,
2210 the number of hops is 2. Default: enabled for iBGP.
2211
2212 <tag><label id="bgp-source-address">source address <m/ip/</tag>
2213 Define local address we should use for next hop calculation and as a
2214 source address for the BGP session. Default: the address of the local
2215 end of the interface our neighbor is connected to.
2216
2217 <tag><label id="bgp-strict-bind">strict bind <m/switch/</tag>
2218 Specify whether BGP listening socket should be bound to a specific local
2219 address (the same as the <cf/source address/) and associated interface,
2220 or to all addresses. Binding to a specific address could be useful in
2221 cases like running multiple BIRD instances on a machine, each using its
2222 IP address. Note that listening sockets bound to a specific address and
2223 to all addresses collide, therefore either all BGP protocols (of the
2224 same address family and using the same local port) should have set
2225 <cf/strict bind/, or none of them. Default: disabled.
2226
2227 <tag><label id="bgp-check-link">check link <M>switch</M></tag>
2228 BGP could use hardware link state into consideration. If enabled,
2229 BIRD tracks the link state of the associated interface and when link
2230 disappears (e.g. an ethernet cable is unplugged), the BGP session is
2231 immediately shut down. Note that this option cannot be used with
2232 multihop BGP. Default: enabled for direct BGP, disabled otherwise.
2233
2234 <tag><label id="bgp-bfd">bfd <M>switch</M>|graceful</tag>
2235 BGP could use BFD protocol as an advisory mechanism for neighbor
2236 liveness and failure detection. If enabled, BIRD setups a BFD session
2237 for the BGP neighbor and tracks its liveness by it. This has an
2238 advantage of an order of magnitude lower detection times in case of
2239 failure. When a neighbor failure is detected, the BGP session is
2240 restarted. Optionally, it can be configured (by <cf/graceful/ argument)
2241 to trigger graceful restart instead of regular restart. Note that BFD
2242 protocol also has to be configured, see <ref id="bfd" name="BFD">
2243 section for details. Default: disabled.
2244
2245 <tag><label id="bgp-ttl-security">ttl security <m/switch/</tag>
2246 Use GTSM (<rfc id="5082"> - the generalized TTL security mechanism). GTSM
2247 protects against spoofed packets by ignoring received packets with a
2248 smaller than expected TTL. To work properly, GTSM have to be enabled on
2249 both sides of a BGP session. If both <cf/ttl security/ and
2250 <cf/multihop/ options are enabled, <cf/multihop/ option should specify
2251 proper hop value to compute expected TTL. Kernel support required:
2252 Linux: 2.6.34+ (IPv4), 2.6.35+ (IPv6), BSD: since long ago, IPv4 only.
2253 Note that full (ICMP protection, for example) <rfc id="5082"> support is
2254 provided by Linux only. Default: disabled.
2255
2256 <tag><label id="bgp-password">password <m/string/</tag>
2257 Use this password for MD5 authentication of BGP sessions (<rfc id="2385">). When
2258 used on BSD systems, see also <cf/setkey/ option below. Default: no
2259 authentication.
2260
2261 <tag><label id="bgp-setkey">setkey <m/switch/</tag>
2262 On BSD systems, keys for TCP MD5 authentication are stored in the global
2263 SA/SP database, which can be accessed by external utilities (e.g.
2264 setkey(8)). BIRD configures security associations in the SA/SP database
2265 automatically based on <cf/password/ options (see above), this option
2266 allows to disable automatic updates by BIRD when manual configuration by
2267 external utilities is preferred. Note that automatic SA/SP database
2268 updates are currently implemented only for FreeBSD. Passwords have to be
2269 set manually by an external utility on NetBSD and OpenBSD. Default:
2270 enabled (ignored on non-FreeBSD).
2271
2272 <tag><label id="bgp-passive">passive <m/switch/</tag>
2273 Standard BGP behavior is both initiating outgoing connections and
2274 accepting incoming connections. In passive mode, outgoing connections
2275 are not initiated. Default: off.
2276
2277 <tag><label id="bgp-confederation">confederation <m/number/</tag>
2278 BGP confederations (<rfc id="5065">) are collections of autonomous
2279 systems that act as one entity to external systems, represented by one
2280 confederation identifier (instead of AS numbers). This option allows to
2281 enable BGP confederation behavior and to specify the local confederation
2282 identifier. When BGP confederations are used, all BGP speakers that are
2283 members of the BGP confederation should have the same confederation
2284 identifier configured. Default: 0 (no confederation).
2285
2286 <tag><label id="bgp-confederation-member">confederation member <m/switch/</tag>
2287 When BGP confederations are used, this option allows to specify whether
2288 the BGP neighbor is a member of the same confederation as the local BGP
2289 speaker. The option is unnecessary (and ignored) for IBGP sessions, as
2290 the same AS number implies the same confederation. Default: no.
2291
2292 <tag><label id="bgp-rr-client">rr client</tag>
2293 Be a route reflector and treat the neighbor as a route reflection
2294 client. Default: disabled.
2295
2296 <tag><label id="bgp-rr-cluster-id">rr cluster id <m/IPv4 address/</tag>
2297 Route reflectors use cluster id to avoid route reflection loops. When
2298 there is one route reflector in a cluster it usually uses its router id
2299 as a cluster id, but when there are more route reflectors in a cluster,
2300 these need to be configured (using this option) to use a common cluster
2301 id. Clients in a cluster need not know their cluster id and this option
2302 is not allowed for them. Default: the same as router id.
2303
2304 <tag><label id="bgp-rs-client">rs client</tag>
2305 Be a route server and treat the neighbor as a route server client.
2306 A route server is used as a replacement for full mesh EBGP routing in
2307 Internet exchange points in a similar way to route reflectors used in
2308 IBGP routing. BIRD does not implement obsoleted <rfc id="1863">, but
2309 uses ad-hoc implementation, which behaves like plain EBGP but reduces
2310 modifications to advertised route attributes to be transparent (for
2311 example does not prepend its AS number to AS PATH attribute and
2312 keeps MED attribute). Default: disabled.
2313
2314 <tag><label id="bgp-allow-local-pref">allow bgp_local_pref <m/switch/</tag>
2315 A standard BGP implementation do not send the Local Preference attribute
2316 to eBGP neighbors and ignore this attribute if received from eBGP
2317 neighbors, as per <rfc id="4271">. When this option is enabled on an
2318 eBGP session, this attribute will be sent to and accepted from the peer,
2319 which is useful for example if you have a setup like in <rfc id="7938">.
2320 The option does not affect iBGP sessions. Default: off.
2321
2322 <tag><label id="bgp-allow-local-as">allow local as [<m/number/]</tag>
2323 BGP prevents routing loops by rejecting received routes with the local
2324 AS number in the AS path. This option allows to loose or disable the
2325 check. Optional <cf/number/ argument can be used to specify the maximum
2326 number of local ASNs in the AS path that is allowed for received
2327 routes. When the option is used without the argument, the check is
2328 completely disabled and you should ensure loop-free behavior by some
2329 other means. Default: 0 (no local AS number allowed).
2330
2331 <tag><label id="bgp-enable-route-refresh">enable route refresh <m/switch/</tag>
2332 After the initial route exchange, BGP protocol uses incremental updates
2333 to keep BGP speakers synchronized. Sometimes (e.g., if BGP speaker
2334 changes its import filter, or if there is suspicion of inconsistency) it
2335 is necessary to do a new complete route exchange. BGP protocol extension
2336 Route Refresh (<rfc id="2918">) allows BGP speaker to request
2337 re-advertisement of all routes from its neighbor. BGP protocol
2338 extension Enhanced Route Refresh (<rfc id="7313">) specifies explicit
2339 begin and end for such exchanges, therefore the receiver can remove
2340 stale routes that were not advertised during the exchange. This option
2341 specifies whether BIRD advertises these capabilities and supports
2342 related procedures. Note that even when disabled, BIRD can send route
2343 refresh requests. Default: on.
2344
2345 <tag><label id="bgp-graceful-restart">graceful restart <m/switch/|aware</tag>
2346 When a BGP speaker restarts or crashes, neighbors will discard all
2347 received paths from the speaker, which disrupts packet forwarding even
2348 when the forwarding plane of the speaker remains intact. <rfc id="4724">
2349 specifies an optional graceful restart mechanism to alleviate this
2350 issue. This option controls the mechanism. It has three states:
2351 Disabled, when no support is provided. Aware, when the graceful restart
2352 support is announced and the support for restarting neighbors is
2353 provided, but no local graceful restart is allowed (i.e. receiving-only
2354 role). Enabled, when the full graceful restart support is provided
2355 (i.e. both restarting and receiving role). Restarting role could be also
2356 configured per-channel. Note that proper support for local graceful
2357 restart requires also configuration of other protocols. Default: aware.
2358
2359 <tag><label id="bgp-graceful-restart-time">graceful restart time <m/number/</tag>
2360 The restart time is announced in the BGP graceful restart capability
2361 and specifies how long the neighbor would wait for the BGP session to
2362 re-establish after a restart before deleting stale routes. Default:
2363 120 seconds.
2364
2365 <tag><label id="bgp-long-lived-graceful-restart">long lived graceful restart <m/switch/|aware</tag>
2366 The long-lived graceful restart is an extension of the traditional
2367 <ref id="bgp-graceful-restart" name="BGP graceful restart">, where stale
2368 routes are kept even after the <ref id="bgp-graceful-restart-time"
2369 name="restart time"> expires for additional long-lived stale time, but
2370 they are marked with the LLGR_STALE community, depreferenced, and
2371 withdrawn from routers not supporting LLGR. Like traditional BGP
2372 graceful restart, it has three states: disabled, aware (receiving-only),
2373 and enabled. Note that long-lived graceful restart requires at least
2374 aware level of traditional BGP graceful restart. Default: aware, unless
2375 graceful restart is disabled.
2376
2377 <tag><label id="bgp-long-lived-stale-time">long lived stale time <m/number/</tag>
2378 The long-lived stale time is announced in the BGP long-lived graceful
2379 restart capability and specifies how long the neighbor would keep stale
2380 routes depreferenced during long-lived graceful restart until either the
2381 session is re-stablished and synchronized or the stale time expires and
2382 routes are removed. Default: 3600 seconds.
2383
2384 <tag><label id="bgp-interpret-communities">interpret communities <m/switch/</tag>
2385 <rfc id="1997"> demands that BGP speaker should process well-known
2386 communities like no-export (65535, 65281) or no-advertise (65535,
2387 65282). For example, received route carrying a no-adverise community
2388 should not be advertised to any of its neighbors. If this option is
2389 enabled (which is by default), BIRD has such behavior automatically (it
2390 is evaluated when a route is exported to the BGP protocol just before
2391 the export filter). Otherwise, this integrated processing of
2392 well-known communities is disabled. In that case, similar behavior can
2393 be implemented in the export filter. Default: on.
2394
2395 <tag><label id="bgp-enable-as4">enable as4 <m/switch/</tag>
2396 BGP protocol was designed to use 2B AS numbers and was extended later to
2397 allow 4B AS number. BIRD supports 4B AS extension, but by disabling this
2398 option it can be persuaded not to advertise it and to maintain old-style
2399 sessions with its neighbors. This might be useful for circumventing bugs
2400 in neighbor's implementation of 4B AS extension. Even when disabled
2401 (off), BIRD behaves internally as AS4-aware BGP router. Default: on.
2402
2403 <tag><label id="bgp-enable-extended-messages">enable extended messages <m/switch/</tag>
2404 The BGP protocol uses maximum message length of 4096 bytes. This option
2405 provides an extension to allow extended messages with length up
2406 to 65535 bytes. Default: off.
2407
2408 <tag><label id="bgp-capabilities">capabilities <m/switch/</tag>
2409 Use capability advertisement to advertise optional capabilities. This is
2410 standard behavior for newer BGP implementations, but there might be some
2411 older BGP implementations that reject such connection attempts. When
2412 disabled (off), features that request it (4B AS support) are also
2413 disabled. Default: on, with automatic fallback to off when received
2414 capability-related error.
2415
2416 <tag><label id="bgp-advertise-ipv4">advertise ipv4 <m/switch/</tag>
2417 Advertise IPv4 multiprotocol capability. This is not a correct behavior
2418 according to the strict interpretation of <rfc id="4760">, but it is
2419 widespread and required by some BGP implementations (Cisco and Quagga).
2420 This option is relevant to IPv4 mode with enabled capability
2421 advertisement only. Default: on.
2422
2423 <tag><label id="bgp-disable-after-error">disable after error <m/switch/</tag>
2424 When an error is encountered (either locally or by the other side),
2425 disable the instance automatically and wait for an administrator to fix
2426 the problem manually. Default: off.
2427
2428 <tag><label id="bgp-disable-after-cease">disable after cease <m/switch/|<m/set-of-flags/</tag>
2429 When a Cease notification is received, disable the instance
2430 automatically and wait for an administrator to fix the problem manually.
2431 When used with <m/switch/ argument, it means handle every Cease subtype
2432 with the exception of <cf/connection collision/. Default: off.
2433
2434 The <m/set-of-flags/ allows to narrow down relevant Cease subtypes. The
2435 syntax is <cf>{<m/flag/ [, <m/.../] }</cf>, where flags are: <cf/cease/,
2436 <cf/prefix limit hit/, <cf/administrative shutdown/,
2437 <cf/peer deconfigured/, <cf/administrative reset/,
2438 <cf/connection rejected/, <cf/configuration change/,
2439 <cf/connection collision/, <cf/out of resources/.
2440
2441 <tag><label id="bgp-hold-time">hold time <m/number/</tag>
2442 Time in seconds to wait for a Keepalive message from the other side
2443 before considering the connection stale. Default: depends on agreement
2444 with the neighboring router, we prefer 240 seconds if the other side is
2445 willing to accept it.
2446
2447 <tag><label id="bgp-startup-hold-time">startup hold time <m/number/</tag>
2448 Value of the hold timer used before the routers have a chance to exchange
2449 open messages and agree on the real value. Default: 240 seconds.
2450
2451 <tag><label id="bgp-keepalive-time">keepalive time <m/number/</tag>
2452 Delay in seconds between sending of two consecutive Keepalive messages.
2453 Default: One third of the hold time.
2454
2455 <tag><label id="bgp-connect-delay-time">connect delay time <m/number/</tag>
2456 Delay in seconds between protocol startup and the first attempt to
2457 connect. Default: 5 seconds.
2458
2459 <tag><label id="bgp-connect-retry-time">connect retry time <m/number/</tag>
2460 Time in seconds to wait before retrying a failed attempt to connect.
2461 Default: 120 seconds.
2462
2463 <tag><label id="bgp-error-wait-time">error wait time <m/number/,<m/number/</tag>
2464 Minimum and maximum delay in seconds between a protocol failure (either
2465 local or reported by the peer) and automatic restart. Doesn't apply
2466 when <cf/disable after error/ is configured. If consecutive errors
2467 happen, the delay is increased exponentially until it reaches the
2468 maximum. Default: 60, 300.
2469
2470 <tag><label id="bgp-error-forget-time">error forget time <m/number/</tag>
2471 Maximum time in seconds between two protocol failures to treat them as a
2472 error sequence which makes <cf/error wait time/ increase exponentially.
2473 Default: 300 seconds.
2474
2475 <tag><label id="bgp-path-metric">path metric <m/switch/</tag>
2476 Enable comparison of path lengths when deciding which BGP route is the
2477 best one. Default: on.
2478
2479 <tag><label id="bgp-med-metric">med metric <m/switch/</tag>
2480 Enable comparison of MED attributes (during best route selection) even
2481 between routes received from different ASes. This may be useful if all
2482 MED attributes contain some consistent metric, perhaps enforced in
2483 import filters of AS boundary routers. If this option is disabled, MED
2484 attributes are compared only if routes are received from the same AS
2485 (which is the standard behavior). Default: off.
2486
2487 <tag><label id="bgp-deterministic-med">deterministic med <m/switch/</tag>
2488 BGP route selection algorithm is often viewed as a comparison between
2489 individual routes (e.g. if a new route appears and is better than the
2490 current best one, it is chosen as the new best one). But the proper
2491 route selection, as specified by <rfc id="4271">, cannot be fully
2492 implemented in that way. The problem is mainly in handling the MED
2493 attribute. BIRD, by default, uses an simplification based on individual
2494 route comparison, which in some cases may lead to temporally dependent
2495 behavior (i.e. the selection is dependent on the order in which routes
2496 appeared). This option enables a different (and slower) algorithm
2497 implementing proper <rfc id="4271"> route selection, which is
2498 deterministic. Alternative way how to get deterministic behavior is to
2499 use <cf/med metric/ option. This option is incompatible with <ref
2500 id="dsc-table-sorted" name="sorted tables">. Default: off.
2501
2502 <tag><label id="bgp-igp-metric">igp metric <m/switch/</tag>
2503 Enable comparison of internal distances to boundary routers during best
2504 route selection. Default: on.
2505
2506 <tag><label id="bgp-prefer-older">prefer older <m/switch/</tag>
2507 Standard route selection algorithm breaks ties by comparing router IDs.
2508 This changes the behavior to prefer older routes (when both are external
2509 and from different peer). For details, see <rfc id="5004">. Default: off.
2510
2511 <tag><label id="bgp-default-med">default bgp_med <m/number/</tag>
2512 Value of the Multiple Exit Discriminator to be used during route
2513 selection when the MED attribute is missing. Default: 0.
2514
2515 <tag><label id="bgp-default-local-pref">default bgp_local_pref <m/number/</tag>
2516 A default value for the Local Preference attribute. It is used when
2517 a new Local Preference attribute is attached to a route by the BGP
2518 protocol itself (for example, if a route is received through eBGP and
2519 therefore does not have such attribute). Default: 100 (0 in pre-1.2.0
2520 versions of BIRD).
2521 </descrip>
2522
2523 <sect1>Channel configuration
2524 <label id="bgp-channel-config">
2525
2526 <p>BGP supports several AFIs and SAFIs over one connection. Every AFI/SAFI
2527 announced to the peer corresponds to one channel. The table of supported AFI/SAFIs
2528 together with their appropriate channels follows.
2529
2530 <table loc="h">
2531 <tabular ca="l|l|l|r|r">
2532 <bf/Channel name/ | <bf/Table nettype/ | <bf/IGP table allowed/ | <bf/AFI/ | <bf/SAFI/
2533 @<hline>
2534 <cf/ipv4/ | <cf/ipv4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 1
2535 @ <cf/ipv6/ | <cf/ipv6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 1
2536 @ <cf/ipv4 multicast/ | <cf/ipv4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 2
2537 @ <cf/ipv6 multicast/ | <cf/ipv6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 2
2538 @ <cf/ipv4 mpls/ | <cf/ipv4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 4
2539 @ <cf/ipv6 mpls/ | <cf/ipv6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 4
2540 @ <cf/vpn4 mpls/ | <cf/vpn4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 128
2541 @ <cf/vpn6 mpls/ | <cf/vpn6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 128
2542 @ <cf/vpn4 multicast/ | <cf/vpn4/ | <cf/ipv4/ and <cf/ipv6/ | 1 | 129
2543 @ <cf/vpn6 multicast/ | <cf/vpn6/ | <cf/ipv4/ and <cf/ipv6/ | 2 | 129
2544 @ <cf/flow4/ | <cf/flow4/ | --- | 1 | 133
2545 @ <cf/flow6/ | <cf/flow6/ | --- | 2 | 133
2546 </tabular>
2547 </table>
2548
2549 <p>Due to <rfc id="8212">, external BGP protocol requires explicit configuration
2550 of import and export policies (in contrast to other protocols, where default
2551 policies of <cf/import all/ and <cf/export none/ are used in absence of explicit
2552 configuration). Note that blanket policies like <cf/all/ or <cf/none/ can still
2553 be used in explicit configuration.
2554
2555 <p>BGP channels have additional config options (together with the common ones):
2556
2557 <descrip>
2558 <tag><label id="bgp-next-hop-keep">next hop keep</tag>
2559 Forward the received Next Hop attribute even in situations where the
2560 local address should be used instead, like when the route is sent to an
2561 interface with a different subnet. Default: disabled.
2562
2563 <tag><label id="bgp-next-hop-self">next hop self</tag>
2564 Avoid calculation of the Next Hop attribute and always advertise our own
2565 source address as a next hop. This needs to be used only occasionally to
2566 circumvent misconfigurations of other routers. Default: disabled.
2567
2568 <tag><label id="bgp-next-hop-address">next hop address <m/ip/</tag>
2569 Avoid calculation of the Next Hop attribute and always advertise this address
2570 as a next hop.
2571
2572 <tag><label id="bgp-missing-lladdr">missing lladdr self|drop|ignore</tag>
2573 Next Hop attribute in BGP-IPv6 sometimes contains just the global IPv6
2574 address, but sometimes it has to contain both global and link-local IPv6
2575 addresses. This option specifies what to do if BIRD have to send both
2576 addresses but does not know link-local address. This situation might
2577 happen when routes from other protocols are exported to BGP, or when
2578 improper updates are received from BGP peers. <cf/self/ means that BIRD
2579 advertises its own local address instead. <cf/drop/ means that BIRD
2580 skips that prefixes and logs error. <cf/ignore/ means that BIRD ignores
2581 the problem and sends just the global address (and therefore forms
2582 improper BGP update). Default: <cf/self/, unless BIRD is configured as a
2583 route server (option <cf/rs client/), in that case default is <cf/ignore/,
2584 because route servers usually do not forward packets themselves.
2585
2586 <tag><label id="bgp-gateway">gateway direct|recursive</tag>
2587 For received routes, their <cf/gw/ (immediate next hop) attribute is
2588 computed from received <cf/bgp_next_hop/ attribute. This option
2589 specifies how it is computed. Direct mode means that the IP address from
2590 <cf/bgp_next_hop/ is used if it is directly reachable, otherwise the
2591 neighbor IP address is used. Recursive mode means that the gateway is
2592 computed by an IGP routing table lookup for the IP address from
2593 <cf/bgp_next_hop/. Note that there is just one level of indirection in
2594 recursive mode - the route obtained by the lookup must not be recursive
2595 itself, to prevent mutually recursive routes.
2596
2597 Recursive mode is the behavior specified by the BGP
2598 standard. Direct mode is simpler, does not require any routes in a
2599 routing table, and was used in older versions of BIRD, but does not
2600 handle well nontrivial iBGP setups and multihop. Recursive mode is
2601 incompatible with <ref id="dsc-table-sorted" name="sorted tables">. Default:
2602 <cf/direct/ for direct sessions, <cf/recursive/ for multihop sessions.
2603
2604 <tag><label id="bgp-igp-table">igp table <m/name/</tag>
2605 Specifies a table that is used as an IGP routing table. The type of this
2606 table must be as allowed in the table above. This option is allowed once
2607 for every allowed table type. Default: the same as the main table
2608 the channel is connected to (if eligible).
2609
2610 <tag><label id="bgp-secondary">secondary <m/switch/</tag>
2611 Usually, if an export filter rejects a selected route, no other route is
2612 propagated for that network. This option allows to try the next route in
2613 order until one that is accepted is found or all routes for that network
2614 are rejected. This can be used for route servers that need to propagate
2615 different tables to each client but do not want to have these tables
2616 explicitly (to conserve memory). This option requires that the connected
2617 routing table is <ref id="dsc-table-sorted" name="sorted">. Default: off.
2618
2619 <tag><label id="bgp-extended-next-hop">extended next hop <m/switch/</tag>
2620 BGP expects that announced next hops have the same address family as
2621 associated network prefixes. This option provides an extension to use
2622 IPv4 next hops with IPv6 prefixes and vice versa. For IPv4 / VPNv4
2623 channels, the behavior is controlled by the Extended Next Hop Encoding
2624 capability, as described in <rfc id="5549">. For IPv6 / VPNv6 channels,
2625 just IPv4-mapped IPv6 addresses are used, as described in
2626 <rfc id="4798"> and <rfc id="4659">. Default: off.
2627
2628 <tag><label id="bgp-add-paths">add paths <m/switch/|rx|tx</tag>
2629 Standard BGP can propagate only one path (route) per destination network
2630 (usually the selected one). This option controls the add-path protocol
2631 extension, which allows to advertise any number of paths to a
2632 destination. Note that to be active, add-path has to be enabled on both
2633 sides of the BGP session, but it could be enabled separately for RX and
2634 TX direction. When active, all available routes accepted by the export
2635 filter are advertised to the neighbor. Default: off.
2636
2637 <tag><label id="bgp-graceful-restart-c">graceful restart <m/switch/</tag>
2638 Although BGP graceful restart is configured mainly by protocol-wide
2639 <ref id="bgp-graceful-restart" name="options">, it is possible to
2640 configure restarting role per AFI/SAFI pair by this channel option.
2641 The option is ignored if graceful restart is disabled by protocol-wide
2642 option. Default: off in aware mode, on in full mode.
2643
2644 <tag><label id="bgp-long-lived-graceful-restart-c">long lived graceful restart <m/switch/</tag>
2645 BGP long-lived graceful restart is configured mainly by protocol-wide
2646 <ref id="bgp-long-lived-graceful-restart" name="options">, but the
2647 restarting role can be set per AFI/SAFI pair by this channel option.
2648 The option is ignored if long-lived graceful restart is disabled by
2649 protocol-wide option. Default: off in aware mode, on in full mode.
2650
2651 <tag><label id="bgp-long-lived-stale-time-c">long lived stale time <m/number/</tag>
2652 Like previous graceful restart channel options, this option allows to
2653 set <ref id="bgp-long-lived-stale-time" name="long lived stale time">
2654 per AFI/SAFI pair instead of per protocol. Default: set by protocol-wide
2655 option.
2656 </descrip>
2657
2658 <sect1>Attributes
2659 <label id="bgp-attr">
2660
2661 <p>BGP defines several route attributes. Some of them (those marked with
2662 `<tt/I/' in the table below) are available on internal BGP connections only,
2663 some of them (marked with `<tt/O/') are optional.
2664
2665 <descrip>
2666 <tag><label id="rta-bgp-path">bgppath bgp_path</tag>
2667 Sequence of AS numbers describing the AS path the packet will travel
2668 through when forwarded according to the particular route. In case of
2669 internal BGP it doesn't contain the number of the local AS.
2670
2671 <tag><label id="rta-bgp-local-pref">int bgp_local_pref [I]</tag>
2672 Local preference value used for selection among multiple BGP routes (see
2673 the selection rules above). It's used as an additional metric which is
2674 propagated through the whole local AS.
2675
2676 <tag><label id="rta-bgp-med">int bgp_med [O]</tag>
2677 The Multiple Exit Discriminator of the route is an optional attribute
2678 which is used on external (inter-AS) links to convey to an adjacent AS
2679 the optimal entry point into the local AS. The received attribute is
2680 also propagated over internal BGP links. The attribute value is zeroed
2681 when a route is exported to an external BGP instance to ensure that the
2682 attribute received from a neighboring AS is not propagated to other
2683 neighboring ASes. A new value might be set in the export filter of an
2684 external BGP instance. See <rfc id="4451"> for further discussion of
2685 BGP MED attribute.
2686
2687 <tag><label id="rta-bgp-origin">enum bgp_origin</tag>
2688 Origin of the route: either <cf/ORIGIN_IGP/ if the route has originated
2689 in an interior routing protocol or <cf/ORIGIN_EGP/ if it's been imported
2690 from the <tt>EGP</tt> protocol (nowadays it seems to be obsolete) or
2691 <cf/ORIGIN_INCOMPLETE/ if the origin is unknown.
2692
2693 <tag><label id="rta-bgp-next-hop">ip bgp_next_hop</tag>
2694 Next hop to be used for forwarding of packets to this destination. On
2695 internal BGP connections, it's an address of the originating router if
2696 it's inside the local AS or a boundary router the packet will leave the
2697 AS through if it's an exterior route, so each BGP speaker within the AS
2698 has a chance to use the shortest interior path possible to this point.
2699
2700 <tag><label id="rta-bgp-atomic-aggr">void bgp_atomic_aggr [O]</tag>
2701 This is an optional attribute which carries no value, but the sole
2702 presence of which indicates that the route has been aggregated from
2703 multiple routes by some router on the path from the originator.
2704
2705 <!-- we don't handle aggregators right since they are of a very obscure type
2706 <tag>bgp_aggregator</tag>
2707 -->
2708 <tag><label id="rta-bgp-community">clist bgp_community [O]</tag>
2709 List of community values associated with the route. Each such value is a
2710 pair (represented as a <cf/pair/ data type inside the filters) of 16-bit
2711 integers, the first of them containing the number of the AS which
2712 defines the community and the second one being a per-AS identifier.
2713 There are lots of uses of the community mechanism, but generally they
2714 are used to carry policy information like "don't export to USA peers".
2715 As each AS can define its own routing policy, it also has a complete
2716 freedom about which community attributes it defines and what will their
2717 semantics be.
2718
2719 <tag><label id="rta-bgp-ext-community">eclist bgp_ext_community [O]</tag>
2720 List of extended community values associated with the route. Extended
2721 communities have similar usage as plain communities, but they have an
2722 extended range (to allow 4B ASNs) and a nontrivial structure with a type
2723 field. Individual community values are represented using an <cf/ec/ data
2724 type inside the filters.
2725
2726 <tag><label id="rta-bgp-large-community">lclist bgp_large_community [O]</tag>
2727 List of large community values associated with the route. Large BGP
2728 communities is another variant of communities, but contrary to extended
2729 communities they behave very much the same way as regular communities,
2730 just larger -- they are uniform untyped triplets of 32bit numbers.
2731 Individual community values are represented using an <cf/lc/ data type
2732 inside the filters.
2733
2734 <tag><label id="rta-bgp-originator-id">quad bgp_originator_id [I, O]</tag>
2735 This attribute is created by the route reflector when reflecting the
2736 route and contains the router ID of the originator of the route in the
2737 local AS.
2738
2739 <tag><label id="rta-bgp-cluster-list">clist bgp_cluster_list [I, O]</tag>
2740 This attribute contains a list of cluster IDs of route reflectors. Each
2741 route reflector prepends its cluster ID when reflecting the route.
2742 </descrip>
2743
2744 <sect1>Example
2745 <label id="bgp-exam">
2746
2747 <p><code>
2748 protocol bgp {
2749 local 198.51.100.14 as 65000; # Use a private AS number
2750 neighbor 198.51.100.130 as 64496; # Our neighbor ...
2751 multihop; # ... which is connected indirectly
2752 ipv4 {
2753 export filter { # We use non-trivial export rules
2754 if source = RTS_STATIC then { # Export only static routes
2755 # Assign our community
2756 bgp_community.add((65000,64501));
2757 # Artificially increase path length
2758 # by advertising local AS number twice
2759 if bgp_path ~ [= 65000 =] then
2760 bgp_path.prepend(65000);
2761 accept;
2762 }
2763 reject;
2764 };
2765 import all;
2766 next hop self; # advertise this router as next hop
2767 igp table myigptable4; # IGP table for routes with IPv4 nexthops
2768 igp table myigptable6; # IGP table for routes with IPv6 nexthops
2769 };
2770 ipv6 {
2771 export filter mylargefilter; # We use a named filter
2772 import all;
2773 missing lladdr self;
2774 igp table myigptable4; # IGP table for routes with IPv4 nexthops
2775 igp table myigptable6; # IGP table for routes with IPv6 nexthops
2776 };
2777 ipv4 multicast {
2778 import all;
2779 export filter someotherfilter;
2780 table mymulticasttable4; # Another IPv4 table, dedicated for multicast
2781 igp table myigptable4;
2782 };
2783 }
2784 </code>
2785
2786
2787 <sect>Device
2788 <label id="device">
2789
2790 <p>The Device protocol is not a real routing protocol. It doesn't generate any
2791 routes and it only serves as a module for getting information about network
2792 interfaces from the kernel. This protocol supports no channel.
2793
2794 <p>Except for very unusual circumstances, you probably should include this
2795 protocol in the configuration since almost all other protocols require network
2796 interfaces to be defined for them to work with.
2797
2798 <sect1>Configuration
2799 <label id="device-config">
2800
2801 <p><descrip>
2802 <tag><label id="device-scan-time">scan time <m/number/</tag>
2803 Time in seconds between two scans of the network interface list. On
2804 systems where we are notified about interface status changes
2805 asynchronously (such as newer versions of Linux), we need to scan the
2806 list only in order to avoid confusion by lost notification messages,
2807 so the default time is set to a large value.
2808
2809 <tag><label id="device-iface">interface <m/pattern/ [, <m/.../]</tag>
2810 By default, the Device protocol handles all interfaces without any
2811 configuration. Interface definitions allow to specify optional
2812 parameters for specific interfaces. See <ref id="proto-iface"
2813 name="interface"> common option for detailed description. Currently only
2814 one interface option is available:
2815
2816 <tag><label id="device-preferred">preferred <m/ip/</tag>
2817 If a network interface has more than one IP address, BIRD chooses one of
2818 them as a preferred one. Preferred IP address is used as source address
2819 for packets or announced next hop by routing protocols. Precisely, BIRD
2820 chooses one preferred IPv4 address, one preferred IPv6 address and one
2821 preferred link-local IPv6 address. By default, BIRD chooses the first
2822 found IP address as the preferred one.
2823
2824 This option allows to specify which IP address should be preferred. May
2825 be used multiple times for different address classes (IPv4, IPv6, IPv6
2826 link-local). In all cases, an address marked by operating system as
2827 secondary cannot be chosen as the primary one.
2828 </descrip>
2829
2830 <p>As the Device protocol doesn't generate any routes, it cannot have
2831 any attributes. Example configuration looks like this:
2832
2833 <p><code>
2834 protocol device {
2835 scan time 10; # Scan the interfaces often
2836 interface "eth0" {
2837 preferred 192.168.1.1;
2838 preferred 2001:db8:1:10::1;
2839 };
2840 }
2841 </code>
2842
2843
2844 <sect>Direct
2845 <label id="direct">
2846
2847 <p>The Direct protocol is a simple generator of device routes for all the
2848 directly connected networks according to the list of interfaces provided by the
2849 kernel via the Device protocol. The Direct protocol supports both IPv4 and IPv6
2850 channels; both can be configured simultaneously. It can also be configured with
2851 <ref id="ip-sadr-routes" name="IPv6 SADR"> channel instead of regular IPv6
2852 channel in order to be used together with SADR-enabled Babel protocol.
2853
2854 <p>The question is whether it is a good idea to have such device routes in BIRD
2855 routing table. OS kernel usually handles device routes for directly connected
2856 networks by itself so we don't need (and don't want) to export these routes to
2857 the kernel protocol. OSPF protocol creates device routes for its interfaces
2858 itself and BGP protocol is usually used for exporting aggregate routes. But the
2859 Direct protocol is necessary for distance-vector protocols like RIP or Babel to
2860 announce local networks.
2861
2862 <p>There are just few configuration options for the Direct protocol:
2863
2864 <p><descrip>
2865 <tag><label id="direct-iface">interface <m/pattern/ [, <m/.../]</tag>
2866 By default, the Direct protocol will generate device routes for all the
2867 interfaces available. If you want to restrict it to some subset of
2868 interfaces or addresses (e.g. if you're using multiple routing tables
2869 for policy routing and some of the policy domains don't contain all
2870 interfaces), just use this clause. See <ref id="proto-iface" name="interface">
2871 common option for detailed description. The Direct protocol uses
2872 extended interface clauses.
2873
2874 <tag><label id="direct-check-link">check link <m/switch/</tag>
2875 If enabled, a hardware link state (reported by OS) is taken into
2876 consideration. Routes for directly connected networks are generated only
2877 if link up is reported and they are withdrawn when link disappears
2878 (e.g., an ethernet cable is unplugged). Default value is no.
2879 </descrip>
2880
2881 <p>Direct device routes don't contain any specific attributes.
2882
2883 <p>Example config might look like this:
2884
2885 <p><code>
2886 protocol direct {
2887 ipv4;
2888 ipv6;
2889 interface "-arc*", "*"; # Exclude the ARCnets
2890 }
2891 </code>
2892
2893
2894 <sect>Kernel
2895 <label id="krt">
2896
2897 <p>The Kernel protocol is not a real routing protocol. Instead of communicating
2898 with other routers in the network, it performs synchronization of BIRD's routing
2899 tables with the OS kernel. Basically, it sends all routing table updates to the
2900 kernel and from time to time it scans the kernel tables to see whether some
2901 routes have disappeared (for example due to unnoticed up/down transition of an
2902 interface) or whether an `alien' route has been added by someone else (depending
2903 on the <cf/learn/ switch, such routes are either ignored or accepted to our
2904 table).
2905
2906 <p>Note that routes created by OS kernel itself, namely direct routes
2907 representing IP subnets of associated interfaces, are not imported even with
2908 <cf/learn/ enabled. You can use <ref id="direct" name="Direct protocol"> to
2909 generate these direct routes.
2910
2911 <p>If your OS supports only a single routing table, you can configure only one
2912 instance of the Kernel protocol. If it supports multiple tables (in order to
2913 allow policy routing; such an OS is for example Linux), you can run as many
2914 instances as you want, but each of them must be connected to a different BIRD
2915 routing table and to a different kernel table.
2916
2917 <p>Because the kernel protocol is partially integrated with the connected
2918 routing table, there are two limitations - it is not possible to connect more
2919 kernel protocols to the same routing table and changing route destination
2920 (gateway) in an export filter of a kernel protocol does not work. Both
2921 limitations can be overcome using another routing table and the pipe protocol.
2922
2923 <p>The Kernel protocol supports both IPv4 and IPv6 channels; only one channel
2924 can be configured in each protocol instance. On Linux, it also supports <ref
2925 id="ip-sadr-routes" name="IPv6 SADR"> and <ref id="mpls-routes" name="MPLS">
2926 channels.
2927
2928 <sect1>Configuration
2929 <label id="krt-config">
2930
2931 <p><descrip>
2932 <tag><label id="krt-persist">persist <m/switch/</tag>
2933 Tell BIRD to leave all its routes in the routing tables when it exits
2934 (instead of cleaning them up).
2935
2936 <tag><label id="krt-scan-time">scan time <m/number/</tag>
2937 Time in seconds between two consecutive scans of the kernel routing
2938 table.
2939
2940 <tag><label id="krt-learn">learn <m/switch/</tag>
2941 Enable learning of routes added to the kernel routing tables by other
2942 routing daemons or by the system administrator. This is possible only on
2943 systems which support identification of route authorship.
2944
2945 <tag><label id="krt-kernel-table">kernel table <m/number/</tag>
2946 Select which kernel table should this particular instance of the Kernel
2947 protocol work with. Available only on systems supporting multiple
2948 routing tables.
2949
2950 <tag><label id="krt-metric">metric <m/number/</tag> (Linux)
2951 Use specified value as a kernel metric (priority) for all routes sent to
2952 the kernel. When multiple routes for the same network are in the kernel
2953 routing table, the Linux kernel chooses one with lower metric. Also,
2954 routes with different metrics do not clash with each other, therefore
2955 using dedicated metric value is a reliable way to avoid overwriting
2956 routes from other sources (e.g. kernel device routes). Metric 0 has a
2957 special meaning of undefined metric, in which either OS default is used,
2958 or per-route metric can be set using <cf/krt_metric/ attribute. Default:
2959 32.
2960
2961 <tag><label id="krt-graceful-restart">graceful restart <m/switch/</tag>
2962 Participate in graceful restart recovery. If this option is enabled and
2963 a graceful restart recovery is active, the Kernel protocol will defer
2964 synchronization of routing tables until the end of the recovery. Note
2965 that import of kernel routes to BIRD is not affected.
2966
2967 <tag><label id="krt-merge-paths">merge paths <M>switch</M> [limit <M>number</M>]</tag>
2968 Usually, only best routes are exported to the kernel protocol. With path
2969 merging enabled, both best routes and equivalent non-best routes are
2970 merged during export to generate one ECMP (equal-cost multipath) route
2971 for each network. This is useful e.g. for BGP multipath. Note that best
2972 routes are still pivotal for route export (responsible for most
2973 properties of resulting ECMP routes), while exported non-best routes are
2974 responsible just for additional multipath next hops. This option also
2975 allows to specify a limit on maximal number of nexthops in one route. By
2976 default, multipath merging is disabled. If enabled, default value of the
2977 limit is 16.
2978 </descrip>
2979
2980 <sect1>Attributes
2981 <label id="krt-attr">
2982
2983 <p>The Kernel protocol defines several attributes. These attributes are
2984 translated to appropriate system (and OS-specific) route attributes. We support
2985 these attributes:
2986
2987 <descrip>
2988 <tag><label id="rta-krt-source">int krt_source</tag>
2989 The original source of the imported kernel route. The value is
2990 system-dependent. On Linux, it is a value of the protocol field of the
2991 route. See /etc/iproute2/rt_protos for common values. On BSD, it is
2992 based on STATIC and PROTOx flags. The attribute is read-only.
2993
2994 <tag><label id="rta-krt-metric">int krt_metric</tag> (Linux)
2995 The kernel metric of the route. When multiple same routes are in a
2996 kernel routing table, the Linux kernel chooses one with lower metric.
2997 Note that preferred way to set kernel metric is to use protocol option
2998 <cf/metric/, unless per-route metric values are needed.
2999
3000 <tag><label id="rta-krt-prefsrc">ip krt_prefsrc</tag> (Linux)
3001 The preferred source address. Used in source address selection for
3002 outgoing packets. Has to be one of the IP addresses of the router.
3003
3004 <tag><label id="rta-krt-realm">int krt_realm</tag> (Linux)
3005 The realm of the route. Can be used for traffic classification.
3006
3007 <tag><label id="rta-krt-scope">int krt_scope</tag> (Linux IPv4)
3008 The scope of the route. Valid values are 0-254, although Linux kernel
3009 may reject some values depending on route type and nexthop. It is
3010 supposed to represent `indirectness' of the route, where nexthops of
3011 routes are resolved through routes with a higher scope, but in current
3012 kernels anything below <it/link/ (253) is treated as <it/global/ (0).
3013 When not present, global scope is implied for all routes except device
3014 routes, where link scope is used by default.
3015 </descrip>
3016
3017 <p>In Linux, there is also a plenty of obscure route attributes mostly focused
3018 on tuning TCP performance of local connections. BIRD supports most of these
3019 attributes, see Linux or iproute2 documentation for their meaning. Attributes
3020 <cf/krt_lock_*/ and <cf/krt_feature_*/ have type bool, others have type int.
3021 Supported attributes are:
3022
3023 <cf/krt_mtu/, <cf/krt_lock_mtu/, <cf/krt_window/, <cf/krt_lock_window/,
3024 <cf/krt_rtt/, <cf/krt_lock_rtt/, <cf/krt_rttvar/, <cf/krt_lock_rttvar/,
3025 <cf/krt_sstresh/, <cf/krt_lock_sstresh/, <cf/krt_cwnd/, <cf/krt_lock_cwnd/,
3026 <cf/krt_advmss/, <cf/krt_lock_advmss/, <cf/krt_reordering/, <cf/krt_lock_reordering/,
3027 <cf/krt_hoplimit/, <cf/krt_lock_hoplimit/, <cf/krt_rto_min/, <cf/krt_lock_rto_min/,
3028 <cf/krt_initcwnd/, <cf/krt_initrwnd/, <cf/krt_quickack/,
3029 <cf/krt_feature_ecn/, <cf/krt_feature_allfrag/
3030
3031 <sect1>Example
3032 <label id="krt-exam">
3033
3034 <p>A simple configuration can look this way:
3035
3036 <p><code>
3037 protocol kernel {
3038 export all;
3039 }
3040 </code>
3041
3042 <p>Or for a system with two routing tables:
3043
3044 <p><code>
3045 protocol kernel { # Primary routing table
3046 learn; # Learn alien routes from the kernel
3047 persist; # Don't remove routes on bird shutdown
3048 scan time 10; # Scan kernel routing table every 10 seconds
3049 ipv4 {
3050 import all;
3051 export all;
3052 };
3053 }
3054
3055 protocol kernel { # Secondary routing table
3056 kernel table 100;
3057 ipv4 {
3058 table auxtable;
3059 export all;
3060 };
3061 }
3062 </code>
3063
3064
3065 <sect>MRT
3066 <label id="mrt">
3067
3068 <sect1>Introduction
3069 <label id="mrt-intro">
3070
3071 <p>The MRT protocol is a component responsible for handling the Multi-Threaded
3072 Routing Toolkit (MRT) routing information export format, which is mainly used
3073 for collecting and analyzing of routing information from BGP routers. The MRT
3074 protocol can be configured to do periodic dumps of routing tables, created MRT
3075 files can be analyzed later by other tools. Independent MRT table dumps can also
3076 be requested from BIRD client. There is also a feature to save incoming BGP
3077 messages in MRT files, but it is controlled by <ref id="proto-mrtdump"
3078 name="mrtdump"> options independently of MRT protocol, although that might
3079 change in the future.
3080
3081 BIRD implements the main MRT format specification as defined in <rfc id="6396">
3082 and the ADD_PATH extension (<rfc id="8050">).
3083
3084 <sect1>Configuration
3085 <label id="mrt-config">
3086
3087 <p>MRT configuration consists of several statements describing routing table
3088 dumps. Multiple independent periodic dumps can be done as multiple MRT protocol
3089 instances. The MRT protocol does not use channels. There are two mandatory
3090 statements: <cf/filename/ and <cf/period/.
3091
3092 The behavior can be modified by following configuration parameters:
3093
3094 <descrip>
3095 <tag><label id="mrt-table">table <m/name/ | "<m/pattern/"</tag>
3096 Specify a routing table (or a set of routing tables described by a
3097 wildcard pattern) that are to be dumped by the MRT protocol instance.
3098 Default: the master table.
3099
3100 <tag><label id="mrt-filter">filter { <m/filter commands/ }</tag>
3101 The MRT protocol allows to specify a filter that is applied to routes as
3102 they are dumped. Rejected routes are ignored and not saved to the MRT
3103 dump file. Default: no filter.
3104
3105 <tag><label id="mrt-where">where <m/filter expression/</tag>
3106 An alternative way to specify a filter for the MRT protocol.
3107
3108 <tag><label id="mrt-filename">filename "<m/filename/"</tag>
3109 Specify a filename for MRT dump files. The filename may contain time
3110 format sequences with <it/strftime(3)/ notation (see <it/man strftime/
3111 for details), there is also a sequence "%N" that is expanded to the name
3112 of dumped table. Therefore, each periodic dump of each table can be
3113 saved to a different file. Mandatory, see example below.
3114
3115 <tag><label id="mrt-period">period <m/number/</tag>
3116 Specify the time interval (in seconds) between periodic dumps.
3117 Mandatory.
3118
3119 <tag><label id="mrt-always-add-path">always add path <m/switch/</tag>
3120 The MRT format uses special records (specified in <rfc id="8050">) for
3121 routes received using BGP ADD_PATH extension to keep Path ID, while
3122 other routes use regular records. This has advantage of better
3123 compatibility with tools that do not know special records, but it loses
3124 information about which route is the best route. When this option is
3125 enabled, both ADD_PATH and non-ADD_PATH routes are stored in ADD_PATH
3126 records and order of routes for network is preserved. Default: disabled.
3127 </descrip>
3128
3129 <sect1>Example
3130 <label id="mrt-exam">
3131
3132 <p><code>
3133 protocol mrt {
3134 table "tab*";
3135 where source = RTS_BGP;
3136 filename "/var/log/bird/%N_%F_%T.mrt";
3137 period 300;
3138 }
3139 </code>
3140
3141
3142 <sect>OSPF
3143 <label id="ospf">
3144
3145 <sect1>Introduction
3146 <label id="ospf-intro">
3147
3148 <p>Open Shortest Path First (OSPF) is a quite complex interior gateway
3149 protocol. The current IPv4 version (OSPFv2) is defined in <rfc id="2328"> and
3150 the current IPv6 version (OSPFv3) is defined in <rfc id="5340"> It's a link
3151 state (a.k.a. shortest path first) protocol -- each router maintains a database
3152 describing the autonomous system's topology. Each participating router has an
3153 identical copy of the database and all routers run the same algorithm
3154 calculating a shortest path tree with themselves as a root. OSPF chooses the
3155 least cost path as the best path.
3156
3157 <p>In OSPF, the autonomous system can be split to several areas in order to
3158 reduce the amount of resources consumed for exchanging the routing information
3159 and to protect the other areas from incorrect routing data. Topology of the area
3160 is hidden to the rest of the autonomous system.
3161
3162 <p>Another very important feature of OSPF is that it can keep routing information
3163 from other protocols (like Static or BGP) in its link state database as external
3164 routes. Each external route can be tagged by the advertising router, making it
3165 possible to pass additional information between routers on the boundary of the
3166 autonomous system.
3167
3168 <p>OSPF quickly detects topological changes in the autonomous system (such as
3169 router interface failures) and calculates new loop-free routes after a short
3170 period of convergence. Only a minimal amount of routing traffic is involved.
3171
3172 <p>Each router participating in OSPF routing periodically sends Hello messages
3173 to all its interfaces. This allows neighbors to be discovered dynamically. Then
3174 the neighbors exchange theirs parts of the link state database and keep it
3175 identical by flooding updates. The flooding process is reliable and ensures that
3176 each router detects all changes.
3177
3178 <sect1>Configuration
3179 <label id="ospf-config">
3180
3181 <p>First, the desired OSPF version can be specified by using <cf/ospf v2/ or
3182 <cf/ospf v3/ as a protocol type. By default, OSPFv2 is used. In the main part of
3183 configuration, there can be multiple definitions of OSPF areas, each with a
3184 different id. These definitions includes many other switches and multiple
3185 definitions of interfaces. Definition of interface may contain many switches and
3186 constant definitions and list of neighbors on nonbroadcast networks.
3187
3188 <p>OSPFv2 needs one IPv4 channel. OSPFv3 needs either one IPv6 channel, or one
3189 IPv4 channel (<rfc id="5838">). Therefore, it is possible to use OSPFv3 for both
3190 IPv4 and Pv6 routing, but it is necessary to have two protocol instances anyway.
3191 If no channel is configured, appropriate channel is defined with default
3192 parameters.
3193
3194 <code>
3195 protocol ospf [v2|v3] &lt;name&gt; {
3196 rfc1583compat &lt;switch&gt;;
3197 rfc5838 &lt;switch&gt;;
3198 instance id &lt;num&gt;;
3199 stub router &lt;switch&gt;;
3200 tick &lt;num&gt;;
3201 ecmp &lt;switch&gt; [limit &lt;num&gt;];
3202 merge external &lt;switch&gt;;
3203 area &lt;id&gt; {
3204 stub;
3205 nssa;
3206 summary &lt;switch&gt;;
3207 default nssa &lt;switch&gt;;
3208 default cost &lt;num&gt;;
3209 default cost2 &lt;num&gt;;
3210 translator &lt;switch&gt;;
3211 translator stability &lt;num&gt;;
3212
3213 networks {
3214 &lt;prefix&gt;;
3215 &lt;prefix&gt; hidden;
3216 }
3217 external {
3218 &lt;prefix&gt;;
3219 &lt;prefix&gt; hidden;
3220 &lt;prefix&gt; tag &lt;num&gt;;
3221 }
3222 stubnet &lt;prefix&gt;;
3223 stubnet &lt;prefix&gt; {
3224 hidden &lt;switch&gt;;
3225 summary &lt;switch&gt;;
3226 cost &lt;num&gt;;
3227 }
3228 interface &lt;interface pattern&gt; [instance &lt;num&gt;] {
3229 cost &lt;num&gt;;
3230 stub &lt;switch&gt;;
3231 hello &lt;num&gt;;
3232 poll &lt;num&gt;;
3233 retransmit &lt;num&gt;;
3234 priority &lt;num&gt;;
3235 wait &lt;num&gt;;
3236 dead count &lt;num&gt;;
3237 dead &lt;num&gt;;
3238 secondary &lt;switch&gt;;
3239 rx buffer [normal|large|&lt;num&gt;];
3240 tx length &lt;num&gt;;
3241 type [broadcast|bcast|pointopoint|ptp|
3242 nonbroadcast|nbma|pointomultipoint|ptmp];
3243 link lsa suppression &lt;switch&gt;;
3244 strict nonbroadcast &lt;switch&gt;;
3245 real broadcast &lt;switch&gt;;
3246 ptp netmask &lt;switch&gt;;
3247 check link &lt;switch&gt;;
3248 bfd &lt;switch&gt;;
3249 ecmp weight &lt;num&gt;;
3250 ttl security [&lt;switch&gt;; | tx only]
3251 tx class|dscp &lt;num&gt;;
3252 tx priority &lt;num&gt;;
3253 authentication none|simple|cryptographic;
3254 password "&lt;text&gt;";
3255 password "&lt;text&gt;" {
3256 id &lt;num&gt;;
3257 generate from "&lt;date&gt;";
3258 generate to "&lt;date&gt;";
3259 accept from "&lt;date&gt;";
3260 accept to "&lt;date&gt;";
3261 from "&lt;date&gt;";
3262 to "&lt;date&gt;";
3263 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
3264 };
3265 neighbors {
3266 &lt;ip&gt;;
3267 &lt;ip&gt; eligible;
3268 };
3269 };
3270 virtual link &lt;id&gt; [instance &lt;num&gt;] {
3271 hello &lt;num&gt;;
3272 retransmit &lt;num&gt;;
3273 wait &lt;num&gt;;
3274 dead count &lt;num&gt;;
3275 dead &lt;num&gt;;
3276 authentication none|simple|cryptographic;
3277 password "&lt;text&gt;";
3278 password "&lt;text&gt;" {
3279 id &lt;num&gt;;
3280 generate from "&lt;date&gt;";
3281 generate to "&lt;date&gt;";
3282 accept from "&lt;date&gt;";
3283 accept to "&lt;date&gt;";
3284 from "&lt;date&gt;";
3285 to "&lt;date&gt;";
3286 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
3287 };
3288 };
3289 };
3290 }
3291 </code>
3292
3293 <descrip>
3294 <tag><label id="ospf-rfc1583compat">rfc1583compat <M>switch</M></tag>
3295 This option controls compatibility of routing table calculation with
3296 <rfc id="1583">. Default value is no.
3297
3298 <tag><label id="ospf-rfc5838">rfc5838 <m/switch/</tag>
3299 Basic OSPFv3 is limited to IPv6 unicast routing. The <rfc id="5838">
3300 extension defines support for more address families (IPv4, IPv6, both
3301 unicast and multicast). The extension is enabled by default, but can be
3302 disabled if necessary, as it restricts the range of available instance
3303 IDs. Default value is yes.
3304
3305 <tag><label id="ospf-instance-id">instance id <m/num/</tag>
3306 When multiple OSPF protocol instances are active on the same links, they
3307 should use different instance IDs to distinguish their packets. Although
3308 it could be done on per-interface basis, it is often preferred to set
3309 one instance ID to whole OSPF domain/topology (e.g., when multiple
3310 instances are used to represent separate logical topologies on the same
3311 physical network). This option specifies the instance ID for all
3312 interfaces of the OSPF instance, but can be overridden by
3313 <cf/interface/ option. Default value is 0 unless OSPFv3-AF extended
3314 address families are used, see <rfc id="5838"> for that case.
3315
3316 <tag><label id="ospf-stub-router">stub router <M>switch</M></tag>
3317 This option configures the router to be a stub router, i.e., a router
3318 that participates in the OSPF topology but does not allow transit
3319 traffic. In OSPFv2, this is implemented by advertising maximum metric
3320 for outgoing links. In OSPFv3, the stub router behavior is announced by
3321 clearing the R-bit in the router LSA. See <rfc id="6987"> for details.
3322 Default value is no.
3323
3324 <tag><label id="ospf-tick">tick <M>num</M></tag>
3325 The routing table calculation and clean-up of areas' databases is not
3326 performed when a single link state change arrives. To lower the CPU
3327 utilization, it's processed later at periodical intervals of <m/num/
3328 seconds. The default value is 1.
3329
3330 <tag><label id="ospf-ecmp">ecmp <M>switch</M> [limit <M>number</M>]</tag>
3331 This option specifies whether OSPF is allowed to generate ECMP
3332 (equal-cost multipath) routes. Such routes are used when there are
3333 several directions to the destination, each with the same (computed)
3334 cost. This option also allows to specify a limit on maximum number of
3335 nexthops in one route. By default, ECMP is enabled if supported by
3336 Kernel. Default value of the limit is 16.
3337
3338 <tag><label id="ospf-merge-external">merge external <M>switch</M></tag>
3339 This option specifies whether OSPF should merge external routes from
3340 different routers/LSAs for the same destination. When enabled together
3341 with <cf/ecmp/, equal-cost external routes will be combined to multipath
3342 routes in the same way as regular routes. When disabled, external routes
3343 from different LSAs are treated as separate even if they represents the
3344 same destination. Default value is no.
3345
3346 <tag><label id="ospf-area">area <M>id</M></tag>
3347 This defines an OSPF area with given area ID (an integer or an IPv4
3348 address, similarly to a router ID). The most important area is the
3349 backbone (ID 0) to which every other area must be connected.
3350
3351 <tag><label id="ospf-stub">stub</tag>
3352 This option configures the area to be a stub area. External routes are
3353 not flooded into stub areas. Also summary LSAs can be limited in stub
3354 areas (see option <cf/summary/). By default, the area is not a stub
3355 area.
3356
3357 <tag><label id="ospf-nssa">nssa</tag>
3358 This option configures the area to be a NSSA (Not-So-Stubby Area). NSSA
3359 is a variant of a stub area which allows a limited way of external route
3360 propagation. Global external routes are not propagated into a NSSA, but
3361 an external route can be imported into NSSA as a (area-wide) NSSA-LSA
3362 (and possibly translated and/or aggregated on area boundary). By
3363 default, the area is not NSSA.
3364
3365 <tag><label id="ospf-summary">summary <M>switch</M></tag>
3366 This option controls propagation of summary LSAs into stub or NSSA
3367 areas. If enabled, summary LSAs are propagated as usual, otherwise just
3368 the default summary route (0.0.0.0/0) is propagated (this is sometimes
3369 called totally stubby area). If a stub area has more area boundary
3370 routers, propagating summary LSAs could lead to more efficient routing
3371 at the cost of larger link state database. Default value is no.
3372
3373 <tag><label id="ospf-default-nssa">default nssa <M>switch</M></tag>
3374 When <cf/summary/ option is enabled, default summary route is no longer
3375 propagated to the NSSA. In that case, this option allows to originate
3376 default route as NSSA-LSA to the NSSA. Default value is no.
3377
3378 <tag><label id="ospf-default-cost">default cost <M>num</M></tag>
3379 This option controls the cost of a default route propagated to stub and
3380 NSSA areas. Default value is 1000.
3381
3382 <tag><label id="ospf-default-cost2">default cost2 <M>num</M></tag>
3383 When a default route is originated as NSSA-LSA, its cost can use either
3384 type 1 or type 2 metric. This option allows to specify the cost of a
3385 default route in type 2 metric. By default, type 1 metric (option
3386 <cf/default cost/) is used.
3387
3388 <tag><label id="ospf-translator">translator <M>switch</M></tag>
3389 This option controls translation of NSSA-LSAs into external LSAs. By
3390 default, one translator per NSSA is automatically elected from area
3391 boundary routers. If enabled, this area boundary router would
3392 unconditionally translate all NSSA-LSAs regardless of translator
3393 election. Default value is no.
3394
3395 <tag><label id="ospf-translator-stability">translator stability <M>num</M></tag>
3396 This option controls the translator stability interval (in seconds).
3397 When the new translator is elected, the old one keeps translating until
3398 the interval is over. Default value is 40.
3399
3400 <tag><label id="ospf-networks">networks { <m/set/ }</tag>
3401 Definition of area IP ranges. This is used in summary LSA origination.
3402 Hidden networks are not propagated into other areas.
3403
3404 <tag><label id="ospf-external">external { <m/set/ }</tag>
3405 Definition of external area IP ranges for NSSAs. This is used for
3406 NSSA-LSA translation. Hidden networks are not translated into external
3407 LSAs. Networks can have configured route tag.
3408
3409 <tag><label id="ospf-stubnet">stubnet <m/prefix/ { <m/options/ }</tag>
3410 Stub networks are networks that are not transit networks between OSPF
3411 routers. They are also propagated through an OSPF area as a part of a
3412 link state database. By default, BIRD generates a stub network record
3413 for each primary network address on each OSPF interface that does not
3414 have any OSPF neighbors, and also for each non-primary network address
3415 on each OSPF interface. This option allows to alter a set of stub
3416 networks propagated by this router.
3417
3418 Each instance of this option adds a stub network with given network
3419 prefix to the set of propagated stub network, unless option <cf/hidden/
3420 is used. It also suppresses default stub networks for given network
3421 prefix. When option <cf/summary/ is used, also default stub networks
3422 that are subnetworks of given stub network are suppressed. This might be
3423 used, for example, to aggregate generated stub networks.
3424
3425 <tag><label id="ospf-iface">interface <M>pattern</M> [instance <m/num/]</tag>
3426 Defines that the specified interfaces belong to the area being defined.
3427 See <ref id="proto-iface" name="interface"> common option for detailed
3428 description. In OSPFv2, extended interface clauses are used, because
3429 each network prefix is handled as a separate virtual interface.
3430
3431 You can specify alternative instance ID for the interface definition,
3432 therefore it is possible to have several instances of that interface
3433 with different options or even in different areas. For OSPFv2, instance
3434 ID support is an extension (<rfc id="6549">) and is supposed to be set
3435 per-protocol. For OSPFv3, it is an integral feature.
3436
3437 <tag><label id="ospf-virtual-link">virtual link <M>id</M> [instance <m/num/]</tag>
3438 Virtual link to router with the router id. Virtual link acts as a
3439 point-to-point interface belonging to backbone. The actual area is used
3440 as a transport area. This item cannot be in the backbone. Like with
3441 <cf/interface/ option, you could also use several virtual links to one
3442 destination with different instance IDs.
3443
3444 <tag><label id="ospf-cost">cost <M>num</M></tag>
3445 Specifies output cost (metric) of an interface. Default value is 10.
3446
3447 <tag><label id="ospf-stub-iface">stub <M>switch</M></tag>
3448 If set to interface it does not listen to any packet and does not send
3449 any hello. Default value is no.
3450
3451 <tag><label id="ospf-hello">hello <M>num</M></tag>
3452 Specifies interval in seconds between sending of Hello messages. Beware,
3453 all routers on the same network need to have the same hello interval.
3454 Default value is 10.
3455
3456 <tag><label id="ospf-poll">poll <M>num</M></tag>
3457 Specifies interval in seconds between sending of Hello messages for some
3458 neighbors on NBMA network. Default value is 20.
3459
3460 <tag><label id="ospf-retransmit">retransmit <M>num</M></tag>
3461 Specifies interval in seconds between retransmissions of unacknowledged
3462 updates. Default value is 5.
3463
3464 <tag><label id="ospf-priority">priority <M>num</M></tag>
3465 On every multiple access network (e.g., the Ethernet) Designated Router
3466 and Backup Designated router are elected. These routers have some special
3467 functions in the flooding process. Higher priority increases preferences
3468 in this election. Routers with priority 0 are not eligible. Default
3469 value is 1.
3470
3471 <tag><label id="ospf-wait">wait <M>num</M></tag>
3472 After start, router waits for the specified number of seconds between
3473 starting election and building adjacency. Default value is 4*<m/hello/.
3474
3475 <tag><label id="ospf-dead-count">dead count <M>num</M></tag>
3476 When the router does not receive any messages from a neighbor in
3477 <m/dead count/*<m/hello/ seconds, it will consider the neighbor down.
3478
3479 <tag><label id="ospf-dead">dead <M>num</M></tag>
3480 When the router does not receive any messages from a neighbor in
3481 <m/dead/ seconds, it will consider the neighbor down. If both directives
3482 <cf/dead count/ and <cf/dead/ are used, <cf/dead/ has precedence.
3483
3484 <tag><label id="ospf-secondary">secondary <M>switch</M></tag>
3485 On BSD systems, older versions of BIRD supported OSPFv2 only for the
3486 primary IP address of an interface, other IP ranges on the interface
3487 were handled as stub networks. Since v1.4.1, regular operation on
3488 secondary IP addresses is supported, but disabled by default for
3489 compatibility. This option allows to enable it. The option is a
3490 transitional measure, will be removed in the next major release as the
3491 behavior will be changed. On Linux systems, the option is irrelevant, as
3492 operation on non-primary addresses is already the regular behavior.
3493
3494 <tag><label id="ospf-rx-buffer">rx buffer <M>num</M></tag>
3495 This option allows to specify the size of buffers used for packet
3496 processing. The buffer size should be bigger than maximal size of any
3497 packets. By default, buffers are dynamically resized as needed, but a
3498 fixed value could be specified. Value <cf/large/ means maximal allowed
3499 packet size - 65535.
3500
3501 <tag><label id="ospf-tx-length">tx length <M>num</M></tag>
3502 Transmitted OSPF messages that contain large amount of information are
3503 segmented to separate OSPF packets to avoid IP fragmentation. This
3504 option specifies the soft ceiling for the length of generated OSPF
3505 packets. Default value is the MTU of the network interface. Note that
3506 larger OSPF packets may still be generated if underlying OSPF messages
3507 cannot be splitted (e.g. when one large LSA is propagated).
3508
3509 <tag><label id="ospf-type-bcast">type broadcast|bcast</tag>
3510 BIRD detects a type of a connected network automatically, but sometimes
3511 it's convenient to force use of a different type manually. On broadcast
3512 networks (like ethernet), flooding and Hello messages are sent using
3513 multicasts (a single packet for all the neighbors). A designated router
3514 is elected and it is responsible for synchronizing the link-state
3515 databases and originating network LSAs. This network type cannot be used
3516 on physically NBMA networks and on unnumbered networks (networks without
3517 proper IP prefix).
3518
3519 <tag><label id="ospf-type-ptp">type pointopoint|ptp</tag>
3520 Point-to-point networks connect just 2 routers together. No election is
3521 performed and no network LSA is originated, which makes it simpler and
3522 faster to establish. This network type is useful not only for physically
3523 PtP ifaces (like PPP or tunnels), but also for broadcast networks used
3524 as PtP links. This network type cannot be used on physically NBMA
3525 networks.
3526
3527 <tag><label id="ospf-type-nbma">type nonbroadcast|nbma</tag>
3528 On NBMA networks, the packets are sent to each neighbor separately
3529 because of lack of multicast capabilities. Like on broadcast networks,
3530 a designated router is elected, which plays a central role in propagation
3531 of LSAs. This network type cannot be used on unnumbered networks.
3532
3533 <tag><label id="ospf-type-ptmp">type pointomultipoint|ptmp</tag>
3534 This is another network type designed to handle NBMA networks. In this
3535 case the NBMA network is treated as a collection of PtP links. This is
3536 useful if not every pair of routers on the NBMA network has direct
3537 communication, or if the NBMA network is used as an (possibly
3538 unnumbered) PtP link.
3539
3540 <tag><label id="ospf-link-lsa-suppression">link lsa suppression <m/switch/</tag>
3541 In OSPFv3, link LSAs are generated for each link, announcing link-local
3542 IPv6 address of the router to its local neighbors. These are useless on
3543 PtP or PtMP networks and this option allows to suppress the link LSA
3544 origination for such interfaces. The option is ignored on other than PtP
3545 or PtMP interfaces. Default value is no.
3546
3547 <tag><label id="ospf-strict-nonbroadcast">strict nonbroadcast <m/switch/</tag>
3548 If set, don't send hello to any undefined neighbor. This switch is
3549 ignored on other than NBMA or PtMP interfaces. Default value is no.
3550
3551 <tag><label id="ospf-real-broadcast">real broadcast <m/switch/</tag>
3552 In <cf/type broadcast/ or <cf/type ptp/ network configuration, OSPF
3553 packets are sent as IP multicast packets. This option changes the
3554 behavior to using old-fashioned IP broadcast packets. This may be useful
3555 as a workaround if IP multicast for some reason does not work or does
3556 not work reliably. This is a non-standard option and probably is not
3557 interoperable with other OSPF implementations. Default value is no.
3558
3559 <tag><label id="ospf-ptp-netmask">ptp netmask <m/switch/</tag>
3560 In <cf/type ptp/ network configurations, OSPFv2 implementations should
3561 ignore received netmask field in hello packets and should send hello
3562 packets with zero netmask field on unnumbered PtP links. But some OSPFv2
3563 implementations perform netmask checking even for PtP links. This option
3564 specifies whether real netmask will be used in hello packets on <cf/type
3565 ptp/ interfaces. You should ignore this option unless you meet some
3566 compatibility problems related to this issue. Default value is no for
3567 unnumbered PtP links, yes otherwise.
3568
3569 <tag><label id="ospf-check-link">check link <M>switch</M></tag>
3570 If set, a hardware link state (reported by OS) is taken into consideration.
3571 When a link disappears (e.g. an ethernet cable is unplugged), neighbors
3572 are immediately considered unreachable and only the address of the iface
3573 (instead of whole network prefix) is propagated. It is possible that
3574 some hardware drivers or platforms do not implement this feature.
3575 Default value is yes.
3576
3577 <tag><label id="ospf-bfd">bfd <M>switch</M></tag>
3578 OSPF could use BFD protocol as an advisory mechanism for neighbor
3579 liveness and failure detection. If enabled, BIRD setups a BFD session
3580 for each OSPF neighbor and tracks its liveness by it. This has an
3581 advantage of an order of magnitude lower detection times in case of
3582 failure. Note that BFD protocol also has to be configured, see
3583 <ref id="bfd" name="BFD"> section for details. Default value is no.
3584
3585 <tag><label id="ospf-ttl-security">ttl security [<m/switch/ | tx only]</tag>
3586 TTL security is a feature that protects routing protocols from remote
3587 spoofed packets by using TTL 255 instead of TTL 1 for protocol packets
3588 destined to neighbors. Because TTL is decremented when packets are
3589 forwarded, it is non-trivial to spoof packets with TTL 255 from remote
3590 locations. Note that this option would interfere with OSPF virtual
3591 links.
3592
3593 If this option is enabled, the router will send OSPF packets with TTL
3594 255 and drop received packets with TTL less than 255. If this option si
3595 set to <cf/tx only/, TTL 255 is used for sent packets, but is not
3596 checked for received packets. Default value is no.
3597
3598 <tag><label id="ospf-tx-class">tx class|dscp|priority <m/num/</tag>
3599 These options specify the ToS/DiffServ/Traffic class/Priority of the
3600 outgoing OSPF packets. See <ref id="proto-tx-class" name="tx class"> common
3601 option for detailed description.
3602
3603 <tag><label id="ospf-ecmp-weight">ecmp weight <M>num</M></tag>
3604 When ECMP (multipath) routes are allowed, this value specifies a
3605 relative weight used for nexthops going through the iface. Allowed
3606 values are 1-256. Default value is 1.
3607
3608 <tag><label id="ospf-auth-none">authentication none</tag>
3609 No passwords are sent in OSPF packets. This is the default value.
3610
3611 <tag><label id="ospf-auth-simple">authentication simple</tag>
3612 Every packet carries 8 bytes of password. Received packets lacking this
3613 password are ignored. This authentication mechanism is very weak.
3614 This option is not available in OSPFv3.
3615
3616 <tag><label id="ospf-auth-cryptographic">authentication cryptographic</tag>
3617 An authentication code is appended to every packet. The specific
3618 cryptographic algorithm is selected by option <cf/algorithm/ for each
3619 key. The default cryptographic algorithm for OSPFv2 keys is Keyed-MD5
3620 and for OSPFv3 keys is HMAC-SHA-256. Passwords are not sent open via
3621 network, so this mechanism is quite secure. Packets can still be read by
3622 an attacker.
3623
3624 <tag><label id="ospf-pass">password "<M>text</M>"</tag>
3625 Specifies a password used for authentication. See
3626 <ref id="proto-pass" name="password"> common option for detailed
3627 description.
3628
3629 <tag><label id="ospf-neighbors">neighbors { <m/set/ } </tag>
3630 A set of neighbors to which Hello messages on NBMA or PtMP networks are
3631 to be sent. For NBMA networks, some of them could be marked as eligible.
3632 In OSPFv3, link-local addresses should be used, using global ones is
3633 possible, but it is nonstandard and might be problematic. And definitely,
3634 link-local and global addresses should not be mixed.
3635 </descrip>
3636
3637 <sect1>Attributes
3638 <label id="ospf-attr">
3639
3640 <p>OSPF defines four route attributes. Each internal route has a <cf/metric/.
3641
3642 <p>Metric is ranging from 1 to infinity (65535). External routes use
3643 <cf/metric type 1/ or <cf/metric type 2/. A <cf/metric of type 1/ is comparable
3644 with internal <cf/metric/, a <cf/metric of type 2/ is always longer than any
3645 <cf/metric of type 1/ or any <cf/internal metric/. <cf/Internal metric/ or
3646 <cf/metric of type 1/ is stored in attribute <cf/ospf_metric1/, <cf/metric type
3647 2/ is stored in attribute <cf/ospf_metric2/.
3648
3649 When both metrics are specified then <cf/metric of type 2/ is used. This is
3650 relevant e.g. when a type 2 external route is propagated from one OSPF domain to
3651 another and <cf/ospf_metric1/ is an internal distance to the original ASBR,
3652 while <cf/ospf_metric2/ stores the type 2 metric. Note that in such cases if
3653 <cf/ospf_metric1/ is non-zero then <cf/ospf_metric2/ is increased by one to
3654 ensure monotonicity of metric, as internal distance is reset to zero when an
3655 external route is announced.
3656
3657 <p>Each external route can also carry attribute <cf/ospf_tag/ which is a 32-bit
3658 integer which is used when exporting routes to other protocols; otherwise, it
3659 doesn't affect routing inside the OSPF domain at all. The fourth attribute
3660 <cf/ospf_router_id/ is a router ID of the router advertising that route /
3661 network. This attribute is read-only. Default is <cf/ospf_metric2 = 10000/ and
3662 <cf/ospf_tag = 0/.
3663
3664 <sect1>Example
3665 <label id="ospf-exam">
3666
3667 <p><code>
3668 protocol ospf MyOSPF {
3669 ipv4 {
3670 export filter {
3671 if source = RTS_BGP then {
3672 ospf_metric1 = 100;
3673 accept;
3674 }
3675 reject;
3676 };
3677 };
3678 area 0.0.0.0 {
3679 interface "eth*" {
3680 cost 11;
3681 hello 15;
3682 priority 100;
3683 retransmit 7;
3684 authentication simple;
3685 password "aaa";
3686 };
3687 interface "ppp*" {
3688 cost 100;
3689 authentication cryptographic;
3690 password "abc" {
3691 id 1;
3692 generate to "22-04-2003 11:00:06";
3693 accept from "17-01-2001 12:01:05";
3694 algorithm hmac sha384;
3695 };
3696 password "def" {
3697 id 2;
3698 generate to "22-07-2005 17:03:21";
3699 accept from "22-02-2001 11:34:06";
3700 algorithm hmac sha512;
3701 };
3702 };
3703 interface "arc0" {
3704 cost 10;
3705 stub yes;
3706 };
3707 interface "arc1";
3708 };
3709 area 120 {
3710 stub yes;
3711 networks {
3712 172.16.1.0/24;
3713 172.16.2.0/24 hidden;
3714 }
3715 interface "-arc0" , "arc*" {
3716 type nonbroadcast;
3717 authentication none;
3718 strict nonbroadcast yes;
3719 wait 120;
3720 poll 40;
3721 dead count 8;
3722 neighbors {
3723 192.168.120.1 eligible;
3724 192.168.120.2;
3725 192.168.120.10;
3726 };
3727 };
3728 };
3729 }
3730 </code>
3731
3732
3733 <sect>Pipe
3734 <label id="pipe">
3735
3736 <sect1>Introduction
3737 <label id="pipe-intro">
3738
3739 <p>The Pipe protocol serves as a link between two routing tables, allowing
3740 routes to be passed from a table declared as primary (i.e., the one the pipe is
3741 connected to using the <cf/table/ configuration keyword) to the secondary one
3742 (declared using <cf/peer table/) and vice versa, depending on what's allowed by
3743 the filters. Export filters control export of routes from the primary table to
3744 the secondary one, import filters control the opposite direction. Both tables
3745 must be of the same nettype.
3746
3747 <p>The Pipe protocol retransmits all routes from one table to the other table,
3748 retaining their original source and attributes. If import and export filters
3749 are set to accept, then both tables would have the same content.
3750
3751 <p>The primary use of multiple routing tables and the Pipe protocol is for
3752 policy routing, where handling of a single packet doesn't depend only on its
3753 destination address, but also on its source address, source interface, protocol
3754 type and other similar parameters. In many systems (Linux being a good example),
3755 the kernel allows to enforce routing policies by defining routing rules which
3756 choose one of several routing tables to be used for a packet according to its
3757 parameters. Setting of these rules is outside the scope of BIRD's work (on
3758 Linux, you can use the <tt/ip/ command), but you can create several routing
3759 tables in BIRD, connect them to the kernel ones, use filters to control which
3760 routes appear in which tables and also you can employ the Pipe protocol for
3761 exporting a selected subset of one table to another one.
3762
3763 <sect1>Configuration
3764 <label id="pipe-config">
3765
3766 <p>Essentially, the Pipe protocol is just a channel connected to a table on both
3767 sides. Therefore, the configuration block for <cf/protocol pipe/ shall directly
3768 include standard channel config options; see the example below.
3769
3770 <p><descrip>
3771 <tag><label id="pipe-peer-table">peer table <m/table/</tag>
3772 Defines secondary routing table to connect to. The primary one is
3773 selected by the <cf/table/ keyword.
3774 </descrip>
3775
3776 <sect1>Attributes
3777 <label id="pipe-attr">
3778
3779 <p>The Pipe protocol doesn't define any route attributes.
3780
3781 <sect1>Example
3782 <label id="pipe-exam">
3783
3784 <p>Let's consider a router which serves as a boundary router of two different
3785 autonomous systems, each of them connected to a subset of interfaces of the
3786 router, having its own exterior connectivity and wishing to use the other AS as
3787 a backup connectivity in case of outage of its own exterior line.
3788
3789 <p>Probably the simplest solution to this situation is to use two routing tables
3790 (we'll call them <cf/as1/ and <cf/as2/) and set up kernel routing rules, so that
3791 packets having arrived from interfaces belonging to the first AS will be routed
3792 according to <cf/as1/ and similarly for the second AS. Thus we have split our
3793 router to two logical routers, each one acting on its own routing table, having
3794 its own routing protocols on its own interfaces. In order to use the other AS's
3795 routes for backup purposes, we can pass the routes between the tables through a
3796 Pipe protocol while decreasing their preferences and correcting their BGP paths
3797 to reflect the AS boundary crossing.
3798
3799 <code>
3800 ipv4 table as1; # Define the tables
3801 ipv4 table as2;
3802
3803 protocol kernel kern1 { # Synchronize them with the kernel
3804 ipv4 { table as1; export all; };
3805 kernel table 1;
3806 }
3807
3808 protocol kernel kern2 {
3809 ipv4 { table as2; export all; };
3810 kernel table 2;
3811 }
3812
3813 protocol bgp bgp1 { # The outside connections
3814 ipv4 { table as1; import all; export all; };
3815 local as 1;
3816 neighbor 192.168.0.1 as 1001;
3817 }
3818
3819 protocol bgp bgp2 {
3820 ipv4 { table as2; import all; export all; };
3821 local as 2;
3822 neighbor 10.0.0.1 as 1002;
3823 }
3824
3825 protocol pipe { # The Pipe
3826 table as1;
3827 peer table as2;
3828 export filter {
3829 if net ~ [ 1.0.0.0/8+] then { # Only AS1 networks
3830 if preference>10 then preference = preference-10;
3831 if source=RTS_BGP then bgp_path.prepend(1);
3832 accept;
3833 }
3834 reject;
3835 };
3836 import filter {
3837 if net ~ [ 2.0.0.0/8+] then { # Only AS2 networks
3838 if preference>10 then preference = preference-10;
3839 if source=RTS_BGP then bgp_path.prepend(2);
3840 accept;
3841 }
3842 reject;
3843 };
3844 }
3845 </code>
3846
3847
3848 <sect>RAdv
3849 <label id="radv">
3850
3851 <sect1>Introduction
3852 <label id="radv-intro">
3853
3854 <p>The RAdv protocol is an implementation of Router Advertisements, which are
3855 used in the IPv6 stateless autoconfiguration. IPv6 routers send (in irregular
3856 time intervals or as an answer to a request) advertisement packets to connected
3857 networks. These packets contain basic information about a local network (e.g. a
3858 list of network prefixes), which allows network hosts to autoconfigure network
3859 addresses and choose a default route. BIRD implements router behavior as defined
3860 in <rfc id="4861">, router preferences and specific routes (<rfc id="4191">),
3861 and DNS extensions (<rfc id="6106">).
3862
3863 <p>The RAdv protocols supports just IPv6 channel.
3864
3865 <sect1>Configuration
3866 <label id="radv-config">
3867
3868 <p>There are several classes of definitions in RAdv configuration -- interface
3869 definitions, prefix definitions and DNS definitions:
3870
3871 <descrip>
3872 <tag><label id="radv-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
3873 Interface definitions specify a set of interfaces on which the
3874 protocol is activated and contain interface specific options.
3875 See <ref id="proto-iface" name="interface"> common options for
3876 detailed description.
3877
3878 <tag><label id="radv-prefix">prefix <m/prefix/ { <m/options/ }</tag>
3879 Prefix definitions allow to modify a list of advertised prefixes. By
3880 default, the advertised prefixes are the same as the network prefixes
3881 assigned to the interface. For each network prefix, the matching prefix
3882 definition is found and its options are used. If no matching prefix
3883 definition is found, the prefix is used with default options.
3884
3885 Prefix definitions can be either global or interface-specific. The
3886 second ones are part of interface options. The prefix definition
3887 matching is done in the first-match style, when interface-specific
3888 definitions are processed before global definitions. As expected, the
3889 prefix definition is matching if the network prefix is a subnet of the
3890 prefix in prefix definition.
3891
3892 <tag><label id="radv-rdnss">rdnss { <m/options/ }</tag>
3893 RDNSS definitions allow to specify a list of advertised recursive DNS
3894 servers together with their options. As options are seldom necessary,
3895 there is also a short variant <cf>rdnss <m/address/</cf> that just
3896 specifies one DNS server. Multiple definitions are cumulative. RDNSS
3897 definitions may also be interface-specific when used inside interface
3898 options. By default, interface uses both global and interface-specific
3899 options, but that can be changed by <cf/rdnss local/ option.
3900
3901 <tag><label id="radv-dnssl">dnssl { <m/options/ }</tag>
3902 DNSSL definitions allow to specify a list of advertised DNS search
3903 domains together with their options. Like <cf/rdnss/ above, multiple
3904 definitions are cumulative, they can be used also as interface-specific
3905 options and there is a short variant <cf>dnssl <m/domain/</cf> that just
3906 specifies one DNS search domain.
3907
3908 <tag><label id="radv-trigger">trigger <m/prefix/</tag>
3909 RAdv protocol could be configured to change its behavior based on
3910 availability of routes. When this option is used, the protocol waits in
3911 suppressed state until a <it/trigger route/ (for the specified network)
3912 is exported to the protocol, the protocol also returnsd to suppressed
3913 state if the <it/trigger route/ disappears. Note that route export
3914 depends on specified export filter, as usual. This option could be used,
3915 e.g., for handling failover in multihoming scenarios.
3916
3917 During suppressed state, router advertisements are generated, but with
3918 some fields zeroed. Exact behavior depends on which fields are zeroed,
3919 this can be configured by <cf/sensitive/ option for appropriate
3920 fields. By default, just <cf/default lifetime/ (also called <cf/router
3921 lifetime/) is zeroed, which means hosts cannot use the router as a
3922 default router. <cf/preferred lifetime/ and <cf/valid lifetime/ could
3923 also be configured as <cf/sensitive/ for a prefix, which would cause
3924 autoconfigured IPs to be deprecated or even removed.
3925
3926 <tag><label id="radv-propagate-routes">propagate routes <m/switch/</tag>
3927 This option controls propagation of more specific routes, as defined in
3928 <rfc id="4191">. If enabled, all routes exported to the RAdv protocol,
3929 with the exception of the trigger prefix, are added to advertisments as
3930 additional options. The lifetime and preference of advertised routes can
3931 be set individually by <cf/ra_lifetime/ and <cf/ra_preference/ route
3932 attributes, or per interface by <cf/route lifetime/ and
3933 <cf/route preference/ options. Default: disabled.
3934
3935 Note that the RFC discourages from sending more than 17 routes and
3936 recommends the routes to be configured manually.
3937 </descrip>
3938
3939 <p>Interface specific options:
3940
3941 <descrip>
3942 <tag><label id="radv-iface-max-ra-interval">max ra interval <m/expr/</tag>
3943 Unsolicited router advertisements are sent in irregular time intervals.
3944 This option specifies the maximum length of these intervals, in seconds.
3945 Valid values are 4-1800. Default: 600
3946
3947 <tag><label id="radv-iface-min-ra-interval">min ra interval <m/expr/</tag>
3948 This option specifies the minimum length of that intervals, in seconds.
3949 Must be at least 3 and at most 3/4 * <cf/max ra interval/. Default:
3950 about 1/3 * <cf/max ra interval/.
3951
3952 <tag><label id="radv-iface-min-delay">min delay <m/expr/</tag>
3953 The minimum delay between two consecutive router advertisements, in
3954 seconds. Default: 3
3955
3956 <tag><label id="radv-iface-managed">managed <m/switch/</tag>
3957 This option specifies whether hosts should use DHCPv6 for IP address
3958 configuration. Default: no
3959
3960 <tag><label id="radv-iface-other-config">other config <m/switch/</tag>
3961 This option specifies whether hosts should use DHCPv6 to receive other
3962 configuration information. Default: no
3963
3964 <tag><label id="radv-iface-link-mtu">link mtu <m/expr/</tag>
3965 This option specifies which value of MTU should be used by hosts. 0
3966 means unspecified. Default: 0
3967
3968 <tag><label id="radv-iface-reachable-time">reachable time <m/expr/</tag>
3969 This option specifies the time (in milliseconds) how long hosts should
3970 assume a neighbor is reachable (from the last confirmation). Maximum is
3971 3600000, 0 means unspecified. Default 0.
3972
3973 <tag><label id="radv-iface-retrans-timer">retrans timer <m/expr/</tag>
3974 This option specifies the time (in milliseconds) how long hosts should
3975 wait before retransmitting Neighbor Solicitation messages. 0 means
3976 unspecified. Default 0.
3977
3978 <tag><label id="radv-iface-current-hop-limit">current hop limit <m/expr/</tag>
3979 This option specifies which value of Hop Limit should be used by
3980 hosts. Valid values are 0-255, 0 means unspecified. Default: 64
3981
3982 <tag><label id="radv-iface-default-lifetime">default lifetime <m/expr/ [sensitive <m/switch/]</tag>
3983 This option specifies the time (in seconds) how long (since the receipt
3984 of RA) hosts may use the router as a default router. 0 means do not use
3985 as a default router. For <cf/sensitive/ option, see <ref id="radv-trigger" name="trigger">.
3986 Default: 3 * <cf/max ra interval/, <cf/sensitive/ yes.
3987
3988 <tag><label id="radv-iface-default-preference">default preference low|medium|high</tag>
3989 This option specifies the Default Router Preference value to advertise
3990 to hosts. Default: medium.
3991
3992 <tag><label id="radv-iface-route-lifetime">route lifetime <m/expr/ [sensitive <m/switch/]</tag>
3993 This option specifies the default value of advertised lifetime for
3994 specific routes; i.e., the time (in seconds) for how long (since the
3995 receipt of RA) hosts should consider these routes valid. A special value
3996 0xffffffff represents infinity. The lifetime can be overriden on a per
3997 route basis by the <ref id="rta-ra-lifetime" name="ra_lifetime"> route
3998 attribute. Default: 3 * <cf/max ra interval/, <cf/sensitive/ no.
3999
4000 For the <cf/sensitive/ option, see <ref id="radv-trigger" name="trigger">.
4001 If <cf/sensitive/ is enabled, even the routes with the <cf/ra_lifetime/
4002 attribute become sensitive to the trigger.
4003
4004 <tag><label id="radv-iface-route-preference">route preference low|medium|high</tag>
4005 This option specifies the default value of advertised route preference
4006 for specific routes. The value can be overriden on a per route basis by
4007 the <ref id="rta-ra-preference" name="ra_preference"> route attribute.
4008 Default: medium.
4009
4010 <tag><label id="radv-prefix-linger-time">prefix linger time <m/expr/</tag>
4011 When a prefix or a route disappears, it is advertised for some time with
4012 zero lifetime, to inform clients it is no longer valid. This option
4013 specifies the time (in seconds) for how long prefixes are advertised
4014 that way. Default: 3 * <cf/max ra interval/.
4015
4016 <tag><label id="radv-route-linger-time">route linger time <m/expr/</tag>
4017 When a prefix or a route disappears, it is advertised for some time with
4018 zero lifetime, to inform clients it is no longer valid. This option
4019 specifies the time (in seconds) for how long routes are advertised
4020 that way. Default: 3 * <cf/max ra interval/.
4021
4022 <tag><label id="radv-iface-rdnss-local">rdnss local <m/switch/</tag>
4023 Use only local (interface-specific) RDNSS definitions for this
4024 interface. Otherwise, both global and local definitions are used. Could
4025 also be used to disable RDNSS for given interface if no local definitons
4026 are specified. Default: no.
4027
4028 <tag><label id="radv-iface-dnssl-local">dnssl local <m/switch/</tag>
4029 Use only local DNSSL definitions for this interface. See <cf/rdnss local/
4030 option above. Default: no.
4031 </descrip>
4032
4033 <p>Prefix specific options
4034
4035 <descrip>
4036 <tag><label id="radv-prefix-skip">skip <m/switch/</tag>
4037 This option allows to specify that given prefix should not be
4038 advertised. This is useful for making exceptions from a default policy
4039 of advertising all prefixes. Note that for withdrawing an already
4040 advertised prefix it is more useful to advertise it with zero valid
4041 lifetime. Default: no
4042
4043 <tag><label id="radv-prefix-onlink">onlink <m/switch/</tag>
4044 This option specifies whether hosts may use the advertised prefix for
4045 onlink determination. Default: yes
4046
4047 <tag><label id="radv-prefix-autonomous">autonomous <m/switch/</tag>
4048 This option specifies whether hosts may use the advertised prefix for
4049 stateless autoconfiguration. Default: yes
4050
4051 <tag><label id="radv-prefix-valid-lifetime">valid lifetime <m/expr/ [sensitive <m/switch/]</tag>
4052 This option specifies the time (in seconds) how long (after the
4053 receipt of RA) the prefix information is valid, i.e., autoconfigured
4054 IP addresses can be assigned and hosts with that IP addresses are
4055 considered directly reachable. 0 means the prefix is no longer
4056 valid. For <cf/sensitive/ option, see <ref id="radv-trigger" name="trigger">.
4057 Default: 86400 (1 day), <cf/sensitive/ no.
4058
4059 <tag><label id="radv-prefix-preferred-lifetime">preferred lifetime <m/expr/ [sensitive <m/switch/]</tag>
4060 This option specifies the time (in seconds) how long (after the
4061 receipt of RA) IP addresses generated from the prefix using stateless
4062 autoconfiguration remain preferred. For <cf/sensitive/ option,
4063 see <ref id="radv-trigger" name="trigger">. Default: 14400 (4 hours),
4064 <cf/sensitive/ no.
4065 </descrip>
4066
4067 <p>RDNSS specific options:
4068
4069 <descrip>
4070 <tag><label id="radv-rdnss-ns">ns <m/address/</tag>
4071 This option specifies one recursive DNS server. Can be used multiple
4072 times for multiple servers. It is mandatory to have at least one
4073 <cf/ns/ option in <cf/rdnss/ definition.
4074
4075 <tag><label id="radv-rdnss-lifetime">lifetime [mult] <m/expr/</tag>
4076 This option specifies the time how long the RDNSS information may be
4077 used by clients after the receipt of RA. It is expressed either in
4078 seconds or (when <cf/mult/ is used) in multiples of <cf/max ra
4079 interval/. Note that RDNSS information is also invalidated when
4080 <cf/default lifetime/ expires. 0 means these addresses are no longer
4081 valid DNS servers. Default: 3 * <cf/max ra interval/.
4082 </descrip>
4083
4084 <p>DNSSL specific options:
4085
4086 <descrip>
4087 <tag><label id="radv-dnssl-domain">domain <m/address/</tag>
4088 This option specifies one DNS search domain. Can be used multiple times
4089 for multiple domains. It is mandatory to have at least one <cf/domain/
4090 option in <cf/dnssl/ definition.
4091
4092 <tag><label id="radv-dnssl-lifetime">lifetime [mult] <m/expr/</tag>
4093 This option specifies the time how long the DNSSL information may be
4094 used by clients after the receipt of RA. Details are the same as for
4095 RDNSS <cf/lifetime/ option above. Default: 3 * <cf/max ra interval/.
4096 </descrip>
4097
4098 <sect1>Attributes
4099 <label id="radv-attr">
4100
4101 <p>RAdv defines two route attributes:
4102
4103 <descrip>
4104 <tag><label id="rta-ra-preference">enum ra_preference</tag>
4105 The preference of the route. The value can be <it/RA_PREF_LOW/,
4106 <it/RA_PREF_MEDIUM/ or <it/RA_PREF_HIGH/. If the attribute is not set,
4107 the <ref id="radv-iface-route-preference" name="route preference">
4108 option is used.
4109
4110 <tag><label id="rta-ra-lifetime">int ra_lifetime</tag>
4111 The advertised lifetime of the route, in seconds. The special value of
4112 0xffffffff represents infinity. If the attribute is not set, the
4113 <ref id="radv-iface-route-lifetime" name="route lifetime">
4114 option is used.
4115 </descrip>
4116
4117 <sect1>Example
4118 <label id="radv-exam">
4119
4120 <p><code>
4121 ipv6 table radv_routes; # Manually configured routes go here
4122
4123 protocol static {
4124 ipv6 { table radv_routes; };
4125
4126 route 2001:0DB8:4000::/48 unreachable;
4127 route 2001:0DB8:4010::/48 unreachable;
4128
4129 route 2001:0DB8:4020::/48 unreachable {
4130 ra_preference = RA_PREF_HIGH;
4131 ra_lifetime = 3600;
4132 };
4133 }
4134
4135 protocol radv {
4136 propagate routes yes; # Propagate the routes from the radv_routes table
4137 ipv6 { table radv_routes; export all; };
4138
4139 interface "eth2" {
4140 max ra interval 5; # Fast failover with more routers
4141 managed yes; # Using DHCPv6 on eth2
4142 prefix ::/0 {
4143 autonomous off; # So do not autoconfigure any IP
4144 };
4145 };
4146
4147 interface "eth*"; # No need for any other options
4148
4149 prefix 2001:0DB8:1234::/48 {
4150 preferred lifetime 0; # Deprecated address range
4151 };
4152
4153 prefix 2001:0DB8:2000::/48 {
4154 autonomous off; # Do not autoconfigure
4155 };
4156
4157 rdnss 2001:0DB8:1234::10; # Short form of RDNSS
4158
4159 rdnss {
4160 lifetime mult 10;
4161 ns 2001:0DB8:1234::11;
4162 ns 2001:0DB8:1234::12;
4163 };
4164
4165 dnssl {
4166 lifetime 3600;
4167 domain "abc.com";
4168 domain "xyz.com";
4169 };
4170 }
4171 </code>
4172
4173
4174 <sect>RIP
4175 <label id="rip">
4176
4177 <sect1>Introduction
4178 <label id="rip-intro">
4179
4180 <p>The RIP protocol (also sometimes called Rest In Pieces) is a simple protocol,
4181 where each router broadcasts (to all its neighbors) distances to all networks it
4182 can reach. When a router hears distance to another network, it increments it and
4183 broadcasts it back. Broadcasts are done in regular intervals. Therefore, if some
4184 network goes unreachable, routers keep telling each other that its distance is
4185 the original distance plus 1 (actually, plus interface metric, which is usually
4186 one). After some time, the distance reaches infinity (that's 15 in RIP) and all
4187 routers know that network is unreachable. RIP tries to minimize situations where
4188 counting to infinity is necessary, because it is slow. Due to infinity being 16,
4189 you can't use RIP on networks where maximal distance is higher than 15
4190 hosts.
4191
4192 <p>BIRD supports RIPv1 (<rfc id="1058">), RIPv2 (<rfc id="2453">), RIPng (<rfc
4193 id="2080">), and RIP cryptographic authentication (<rfc id="4822">).
4194
4195 <p>RIP is a very simple protocol, and it has a lot of shortcomings. Slow
4196 convergence, big network load and inability to handle larger networks makes it
4197 pretty much obsolete. It is still usable on very small networks.
4198
4199 <sect1>Configuration
4200 <label id="rip-config">
4201
4202 <p>RIP configuration consists mainly of common protocol options and interface
4203 definitions, most RIP options are interface specific. RIPng (RIP for IPv6)
4204 protocol instance can be configured by using <cf/rip ng/ instead of just
4205 <cf/rip/ as a protocol type.
4206
4207 <p>RIP needs one IPv4 channel. RIPng needs one IPv6 channel. If no channel is
4208 configured, appropriate channel is defined with default parameters.
4209
4210 <code>
4211 protocol rip [ng] [&lt;name&gt;] {
4212 infinity &lt;number&gt;;
4213 ecmp &lt;switch&gt; [limit &lt;number&gt;];
4214 interface &lt;interface pattern&gt; {
4215 metric &lt;number&gt;;
4216 mode multicast|broadcast;
4217 passive &lt;switch&gt;;
4218 address &lt;ip&gt;;
4219 port &lt;number&gt;;
4220 version 1|2;
4221 split horizon &lt;switch&gt;;
4222 poison reverse &lt;switch&gt;;
4223 check zero &lt;switch&gt;;
4224 update time &lt;number&gt;;
4225 timeout time &lt;number&gt;;
4226 garbage time &lt;number&gt;;
4227 ecmp weight &lt;number&gt;;
4228 ttl security &lt;switch&gt;; | tx only;
4229 tx class|dscp &lt;number&gt;;
4230 tx priority &lt;number&gt;;
4231 rx buffer &lt;number&gt;;
4232 tx length &lt;number&gt;;
4233 check link &lt;switch&gt;;
4234 authentication none|plaintext|cryptographic;
4235 password "&lt;text&gt;";
4236 password "&lt;text&gt;" {
4237 id &lt;num&gt;;
4238 generate from "&lt;date&gt;";
4239 generate to "&lt;date&gt;";
4240 accept from "&lt;date&gt;";
4241 accept to "&lt;date&gt;";
4242 from "&lt;date&gt;";
4243 to "&lt;date&gt;";
4244 algorithm ( keyed md5 | keyed sha1 | hmac sha1 | hmac sha256 | hmac sha384 | hmac sha512 );
4245 };
4246 };
4247 }
4248 </code>
4249
4250 <descrip>
4251 <tag><label id="rip-infinity">infinity <M>number</M></tag>
4252 Selects the distance of infinity. Bigger values will make
4253 protocol convergence even slower. The default value is 16.
4254
4255 <tag><label id="rip-ecmp">ecmp <M>switch</M> [limit <M>number</M>]</tag>
4256 This option specifies whether RIP is allowed to generate ECMP
4257 (equal-cost multipath) routes. Such routes are used when there are
4258 several directions to the destination, each with the same (computed)
4259 cost. This option also allows to specify a limit on maximum number of
4260 nexthops in one route. By default, ECMP is enabled if supported by
4261 Kernel. Default value of the limit is 16.
4262
4263 <tag><label id="rip-iface">interface <m/pattern/ [, <m/.../] { <m/options/ }</tag>
4264 Interface definitions specify a set of interfaces on which the
4265 protocol is activated and contain interface specific options.
4266 See <ref id="proto-iface" name="interface"> common options for
4267 detailed description.
4268 </descrip>
4269
4270 <p>Interface specific options:
4271
4272 <descrip>
4273 <tag><label id="rip-iface-metric">metric <m/num/</tag>
4274 This option specifies the metric of the interface. When a route is
4275 received from the interface, its metric is increased by this value
4276 before further processing. Valid values are 1-255, but values higher
4277 than infinity has no further meaning. Default: 1.
4278
4279 <tag><label id="rip-iface-mode">mode multicast|broadcast</tag>
4280 This option selects the mode for RIP to use on the interface. The
4281 default is multicast mode for RIPv2 and broadcast mode for RIPv1.
4282 RIPng always uses the multicast mode.
4283
4284 <tag><label id="rip-iface-passive">passive <m/switch/</tag>
4285 Passive interfaces receive routing updates but do not transmit any
4286 messages. Default: no.
4287
4288 <tag><label id="rip-iface-address">address <m/ip/</tag>
4289 This option specifies a destination address used for multicast or
4290 broadcast messages, the default is the official RIP (224.0.0.9) or RIPng
4291 (ff02::9) multicast address, or an appropriate broadcast address in the
4292 broadcast mode.
4293
4294 <tag><label id="rip-iface-port">port <m/number/</tag>
4295 This option selects an UDP port to operate on, the default is the
4296 official RIP (520) or RIPng (521) port.
4297
4298 <tag><label id="rip-iface-version">version 1|2</tag>
4299 This option selects the version of RIP used on the interface. For RIPv1,
4300 automatic subnet aggregation is not implemented, only classful network
4301 routes and host routes are propagated. Note that BIRD allows RIPv1 to be
4302 configured with features that are defined for RIPv2 only, like
4303 authentication or using multicast sockets. The default is RIPv2 for IPv4
4304 RIP, the option is not supported for RIPng, as no further versions are
4305 defined.
4306
4307 <tag><label id="rip-iface-version-only">version only <m/switch/</tag>
4308 Regardless of RIP version configured for the interface, BIRD accepts
4309 incoming packets of any RIP version. This option restrict accepted
4310 packets to the configured version. Default: no.
4311
4312 <tag><label id="rip-iface-split-horizon">split horizon <m/switch/</tag>
4313 Split horizon is a scheme for preventing routing loops. When split
4314 horizon is active, routes are not regularly propagated back to the
4315 interface from which they were received. They are either not propagated
4316 back at all (plain split horizon) or propagated back with an infinity
4317 metric (split horizon with poisoned reverse). Therefore, other routers
4318 on the interface will not consider the router as a part of an
4319 independent path to the destination of the route. Default: yes.
4320
4321 <tag><label id="rip-iface-poison-reverse">poison reverse <m/switch/</tag>
4322 When split horizon is active, this option specifies whether the poisoned
4323 reverse variant (propagating routes back with an infinity metric) is
4324 used. The poisoned reverse has some advantages in faster convergence,
4325 but uses more network traffic. Default: yes.
4326
4327 <tag><label id="rip-iface-check-zero">check zero <m/switch/</tag>
4328 Received RIPv1 packets with non-zero values in reserved fields should
4329 be discarded. This option specifies whether the check is performed or
4330 such packets are just processed as usual. Default: yes.
4331
4332 <tag><label id="rip-iface-update-time">update time <m/number/</tag>
4333 Specifies the number of seconds between periodic updates. A lower number
4334 will mean faster convergence but bigger network load. Default: 30.
4335
4336 <tag><label id="rip-iface-timeout-time">timeout time <m/number/</tag>
4337 Specifies the time interval (in seconds) between the last received route
4338 announcement and the route expiration. After that, the network is
4339 considered unreachable, but still is propagated with infinity distance.
4340 Default: 180.
4341
4342 <tag><label id="rip-iface-garbage-time">garbage time <m/number/</tag>
4343 Specifies the time interval (in seconds) between the route expiration
4344 and the removal of the unreachable network entry. The garbage interval,
4345 when a route with infinity metric is propagated, is used for both
4346 internal (after expiration) and external (after withdrawal) routes.
4347 Default: 120.
4348
4349 <tag><label id="rip-iface-ecmp-weight">ecmp weight <m/number/</tag>
4350 When ECMP (multipath) routes are allowed, this value specifies a
4351 relative weight used for nexthops going through the iface. Valid
4352 values are 1-256. Default value is 1.
4353
4354 <tag><label id="rip-iface-auth">authentication none|plaintext|cryptographic</tag>
4355 Selects authentication method to be used. <cf/none/ means that packets
4356 are not authenticated at all, <cf/plaintext/ means that a plaintext
4357 password is embedded into each packet, and <cf/cryptographic/ means that
4358 packets are authenticated using some cryptographic hash function
4359 selected by option <cf/algorithm/ for each key. The default
4360 cryptographic algorithm for RIP keys is Keyed-MD5. If you set
4361 authentication to not-none, it is a good idea to add <cf>password</cf>
4362 section. Default: none.
4363
4364 <tag><label id="rip-iface-pass">password "<m/text/"</tag>
4365 Specifies a password used for authentication. See <ref id="proto-pass"
4366 name="password"> common option for detailed description.
4367
4368 <tag><label id="rip-iface-ttl-security">ttl security [<m/switch/ | tx only]</tag>
4369 TTL security is a feature that protects routing protocols from remote
4370 spoofed packets by using TTL 255 instead of TTL 1 for protocol packets
4371 destined to neighbors. Because TTL is decremented when packets are
4372 forwarded, it is non-trivial to spoof packets with TTL 255 from remote
4373 locations.
4374
4375 If this option is enabled, the router will send RIP packets with TTL 255
4376 and drop received packets with TTL less than 255. If this option si set
4377 to <cf/tx only/, TTL 255 is used for sent packets, but is not checked
4378 for received packets. Such setting does not offer protection, but offers
4379 compatibility with neighbors regardless of whether they use ttl
4380 security.
4381
4382 For RIPng, TTL security is a standard behavior (required by <rfc
4383 id="2080">) and therefore default value is yes. For IPv4 RIP, default
4384 value is no.
4385
4386 <tag><label id="rip-iface-tx-class">tx class|dscp|priority <m/number/</tag>
4387 These options specify the ToS/DiffServ/Traffic class/Priority of the
4388 outgoing RIP packets. See <ref id="proto-tx-class" name="tx class"> common
4389 option for detailed description.
4390
4391 <tag><label id="rip-iface-rx-buffer">rx buffer <m/number/</tag>
4392 This option specifies the size of buffers used for packet processing.
4393 The buffer size should be bigger than maximal size of received packets.
4394 The default value is 532 for IPv4 RIP and interface MTU value for RIPng.
4395
4396 <tag><label id="rip-iface-tx-length">tx length <m/number/</tag>
4397 This option specifies the maximum length of generated RIP packets. To
4398 avoid IP fragmentation, it should not exceed the interface MTU value.
4399 The default value is 532 for IPv4 RIP and interface MTU value for RIPng.
4400
4401 <tag><label id="rip-iface-check-link">check link <m/switch/</tag>
4402 If set, the hardware link state (as reported by OS) is taken into
4403 consideration. When the link disappears (e.g. an ethernet cable is
4404 unplugged), neighbors are immediately considered unreachable and all
4405 routes received from them are withdrawn. It is possible that some
4406 hardware drivers or platforms do not implement this feature.
4407 Default: yes.
4408 </descrip>
4409
4410 <sect1>Attributes
4411 <label id="rip-attr">
4412
4413 <p>RIP defines two route attributes:
4414
4415 <descrip>
4416 <tag><label id="rta-rip-metric">int rip_metric</tag>
4417 RIP metric of the route (ranging from 0 to <cf/infinity/). When routes
4418 from different RIP instances are available and all of them have the same
4419 preference, BIRD prefers the route with lowest <cf/rip_metric/. When a
4420 non-RIP route is exported to RIP, the default metric is 1.
4421
4422 <tag><label id="rta-rip-tag">int rip_tag</tag>
4423 RIP route tag: a 16-bit number which can be used to carry additional
4424 information with the route (for example, an originating AS number in
4425 case of external routes). When a non-RIP route is exported to RIP, the
4426 default tag is 0.
4427 </descrip>
4428
4429 <sect1>Example
4430 <label id="rip-exam">
4431
4432 <p><code>
4433 protocol rip {
4434 ipv4 {
4435 import all;
4436 export all;
4437 };
4438 interface "eth*" {
4439 metric 2;
4440 port 1520;
4441 mode multicast;
4442 update time 12;
4443 timeout time 60;
4444 authentication cryptographic;
4445 password "secret" { algorithm hmac sha256; };
4446 };
4447 }
4448 </code>
4449
4450
4451 <sect>RPKI
4452 <label id="rpki">
4453
4454 <sect1>Introduction
4455
4456 <p>The Resource Public Key Infrastructure (RPKI) is mechanism for origin
4457 validation of BGP routes (RFC 6480). BIRD supports only so-called RPKI-based
4458 origin validation. There is implemented RPKI to Router (RPKI-RTR) protocol (RFC
4459 6810). It uses some of the RPKI data to allow a router to verify that the
4460 autonomous system announcing an IP address prefix is in fact authorized to do
4461 so. This is not crypto checked so can be violated. But it should prevent the
4462 vast majority of accidental hijackings on the Internet today, e.g. the famous
4463 Pakastani accidental announcement of YouTube's address space.
4464
4465 <p>The RPKI-RTR protocol receives and maintains a set of ROAs from a cache
4466 server (also called validator). You can validate routes (RFC 6483) using
4467 function <cf/roa_check()/ in filter and set it as import filter at the BGP
4468 protocol. BIRD should re-validate all of affected routes after RPKI update by
4469 RFC 6811, but we don't support it yet! You can use a BIRD's client command
4470 <cf>reload in <m/bgp_protocol_name/</cf> for manual call of revalidation of all
4471 routes.
4472
4473 <sect1>Supported transports
4474 <p>
4475 <itemize>
4476 <item>Unprotected transport over TCP uses a port 323. The cache server
4477 and BIRD router should be on the same trusted and controlled network
4478 for security reasons.
4479 <item>SSHv2 encrypted transport connection uses the normal SSH port
4480 22.
4481 </itemize>
4482
4483 <sect1>Configuration
4484
4485 <p>We currently support just one cache server per protocol. However you can
4486 define more RPKI protocols generally.
4487
4488 <code>
4489 protocol rpki [&lt;name&gt;] {
4490 roa4 { table &lt;tab&gt;; };
4491 roa6 { table &lt;tab&gt;; };
4492 remote &lt;ip&gt; | "&lt;domain&gt;" [port &lt;num&gt;];
4493 port &lt;num&gt;;
4494 refresh [keep] &lt;num&gt;;
4495 retry [keep] &lt;num&gt;;
4496 expire [keep] &lt;num&gt;;
4497 transport tcp;
4498 transport ssh {
4499 bird private key "&lt;/path/to/id_rsa&gt;";
4500 remote public key "&lt;/path/to/known_host&gt;";
4501 user "&lt;name&gt;";
4502 };
4503 }
4504 </code>
4505
4506 <p>Alse note that you have to specify the ROA channel. If you want to import
4507 only IPv4 prefixes you have to specify only roa4 channel. Similarly with IPv6
4508 prefixes only. If you want to fetch both IPv4 and even IPv6 ROAs you have to
4509 specify both channels.
4510
4511 <sect2>RPKI protocol options
4512 <p>
4513 <descrip>
4514 <tag>remote <m/ip/ | "<m/hostname/" [port <m/num/]</tag> Specifies
4515 a destination address of the cache server. Can be specified by an IP
4516 address or by full domain name string. Only one cache can be specified
4517 per protocol. This option is required.
4518
4519 <tag>port <m/num/</tag> Specifies the port number. The default port
4520 number is 323 for transport without any encryption and 22 for transport
4521 with SSH encryption.
4522
4523 <tag>refresh [keep] <m/num/</tag> Time period in seconds. Tells how
4524 long to wait before next attempting to poll the cache using a Serial
4525 Query or a Reset Query packet. Must be lower than 86400 seconds (one
4526 day). Too low value can caused a false positive detection of
4527 network connection problems. A keyword <cf/keep/ suppresses updating
4528 this value by a cache server.
4529 Default: 3600 seconds
4530
4531 <tag>retry [keep] <m/num/</tag> Time period in seconds between a failed
4532 Serial/Reset Query and a next attempt. Maximum allowed value is 7200
4533 seconds (two hours). Too low value can caused a false positive
4534 detection of network connection problems. A keyword <cf/keep/
4535 suppresses updating this value by a cache server.
4536 Default: 600 seconds
4537
4538 <tag>expire [keep] <m/num/</tag> Time period in seconds. Received
4539 records are deleted if the client was unable to successfully refresh
4540 data for this time period. Must be in range from 600 seconds (ten
4541 minutes) to 172800 seconds (two days). A keyword <cf/keep/
4542 suppresses updating this value by a cache server.
4543 Default: 7200 seconds
4544
4545 <tag>transport tcp</tag> Unprotected transport over TCP. It's a default
4546 transport. Should be used only on secure private networks.
4547 Default: tcp
4548
4549 <tag>transport ssh { <m/SSH transport options.../ }</tag> It enables a
4550 SSHv2 transport encryption. Cannot be combined with a TCP transport.
4551 Default: off
4552 </descrip>
4553
4554 <sect3>SSH transport options
4555 <p>
4556 <descrip>
4557 <tag>bird private key "<m>/path/to/id_rsa</m>"</tag>
4558 A path to the BIRD's private SSH key for authentication.
4559 It can be a <cf><m>id_rsa</m></cf> file.
4560
4561 <tag>remote public key "<m>/path/to/known_host</m>"</tag>
4562 A path to the cache's public SSH key for verification identity
4563 of the cache server. It could be a path to <cf><m>known_host</m></cf> file.
4564
4565 <tag>user "<m/name/"</tag>
4566 A SSH user name for authentication. This option is a required.
4567 </descrip>
4568
4569 <sect1>Examples
4570 <sect2>BGP origin validation
4571 <p>Policy: Don't import <cf/ROA_INVALID/ routes.
4572 <code>
4573 roa4 table r4;
4574 roa6 table r6;
4575
4576 protocol rpki {
4577 debug all;
4578
4579 roa4 { table r4; };
4580 roa6 { table r6; };
4581
4582 # Please, do not use rpki-validator.realmv6.org in production
4583 remote "rpki-validator.realmv6.org" port 8282;
4584
4585 retry keep 5;
4586 refresh keep 30;
4587 expire 600;
4588 }
4589
4590 filter peer_in_v4 {
4591 if (roa_check(r4, net, bgp_path.last) = ROA_INVALID) then
4592 {
4593 print "Ignore invalid ROA ", net, " for ASN ", bgp_path.last;
4594 reject;
4595 }
4596 accept;
4597 }
4598
4599 protocol bgp {
4600 debug all;
4601 local as 65000;
4602 neighbor 192.168.2.1 as 65001;
4603 ipv4 {
4604 import filter peer_in_v4;
4605 export none;
4606 };
4607 }
4608 </code>
4609
4610 <sect2>SSHv2 transport encryption
4611 <p>
4612 <code>
4613 roa4 table r4;
4614 roa6 table r6;
4615
4616 protocol rpki {
4617 debug all;
4618
4619 roa4 { table r4; };
4620 roa6 { table r6; };
4621
4622 remote 127.0.0.1 port 2345;
4623 transport ssh {
4624 bird private key "/home/birdgeek/.ssh/id_rsa";
4625 remote public key "/home/birdgeek/.ssh/known_hosts";
4626 user "birdgeek";
4627 };
4628
4629 # Default interval values
4630 }
4631 </code>
4632
4633
4634 <sect>Static
4635 <label id="static">
4636
4637 <p>The Static protocol doesn't communicate with other routers in the network,
4638 but instead it allows you to define routes manually. This is often used for
4639 specifying how to forward packets to parts of the network which don't use
4640 dynamic routing at all and also for defining sink routes (i.e., those telling to
4641 return packets as undeliverable if they are in your IP block, you don't have any
4642 specific destination for them and you don't want to send them out through the
4643 default route to prevent routing loops).
4644
4645 <p>There are three classes of definitions in Static protocol configuration --
4646 global options, static route definitions, and per-route options. Usually, the
4647 definition of the protocol contains mainly a list of static routes.
4648 Static routes have no specific attributes.
4649
4650 <p>Global options:
4651
4652 <descrip>
4653 <tag><label id="static-check-link">check link <m/switch/</tag>
4654 If set, hardware link states of network interfaces are taken into
4655 consideration. When link disappears (e.g. ethernet cable is unplugged),
4656 static routes directing to that interface are removed. It is possible
4657 that some hardware drivers or platforms do not implement this feature.
4658 Default: off.
4659
4660 <tag><label id="static-igp-table">igp table <m/name/</tag>
4661 Specifies a table that is used for route table lookups of recursive
4662 routes. Default: the same table as the protocol is connected to.
4663 </descrip>
4664
4665 <p>Route definitions (each may also contain a block of per-route options):
4666
4667 <sect1>Regular routes; MPLS switching rules
4668
4669 <p>There exist several types of routes; keep in mind that <m/prefix/ syntax is
4670 <ref id="type-prefix" name="dependent on network type">.
4671
4672 <descrip>
4673 <tag>route <m/prefix/ via <m/ip/|<m/"interface"/ [mpls <m/num/[/<m/num/[/<m/num/[...]]]]</tag>
4674 Next hop routes may bear one or more <ref id="route-next-hop" name="next hops">.
4675 Every next hop is preceded by <cf/via/ and configured as shown.
4676
4677 <tag>route <m/prefix/ recursive <m/ip/ [mpls <m/num/[/<m/num/[/<m/num/[...]]]]</tag>
4678 Recursive nexthop resolves the given IP in the configured IGP table and
4679 uses that route's next hop. The MPLS stacks are concatenated; on top is
4680 the IGP's nexthop stack and on bottom is this route's stack.
4681
4682 <tag>route <m/prefix/ blackhole|unreachable|prohibit</tag>
4683 Special routes specifying to silently drop the packet, return it as
4684 unreachable or return it as administratively prohibited. First two
4685 targets are also known as <cf/drop/ and <cf/reject/.
4686 </descrip>
4687
4688 <p>When the particular destination is not available (the interface is down or
4689 the next hop of the route is not a neighbor at the moment), Static just
4690 uninstalls the route from the table it is connected to and adds it again as soon
4691 as the destination becomes adjacent again.
4692
4693 <sect1>Route Origin Authorization
4694
4695 <p>The ROA config is just <cf>route <m/prefix/ max <m/int/ as <m/int/</cf> with no nexthop.
4696
4697 <sect1>Flowspec
4698 <label id="flowspec-network-type">
4699
4700 <p>The flow specification are rules for routers and firewalls for filtering
4701 purpose. It is described by <rfc id="5575">. There are 3 types of arguments:
4702 <m/inet4/ or <m/inet6/ prefixes, bitmasks matching expressions and numbers
4703 matching expressions.
4704
4705 Bitmasks matching is written using <m/value/<cf>/</cf><m/mask/ or
4706 <cf/!/<m/value/<cf>/</cf><m/mask/ pairs. It means that <cf/(/<m/data/ <cf/&/
4707 <m/mask/<cf/)/ is or is not equal to <m/value/.
4708
4709 Numbers matching is a matching sequence of numbers and ranges separeted by a
4710 commas (<cf/,/) (e.g. <cf/10,20,30/). Ranges can be written using double dots
4711 <cf/../ notation (e.g. <cf/80..90,120..124/). An alternative notation are
4712 sequence of one or more pairs of relational operators and values separated by
4713 logical operators <cf/&&/ or <cf/||/. Allowed relational operators are <cf/=/,
4714 <cf/!=/, <cf/</, <cf/<=/, <cf/>/, <cf/>=/, <cf/true/ and <cf/false/.
4715
4716 <sect2>IPv4 Flowspec
4717
4718 <p><descrip>
4719 <tag><label id="flow-dst">dst <m/inet4/</tag>
4720 Set a matching destination prefix (e.g. <cf>dst 192.168.0.0/16</cf>).
4721 Only this option is mandatory in IPv4 Flowspec.
4722
4723 <tag><label id="flow-src">src <m/inet4/</tag>
4724 Set a matching source prefix (e.g. <cf>src 10.0.0.0/8</cf>).
4725
4726 <tag><label id="flow-proto">proto <m/numbers-match/</tag>
4727 Set a matching IP protocol numbers (e.g. <cf/proto 6/).
4728
4729 <tag><label id="flow-port">port <m/numbers-match/</tag>
4730 Set a matching source or destination TCP/UDP port numbers (e.g.
4731 <cf>port 1..1023,1194,3306</cf>).
4732
4733 <tag><label id="flow-dport">dport <m/numbers-match/</tag>
4734 Set a mating destination port numbers (e.g. <cf>dport 49151</cf>).
4735
4736 <tag><label id="flow-sport">sport <m/numbers-match/</tag>
4737 Set a matching source port numbers (e.g. <cf>sport = 0</cf>).
4738
4739 <tag><label id="flow-icmp-type">icmp type <m/numbers-match/</tag>
4740 Set a matching type field number of an ICMP packet (e.g. <cf>icmp type
4741 3</cf>)
4742
4743 <tag><label id="flow-icmp-code">icmp code <m/numbers-match/</tag>
4744 Set a matching code field number of an ICMP packet (e.g. <cf>icmp code
4745 1</cf>)
4746
4747 <tag><label id="flow-tcp-flags">tcp flags <m/bitmask-match/</tag>
4748 Set a matching bitmask for TCP header flags (aka control bits) (e.g.
4749 <cf>tcp flags 0x03/0x0f;</cf>). The maximum length of mask is 12 bits
4750 (0xfff).
4751
4752 <tag><label id="flow-length">length <m/numbers-match/</tag>
4753 Set a matching packet length (e.g. <cf>length > 1500;</cf>)
4754
4755 <tag><label id="flow-dscp">dscp <m/numbers-match/</tag>
4756 Set a matching DiffServ Code Point number (e.g. <cf>length > 1500;</cf>).
4757
4758 <tag><label id="flow-fragment">fragment <m/fragmentation-type/</tag>
4759 Set a matching type of packet fragmentation. Allowed fragmentation
4760 types are <cf/dont_fragment/, <cf/is_fragment/, <cf/first_fragment/,
4761 <cf/last_fragment/ (e.g. <cf>fragment is_fragment &&
4762 !dont_fragment</cf>).
4763 </descrip>
4764
4765 <p><code>
4766 protocol static {
4767 flow4;
4768
4769 route flow4 {
4770 dst 10.0.0.0/8;
4771 port > 24 && < 30 || 40..50,60..70,80 && >= 90;
4772 tcp flags 0x03/0x0f;
4773 length > 1024;
4774 dscp = 63;
4775 fragment dont_fragment, is_fragment || !first_fragment;
4776 };
4777 }
4778 </code>
4779
4780 <sect2>Differences for IPv6 Flowspec
4781
4782 <p>Flowspec IPv6 are same as Flowspec IPv4 with a few exceptions.
4783 <itemize>
4784 <item>Prefixes <m/inet6/ can be specified not only with prefix length,
4785 but with prefix <cf/offset/ <m/num/ too (e.g.
4786 <cf>::1234:5678:9800:0000/101 offset 64</cf>). Offset means to don't
4787 care of <m/num/ first bits.
4788 <item>IPv6 Flowspec hasn't mandatory any flowspec component.
4789 <item>In IPv6 packets, there is a matching the last next header value
4790 for a matching IP protocol number (e.g. <cf>next header 6</cf>).
4791 <item>It is not possible to set <cf>dont_fragment</cf> as a type of
4792 packet fragmentation.
4793 </itemize>
4794
4795 <p><descrip>
4796 <tag><label id="flow6-dst">dst <m/inet6/ [offset <m/num/]</tag>
4797 Set a matching destination IPv6 prefix (e.g. <cf>dst
4798 ::1c77:3769:27ad:a11a/128 offset 64</cf>).
4799
4800 <tag><label id="flow6-src">src <m/inet6/ [offset <m/num/]</tag>
4801 Set a matching source IPv6 prefix (e.g. <cf>src fe80::/64</cf>).
4802
4803 <tag><label id="flow6-next-header">next header <m/numbers-match/</tag>
4804 Set a matching IP protocol numbers (e.g. <cf>next header != 6</cf>).
4805
4806 <tag><label id="flow6-label">label <m/bitmask-match/</tag>
4807 Set a 20-bit bitmask for matching Flow Label field in IPv6 packets
4808 (e.g. <cf>label 0x8e5/0x8e5</cf>).
4809 </descrip>
4810
4811 <p><code>
4812 protocol static {
4813 flow6 { table myflow6; };
4814
4815 route flow6 {
4816 dst fec0:1122:3344:5566:7788:99aa:bbcc:ddee/128;
4817 src 0000:0000:0000:0001:1234:5678:9800:0000/101 offset 63;
4818 next header = 23;
4819 sport > 24 && < 30 || = 40 || 50,60,70..80;
4820 dport = 50;
4821 tcp flags 0x03/0x0f, !0/0xff || 0x33/0x33;
4822 fragment !is_fragment || !first_fragment;
4823 label 0xaaaa/0xaaaa && 0x33/0x33;
4824 };
4825 }
4826 </code>
4827
4828 <sect1>Per-route options
4829 <p>
4830 <descrip>
4831 <tag><label id="static-route-bfd">bfd <m/switch/</tag>
4832 The Static protocol could use BFD protocol for next hop liveness
4833 detection. If enabled, a BFD session to the route next hop is created
4834 and the static route is BFD-controlled -- the static route is announced
4835 only if the next hop liveness is confirmed by BFD. If the BFD session
4836 fails, the static route is removed. Note that this is a bit different
4837 compared to other protocols, which may use BFD as an advisory mechanism
4838 for fast failure detection but ignores it if a BFD session is not even
4839 established.
4840
4841 This option can be used for static routes with a direct next hop, or
4842 also for for individual next hops in a static multipath route (see
4843 above). Note that BFD protocol also has to be configured, see
4844 <ref id="bfd" name="BFD"> section for details. Default value is no.
4845
4846 <tag><label id="static-route-filter"><m/filter expression/</tag>
4847 This is a special option that allows filter expressions to be configured
4848 on per-route basis. Can be used multiple times. These expressions are
4849 evaluated when the route is originated, similarly to the import filter
4850 of the static protocol. This is especially useful for configuring route
4851 attributes, e.g., <cf/ospf_metric1 = 100;/ for a route that will be
4852 exported to the OSPF protocol.
4853 </descrip>
4854
4855 <sect1>Example static config
4856
4857 <p><code>
4858 protocol static {
4859 ipv4 { table testable; }; # Connect to a non-default routing table
4860 check link; # Advertise routes only if link is up
4861 route 0.0.0.0/0 via 198.51.100.130; # Default route
4862 route 10.0.0.0/8 # Multipath route
4863 via 198.51.100.10 weight 2
4864 via 198.51.100.20 bfd # BFD-controlled next hop
4865 via 192.0.2.1;
4866 route 203.0.113.0/24 unreachable; # Sink route
4867 route 10.2.0.0/24 via "arc0"; # Secondary network
4868 route 192.168.10.0/24 via 198.51.100.100 {
4869 ospf_metric1 = 20; # Set extended attribute
4870 }
4871 route 192.168.10.0/24 via 198.51.100.100 {
4872 ospf_metric2 = 100; # Set extended attribute
4873 ospf_tag = 2; # Set extended attribute
4874 bfd; # BFD-controlled route
4875 }
4876 }
4877 </code>
4878
4879
4880 <chapt>Conclusions
4881 <label id="conclusion">
4882
4883 <sect>Future work
4884 <label id="future-work">
4885
4886 <p>Although BIRD supports all the commonly used routing protocols, there are
4887 still some features which would surely deserve to be implemented in future
4888 versions of BIRD:
4889
4890 <itemize>
4891 <item>Opaque LSA's
4892 <item>Route aggregation and flap dampening
4893 <item>Multicast routing protocols
4894 <item>Ports to other systems
4895 </itemize>
4896
4897
4898 <sect>Getting more help
4899 <label id="help">
4900
4901 <p>If you use BIRD, you're welcome to join the bird-users mailing list
4902 (<HTMLURL URL="mailto:bird-users@network.cz" name="bird-users@network.cz">)
4903 where you can share your experiences with the other users and consult
4904 your problems with the authors. To subscribe to the list, visit
4905 <HTMLURL URL="http://bird.network.cz/?m_list" name="http://bird.network.cz/?m_list">.
4906 The home page of BIRD can be found at <HTMLURL URL="http://bird.network.cz/" name="http://bird.network.cz/">.
4907
4908 <p>BIRD is a relatively young system and it probably contains some bugs. You can
4909 report any problems to the bird-users list and the authors will be glad to solve
4910 them, but before you do so, please make sure you have read the available
4911 documentation and that you are running the latest version (available at
4912 <HTMLURL URL="ftp://bird.network.cz/pub/bird" name="bird.network.cz:/pub/bird">).
4913 (Of course, a patch which fixes the bug is always welcome as an attachment.)
4914
4915 <p>If you want to understand what is going inside, Internet standards are a good
4916 and interesting reading. You can get them from
4917 <HTMLURL URL="ftp://ftp.rfc-editor.org/" name="ftp.rfc-editor.org"> (or a
4918 nicely sorted version from <HTMLURL URL="ftp://atrey.karlin.mff.cuni.cz/pub/rfc"
4919 name="atrey.karlin.mff.cuni.cz:/pub/rfc">).
4920
4921 <p><it/Good luck!/
4922
4923 </book>
4924
4925 <!--
4926 LocalWords: GPL IPv GateD BGPv RIPv OSPFv Linux sgml html dvi sgmltools Pavel
4927 LocalWords: linuxdoc dtd descrip config conf syslog stderr auth ospf bgp Mbps
4928 LocalWords: router's eval expr num birdc ctl UNIX if's enums bool int ip GCC
4929 LocalWords: len ipaddress pxlen netmask enum bgppath bgpmask clist gw md eth
4930 LocalWords: RTS printn quitbird iBGP AS'es eBGP RFC multiprotocol IGP Machek
4931 LocalWords: EGP misconfigurations keepalive pref aggr aggregator BIRD's RTC
4932 LocalWords: OS'es AS's multicast nolisten misconfigured UID blackhole MRTD MTU
4933 LocalWords: uninstalls ethernets IP binutils ANYCAST anycast dest RTD ICMP rfc
4934 LocalWords: compat multicasts nonbroadcast pointopoint loopback sym stats
4935 LocalWords: Perl SIGHUP dd mm yy HH MM SS EXT IA UNICAST multihop Discriminator txt
4936 LocalWords: proto wildcard Ondrej Filip
4937 -->