git.ipfire.org Git - thirdparty/haproxy.git/log

]> git.ipfire.org Git - thirdparty/haproxy.git/log

projects / thirdparty / haproxy.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Willy Tarreau [Sun, 11 Nov 2012 19:38:30 +0000 (20:38 +0100)]

BUG: raw_sock: also consider ENOTCONN in addition to EAGAIN

A failed send() may return ENOTCONN when the connection is not yet established.
On Linux, we generally see EAGAIN but on OpenBSD we clearly have ENOTCONN, so
let's ensure we poll for write when we encounter this error.

commit | commitdiff | tree

Willy Tarreau [Sun, 11 Nov 2012 15:43:45 +0000 (16:43 +0100)]

REORG: fd: centralize the processing of speculative events

Speculative events are independant on the poller, so they can be
centralized in fd.c.

commit | commitdiff | tree

Willy Tarreau [Sun, 11 Nov 2012 15:05:19 +0000 (16:05 +0100)]

REORG: fd: move the fd state management from ev_sepoll

ev_sepoll already provides everything needed to manage FD events
by only manipulating the speculative I/O list. Nothing there is
sepoll-specific so move all this to fd.

commit | commitdiff | tree

Willy Tarreau [Sun, 11 Nov 2012 14:02:54 +0000 (15:02 +0100)]

REORG: fd: move the speculative I/O management from ev_sepoll

The speculative I/O will need to be ported to all pollers, so move
this to fd.c.

commit | commitdiff | tree

Willy Tarreau [Sun, 11 Nov 2012 16:08:32 +0000 (17:08 +0100)]

MEDIUM: fd: don't unset fdtab[].updated upon delete

We must not remove the .updated flag otherwise we risk having to
reallocate a new updt entry if the same fd is reused.

commit | commitdiff | tree

William Lallemand [Sun, 11 Nov 2012 16:30:56 +0000 (17:30 +0100)]

MINOR: log-format: check number of arguments in cfgparse.c

Exit with error if there is a second argument in the 'log-format' and
'unique-id-format' options. It is convenient when we forgot to escape
spaces.

commit | commitdiff | tree

Cyril Bonté [Sun, 11 Nov 2012 12:38:27 +0000 (13:38 +0100)]

DOC: compression: add some details and clean up the formatting

commit 82fe75c1 provided useful details in its log message. We should report
part of them in the documentation to know which algorithms are available.

This patch also makes some formatting cleanups (including a line outside the
compression scope, which exceeded 80 chars).

commit | commitdiff | tree

Cyril Bonté [Sat, 10 Nov 2012 18:27:47 +0000 (19:27 +0100)]

BUILD: report zlib support in haproxy -vv

Compression algorithms are not always supported depending on build options.
"haproxy -vv" now reports if zlib is supported and lists compression algorithms
also supported.

commit | commitdiff | tree

Willy Tarreau [Sat, 10 Nov 2012 16:49:37 +0000 (17:49 +0100)]

BUILD: compression: remove a build warning

gcc emits this warning while building free_zlib() :
src/compression.c: In function `free_zlib':
src/compression.c:403: warning: 'pool' might be used uninitialized in this function

This is not a bug as the pool cannot take other values, but let's
pre-initialize is to null to fix the warning.

commit | commitdiff | tree

William Lallemand [Fri, 9 Nov 2012 16:05:39 +0000 (17:05 +0100)]

MINOR: compression: maximum compression rate limit

This patch adds input and output rate calcutation on the HTTP compresion
feature.

Compression can be limited with a maximum rate value in kilobytes per
second. The rate is set with the global 'maxcomprate' option. You can
change this value dynamicaly with 'set rate-limit http-compression
global' on the UNIX socket.

commit | commitdiff | tree

William Lallemand [Fri, 9 Nov 2012 11:33:10 +0000 (12:33 +0100)]

MINOR: compression: tune.comp.maxlevel

This option allows you to set the maximum compression level usable by
the compression algorithm. It affects CPU usage.

commit | commitdiff | tree

Finn Arne Gangstad [Fri, 9 Nov 2012 20:02:36 +0000 (21:02 +0100)]

BUG: http: revert broken optimisation from 82fe75c1a79dac933391501b9d293bce34513755

This optimisation causes haproxy to time out requests that result
in two TCP packets, one packet containing the header, and one
packet containing the actual data. This is a very typical type
of response from a lot of servers.

[Willy: I suspect the fix might have an impact on the compression code
which I'm not sure completely handles calls with 0 bytes to forward]

commit | commitdiff | tree

Willy Tarreau [Fri, 9 Nov 2012 17:27:26 +0000 (18:27 +0100)]

OPTIM: stream_interface: disable reading when CF_READ_DONTWAIT is set

CF_READ_DONTWAIT was designed to avoid getting an EAGAIN upon recv() when
very few data are expected. It prevents the reader from looping over
recv(). Unfortunately with speculative I/O, it is very common that the
same event has the time to be called twice before the task handles the
data and disables the recv(). This is because not all tasks are always
processed at once.

Instead of leaving the buffer free-wheeling and doing an EAGAIN, we
disable reading as soon as the first recv() succeeds. This way we're
sure that only the next wakeup of the task will re-enable it if needed.

Doing so has totally removed the EAGAIN we were seeing till now (30% of
recv).

commit | commitdiff | tree

Willy Tarreau [Tue, 6 Nov 2012 01:34:46 +0000 (02:34 +0100)]

MAJOR: sepoll: make the poller totally event-driven

At the moment sepoll is not 100% event-driven, because a call to fd_set()
on an event which is already being polled will not change its state.

This causes issues with OpenSSL because if some I/O processing is interrupted
after clearing the I/O event (eg: read all data from a socket, can't put it
all into the buffer), then there is no way to call the SSL_read() again once
the buffer releases some space.

The only real solution is to go 100% event-driven. The principle is to use
the spec list as an event cache and that each time an I/O event is reported
by epoll_wait(), this event is automatically scheduled for addition to the
spec list for future calls until the consumer explicitly asks for polling
or stopping.

Doing this is a bit tricky because sepoll used to provide a substantial
number of optimizations such as event merging. These optimizations have
been maintained : a dedicated update list is affected when events change,
but not the event list, so that updates may cancel themselves without any
side effect such as displacing events. A specific case was considered for
handling newly created FDs as soon as they are detected from within the
poll loop. This ensures that their read or write operation will always be
attempted as soon as possible, thus reducing the number of poll loops and
process_session wakeups. This is especially true for newly accepted fds
which immediately perform their first recv() call.

Two new flags were added to the fdtab[] struct to tag the fact that a file
descriptor already exists in the update list. One flag indicates that a
file descriptor is new and has just been created (fdtab[].new) and the other
one indicates that a file descriptor is already referenced by the update list
(fdtab[].updated). Even if the FD state changes during operations or if the
fd is closed and replaced, it's not an issue because the update flag remains
and is easily spotted during list walks. The flag must absolutely reflect the
presence of the fd in the update list in order to avoid overflowing the update
list with more events than there are distinct fds.

Note that this change also recovers the small performance loss introduced
by its connection counter-part and goes even beyond.

commit | commitdiff | tree

Willy Tarreau [Mon, 5 Nov 2012 19:00:43 +0000 (20:00 +0100)]

BUG/MAJOR: always clear the CO_FL_WAIT_* flags after updating polling flags

The CO_FL_WAIT_* flags were not cleared after updating polling flags.
This means that any caller of these functions that did not clear it
would enable polling instead of speculative I/O. This happens during
the stream interface update call which is performed from the session
handler for example.

As of now it's not a problem yet because speculative I/O and polling
are handled the same way. However with upcoming changes it does cause
some deadlocks because enabling read processing on a file descriptor
where everything was already read will do nothing until something new
happens on this FD.

The correct fix consists in clearing the flags while leaving the update
functions.

This fix does not need any backport as it was introduced with recent
connection changes (dev12) and not triggered until last commit.

commit | commitdiff | tree

Willy Tarreau [Mon, 5 Nov 2012 16:52:26 +0000 (17:52 +0100)]

MAJOR: connection: remove the CO_FL_CURR_*_POL flag

This is the first step of a series of changes aiming at making the
polling totally event-driven. This first change consists in only
remembering at the connection level whether an FD was enabled or not,
regardless of the fact it was being polled or cached. From now on, an
EAGAIN will always be considered as a change so that the pollers are
able to manage a cache and to flush it based on such events. One of
the noticeable effect is that conn_fd_handler() is called once more
per session (6 instead of 5 min) but other update functions are less
called.

Note that the performance loss caused by this change at the moment is
quite significant, around 2.5%, but the change is needed to have SSL
working correctly in all situations, even when data were read from the
socket and stored in the invisible cache, waiting for some room in the
channel's buffer.

commit | commitdiff | tree

Willy Tarreau [Mon, 5 Nov 2012 23:14:25 +0000 (00:14 +0100)]

BUG/MINOR: session: mark the handshake as complete earlier

There is a small waste of CPU cycles when no handshake is required on an
accepted connection, because we had to perform one call to conn_fd_handler()
to mark the connection CONNECTED and to call process_session() again to say
that nothing happened.

By marking the connection CONNECTED when there is no pending handshake, we
avoid this extra call to process_session().

commit | commitdiff | tree

Willy Tarreau [Thu, 8 Nov 2012 13:49:17 +0000 (14:49 +0100)]

OPTIM: session: don't process the whole session when only timers need a refresh

Having a global expiration timer for a task means that the tasks are regularly
woken up (at least after each expiration timer). It's totally useless and counter
productive to process the whole session upon each such wakeup, and it's fairly
easy to detect such wakeups, so let's just update the task's timer and return
to sleep when this happens.

For 100k concurrent connections with 10s of timeouts, this can save 10k wakeups
per second, which is not bad.

commit | commitdiff | tree

William Lallemand [Wed, 7 Nov 2012 15:12:57 +0000 (16:12 +0100)]

MEDIUM: compression: limit RAM usage

With the global maxzlibmem option, you are able ton control the maximum
amount of RAM usable for HTTP compression.

A test is done before each zlib allocation, if the there isn't available
memory, the test fail and so the zlib initialization, so data won't be
compressed.

commit | commitdiff | tree

William Lallemand [Wed, 7 Nov 2012 14:00:23 +0000 (15:00 +0100)]

MINOR: compression: init before deleting headers

Init the compression algorithm before modifying the response headers. So
if the compression init fail, the headers won't be modified.

commit | commitdiff | tree

William Lallemand [Wed, 7 Nov 2012 12:21:47 +0000 (13:21 +0100)]

MINOR: compression: try init in cfgparse.c

Try to init and deinit the algorithm in the configuration parser and
exit with error if it doesn't work.

commit | commitdiff | tree

William Lallemand [Tue, 30 Oct 2012 13:30:39 +0000 (14:30 +0100)]

MEDIUM: use pool for zlib

Don't use the zlib allocator anymore, 5 pools are used for the zlib
compression. Their sizes depends of the window size and the memLevel in
deflateInit2.

commit | commitdiff | tree

William Lallemand [Wed, 7 Nov 2012 15:54:34 +0000 (16:54 +0100)]

MINOR: compression: memlevel and windowsize

The window size and the memlevel of the zlib are now configurable using
global options tune.zlib.memlevel and tune.zlib.windowsize.

It affects the memory consumption of the zlib.

commit | commitdiff | tree

William Lallemand [Wed, 31 Oct 2012 10:19:18 +0000 (11:19 +0100)]

BUILD: remove dependency to zlib.h

The build was dependent of the zlib.h header, regardless of the USE_ZLIB
option. The fix consists of several #ifdef in the source code.

It removes the overhead of the zstream structure in the session when you
don't use the option.

commit | commitdiff | tree

William Lallemand [Tue, 30 Oct 2012 14:52:53 +0000 (15:52 +0100)]

CLEANUP: use struct comp_ctx instead of union

Replace union comp_ctx by struct comp_ctx.

Use struct comp_ctx * in the init/add_data/flush/reset/end prototypes of
compression.h functions.

commit | commitdiff | tree

David BERARD [Fri, 2 Nov 2012 23:11:31 +0000 (00:11 +0100)]

DOC: Change is_ssl acl to ssl_fc acl in example

commit | commitdiff | tree

Willy Tarreau [Mon, 29 Oct 2012 21:41:31 +0000 (22:41 +0100)]

BUG/MINOR: session: ensure that we don't retry connection if some data were sent

With extra-large buffers, it is possible that a lot of data are sent upon
connection establishment before the session is notified. The issue is how
to handle a send() error after some data were actually sent.

At the moment, only a connection error is reported, causing a new connection
attempt and send() to restart after the last data. We absolutely don't want
to retry the connect() if at least one byte was sent, because those data are
lost.

The solution consists in reporting exactly what happens, which is :
- a successful connection attempt
- a read/write error on the channel

That way we go on with sess_establish(), the response analysers are called
and report the appropriate connection state for the error (typically a server
abort while waiting for a response). This mechanism also guarantees that we
won't retry since it's a success. The logs also report the correct connect
time.

Note that 1.4 is not directly affected because it only attempts one send(),
so it cannot detect a send() failure here and distinguish it form a failed
connection attempt. So no backport is needed. Also, this is just a safe belt
we're taking, since this issue should not happen anymore since previous commit.

commit | commitdiff | tree

Willy Tarreau [Mon, 29 Oct 2012 22:27:14 +0000 (23:27 +0100)]

BUG/MINOR: stream_interface: don't loop over ->snd_buf()

It is stupid to loop over ->snd_buf() because the snd_buf() itself already
loops and stops when system buffers are full. But looping again onto it,
we lose the information of the full buffers and perform one useless syscall.

Furthermore, this causes issues when dealing with large uploads while waiting
for a connection to establish, as it can report a server reject of some data
as a connection abort, which is wrong.

1.4 does not have this issue as it loops maximum twice (once for each buffer
half) and exists as soon as system buffers are full. So no backport is needed.

commit | commitdiff | tree

Finn Arne Gangstad [Mon, 29 Oct 2012 20:43:01 +0000 (21:43 +0100)]

MINOR: compression: Enable compression for IE6 w/SP2, IE7 and IE8

Some old browsers that have a user-agent starting with "Mozilla/4" do
not support compressison correctly, so disable compression for those.

Internet explorer 6 after Windows XP service pack 2, IE 7, and IE 8,
do however support compression and still have a user agent starting
with Mozilla/4, so we try to enable compression for those.

MSIE has a user-agent on this form:
Mozilla/4.0 (compatible; MSIE <version>; ...)

98% of MSIE 6 SP2 user agents start with
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1
The remaining 2% have additional flags before "SV1".

This simplified matching looking for MSIE at exactly position 25
and SV1 at exacly position 51 gives a few false negatives, so sometimes
a compression opportunity is lost.

A test against 3 hours of traffic to around 3000 news sites worldwide
gives less than 0.007% (70ppm) missed compression opportunities.

commit | commitdiff | tree

Willy Tarreau [Mon, 29 Oct 2012 20:56:59 +0000 (21:56 +0100)]

MEDIUM: stick-table: allocate the table key of size buffer size

Keys are copied from samples to stick_table_key. If a key is larger
than the stick_table_key, we have an overflow. In pratice it does not
happen because it requires :
   1) a configuration with tune.bufsize larger than BUFSIZE (common)
   2) a stick-table configured with keys strictly larger than buffers
   3) extraction of data larger than BUFSIZE (eg: using payload())

Points 2 and 3 don't make any sense for a real world configuration. That
said the issue needs be fixed. The solution consists in allocating it the
same size as the global buffer size, just like the samples. This fixes the
issue.

commit | commitdiff | tree

Willy Tarreau [Mon, 29 Oct 2012 19:44:36 +0000 (20:44 +0100)]

MEDIUM: remove remains of BUFSIZE in HTTP auth and sample conversions

Sample conversions rely on two alternative buffers which were previously
allocated as static bufs of size BUFSIZE. Now they're initialized to the
global buffer size. It was the same for HTTP authentication. Note that it
seems that none of them was prone to any mistake when dealing with the
buffer size, but better stay on the safe side by maintaining the old
assumption that a trash buffer is always "large enough".

commit | commitdiff | tree

Willy Tarreau [Mon, 29 Oct 2012 15:51:55 +0000 (16:51 +0100)]

MEDIUM: make the trash be a chunk instead of a char *

The trash is used everywhere to store the results of temporary strings
built out of s(n)printf, or as a storage for a chunk when chunks are
needed.

Using global.tune.bufsize is not the most convenient thing either.

So let's replace trash with a chunk and directly use it as such. We can
then use trash.size as the natural way to get its size, and get rid of
many intermediary chunks that were previously used.

The patch is huge because it touches many areas but it makes the code
a lot more clear and even outlines places where trash was used without
being that obvious.

commit | commitdiff | tree

Willy Tarreau [Mon, 29 Oct 2012 15:14:26 +0000 (16:14 +0100)]

CLEANUP: replace chunk_printf() with chunk_appendf()

This function's naming was misleading as it is used to append data
at the end of a string, causing some surprizes when used for the
first time!

Add a chunk_printf() function which does what its name suggests.

commit | commitdiff | tree

Willy Tarreau [Mon, 29 Oct 2012 12:23:11 +0000 (13:23 +0100)]

MINOR: chunk: add a function to reset a chunk

This is a first step in avoiding to constantly reinitialize chunks.
It replaces the old chunk_reset() which was not properly named as it
used to drop everything and was only used by chunk_destroy(). It has
been renamed chunk_drop().

commit | commitdiff | tree

Willy Tarreau [Mon, 29 Oct 2012 12:27:23 +0000 (13:27 +0100)]

CLEANUP: completely remove trashlen

Commit c919dc66 did not remove the trashlen assigment.

commit | commitdiff | tree

Yuxans Yao [Fri, 19 Oct 2012 02:36:09 +0000 (10:36 +0800)]

MINOR: log: add '%Tl' to log-format

The '%Tl' is similar to '%T', but using local timezone.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 23:36:34 +0000 (01:36 +0200)]

BUG: compression: disable auto-close and enable MSG_MORE during transfer

We don't want the lower layer to forward a close while we're compressing,
and we want the system to fuse outgoing TCP segments using MSG_MORE as
much as possible to save round trips that can emerge from sending short
packets with a PUSH flag.

A test on a remote busy DSL line consisting in compressing a 100MB file
on the fly full of zeroes only showed a transfer rate of a few kB/s due
to these round trips.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 22:34:28 +0000 (00:34 +0200)]

MINOR: compression: add an offload option to remove the Accept-Encoding header

This is used when it is desired that backend servers don't compress
(eg: because of buggy implementations).

commit | commitdiff | tree

Willy Tarreau [Fri, 10 Feb 2012 19:37:26 +0000 (20:37 +0100)]

BUILD: make it possible to specify ZLIB path

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 18:40:13 +0000 (20:40 +0200)]

DOC: update document describing relations between internal entities

Connections have left the stream interface. fdtab[] has been represented.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 18:10:28 +0000 (20:10 +0200)]

MAJOR: session: detach the connections from the stream interfaces

We will need to be able to switch server connections on a session and
to keep idle connections. In order to achieve this, the preliminary
requirement is that the connections can survive the session and be
detached from them.

Right now they're still allocated at exactly the same place, so when
there is a session, there are always 2 connections. We could soon
improve on this by allocating the outgoing connection only during a
connect().

This current patch touches a lot of code and intentionally does not
change any functionnality. Performance tests show no regression (even
a very minor improvement). The doc has not yet been updated.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 15:35:22 +0000 (17:35 +0200)]

CLEANUP: remove trashlen

trashlen is a copy of global.tune.bufsize, so let's stop using it as
a duplicate, fall back to the original bufsize, it's less confusing
this way.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 17:57:58 +0000 (19:57 +0200)]

BUG/MEDIUM: tcp: transparent bind to the source only when address is set

Thomas Heil reported that health checks did not work anymore when a backend
or server has "usesrc clientip". This is because the source address is not
set and tcp_bind_socket() tries to bind to that address anyway.

The solution consists in explicitly clearing the source address in the checks
and to make tcp_bind_socket() avoid binding when the address is not set. This
also has an indirect benefit that a useless bind() syscall will be avoided
when using "source 0.0.0.0 usesrc clientip" in health checks.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 17:47:23 +0000 (19:47 +0200)]

MINOR: tools: add a clear_addr() function to unset an address

This will be used to unset a from address.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 14:04:28 +0000 (16:04 +0200)]

BUG/MEDIUM: command-line option -D must have precedence over "debug"

From the beginning it has been said that -D must always be used on the
command line from startup scripts so that haproxy does not accidentally
stay in foreground when loaded from init script... Except that this has
not been true for a long time now.

The fix is easy and must be backported to 1.4 too which is affected.

commit | commitdiff | tree

Emeric Brun [Fri, 26 Oct 2012 11:35:33 +0000 (13:35 +0200)]

MINOR: ssl: checks the consistency of a private key with the corresponding certificate

commit | commitdiff | tree

Emeric Brun [Fri, 26 Oct 2012 10:58:00 +0000 (12:58 +0200)]

MINOR: ssl: add 'crt' statement on server.

crt: client certificate to send

commit | commitdiff | tree

Emeric Brun [Mon, 22 Oct 2012 12:11:22 +0000 (14:11 +0200)]

MINOR: ssl: add pattern and ACLs fetches 'ssl_c_notbefore', 'ssl_c_notafter', 'ssl_f_notbefore' and 'ssl_f_notafter'

ssl_c_notbefore: start date of client cert (string, eg: "121022182230Z" for YYMMDDhhmmss[Z])
ssl_c_notafter: end date of client cert (string, eg: "121022182230Z" for YYMMDDhhmmss[Z])
ssl_f_notbefore: start date of frontend cert (string, eg: "121022182230Z" for YYMMDDhhmmss[Z])
ssl_f_notafter: end date of frontend cert (string, eg: "121022182230Z" for YYMMDDhhmmss[Z])

commit | commitdiff | tree

Emeric Brun [Mon, 22 Oct 2012 10:22:55 +0000 (12:22 +0200)]

MINOR: ssl: add pattern and ACLs fetches 'ssl_c_key_alg' and 'ssl_f_key_alg'

ssl_c_key_alg: algo used to encrypt the client's cert key (ex: rsaEncryption)
ssl_f_key_alg: algo used to encrypt the frontend's cert key (ex: rsaEncryption)

commit | commitdiff | tree

Emeric Brun [Fri, 19 Oct 2012 16:15:40 +0000 (18:15 +0200)]

MINOR: ssl: add pattern and ACLs 'ssl_c_sig_alg' and 'ssl_f_sig_alg'

ssl_c_sig_alg: client cert signature algo (string). Ex: "RSA-SHA1"
ssl_f_sig_alg: frontend cert signature algo (string). Ex: "RSA-SHA1"

commit | commitdiff | tree

Emeric Brun [Wed, 17 Oct 2012 15:39:35 +0000 (17:39 +0200)]

MINOR: ssl: add pattern and ACLs fetches 'ssl_c_s_dn', 'ssl_c_i_dn', 'ssl_f_s_dn' and 'ssl_c_i_dn'

ssl_c_s_dn : client cert subject DN (string)
ssl_c_i_dn : client cert issuer DN (string)
ssl_f_s_dn : frontend cert subject DN (string)
ssl_f_i_dn : frontend cert issuer DN (string)

Return either the full DN without params, or just the DN entry (first param) or
its specific occurrence (second param).

commit | commitdiff | tree

Emeric Brun [Wed, 17 Oct 2012 13:03:11 +0000 (15:03 +0200)]

MINOR: ssl: add pattern and ACLs fetches 'ssl_c_version' and 'ssl_f_version'

ssl_c_version : version of the cert presented by the client (integer)
ssl_f_version : version of the cert presented by the frontend (integer)

commit | commitdiff | tree

Willy Tarreau [Mon, 22 Oct 2012 15:58:39 +0000 (17:58 +0200)]

MINOR: ssl: add pattern and ACLs fetches 'ssl_c_serial' and 'ssl_f_serial'

ssl_c_serial: serial of the certificate presented by the client.
ssl_f_serial: serial of the certificate presentend by the frontend.

commit | commitdiff | tree

Emeric Brun [Tue, 16 Oct 2012 12:59:28 +0000 (14:59 +0200)]

MINOR: ssl: add pattern fetch 'ssl_fc_session_id'

This fetch returns the SSL ID of the front connection. Useful to stick
on a given client.

commit | commitdiff | tree

Emeric Brun [Tue, 16 Oct 2012 12:13:26 +0000 (14:13 +0200)]

MINOR: ssl: add pattern and ACLs fetches 'ssl_fc_protocol', 'ssl_fc_cipher', 'ssl_fc_use_keysize' and 'ssl_fc_alg_keysize'

Some front connection fetches :
- ssl_fc_protocol = protocol name (string)
- ssl_fc_cipher = cipher name (string)
- ssl_fc_use_keysize = symmetric cipher key size used in bits (integer)
- ssl_fc_alg_keysize = symmetric cipher key size supported in bits (integer)

commit | commitdiff | tree

Emeric Brun [Thu, 18 Oct 2012 13:59:43 +0000 (15:59 +0200)]

MINOR: conf: rename all ssl modules fetches using prefix 'ssl_fc' and 'ssl_c'

SSL fetches were renamed :
ssl_fc_* = Front Connection (attributes of the connection itself)
ssl_c_* = Client side certificate

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 13:05:35 +0000 (15:05 +0200)]

BUILD: fix coexistence of openssl and zlib

The crappy zlib and openssl libs both define a free_func as a different typedef.
That's a very clever idea to use such a generic name in general purpose libraries,
really... The zlib one is easier to redefine than openssl's, so let's only fix this
one.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 12:50:26 +0000 (14:50 +0200)]

BUG/MINOR: http: compression should consider all Accept-Encoding header values

Right now commit 82fe75c1 came with a minor bug limiting the check to the first
accept-encoding header value only.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 09:36:40 +0000 (11:36 +0200)]

MINOR: compression: optimize memLevel to improve byte rate

Decreasing the deflateInit2's memLevel parameter from 9 to 8 does not
affect the compression ratio and increases the compression speed by 12%.
Lower values do not increase transfer speed but decrease the compression
ratio so it looks like 8 is optimal.

commit | commitdiff | tree

Willy Tarreau [Fri, 26 Oct 2012 00:11:25 +0000 (02:11 +0200)]

MINOR: compression: automatically disable compression for older browsers

A number of older browsers have many issues with compressed contents. It
happens that all these older browsers announce themselves as "Mozilla/4"
and that despite not being all broken, the amount of working browsers
announcing themselves this way compared to all other ones is so tiny
that it's not worth wasting cycles trying to adapt to every specific
one.

So let's simply disable compression for these older browsers.

More information on this very detailed article :

http://zoompf.com/2012/02/lose-the-wait-http-compression

commit | commitdiff | tree

William Lallemand [Tue, 23 Oct 2012 08:25:10 +0000 (10:25 +0200)]

MEDIUM: HTTP compression (zlib library support)

This commit introduces HTTP compression using the zlib library.

http_response_forward_body has been modified to call the compression
functions.

This feature includes 3 algorithms: identity, gzip and deflate:

  * identity: this is mostly for debugging, and it was useful for
  developping the compression feature. With Content-Length in input, it
  is making each chunk with the data available in the current buffer.
  With chunks in input, it is rechunking, the output chunks will be
  bigger or smaller depending of the size of the input chunk and the
  size of the buffer. Identity does not apply any change on data.

  * gzip: same as identity, but applying a gzip compression. The data
  are deflated using the Z_NO_FLUSH flag in zlib. When there is no more
  data in the input buffer, it flushes the data in the output buffer
  (Z_SYNC_FLUSH). At the end of data, when it receives the last chunk in
  input, or when there is no more data to read, it writes the end of
  data with Z_FINISH and the ending chunk.

  * deflate: same as gzip, but with deflate algorithm and zlib format.
  Note that this algorithm has ambiguous support on many browsers and
  no support at all from recent ones. It is strongly recommended not
  to use it for anything else than experimentation.

You can't choose the compression ratio at the moment, it will be set to
Z_BEST_SPEED (1), as tests have shown very little benefit in terms of
compression ration when going above for HTML contents, at the cost of
a massive CPU impact.

Compression will be activated depending of the Accept-Encoding request
header. With identity, it does not take care of that header.

To build HAProxy with zlib support, use USE_ZLIB=1 in the make
parameters.

This work was initially started by David Du Colombier at Exceliance.

commit | commitdiff | tree

Willy Tarreau [Thu, 25 Oct 2012 17:04:45 +0000 (19:04 +0200)]

CLEANUP: http: rename HTTP_MSG_DATA_CRLF state

This state's name is confusing as it is only used with chunked encoding
and makes newcomers think it's also related to the content-length. Let's
call it CHUNK_CRLF to clear any doubt on this.

commit | commitdiff | tree

Willy Tarreau [Thu, 25 Oct 2012 22:58:22 +0000 (00:58 +0200)]

OPTIM: tools: inline hex2i()

This tiny function was not inlined because initially not much used.
However it's been used un the chunk parser for a while and it became
one of the most CPU-cycle eater there. By inlining it, the chunk parser
speed was increased by 74 %. We're almost 3 times faster than original
with just the last 4 commits.

commit | commitdiff | tree

Willy Tarreau [Thu, 25 Oct 2012 22:49:52 +0000 (00:49 +0200)]

OPTIM: http: inline http_parse_chunk_size() and http_skip_chunk_crlf()

These functions are not that long and the compiler inlines them well. Doing
so has sped up the chunked encoding parser by 41% !

Note that http_forward_trailers was also declared static because it's not
exported.

commit | commitdiff | tree

Willy Tarreau [Thu, 25 Oct 2012 22:21:52 +0000 (00:21 +0200)]

OPTIM: channel: inline channel_forward's fast path

Most calls to channel_forward() are performed with short byte counts and
are already optimized in channel_forward() taking just a few instructions.
Thus it's a waste of CPU cycles to call a function for this, let's just
inline the short byte count case and fall back to the common one for
remaining situations.

Doing so has increased the chunked encoding parser's performance by 12% !

commit | commitdiff | tree

Cyril Bonté [Wed, 24 Oct 2012 22:01:06 +0000 (00:01 +0200)]

MEDIUM: http: accept IPv6 values with (s)hdr_ip acl

Commit ceb4ac9c states that IPv6 values are accepted by "hdr_ip" acl,
but the code didn't allow it. This patch provides the ability to accept IPv6
values.

commit | commitdiff | tree

Cyril Bonté [Wed, 24 Oct 2012 21:47:47 +0000 (23:47 +0200)]

BUG/MAJOR: fix a segfault on option http_proxy and url_ip acl

url2sa() mistakenly uses "addr" as a reference. This causes a segfault when
option http_proxy or url_ip are used.

This bug was introduced in haproxy 1.5 and doesn't need to be backported.

commit | commitdiff | tree

Cyril Bonté [Tue, 23 Oct 2012 19:28:31 +0000 (21:28 +0200)]

BUG/MEDIUM: acls using IPv6 subnets patterns incorrectly match IPs

Some tests revealed that IPs not in the range of IPv6 subnets incorrectly
matched (for example "acl BUG src 2804::/16" applied to a src IP "127.0.0.1").

This is caused by the acl_match_ip() function applies a mask in host byte
order, whereas it should be in network byte order.

commit | commitdiff | tree

Willy Tarreau [Mon, 22 Oct 2012 21:17:18 +0000 (23:17 +0200)]

MEDIUM: cli: allow the stats socket to be bound to a specific set of processes

Using "stats bind-process", it becomes possible to indicate to haproxy which
process will get the incoming connections to the stats socket. It will also
shut down the warning when nbproc > 1.

commit | commitdiff | tree

Willy Tarreau [Mon, 22 Oct 2012 20:47:55 +0000 (22:47 +0200)]

BUG/MAJOR: connection: risk of crash on certain tricky close scenario

In some circumstances, if the connection to the server is aborted while
some data were planned to be sent and the poller reported an ability to
send, then conn_fd_handler() would still call conn->data->send(), causing
the data layer to dereference the now NULL conn->xprt and crash.

So we have to check for conn->xprt validity before calling the data
layer.

This issue was introduced after 1.5-dev12 so it does not need any backport
and does not affect any released version.

Special thanks go to Cristian Ditoiu who once again provided amazing help
to troubleshoot this bug !

commit | commitdiff | tree

Willy Tarreau [Mon, 22 Oct 2012 17:32:55 +0000 (19:32 +0200)]

MEDIUM: listener: provide a fallback for accept4() when not supported

It happens that on some systems, the libc is recent enough to permit
building with accept4() but the kernel does not support it. The result
is then a disaster since no connection is accepted. We now detect this
and automatically fall back to accept() and fcntl() when this happens.

commit | commitdiff | tree

Emeric Brun [Wed, 17 Oct 2012 13:34:03 +0000 (15:34 +0200)]

MINOR: sample: export 'sample_get_trash_chunk(void)'

This will be used on external fetch modules.

commit | commitdiff | tree

Emeric Brun [Wed, 17 Oct 2012 11:38:19 +0000 (13:38 +0200)]

MINOR: acl: add parse and match primitives to use binary type on ACLs

Binary ACL match patterns can now be entered as hex digit strings.

commit | commitdiff | tree

Emeric Brun [Wed, 17 Oct 2012 11:36:06 +0000 (13:36 +0200)]

MINOR: sample: manage binary to string type convertion in stick-table and samples.

Binary type is converted to a null terminated hexa string.

commit | commitdiff | tree

Dmitry Sivachenko [Sun, 21 Oct 2012 14:10:25 +0000 (18:10 +0400)]

DOC: make it clear what the HTTP request size is

Please consider the following patch for configuration.txt to clarify meaning
of bufsize, maxrewrite and the size of HTTP request which can be processed.

commit | commitdiff | tree

Willy Tarreau [Sat, 20 Oct 2012 08:38:09 +0000 (10:38 +0200)]

BUG/MEDIUM: http: set DONTWAIT on data when switching to tunnel mode

Jaroslaw Bojar diagnosed an issue when haproxy switches to tunnel mode
after a transfer. The response data are sent with the MSG_MORE flag,
causing them to be needlessly queued in the kernel. In order to fix this,
we set the CF_NEVER_WAIT flag on the channels when switching to tunnel
mode.

One issue remained with client-side keep-alive : if the response is sent
before the end of the request, it suffers the same issue for the same
reason. This is easily addressed by setting the CF_SEND_DONTWAIT flag
on the channel when the response has been parsed and we're waiting for
the other side.

The same issue is present in 1.4 so the fix must be backported.

commit | commitdiff | tree

Willy Tarreau [Fri, 19 Oct 2012 18:52:18 +0000 (20:52 +0200)]

MINOR: ssl: improve socket behaviour upon handshake abort.

While checking haproxy's SSL stack with www.ssllabs.com, it appeared that
immediately closing upon a failed handshake caused a TCP reset to be emitted.
This is because OpenSSL does not consume pending data in the socket buffers.
One side effect is that if the reset packet is lost, the client might not get
it. So now when a handshake fails, we try to clean the socket buffers before
closing, resulting in a clean FIN instead of an RST.

commit | commitdiff | tree

Willy Tarreau [Fri, 19 Oct 2012 17:49:09 +0000 (19:49 +0200)]

MEDIUM: sample: pass an empty list instead of a null for fetch args

ACL and sample fetches use args list and it is really not convenient to
check for null args everywhere. Now for empty args we pass a constant
list of end of lists. It will allow us to remove many useless checks.

commit | commitdiff | tree

Willy Tarreau [Fri, 19 Oct 2012 14:47:23 +0000 (16:47 +0200)]

MINOR: sample: accept fetch keywords without parenthesis

fetch keywords which support arguments do not support being called
without parenthesis even if all arguments are optional. Let's fix
this to allow fetch keywords without parenthesis as is already done
in ACLs.

commit | commitdiff | tree

Willy Tarreau [Fri, 19 Oct 2012 13:18:06 +0000 (15:18 +0200)]

MINOR: chunk: provide string compare functions

It's sometimes needed to be able to compare a zero-terminated string with a
chunk, so we now have two functions to do that, one strcmp() equivalent and
one strcasecmp() equivalent.

commit | commitdiff | tree

Willy Tarreau [Fri, 19 Oct 2012 12:34:30 +0000 (14:34 +0200)]

BUG: ssl: fix ssl_sni ACLs to correctly process regular expressions

ssl_sni_reg was using acl_parse_str which is wrong since we're parsing
a regex. Additionally, neither _end nor _reg may be looked up.

commit | commitdiff | tree

Willy Tarreau [Thu, 18 Oct 2012 16:57:14 +0000 (18:57 +0200)]

MEDIUM: ssl: add support for the "npn" bind keyword

The ssl_npn match could not work by itself because clients do not use
the NPN extension unless the server advertises the protocols it supports.
Thanks to Simone Bordet for the explanations on how to get it right.

commit | commitdiff | tree

Willy Tarreau [Thu, 18 Oct 2012 13:11:52 +0000 (15:11 +0200)]

BUILD: ssl: fix shctx build on older compilers

gcc < 3 breaks on shctx because of the missing arg in the lock macros.
We don't need the arg at all, it's not used.

commit | commitdiff | tree

Cyril Bonté [Mon, 15 Oct 2012 17:44:24 +0000 (19:44 +0200)]

DOC: fix minor typo on http-send-name-header

The "server" keyword documentation had a reference on "http-send-server-name"
instead of "http-send-name-header".

commit | commitdiff | tree

Willy Tarreau [Mon, 15 Oct 2012 11:19:06 +0000 (13:19 +0200)]

MINOR: ssl: add 'ssl_npn' sample/acl to extract TLS/NPN information

This may be used to distinguish between SPDY versions for example.

commit | commitdiff | tree

Willy Tarreau [Sun, 14 Oct 2012 12:05:16 +0000 (14:05 +0200)]

DOC: document relations between internal entities

Links between internal structures have been represented in
entities.fig. The pdf and svg versions were added too.

commit | commitdiff | tree

Willy Tarreau [Sat, 13 Oct 2012 12:33:58 +0000 (14:33 +0200)]

OPTIM: connection: pack the struct target

The struct target contains one int and one pointer, causing it to be
64-bit aligned on 64-bit platforms. By marking it "packed", we can
save 8 bytes in struct connection and as many in struct session on
such platforms.

commit | commitdiff | tree

Willy Tarreau [Sat, 13 Oct 2012 09:22:24 +0000 (11:22 +0200)]

OPTIM: session: reorder struct session fields

A reorering of the struct session fields has increased overall performance
by almost 1% due to better cache usage.

commit | commitdiff | tree

Willy Tarreau [Sat, 13 Oct 2012 09:09:14 +0000 (11:09 +0200)]

CLEANUP: session: remove term_trace which is not used anymore

This field was used to trace precisely where a session was terminated
but it did not survive code rearchitecture and was not used at all
anymore. Let's get rid of it.

commit | commitdiff | tree

Willy Tarreau [Sat, 13 Oct 2012 08:05:56 +0000 (10:05 +0200)]

OPTIM: channel: reorganize struct members to improve cache efficiency

Now that the buffer is moved out of the channel, it is possible to move
the pointer earlier in the struct and reorder some fields. This new
ordering improves overall performance by 2%, mainly saved in the HTTP
parsers and data transfers.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 21:49:43 +0000 (23:49 +0200)]

MAJOR: channel: replace the struct buffer with a pointer to a buffer

With this commit, we now separate the channel from the buffer. This will
allow us to replace buffers on the fly without touching the channel. Since
nobody is supposed to keep a reference to a buffer anymore, doing so is not
a problem and will also permit some copy-less data manipulation.

Interestingly, these changes have shown a 2% performance increase on some
workloads, probably due to a better cache placement of data.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 21:58:13 +0000 (23:58 +0200)]

CLEANUP: acl: use 'chn' instead of 'b' to name channel pointers

As with previous patches, this naming is confusing.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 21:56:57 +0000 (23:56 +0200)]

CLEANUP: stream_interface: use 'chn' instead of 'b' to name channel pointers

As with previous patches, this naming is confusing.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 21:53:39 +0000 (23:53 +0200)]

CLEANUP: tcp: use 'chn' instead of 'buf' or 'b' for channel pointer names

Same as previous patches, avoid confusion in local variable names.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 21:11:02 +0000 (23:11 +0200)]

CLEANUP: channel: use 'chn' instead of 'buf' as local variable names

It's too confusing to see buf->buf everywhere where the first buf is
a channel. Let's fix this now.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 20:51:15 +0000 (22:51 +0200)]

CLEANUP: http: use 'chn' to name channel variables, not 'buf'

These "buf" were confusing as they were really refering to channels. At
most places, a buffer was really all what was needed, so a struct buffer
was used instead. It is possible that the performance has slightly increased
by the removal of pointer offset in many pointer operations by directly
using the buffer pointer instead of the channel pointer.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 20:40:39 +0000 (22:40 +0200)]

REORG: http: rename msg->buf to msg->chn since it's a channel

It's extremely confusing to have all those msg->buf->buf everywhere after
the extraction of the buffer from the channel. Let's clean this up.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 18:17:54 +0000 (20:17 +0200)]

MEDIUM: log: report SSL ciphers and version in logs using logformat %sslc/%sslv

These two new log-format tags report the SSL protocol version (%sslv) and the
SSL ciphers (%sslc) used for the connection with the client. For instance, to
append these information just after the client's IP/port address information
on an HTTP log line, use the following configuration :

log-format %Ci:%Cp\ %sslv:%sslc\ [%t]\ %ft\ %b/%s\ %Tq/%Tw/%Tc/%Tr/%Tt\ %st\ %B\ %cc\ \ %cs\ %tsc\ %ac/%fc/%bc/%sc/%rc\ %sq/%bq\ %hr\ %hs\ %{+Q}r

It will report a line such as the following one :

Oct 12 20:47:30 haproxy[9643]: 127.0.0.1:43602 TLSv1:AES-SHA [12/Oct/2012:20:47:30.303] stick2~ stick2/s1 7/0/12/0/19 200 145 - - ---- 0/0/0/0/0 0/0 "GET /?t=0 HTTP/1.0"

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 17:48:16 +0000 (19:48 +0200)]

MINOR: log: make lf_text use a const char *

lf_text() should use a const char * otherwise it makes it more complex
to use data coming from const strings.

commit | commitdiff | tree

Willy Tarreau [Fri, 12 Oct 2012 16:01:49 +0000 (18:01 +0200)]

MEDIUM: log: add a new LW_XPRT flag to pin the transport layer

This flag will have to be set on log tags which require transport layer
information. They will prevent the conn_xprt_close() call from releasing
the transport layer too early.

Mirror of https://github.com/haproxy/haproxy.git