git.ipfire.org Git - thirdparty/samba.git/log

]> git.ipfire.org Git - thirdparty/samba.git/log

Andrew Tridgell [Mon, 15 Oct 2007 04:28:51 +0000 (14:28 +1000)]

sync flags between nodes in monitor loop in recmaster
(This used to be ctdb commit 6eef86e06388fc53a1212f1e2783ae174c6cd210)

commit | commitdiff | tree

Andrew Tridgell [Mon, 15 Oct 2007 04:17:49 +0000 (14:17 +1000)]

merge from ronnie
(This used to be ctdb commit d18712caba11855010be52f90bac656683076676)

commit | commitdiff | tree

Andrew Tridgell [Mon, 15 Oct 2007 03:31:09 +0000 (13:31 +1000)]

disable optimisation for now, until we find a occasional segv
(This used to be ctdb commit d09570c70551aa40390ce9ceffe7bc234e1afafe)

commit | commitdiff | tree

Andrew Tridgell [Mon, 15 Oct 2007 03:22:58 +0000 (13:22 +1000)]

add config option for disabling bans
(This used to be ctdb commit 153b911f7f957d4c564b04f5aa878033a02da9e4)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 10 Oct 2007 21:51:57 +0000 (07:51 +1000)]

use $CTDB_BASE in 90.ipmux instead of hardcoding it to /etc/ctdb

(This used to be ctdb commit 6abb46b010851f5719f12273b4a3d46ec986f0c7)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 10 Oct 2007 21:30:10 +0000 (07:30 +1000)]

use kill_tcp_connections() to kill off all tcp connections to the
"single public ip" address when we do a recovery

(This used to be ctdb commit 19b52a2d5db31efa9e7c77037097ff8539986ac3)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 10 Oct 2007 21:27:38 +0000 (07:27 +1000)]

move the kill_tcp_connections() function from 10.interfaces to functions

(This used to be ctdb commit 055948530fb16bf49c42fc4489f29a21665156c0)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 10 Oct 2007 21:10:17 +0000 (07:10 +1000)]

first check that recovery master is connected (we know this from our own
flags)

then pull the flags off recovery master before checking if it is banned

(This used to be ctdb commit 94c1d234e57a40eda2d8b892dd9fbe1ffc4b3433)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 10 Oct 2007 20:16:36 +0000 (06:16 +1000)]

simplify election handling

make sure we read and update the flags from all remote nodes before we
reach the first codepath that can call do_recovery()
since during do_recovery() we need to know what the flags are.

(This used to be ctdb commit e85f3806483ea420559d449e0e4d81bec996740f)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 10 Oct 2007 00:49:55 +0000 (10:49 +1000)]

merge from tridge

(This used to be ctdb commit 4690a205fe4325b03ab044bdb5fbc9aa3e94db6e)

commit | commitdiff | tree

Andrew Tridgell [Wed, 10 Oct 2007 00:45:22 +0000 (10:45 +1000)]

make sure reconnected nodes start off as unhealthy so they don't get a public IP
(This used to be ctdb commit c733ec6760cae01ce277f491caf1355e46de5cf7)

commit | commitdiff | tree

Ronnie Sahlberg [Tue, 9 Oct 2007 23:42:32 +0000 (09:42 +1000)]

add a --single-public-ip argument to ctdbd to specify the ip address
used in single public ip address mode.
when using this argument, --public-interface must also be used.

add a vnn structure to the ctdb context to describe the single public ip
address

update the killtcp control in the daemon that if a socketpair that is to
be killed does not match a normal public address it checks if the
destination address maches the single public ip address and if so uses
that vnn structure from the ctdb context

this allows killtcp to kill also connections to the single public ip
instead of only normal public addresses

(This used to be ctdb commit 5661ba17b91f62821dec1c76056c78b99752a90b)

commit | commitdiff | tree

Ronnie Sahlberg [Tue, 9 Oct 2007 03:45:42 +0000 (13:45 +1000)]

remove some debug outputs

(This used to be ctdb commit f29c0b52df1f455909ba133e3ad3bc462dc32929)

commit | commitdiff | tree

Ronnie Sahlberg [Tue, 9 Oct 2007 02:00:12 +0000 (12:00 +1000)]

send out gratious arps when we are starting up serving the "single
public ip" but before we start the ipmux tool

(This used to be ctdb commit dad1a80f39763314825939095f7656c13dcdbdc3)

commit | commitdiff | tree

Ronnie Sahlberg [Tue, 9 Oct 2007 01:56:09 +0000 (11:56 +1000)]

add a control to send gratious arps from the ctdb daemon

(This used to be ctdb commit 563819dd1acb344f95aabb4bad990b36f7ea4520)

commit | commitdiff | tree

Ronnie Sahlberg [Mon, 8 Oct 2007 04:05:22 +0000 (14:05 +1000)]

add an initial test version of an ip multiplex tool that allows us
to have one single public ip address for the entire cluster.

this ip address is attached to lo on all nodes but only the recmaster
will respond to arp requests for this address.
the recmaster then runs an ipmux process that will pass any incoming
packets to this ip address onto the other node sin the cluster based on
the ip address of the client host

to use this feature one must
1, have one fixed ip address in the customers network attached
permanently attached to an interface
2, set CTDB_PUBLI_INTERFACE=
   to specify on which interface the clients attach to the node
3, CTDB_SINGLE_PUBLI_IP=ip-address
   to specify which ipaddress should be the "single public ip address"

to test with only one single client,   attach several ip addresses to
the client and ping the public address from the client with different -I
options.   look in network trace to see to which node the packet is
passed onto.

(This used to be ctdb commit 50d648c95e4e6d7c2867a034c2b550086d853320)

commit | commitdiff | tree

Ronnie Sahlberg [Sun, 7 Oct 2007 23:47:20 +0000 (09:47 +1000)]

add a function in the ctdb tool to determine whether the local node is
the recmaster or not.

return 0 if the node is the recmaster and 1 (true) if it is not or if
we could not communicate with the ctdb daemon.

call it 'isnotrecmaster' to cope with that if the tool could not bind to
the socket to tyalk to the daemon, the tool will automatically return an
error and exit code 1
thus the tool will only return 0 if it could talk successfully to the
local daemon and if the local daemon confirms this node is the recmaster

(This used to be ctdb commit ae5fcb790b6c3985f514fa8a96bc00c2619f2a28)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 5 Oct 2007 22:11:24 +0000 (08:11 +1000)]

merge from tridge

(This used to be ctdb commit 02cda01c032804cb1c53593ceb98685c827e2d58)

commit | commitdiff | tree

Andrew Tridgell [Fri, 5 Oct 2007 03:51:31 +0000 (13:51 +1000)]

fixed several places where we set the recovery culprit incorrectly
(This used to be ctdb commit d9da73395fa443801fc68ec53a42b548e832d58a)

commit | commitdiff | tree

Andrew Tridgell [Fri, 5 Oct 2007 03:28:21 +0000 (13:28 +1000)]

- catch ESTALE in the recovery lock by trying a read()
- priortise nodes that are unbanned and healthy in the election

(This used to be ctdb commit 929feb475dfdf7283f0e99b50b179e1c91d3a39f)

commit | commitdiff | tree

Andrew Tridgell [Fri, 5 Oct 2007 02:01:40 +0000 (12:01 +1000)]

we are the culprit if we can't get the reclock
(This used to be ctdb commit 1d320e113c6134ff6822b985a47131d8204af35a)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 26 Sep 2007 04:25:32 +0000 (14:25 +1000)]

change async.private to async.private_data since private is a reserved
work in c++

(This used to be ctdb commit 79eb28f6cd5dcc30b04966d202a050eaf98a2552)

commit | commitdiff | tree

Ronnie Sahlberg [Tue, 25 Sep 2007 01:43:42 +0000 (11:43 +1000)]

merge from tridge

(This used to be ctdb commit 5655fab1284dce8f4a09ad426d53f5151c88968b)

commit | commitdiff | tree

Andrew Tridgell [Mon, 24 Sep 2007 05:27:01 +0000 (15:27 +1000)]

upped version number
(This used to be ctdb commit 4312e20e047ddb0f825c5e0c51d85dfa6a1b7df8)

commit | commitdiff | tree

Andrew Tridgell [Mon, 24 Sep 2007 03:52:35 +0000 (13:52 +1000)]

merge from ronnie
(This used to be ctdb commit c67f516f01f8033e3fbd0f338eaa3a8afb862495)

commit | commitdiff | tree

Ronnie Sahlberg [Mon, 24 Sep 2007 00:52:26 +0000 (10:52 +1000)]

when we have a public ip address mismatch (i.e. we hold addresses we
shouldnt or we are not holding addresses wqe should)
we must first freeze the local node before we set the recovery mode

(This used to be ctdb commit a77a77e8b5180f6a4a1f3d7d4ff03811f3b71b56)

commit | commitdiff | tree

Ronnie Sahlberg [Mon, 24 Sep 2007 00:27:48 +0000 (10:27 +1000)]

merge from tridge

(This used to be ctdb commit 7f9242747543ea1a2cc05f5c8afc51ab26e7d4bb)

commit | commitdiff | tree

Andrew Tridgell [Mon, 24 Sep 2007 00:19:07 +0000 (10:19 +1000)]

fixed a fd leak on the recovery lock
(This used to be ctdb commit 186f35c42ed4fcc9ed44390b0dd036ece475d45e)

commit | commitdiff | tree

Andrew Tridgell [Mon, 24 Sep 2007 00:12:18 +0000 (10:12 +1000)]

run monitoring more quickly when unhealthy and at startup
(This used to be ctdb commit ff1c205928e3ef5bcc6bf4e4b2122a19fa38d8f4)

commit | commitdiff | tree

Andrew Tridgell [Mon, 24 Sep 2007 00:00:14 +0000 (10:00 +1000)]

no longer wait at startup for services to become available, instead
set the node initially unhealthy and let the status monitoring bring the node online.
This fixes a problem with winbindd, where it refused to start because secrets.tdb was not populated
but we could not populate ctdbd, because the net command would not run while ctdbd was still doing startup
and thus frozen
(This used to be ctdb commit 3a001b793dd76fb96addf1e2ccb74da326fbcfbc)

commit | commitdiff | tree

Andrew Tridgell [Sun, 23 Sep 2007 23:57:14 +0000 (09:57 +1000)]

fixed a valgrind error, and some warnings
(This used to be ctdb commit c0f52dbb385fa0748680adb7c40755c92e577551)

commit | commitdiff | tree

Andrew Tridgell [Fri, 21 Sep 2007 06:12:04 +0000 (16:12 +1000)]

make the persistent dbdir configurable
(This used to be ctdb commit 2587b887dcfce26b12c66fcb5d34e92da42a1776)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 21 Sep 2007 05:45:48 +0000 (15:45 +1000)]

merge from tridge

(This used to be ctdb commit 01c388f12ed554bda1a46b21cd18bf1b00f962c3)

commit | commitdiff | tree

Andrew Tridgell [Fri, 21 Sep 2007 05:44:13 +0000 (15:44 +1000)]

avoid using connected nodes that aren't in the vnn map yet
(This used to be ctdb commit 2b5ae133f5f6fa9ad1a8896fe4b4c542d4ca462d)

commit | commitdiff | tree

Andrew Tridgell [Fri, 21 Sep 2007 05:32:11 +0000 (15:32 +1000)]

merge bugfix from ronnie
(This used to be ctdb commit c179b0085ae239b576139545c43f1c141a03e196)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 21 Sep 2007 05:19:33 +0000 (15:19 +1000)]

in ctdb_control_persistent_store() we must talloc_steal() the pointer to
c to prevent it from being immediately freed (and our persistent store
state with it) if we need to wait asynchronously for other nodes before
we can reply back to the client

(This used to be ctdb commit fa5915280933e4d2e7d4d07199829c9c2b87a335)

commit | commitdiff | tree

Andrew Tridgell [Fri, 21 Sep 2007 04:47:32 +0000 (14:47 +1000)]

merge from ronnie
(This used to be ctdb commit 485182ba7b91b756f93be4863bf5fa208d460a54)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 21 Sep 2007 03:47:40 +0000 (13:47 +1000)]

when ctdb attaches to a database it broadcasts the attach to all other
nodes so that the db is created on them as well

when we send this broadcast we must use the correct control and not
assume all databases created are of the temporary kind

(This used to be ctdb commit 106f816d4a0814ca4418de051289d9fc62df7dd2)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 21 Sep 2007 03:20:29 +0000 (13:20 +1000)]

merge from tridge

(This used to be ctdb commit bb283ee8ebaea848366e9c3b3d3244da459a7967)

commit | commitdiff | tree

Andrew Tridgell [Fri, 21 Sep 2007 02:24:02 +0000 (12:24 +1000)]

added support for persistent databases in ctdbd
(This used to be ctdb commit 3115090a0d882beca9d70761130b74bb0821f201)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 19 Sep 2007 01:54:45 +0000 (11:54 +1000)]

merge from tridge

(This used to be ctdb commit c4262a3f980a9fabe0985fc3281e9f5de1cdf35e)

commit | commitdiff | tree

Ronnie Sahlberg [Wed, 19 Sep 2007 01:53:48 +0000 (11:53 +1000)]

one more command to run to enable winbind for vsftpd

(This used to be ctdb commit 1f6d13a364cde58b66a4bc52e909cc68b8c807d7)

commit | commitdiff | tree

Andrew Tridgell [Wed, 19 Sep 2007 01:46:37 +0000 (11:46 +1000)]

make sure we set close on exec on any possibly inherited fds
(This used to be ctdb commit d9dec82076f14a348e7b67b4350180681ff86f32)

commit | commitdiff | tree

Andrew Tridgell [Wed, 19 Sep 2007 01:46:11 +0000 (11:46 +1000)]

separate out the various fs display ops
(This used to be ctdb commit dc89e1a428da5d5ca2a9c4988c05de3ea65f00f4)

commit | commitdiff | tree

Andrew Tridgell [Mon, 17 Sep 2007 05:31:33 +0000 (15:31 +1000)]

expanded ctdb_diagnostics a bit
(This used to be ctdb commit 70a4bb3dc7e624ad778949dbc874c2617fd532e6)

commit | commitdiff | tree

Ronnie Sahlberg [Mon, 17 Sep 2007 03:01:16 +0000 (13:01 +1000)]

add documantation of additional requirements for FTP so that users can
log in and access files using the AD username/password

(This used to be ctdb commit 679e125770247fc24dfb14b5781d44f639457ecd)

commit | commitdiff | tree

Ronnie Sahlberg [Sun, 16 Sep 2007 21:43:15 +0000 (07:43 +1000)]

merge from tridge

(This used to be ctdb commit 6aa62f1b42c151e9a5760d927be484ce8a8540b0)

commit | commitdiff | tree

Andrew Tridgell [Fri, 14 Sep 2007 09:27:11 +0000 (19:27 +1000)]

increase release number
(This used to be ctdb commit b213f4f1bf5bcfb40b9ce176df22216e3ebbe964)

commit | commitdiff | tree

Andrew Tridgell [Fri, 14 Sep 2007 05:23:23 +0000 (15:23 +1000)]

merge from ronnie
(This used to be ctdb commit 913c33a7d2f67570548fecc568dba874e5f72dd2)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 14 Sep 2007 05:19:44 +0000 (15:19 +1000)]

let ctdb ip only print the ip addresses known to the specified node
and not the entire cluster

(This used to be ctdb commit eb1f67a56d752c9f42a9a26a6697a7ab8e668b3a)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 14 Sep 2007 04:24:53 +0000 (14:24 +1000)]

update vnn -> pnn in documentation

(This used to be ctdb commit bb62b4df514255b95d3871b254bec9c440bc4a06)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 14 Sep 2007 04:19:12 +0000 (14:19 +1000)]

documentation updates

it is --event-script-dir not --event-script

add explanation of the public_addresses file

(This used to be ctdb commit 21325b23e786ac1c2abc07ea75b0814e9c725a9e)

commit | commitdiff | tree

Andrew Tridgell [Fri, 14 Sep 2007 04:14:03 +0000 (14:14 +1000)]

cope with non-standard install dirs in event scripts
(This used to be ctdb commit 52fff5345873690a9cc86495f414343eaa3bd540)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 14 Sep 2007 02:18:34 +0000 (12:18 +1000)]

merge from tridge

(This used to be ctdb commit 340b4a0a31a50385430fea015b146c84d0b2bbb2)

commit | commitdiff | tree

Andrew Tridgell [Fri, 14 Sep 2007 01:59:04 +0000 (11:59 +1000)]

fix pkill args
(This used to be ctdb commit 9690de97b4746f4a79830465e3a1679e9fbda671)

commit | commitdiff | tree

Andrew Tridgell [Fri, 14 Sep 2007 01:56:40 +0000 (11:56 +1000)]

make sure all public IPs are removed at startup
(This used to be ctdb commit b16f33787f2a9471285037f4a6d470e826536570)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 14 Sep 2007 00:37:10 +0000 (10:37 +1000)]

during startup make sure to delete any public addresses from any
interface

(This used to be ctdb commit 18d80ea6db39e61f60e4c01de164d58bcbd8ab10)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 14 Sep 2007 00:16:36 +0000 (10:16 +1000)]

let each node verify that they have a correct assignment of public ip
addresses (i.e. htey hold those they should hold and they dont hold
any of those they shouldnt hold)

if an inconsistency is found, mark the local node as recovery mode
active
and wait for the recovery master to trigger a full blown recovery

(This used to be ctdb commit 55a5bfc8244c5b9cdda3f11992f384f00566b5dc)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 23:49:12 +0000 (09:49 +1000)]

- merge from ronnie
- add a flag to check that recovery completed correctly. If not, re-trigger it in monitoring

(This used to be ctdb commit d5ed941d9bab4af30d8b5f9b77bdf43d9218d69b)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 23:25:11 +0000 (09:25 +1000)]

wait for ctdbd to finish cleanup before considering "service ctdb stop" to be done
(This used to be ctdb commit 216eb4be7ec481cfe9aaeeada257b77cb394d2e4)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 23:24:34 +0000 (09:24 +1000)]

nicer use of testparm
(This used to be ctdb commit a611ea930fb9dae6e56f6a74b2bdc9e08066d4d1)

commit | commitdiff | tree

Ronnie Sahlberg [Thu, 13 Sep 2007 22:56:27 +0000 (08:56 +1000)]

update the section about event scripts

(This used to be ctdb commit a0744480c85a4e8648bd0ae7600f90d311b931ea)

commit | commitdiff | tree

Ronnie Sahlberg [Thu, 13 Sep 2007 22:15:24 +0000 (08:15 +1000)]

disable nfsv4 in etc/sysconfig/nfs

(This used to be ctdb commit b71e11f0e27bb3ff908ad171aa5b1f724609ad05)

commit | commitdiff | tree

Ronnie Sahlberg [Thu, 13 Sep 2007 04:51:37 +0000 (14:51 +1000)]

when a ctdb_takeover_run has failed  we must make sure that
need_takeover_run is set to true  or else we might forget to rerun it
again during the next recovery

othervise,  need_takeover_run is only set to true IFF the node flags for
a remote node and the local nodes differ.
It is possible that a takeover run fails  and thus the reassignment of
ip addresses is incomplete  but before we get back to the test in
monitor_cluster()  that all the node flags of all nodes have converged
and they now match each others again.   and thus causing
monitor_cluster() to fail to realize that a takeover run is needed.

(This used to be ctdb commit ae7e866787cebd14394983ce1834387c959d1022)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 04:36:23 +0000 (14:36 +1000)]

ensure smbd and winbindd do die in 50.samba
(This used to be ctdb commit 6f23affedb626fc7a5ca86c4763f3045a5586231)

commit | commitdiff | tree

Ronnie Sahlberg [Thu, 13 Sep 2007 04:28:18 +0000 (14:28 +1000)]

merge from tridge

(This used to be ctdb commit eda3caa77be352967a41ff9bddda5296c94797a9)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 04:08:18 +0000 (14:08 +1000)]

prevent recursion in the calling of ctdb_takeover_run
(This used to be ctdb commit 0fbdeb7c91b965d9bc5ecc7b24e31070378d8f1d)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 01:57:42 +0000 (11:57 +1000)]

more shell scripting fixes in 10.interface
(This used to be ctdb commit 4ee2230b3f2ae7437a9d0cf973eb4645d276accd)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 01:19:49 +0000 (11:19 +1000)]

force recovery if unable to tell a node to release an IP
(This used to be ctdb commit 6895788d2499344a03357e5c1103cb8383e9eaf7)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 01:19:30 +0000 (11:19 +1000)]

fixed script errors in 10.interface
(This used to be ctdb commit 0c759614d27758cef3eba5942b2cccad54193cbb)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 00:45:06 +0000 (10:45 +1000)]

we don't need the is_loopback logic in ctdb any more
(This used to be ctdb commit 4ecf29ade0099c7180932288191de9840c8d90a9)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 00:39:05 +0000 (10:39 +1000)]

remove more cruft from the logs
(This used to be ctdb commit b67f35c483b6cbb5facaa6380c7794709f44213a)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 00:24:48 +0000 (10:24 +1000)]

new approach for killing TCP connections on IP release
(This used to be ctdb commit c33a0db29b5604966f582b1f8c5fd66760c72197)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 00:03:18 +0000 (10:03 +1000)]

remove clutter from ctdb log file
(This used to be ctdb commit 54d5dcaaee0498f40bbee5059cc72d0ca75d33b7)

commit | commitdiff | tree

Andrew Tridgell [Thu, 13 Sep 2007 00:02:56 +0000 (10:02 +1000)]

fixed return code
(This used to be ctdb commit 30165b5a19f9bd9d1f62c9c222df0711c1c6a927)

commit | commitdiff | tree

Andrew Tridgell [Wed, 12 Sep 2007 03:26:24 +0000 (13:26 +1000)]

handle hung or slow ctdb daemons on shutdown
(This used to be ctdb commit a3089211782ab12387c1b04efa28914c94d89b30)

commit | commitdiff | tree

Andrew Tridgell [Wed, 12 Sep 2007 03:23:36 +0000 (13:23 +1000)]

- set arp_ignore to prevent replying to arp requests for addresses on loopback
- put removed IPs on loopback with scope host
- check for nul strings in ethtool call
;

(This used to be ctdb commit e2df1d6d08e67a36ff05a590a34c56e900741287)

commit | commitdiff | tree

Andrew Tridgell [Wed, 12 Sep 2007 03:22:31 +0000 (13:22 +1000)]

- don't allow the registration of clients with IPs we don't hold
- change some debug levels to make tracking of IP release problems easier
(This used to be ctdb commit 5f9aed62adaf87750f953412c55b29c58e4bb6c0)

commit | commitdiff | tree

Andrew Tridgell [Wed, 12 Sep 2007 03:21:19 +0000 (13:21 +1000)]

changed some debug levels
(This used to be ctdb commit ed764533e1c2f8982e1577ca5e7f5f4482a15345)

commit | commitdiff | tree

Ronnie Sahlberg [Tue, 11 Sep 2007 21:28:24 +0000 (07:28 +1000)]

use the public addresses variable instead of hardcoding the path

(This used to be ctdb commit 8e23f173cda8a76bbc243863bfc49fe8c7b907f4)

commit | commitdiff | tree

Ronnie Sahlberg [Tue, 11 Sep 2007 21:26:30 +0000 (07:26 +1000)]

move all ip addresses onto loopback when we startup ctdb

(This used to be ctdb commit 5d7500f7d93f0d36ffbf3c966c5b38f82f0376c7)

commit | commitdiff | tree

Andrew Tridgell [Tue, 11 Sep 2007 06:38:32 +0000 (16:38 +1000)]

fixed location of arp_filter
(This used to be ctdb commit ea239c82fca2b9a648d21e5c603e632011958452)

commit | commitdiff | tree

Andrew Tridgell [Mon, 10 Sep 2007 10:45:27 +0000 (20:45 +1000)]

get interface right
(This used to be ctdb commit e0edc38d7e897f7de2850eb2cfd17fea75c16fcc)

commit | commitdiff | tree

Ronnie Sahlberg [Mon, 10 Sep 2007 06:34:11 +0000 (16:34 +1000)]

grab the interface name from tok and not from the uninitialized array

(This used to be ctdb commit 23a47ca2331a163b5fde03bd2f6f1d478633aede)

commit | commitdiff | tree

Ronnie Sahlberg [Mon, 10 Sep 2007 06:23:06 +0000 (16:23 +1000)]

merged patch from tridge

(This used to be ctdb commit 90ab044093f67b656e21861ce12d6fee5794d21f)

commit | commitdiff | tree

Andrew Tridgell [Mon, 10 Sep 2007 05:16:17 +0000 (15:16 +1000)]

fixed a pointer cast warning
(This used to be ctdb commit df0e7a4aa13112d613702d8ea0fb0e18510d293c)

commit | commitdiff | tree

Andrew Tridgell [Mon, 10 Sep 2007 05:09:28 +0000 (15:09 +1000)]

added back --public-interface to startup script
(This used to be ctdb commit 9e9cb3c0da7251f522c655366ef0868037577a9c)

commit | commitdiff | tree

Andrew Tridgell [Mon, 10 Sep 2007 04:27:29 +0000 (14:27 +1000)]

- use struct sockaddr_in more consistently instead of string addresses
- allow for public_address lines with a defaulting interface

(This used to be ctdb commit 29cb760f76e639a0f2ce1d553645a9dc26ee09e5)

commit | commitdiff | tree

Andrew Tridgell [Mon, 10 Sep 2007 04:26:35 +0000 (14:26 +1000)]

add back in --public-interface as a default
(This used to be ctdb commit cdf56daf69b2c8381ee673943e982ad20f19affd)

commit | commitdiff | tree

Andrew Tridgell [Mon, 10 Sep 2007 03:21:11 +0000 (13:21 +1000)]

merge from ronnie
(This used to be ctdb commit 1f21d4d563232926c35d03c4d69eb69190823dc6)

commit | commitdiff | tree

Andrew Tridgell [Mon, 10 Sep 2007 01:27:07 +0000 (11:27 +1000)]

add crontab and sysctl output
(This used to be ctdb commit b1b59f3294ee7a5ed6d685f373bf19d3152170fa)

commit | commitdiff | tree

Ronnie Sahlberg [Sun, 9 Sep 2007 21:45:57 +0000 (07:45 +1000)]

update a comment

(This used to be ctdb commit e7d3ef4443686529299e8f293398cc0522235627)

commit | commitdiff | tree

Ronnie Sahlberg [Sun, 9 Sep 2007 21:20:44 +0000 (07:20 +1000)]

change the signature to ctdb_sys_have_ip() to also return:
a bool that specifies whether the ip was held by a loopback adaptor or
not
the name of the interface where the ip was held

when we release an ip address from an interface, move the ip address
over to the loopback interface

when we release an ip address  after we have move it onto loopback,
use 60.nfs to kill off the server side (the local part) of the tcp
connection   so that the tcp connections dont survive a
failover/failback

61.nfstickle,   since we kill hte tcp connections when we release an ip
address   we no longer need to restart the nfs service in 61.nfstickle

update ctdb_takeover to use the new signature for ctdb_sys_have_ip

when we add a tcp connection to kill in ctdb_killtcp_add_connection()
check if either the srouce or destination address match a known public
address

(This used to be ctdb commit f9fd2a4719c50f6b8e01d0a1b3a74b76b52ecaf3)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 7 Sep 2007 22:09:02 +0000 (08:09 +1000)]

set /proc/sys/net/ipv4/conf/all/arp_filter to 1 by default when
10.interfaces startsup

this setting makes the system only respond to APR requests from the NIC
where the ip address is tied to and adds to the
"principle of least surprise" when using multihoming servers

(This used to be ctdb commit 39ddf347dc45f599964a4c17e67e71faed00e544)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 7 Sep 2007 06:45:19 +0000 (16:45 +1000)]

ctdb ip    must loop over all connected nodes to pull hte public ip list
and merge into a big list   since with the deassociation between a node
and a public ipaddress    the /etc/ctdb/public_addresses files can
differ between nodes and no node know about all public addresses that a
cluster can use

(This used to be ctdb commit e208294fed183977cacc44b2cd1195c11d967c18)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 7 Sep 2007 05:39:26 +0000 (15:39 +1000)]

remove the ctdb publicip command
this command no longer makes sense when there is no on-to-one mapping
between a node and its default public ip

(This used to be ctdb commit 91280db7f6dd3d659edd86fae21ba347d6f9da9e)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 7 Sep 2007 02:20:48 +0000 (12:20 +1000)]

update web nfs with the new NFS_HOSTNAME variable we need to be able to
stat notify using the correct hostname

(This used to be ctdb commit 1498e33e48a4654e02b74a00ef7473fed3225d69)

commit | commitdiff | tree

Ronnie Sahlberg [Fri, 7 Sep 2007 02:14:53 +0000 (12:14 +1000)]

add a short delay after stopping nfslock to make it less likely that
"weird" things happen

(This used to be ctdb commit 4934c083cbcc19714094e08a0b7da1fb6fdc8a5a)

commit | commitdiff | tree

Ronnie Sahlberg [Thu, 6 Sep 2007 23:21:40 +0000 (09:21 +1000)]

merge from tridge

(This used to be ctdb commit 58c918b1bfe09c31049769dee266129cbad4cb20)

commit | commitdiff | tree

Ronnie Sahlberg [Thu, 6 Sep 2007 22:52:56 +0000 (08:52 +1000)]

60.nfs:
we must always restart the lockmanager when the cluster has been
reconfigured and ip addresses has changed. This is to make sure we get a
clusterwide grace period for nfs locking.
if we dont do this and only restart locking on the nodes that were
direclty affected, a different client can take out a conflicting lock
from a different node before affected clients has had a chance to
reclaim all the locks lost during reconfigure.
grace period on rhel5 kernel has bene increased to 90 seconds!

statd-callout:
we must restart lockmanager to ensure a clusterwide grace period for
nfs. this makes locking "more correct" for nfs clients and prevents
other clients/nodes from taking out a conflicting lock while a different
client/node tries to reclaim lost locks.
This makes it "almost consistent" for NFS clients   but there is still
the possibility that a cifs client can take out a conflicting lock
before an nfs client has had a chance to reclaim an existing lock.
This can not be solved with anything less than making the kernel nfs
lock manager "samba aware" and making samba aware of the internal state
of the kernel lock manager so that they can cooperate.

we can not just stop/start the lockmanager back to back in rhel5 since
if they are stopped/started too close to eachother then when the new
lockmanager upon starting up sends out statd notifications two things
can happen:
1, new lockmanager sends out notification BEFORE it has registered with
portmapper leading to
  lockmanager starts
  lockmanager sends notification to the client
  client tries to recover the lock and tries to portmap the lockmanager
  port on the server.
  server is not (yet) registered with portmapper and server responds
  "no such program" to hte clients request to discover where lockmanager
   is.
  client then just completely gives up reclaiming the lock and doesnt
  even reattempt the portmapper call after some timeout.
  ==> lock reclaim failed.
2, if they are started back to back, and a client tries to reclaim the
   lock  the lockmanager sometimes sends two responses back to back
   to the client.   one with status NLM_GRANTED (==you got the lock
reclaimed) and one with status NLM_DENIED (==you could not get the lock
reclaimed)
   This confuses the client and leads to the server thinking that the
client does have the lock   and the client thinking it has not got the
lock    and orphaned locks result.

We also send out additional notification messages of different formats
to allow more legacy clients to interoperate with locking.

(This used to be ctdb commit 13208c1aab2942e28dff87e38e6794bf0c026033)

Mirror of https://github.com/samba-team/samba.git

RSS Atom