git.ipfire.org Git - thirdparty/mdadm.git/log

]> git.ipfire.org Git - thirdparty/mdadm.git/log

Dan Williams [Thu, 2 Oct 2008 01:50:43 +0000 (18:50 -0700)]

fname_as_uuid: print uuids msb first

The sha1 routines store the uuids in little endian byte-order, so always
print from msb to lsb. This allows imsm containers to be assembled with
-As.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Thu, 2 Oct 2008 01:50:43 +0000 (18:50 -0700)]

mdmon: periodically retry to create the socket

If initial socket creation fails, EROFS, set a periodic alarm to wake up
the manager and retry. Include a kernel patch that will wake us up if
the mount flags are changed.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Thu, 2 Oct 2008 01:49:53 +0000 (18:49 -0700)]

sysfs_open leaks devnum2devname() result

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:08 +0000 (12:12 -0700)]

non-trivial warn_unused_result fix, prepare_update

If an allocation fails in ->prepare_update we need to catch it in
->process_update.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:08 +0000 (12:12 -0700)]

non-trivial warn_unused_result fixes, activate_spare

Both super-ddf and super-intel ignore memory allocation failures during
->activate_spare. Fix these up by cancelling the activation.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:08 +0000 (12:12 -0700)]

non-trivial warn_unused_result fixes, write_init_super_ddf

When a write fails just move on to the next disk.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:07 +0000 (12:12 -0700)]

trivial warn_unused_result squashing

Made the mistake of recompiling the F9 mdadm rpm which has a patch to
remove -Werror and add "-Wp,-D_FORTIFY_SOURCE -O2" which turns on lots
of errors:

config.c:568: warning: ignoring return value of asprintf
Assemble.c:411: warning: ignoring return value of asprintf
Assemble.c:413: warning: ignoring return value of asprintf
super0.c:549: warning: ignoring return value of posix_memalign
super0.c:742: warning: ignoring return value of posix_memalign
super0.c:812: warning: ignoring return value of posix_memalign
super1.c:692: warning: ignoring return value of posix_memalign
super1.c:1039: warning: ignoring return value of posix_memalign
super1.c:1155: warning: ignoring return value of posix_memalign
super-ddf.c:508: warning: ignoring return value of posix_memalign
super-ddf.c:645: warning: ignoring return value of posix_memalign
super-ddf.c:696: warning: ignoring return value of posix_memalign
super-ddf.c:715: warning: ignoring return value of posix_memalign
super-ddf.c:1476: warning: ignoring return value of posix_memalign
super-ddf.c:1603: warning: ignoring return value of posix_memalign
super-ddf.c:1614: warning: ignoring return value of posix_memalign
super-ddf.c:1842: warning: ignoring return value of posix_memalign
super-ddf.c:2013: warning: ignoring return value of posix_memalign
super-ddf.c:2140: warning: ignoring return value of write
super-ddf.c:2143: warning: ignoring return value of write
super-ddf.c:2147: warning: ignoring return value of write
super-ddf.c:2150: warning: ignoring return value of write
super-ddf.c:2162: warning: ignoring return value of write
super-ddf.c:2169: warning: ignoring return value of write
super-ddf.c:2172: warning: ignoring return value of write
super-ddf.c:2176: warning: ignoring return value of write
super-ddf.c:2181: warning: ignoring return value of write
super-ddf.c:2686: warning: ignoring return value of posix_memalign
super-ddf.c:2690: warning: ignoring return value of write
super-ddf.c:3070: warning: ignoring return value of posix_memalign
super-ddf.c:3254: warning: ignoring return value of posix_memalign
bitmap.c:128: warning: ignoring return value of posix_memalign
mdmon.c:94: warning: ignoring return value of write
mdmon.c:221: warning: ignoring return value of pipe
mdmon.c:327: warning: ignoring return value of write
mdmon.c:330: warning: ignoring return value of chdir
mdmon.c:335: warning: ignoring return value of dup
monitor.c:415: warning: rv may be used uninitialized in this function

...some of these like the write() ones are not so trivial so save those
fixes for the next patch.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:07 +0000 (12:12 -0700)]

imsm: determine failed indexes from the most up-to-date disk

load_imsm_disk() currently notices if spares missed their activation
update, but we allow a stale failed disk back in to the array because its
serial number is clobbered in the most up-to-date disk.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:07 +0000 (12:12 -0700)]

imsm: manage a list of missing disks

If a drive is removed while mdmon is not running we need a way to
identify what is missing and mark that disk as failed in the metadata.
At ->load_super() time create a list of missing disks defined as a disk
that is marked in-sync yet does not appear in super->disks.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:07 +0000 (12:12 -0700)]

imsm: fix mpb_size calculation in write_super_imsm

Spotted a thinko... raid devices are dynamically sized, disks are not.
The space for disks is always mpb->num_disks * sizeof(struct imsm_disk).

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:06 +0000 (12:12 -0700)]

imsm: enable checkpointing of migration (resync/rebuild)

When the array is shutdown, or when mdadm --wait-clean is called, any
active resync process will be idled allowing mdmon to record the current
resync position.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:06 +0000 (12:12 -0700)]

Extend --wait-clean to checkpoint resync

Root file systems backed by external metadata arrays need to be
explicitly checkpointed near the time the rootfs is marked readonly as
userspace will not have an opportunity to react to the final shutdown of
the array.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:06 +0000 (12:12 -0700)]

--wait-clean: shorten timeout

Set the safemode timeout to a small value to get the array marked clean as
soon as possible. We don't write 'clean' directly as it may cause mdmon to
miss a 'write-pending' event.

Include a couple fixes to sysfs_set_safemode():
1/ 0 pad the milliseconds field
2/ workaround input truncation in the kernel

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:06 +0000 (12:12 -0700)]

monitor: protect against CONFIG_LBD=n

md/resync_start reports different terminal values depending on kernel
configuration (~0UL versus ~0ULL). Make detection of the
resync-complete state more robust by comparing against array size.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Sun, 28 Sep 2008 19:12:03 +0000 (12:12 -0700)]

imsm: trust sector reservation from metadata

On ich6r the option-rom appears to reserve only 432 sectors rather than
the 418+4096 of newer implementations. For compatibility trust the
metadata in these cases.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Wed, 24 Sep 2008 12:58:02 +0000 (05:58 -0700)]

sysfs: dprintf when we fail to write a sysfs file

When arrays do not startup correctly it would be nice to know why. Need
to move the dprintf definition to mdadm.h

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Wed, 15 Oct 2008 21:15:47 +0000 (14:15 -0700)]

imsm: confirm raid10 layout, fix up handling raid10 failures

1/ near-2 indeed matches how the Windows driver lays out the data
2/ update imsm_check_degraded to check for rebuilding disks in the
raid10 case

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Wed, 15 Oct 2008 20:12:17 +0000 (13:12 -0700)]

imsm: more serial handling fixups

zero-initialize the serial buffer to handle cases where the response is
less than MAX_RAID_SERIAL_LEN.

Tested-by: Jacek Danecki <jacek.danecki@intel.com>
Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 07:27:49 +0000 (17:27 +1000)]

Updates version numbers for 3.0-devel1 release.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 07:05:02 +0000 (17:05 +1000)]

Don't try to set_array_info when -I find new devices for an array.

When -I get a new device for a container and tries to incrementally
assemble the container array, it calls sysfs_set_array to create the
array without first checking if it already exists. This produces
unpleasant error messages.

So check first.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:43:59 +0000 (16:43 +1000)]

Remove .sock file when removing .pid file for mdmon

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:21:08 +0000 (16:21 +1000)]

Add support for assembling specific subarrays.

This normally isn't needed as --incremental does all the work.
But it is needed to recognise member= and container= in mdadm.conf

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:12:28 +0000 (16:12 +1000)]

Use common code to report MD_UUID for --detail --export

As we need to be able to extract a UUID from any superblock
for matching, use that as the MD_UUID as it will probably be
used for array matching too.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:11:40 +0000 (16:11 +1000)]

Report uuid in --detail --brief for ddf and intel

The uuid is slightly fictitious but needed for array matching.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:08:10 +0000 (16:08 +1000)]

Use uuid as /dev name when assembling array of uncertain origin.

If we aren't sure that the array belongs to 'this' host, use the
uuid to choose a name to avoid any conflict.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:07:32 +0000 (16:07 +1000)]

Add uuid support for super-intel.

'imsm' does not provide any real uuid, so we synthesise one
from various stable bits of the superblock.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:06:41 +0000 (16:06 +1000)]

Allow metadata handler to report that it doesn't record homehost.

For now, this means that the lack of a homehost doesn't always prevent
assembly.
Soon we will allow assembly anyway, but have different messages if
homehost isn't supported.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:03:08 +0000 (16:03 +1000)]

Don't allow spares when creating 'external' arrays.

It is meaningless when creating the container, and for
subarrays, the container is responsible for assigning
spares.

Also, don't do the 'spare' fiddle for raid5 as we cannot
set up a spare at this point yet. Later maybe just create
the array degraded and let the container sort it out.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:03:05 +0000 (16:03 +1000)]

Lots of fixes to make incremental assembly of containers work.

So:
mdadm -I /dev/whatever

will (if appropriate) add whatever to a container, then start
any arrays inside the container.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:01:57 +0000 (16:01 +1000)]

Handle incremental assembly of containers.

mdadm -I /dev/part-of-container

should add that to a container, creating if it needed,
and then try to assemble any arrays in the container.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 06:01:55 +0000 (16:01 +1000)]

Move calls to SET_ARRAY_INFO to common helper.

When we assemble an array, there are three different approaches
depending on whether metadata is internal or external, and on
kernel version.

Move all this to a common helper instead of duplicating in 3 places.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 05:13:32 +0000 (15:13 +1000)]

Factor out add-disk code

The variety of approaches to 'add_disk' are factored out into
a separate function, and Incremental mode benefits by being
closer to supporting the assembly of containers.

Also remove the adding-to-array-data-structure out of sysfs_add_disk
and into add_disk.

And add some tests for --incremental mode to make sure we don't break it.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 05:07:45 +0000 (15:07 +1000)]

Ignore leading zeros in version number information.

--detail sometimes generates leading zero which are just noise.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 05:05:46 +0000 (15:05 +1000)]

Allow --config in --incremental mode.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 05:05:20 +0000 (15:05 +1000)]

Teach --detail about containers and members there-of.

Make --detail on a container more useful by suppressing irrelevant
detail and adding useful detail like a list of member arrays.

Ditto for members of a container: report the name of the container
array.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 05:04:47 +0000 (15:04 +1000)]

Compile fixes, particularly moving more stuff under MDASSEMBLE

Now 'make everything' works again.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 04:33:37 +0000 (14:33 +1000)]

Disable compilation with diet-libc

We need posix_memalign (or something similar) which diet-libc does not
provide.

commit | commitdiff | tree

NeilBrown [Thu, 18 Sep 2008 04:10:42 +0000 (14:10 +1000)]

Fix compile warning/error.

gcc said:
error: large integer implicitly truncated to unsigned type

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:43 +0000 (20:58 -0700)]

mdmon: recreate socket/pid file on SIGHUP

Allow mdmon to start while /var/run/mdadm is readonly. Later a SIGHUP
can trigger mdmon to drop its pid and socket once /var/run/mdadm is
writable. Of course one needs the pid to send a HUP, that can be stored
in a distribution specific rw-init directory... For now, rely on a
killall -HUP mdmon to get the files dumped.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:43 +0000 (20:58 -0700)]

ping_manager() to prevent 'add' before 'remove' completes

It is currently possible to remove a device and re-add it without the
manager noticing, i.e. without detecting a mdstat->devcnt
container->devcnt mismatch.  Introduce ping_manager() to arrange for
mdmon to run manage_container() prior to mdadm dropping the exclusive
open() on the container.  Despite these precautions sysfs_read() may
still fail.  If this happens invalidate container->devcnt to ensure
manage_container() runs at the next event.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:43 +0000 (20:58 -0700)]

sysfs: detect disks that are in the process of being removed

When removing a disk there is a window where the 'slot' attribute of
md/dev-$name will return -EBUSY to read attempts. When this happens
look at the the 'block' link, if it is removed then we can be sure the
device has been removed, versus some other error.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:43 +0000 (20:58 -0700)]

monitor: clean up some debug messages

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:43 +0000 (20:58 -0700)]

mdmon: resume rebuild

If we started a degraded array that was previously rebuilding we may
have enough information to resume the rebuild without a trip through the
monitor.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:42 +0000 (20:58 -0700)]

imsm: allow a failed disk to be readded

Allow the following sequence to rebuild the array
mdadm --fail /dev/md/r1 /dev/disk
mdadm --remove /dev/imsm /dev/disk
mdadm --add /dev/imsm /dev/disk

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:42 +0000 (20:58 -0700)]

'mdadm --wait-clean' wait for array to be marked clean

For use in distro shutdown scripts with a RAID root file system.
Returns immediately if the array is 'readonly', or not an externally
managed array. It is up to the distro's scripts to make sure no new
writes hit the device after this returns 'true'.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:42 +0000 (20:58 -0700)]

Add ping_monitor() to mdadm --wait

The action we are waiting for may not be complete until the monitor has
had a chance to take action on the result.

The following script can now remove the device on the first attempt,
versus a few attempts with the original Wait():
#!/bin/bash
#export MDADM_NO_MDMON=1
export IMSM_DEVNAME_AS_SERIAL=1
./mdadm -Ss
./mdadm --zero-superblock /dev/loop[0-3]
echo 2 > /proc/sys/dev/raid/speed_limit_max
./mdadm --create /dev/imsm /dev/loop[0-3] -n 4 -e imsm -a md
./mdadm --create /dev/md/r1 /dev/loop[0-3] -n 4 -l 5 --force -a mdp
./mdadm --fail /dev/md/r1 /dev/loop3
./mdadm --wait /dev/md/r1
x=0
while  ! ./mdadm --remove /dev/imsm /dev/loop3 > /dev/null 2>&1
do
        x=$((x+1))
done
echo "removed after $x attempts"
./mdadm --add /dev/imsm /dev/loop3

Include 2 small cleanups:
* remove the almost open coded fd2devnum() in Wait() by introducing a
  new utility routine stat2devnum()
* teach connect_monitor() to parse the container device from a subarray
  string

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:42 +0000 (20:58 -0700)]

monitor: don't mark dirty on resync complete

...instead look at array state to determine if the array is consistent

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:42 +0000 (20:58 -0700)]

monitor: mark clean on active-idle

This also handles the case where 'clean' is set directly.

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:42 +0000 (20:58 -0700)]

Honor safemode_delay at Create() and Incremental() time

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:42 +0000 (20:58 -0700)]

imsm: use ->getinfo_super() in ->container_content()

* allows container_content() to pick up the safemode_delay
* removes some duplicate code
* fixes an endian bug setting info->array.chunk_size

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree

Dan Williams [Tue, 16 Sep 2008 03:58:42 +0000 (20:58 -0700)]

Allow metadata handlers to communicate desired safemode delay via mdinfo

Signed-off-by: Dan Williams <dan.j.williams@intel.com>

commit | commitdiff | tree