git.ipfire.org Git - thirdparty/mdadm.git/log

]> git.ipfire.org Git - thirdparty/mdadm.git/log

projects / thirdparty / mdadm.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Adam Kwolek [Thu, 14 Apr 2011 07:50:17 +0000 (17:50 +1000)]

FIX: Use successfully loaded metadata only

Values greater than 0, means error. We exit from loop on error
with empty super-block pointer when sd pointer is valid.
This cannot be detected by check condition as error.
For sure we shouldn't go forward with error condition.
It leads to throwing exception with core file when metadata handler
wants to access non existing super-block.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Piergiorgio Sartor [Thu, 14 Apr 2011 07:28:31 +0000 (17:28 +1000)]

RAID-6 check standalone fix component list parsing

Fix the parsing of the component list, i.e. skipping the "spare" one.

I also added a check in case the array is degraded.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Jonathan Liu [Tue, 12 Apr 2011 08:28:01 +0000 (18:28 +1000)]

Monitor: avoid NULL dereference with 0.90 metadata

0.90 array do not report the metadata type in /proc/mdstat, so
we cannot assume that mse->metadata_version is non-NULL.

So add an appropriate check.

This adds an additional check missed by commit
eb28e119b03fd5149886ed516fa4bb006ad3602e.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Mon, 11 Apr 2011 05:00:13 +0000 (15:00 +1000)]

FIX: Raid0 expansion cannot be restarted

When raid0 expansion is restarted, mdadm refuses to correctly assemble
array because critical section cannot be restored from backup file.
mdadm exits with information:
mdadm: Failed to restore critical section for reshape - sorry.

For raid0 new level is 0, current array level is 4.
Function Grow_restart() doesn't allow for level change.

Grow_restart really shouldn't be checking for level changes.
As they are always instantaneous they should never appear
in the metadata so it doesn't mean anything to check for them.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Mike Frysinger [Mon, 11 Apr 2011 04:54:42 +0000 (14:54 +1000)]

mdadm/mdmon: use CFLAGS when linking

People often put flags that control ABI options into CFLAGS (like -mcpu)
and don't duplicate them in LDFLAGS because most build systems nowadays
(like autotools) use both when linking. So make that work with mdadm's
custom build system too.

Signed-off-by: Mike Frysinger <vapier@gentoo.org>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Mike Frysinger [Mon, 11 Apr 2011 04:54:27 +0000 (14:54 +1000)]

mdadm: respect --syslog in monitor mode

A few places don't accept syslog as a monitor mode, so fix that.

Signed-off-by: Mike Frysinger <vapier@gentoo.org>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Mike Frysinger [Mon, 11 Apr 2011 04:54:18 +0000 (14:54 +1000)]

mdadm: add missing --syslog option to monitor help

Signed-off-by: Mike Frysinger <vapier@gentoo.org>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Mike Frysinger [Mon, 11 Apr 2011 04:54:16 +0000 (14:54 +1000)]

move .man targets from "all" to "man" - and "everything"

These .man files are never installed, nor generally used, so don't force
people who generally want to build/install mdadm to build them up.

Signed-off-by: Mike Frysinger <vapier@gentoo.org>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Wed, 6 Apr 2011 02:40:31 +0000 (12:40 +1000)]

imsm: fix: report aligned component size value

OROM can create array with chunk size not aligned.
To resolve this problem in mdadm, metadata handler has to report
component size aligned value for mdadm operations
while metadata value stays unchanged.

Do not correct alignment for raid1 and in error case.

Correction allows check in analyse_change() (Grow.c:905) to pass.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Wed, 6 Apr 2011 02:40:04 +0000 (12:40 +1000)]

imsm: FIX: Check array alignment before expansion

It can occur that OROM creates array not aligned properly.
Expansion cannot be run in such cases. It is detected in analyse_change().
It is too late. This causes that metadata is in migration state already,
when expansion cannot be started.
This problem has to be detected before metadata is updated,
in all arrays in reshaped container.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Wed, 6 Apr 2011 02:38:50 +0000 (12:38 +1000)]

imsm: Warn user about reboot risk

Current check-pointing implementation doesn't allow for interrupting reshape of boot arrays
due to checkpoint restore has to be done before system start.
There is problem with passing backup file name to array automatically mounted during boot time,
especially when scan mode is used.

Until IMSM check-pointing implementation will be introduced, warning about reboot risk
should be placed in mdadm.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 5 Apr 2011 11:43:52 +0000 (21:43 +1000)]

restripe: make sure zero buffer is always large enough.

If restripe is called to restore stripes of one size and then
save stripes with a larger chunk size, the 'zero' buffer will not
be large enough and a double-degraded RAID6 will over-run the buffer.

So record the current size of the zero buffer and use it when deciding
if we need to allocate a new buffer.

Reported-by: Brad Campbell <lists2009@fnarfbargle.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Czarnowska, Anna [Mon, 4 Apr 2011 23:29:45 +0000 (09:29 +1000)]

Create: fix size after setting default chunk

When -e option is given then the first validate_geometry
sets default chunk. Size must be rounded there and do_default_chunk
needs to be set to 0 so that we don't repeat the message below.

If we start without st then what we find on the the first disk determines
the st and sets chunk. So after running
validate_geometry on the first disk we need to fix the size too.
At this point chunk should always be set but it is safer to keep the check.

Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Czarnowska, Anna [Wed, 30 Mar 2011 10:28:11 +0000 (11:28 +0100)]

Create: check for UnSet when looking at chunk

A default chunk size of 0 gets modified to UnSet, so any location that
checks for !chunk really needs to check for !(chunk || chunk == UnSet).

Signed-off-by: Dan Williams <dan.j.williams@intel.com>
Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Mon, 28 Mar 2011 11:56:49 +0000 (13:56 +0200)]

FIX: After discarding array give chance monitor to remove it

When raid0 expansion occurs, takeover operation is used.
After backward takeover monitor remains in memory.

This happens due to remaining just removed active array in mdmon structures.
If there is no other monitored arrays, mdmon has to finish his work.

Problem was introduced in patch (2011.03.22):
mdmon: Stop keeping track of RAID0 (and LINEAR) arrays.
Prior to this patch mdmon kicking occurs via replace_array() where
wakeup_monitor() was called.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 4 Apr 2011 23:16:57 +0000 (09:16 +1000)]

Monitor: avoid NULL dereference with 0.90 metadata

0.90 array do not report the metadata type in /proc/mdstat, so
we cannot assume that mse->metadata_version is non-NULL.

So add an appropriate check.

Reported-by: Eugene <hdejin@yahoo.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Piergiorgio Sartor [Mon, 4 Apr 2011 23:16:55 +0000 (09:16 +1000)]

RAID-6 check standalone code cleanup

Major change is code cleanup and simplification.
Furthermore, a better error handling and a couple
of bug fixes.
Last but not least, the command line parameters are
changed from "bytes" to "stripes", which is more
convenient, I guess.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Piergiorgio Sartor [Mon, 4 Apr 2011 22:56:41 +0000 (08:56 +1000)]

RAID-6 check standalone md device

Allow RAID-6 check to be passed only the
MD device, start and length.
The three parameters are mandatory.

All necessary information is collected using
the "sysfs_read()" call.
Furthermore, if "length" is "0", then the check
is performed until the end of the array.

Some checks are done, for example if the md device
is really a RAID-6. Nevertheless I guess it is not
bullet proof...

Next patch will include the "suspend" action.
My idea is to do it "per stripe", please let me
know if you've some better options.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 4 Apr 2011 22:44:54 +0000 (08:44 +1000)]

Split some of util.c into a new lib.c

Some of util.c is dependent on lots of other code, some of it
is stand-alone.
Move some of the stand-alone stuff into a new lib.c so it can be used
by smaller utilities.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 4 Apr 2011 22:40:49 +0000 (08:40 +1000)]

split name/number maps into separate file.

This reduced some interdependencies between files.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 4 Apr 2011 22:21:03 +0000 (08:21 +1000)]

Move WaitClean from sysfs to Monitor.c

It might not really belong in Monitor, but it really doesn't
belong in sysfs.c, and fits well with Wait()

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 28 Mar 2011 02:30:29 +0000 (13:30 +1100)]

Release 3.2.1

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 28 Mar 2011 02:24:04 +0000 (13:24 +1100)]

test: Don't use dev6 and dev7 together in a non-multipath test

dev6 and dev7 refer to the same storage and are used for
multipath testing. So using them both in any other test will
be confusing. So change 11spare-migration test 5 to use
dev10 rather than dev7

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Hawrylewicz Czarnowski, Przemyslaw [Sun, 27 Mar 2011 23:42:07 +0000 (10:42 +1100)]

imsm: reading of UEFI variables needs an update

Content of EFI variable is stored in "data" file. Moreover size of data
provided by given variable can be initially validated by reading value of
"size" file.
Function read_efi_variable() has been introduced to simplify the code.

Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Hawrylewicz Czarnowski, Przemyslaw [Sun, 27 Mar 2011 23:41:35 +0000 (10:41 +1100)]

imsm: remove OEM table from detection of OROM and EFI.

OEM table does not suit our needs so it cannot be used.
This patch removes feature added in commit 8a0bf4f378c8b.

Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Sun, 27 Mar 2011 23:41:09 +0000 (10:41 +1100)]

tests: Make sure config file is empty when required.

We need to have no config at all for this test so
make sure it is empty.

Reported-by: Anna Czarnowska <anna.czarnowska@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Czarnowska, Anna [Thu, 24 Mar 2011 21:43:44 +0000 (21:43 +0000)]

tests: use $config to store test config path

We also need to tell Monitor where to look for Policy in 11spare-migration tests

Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 24 Mar 2011 03:21:58 +0000 (14:21 +1100)]

open_dev_excl: allow device to be read-only.

For many operations we don't need a writable device. So if
opening O_RDWR fails in open_dev_excl, then try again O_RDONLY.

If we really needed write, a subsequent operation will failed. But
if we didn't, we succeed when otherwise we wouldn't have.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 24 Mar 2011 01:45:23 +0000 (12:45 +1100)]

tests: use /tmp/mdadm.conf rather than /etc/mdadm.conf.

Modifying /etc/mdadm.conf for testing is just wrong.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 24 Mar 2011 01:00:55 +0000 (12:00 +1100)]

Merge branch 'master' into devel-3.2

Conflicts:
Incremental.c
Manage.c
ReadMe.c
inventory
mdadm.8.in
mdadm.spec
mdassemble.8
mdmon.8

commit | commitdiff | tree

Krzysztof Wojcik [Wed, 23 Mar 2011 23:15:01 +0000 (10:15 +1100)]

FIX: imsm: Do not change serial if disk failed

This patch rollback one change connected with mdadm-OROM
compatibility:
adding ':0' at the end of disk serial number if disk is
detected as failed.
Current mdadm's implementation does not distinguish two
cases when disk is marked as failed:
1. If disk is really failed- disconnected, broken
2. Just marked as failed by mdadm- using "-f" option

Second case is not yet fully handled and compatible with
IMSM standard.
Changing serial number of existing, operational disk causes
problems in "thunderdome" and "load_super" functions that use
serial numbers to disks comparisons and searching.
The change must be recalled until full support will be
developed.

Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Krzysztof Wojcik [Wed, 23 Mar 2011 23:11:58 +0000 (10:11 +1100)]

FIX: Tests: raid0->raid10 without degradation

raid0->raid10 transition needs at least 2 spare devices.
After level changing to raid10 recovery is triggered on
failed (missing) disks. At the end of recovery process
we have fully operational (not degraded) raid10 array.

Initialy there was possibility to migrate raid0->raid10
without recovery triggering (it results degraded raid10).
Now it is not possible.
This patch adapt tests to new mdadm's behavior.

Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Krzysztof Wojcik [Wed, 23 Mar 2011 15:04:20 +0000 (16:04 +0100)]

FIX: imsm: Rebuild does not start on second failed disk

Problem:
If we have an array with two failed disks and the array is in degraded
state (now it is possible only for raid10 with 2 degraded mirrors) and
we have two spare devices in the container, recovery process should be
triggered on booth failed disks. It does not.
Recovery is triggered only for first failed disk.
Second failed disk remains unchanged although the spare drive exists
in the container and is ready to recovery.

Root cause:
mdmon does not check if the array is degraded after recovery of first
drive is completed.

Resolution:
Check if current number of disks in the array equals target number of disks.
If not, trigger degradation check and then recovery process.

Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Wed, 23 Mar 2011 04:43:19 +0000 (15:43 +1100)]

Release mdadm-3.1.5

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Wed, 23 Mar 2011 04:42:35 +0000 (15:42 +1100)]

Incr: don't exclude 'active' devices from auto inclusion in a container.

For containers, it is always appropriate to include a device in the
container.
Whether it should then be included in an array is a separate question.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Wed, 23 Mar 2011 04:42:24 +0000 (15:42 +1100)]

--stop: separate 'is busy' test for 'did it stop properly'.

Stopping an md array requires that there is no other user of it.
However with udev and udisks and such there can be transient other
users of md devices which can interfere with stopping the array.

If there is a transient users, we really want "mdadm --stop" to wait a
little while and retry.
However if the array is genuinely in-use (e.g. mounted), then we
don't want to wait at all - we want to fail immediately.

So before trying to stop, re-open device with O_EXCL. If this fails
then the device is probably in use, so give up.

If it succeeds, but a subsequent STOP_ARRAY fails, then it is possibly
a transient failure, so try again for a few seconds.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Wed, 23 Mar 2011 00:07:27 +0000 (11:07 +1100)]

Assemble: improve efficacy of -Af in assembling degraded dirty arrays.

If a degraded dirty array has some superblocks which are clean and
others that are dirty, and the dirty ones are newer by precisely '1'
in the event count, then the current code to force the array to be
clean will not work.
We need to make sure to find a superblock with most recent event count
and force that one to be 'clean'.

Reported-by: A J Wyborny <ajwyborny@gmail.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Labun, Marcin [Wed, 23 Mar 2011 01:05:53 +0000 (12:05 +1100)]

super-intel: enable loading metadata from non-IMSM compliant disks

Honor ignore_hw_compat to load metadata from disk attached to non-IMSM
controller or when there are no IMSM OROM/EFI capabilities.
Used only for guessing and examining metadata format.

Signed-off-by: Marcin Labun <marcin.labun@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Labun, Marcin [Wed, 23 Mar 2011 01:04:46 +0000 (12:04 +1100)]

examine: allows to examine a disk metadata on non-metadata compliant systems

Allow for loading metadata from disk attached to non-metadata compliant
system. Affects mdadm --examine and guess_super.

Added ignore_hw_compat in supertype to pass information to load_super
handler. If ignore_hw_compat is set the handler should load metadata
also from disks that do not comply with metadata requirements (i.e. disk is not
attached to native controller, etc).

Signed-off-by: Marcin Labun <marcin.labun@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Wed, 23 Mar 2011 01:02:28 +0000 (12:02 +1100)]

man mdadm: Add note about auto-assembly during array reshape

Add note to man that auto-assembly cannot be used for reshaped arrays.

Revisions: NeilBrown

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Wed, 23 Mar 2011 00:45:03 +0000 (11:45 +1100)]

man mdadm: add information for MDADM_EXPERIMENTAL flag

Update man for MDADM_EXPERIMENTAL flag.

Minor revisions by Mathias Burén <mathias.buren@gmail.com> and Neil Brown.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 22 Mar 2011 03:47:55 +0000 (14:47 +1100)]

Monitor: handle v.quick removal of devices better.

If a device fails and then is removed before Monitor sees
the failure, GET_DISK_INFO returns nothing so Monitor relies
on mdstat info where '_' is incorrectly interpreted as 'a spare'.

We should treat '_' as 'removed' - that is safer.

Without this, a v.quick fail+remove gets reported as 'Failed' then
'SpareActive'.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 21 Mar 2011 23:32:09 +0000 (10:32 +1100)]

ddf: fix up detection of failed/missing devices.

If a device hasn't been found yet we can still tell if it is
expected to be working, and we must to do to make sure
'working_disks' is correct.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Piergiorgio Sartor [Mon, 21 Mar 2011 23:09:38 +0000 (10:09 +1100)]

restripe: allow test code to have an offset on each device.

If device name ends :number, e.g.
/dev/sda0:1234

then assume the RAID data starts that many sectors from start of
device.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Wed, 23 Mar 2011 00:07:27 +0000 (11:07 +1100)]

Assemble: improve efficacy of -Af in assembling degraded dirty arrays.

If a degraded dirty array has some superblocks which are clean and
others that are dirty, and the dirty ones are newer by precisely '1'
in the event count, then the current code to force the array to be
clean will not work.
We need to make sure to find a superblock with most recent event count
and force that one to be 'clean'.

Reported-by: A J Wyborny <ajwyborny@gmail.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 22 Mar 2011 06:23:17 +0000 (17:23 +1100)]

mdmon: Stop keeping track of RAID0 (and LINEAR) arrays.

Tracking RAID0 arrays doesn't really work. There is no need,
and there are some sysfs files which won't exist when the array
appears and then won't be opened when the level is changed.

So simply ignore RAID0 and LINEAR arrays - don't add them when they
appear and if an array we are monitoring turns into one of these,
discard it promptly.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 22 Mar 2011 05:10:22 +0000 (16:10 +1100)]

mdmon: don't wait for O_EXCL when shutting down.

If mdmon is shutting down because there are no devices
left to look at, then don't wait 5 seconds for an O_EXCL open,
and that can block progress of --grow.

Only wait for O_EXCL if we received a signal.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 22 Mar 2011 03:52:37 +0000 (14:52 +1100)]

mdmon: allow manage_member to cope with ->container becoming NULL.

As monitor() can set ->container to NULL, we need to be careful
about dereferencing it.
So take a copy in manage_member, return if it is NULL, and only
use the copy.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 22 Mar 2011 03:52:36 +0000 (14:52 +1100)]

Grow: increase raid_disks before adding specific spares.

When we add spared that have been targeted at a specific slot,
we need raid_disks to be bigger than the slot number.
But currently we don't increase raid_disks until after we add
these spares.

So introduce an early increase of raid_disks to allow the spares
to be added.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 22 Mar 2011 03:47:55 +0000 (14:47 +1100)]

Monitor: handle v.quick removal of devices better.

If a device fails and then is removed before Monitor sees
the failure, GET_DISK_INFO returns nothing so Monitor relies
on mdstat info where '_' is incorrectly interpreted as 'a spare'.

We should treat '_' as 'removed' - that is safer.

Without this, a v.quick fail+remove gets reported as 'Failed' then
'SpareActive'.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 21 Mar 2011 23:32:09 +0000 (10:32 +1100)]

ddf: fix up detection of failed/missing devices.

If a device hasn't been found yet we can still tell if it is
expected to be working, and we must to do to make sure
'working_disks' is correct.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Piergiorgio Sartor [Mon, 21 Mar 2011 23:09:38 +0000 (10:09 +1100)]

restripe: allow test code to have an offset on each device.

If device name ends :number, e.g.
/dev/sda0:1234

then assume the RAID data starts that many sectors from start of
device.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 21 Mar 2011 23:09:30 +0000 (10:09 +1100)]

test: call "udevadm settle" after stopping array.

If we don't do this, then the unlink from /dev might happen
after the next step in the test creates something in /dev,
and device names seem to go missing.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Piergiorgio Sartor [Mon, 21 Mar 2011 02:52:44 +0000 (13:52 +1100)]

RAID-6 check standalone

Hi Neil,

please find attached a patch, to mdadm-3.2 base, including
a standalone versione of the raid-6 check.

This is basically a re-working (and hopefully improvement)
of the already implemented check in "restripe.c".

I splitted the check function into "collect" and "stats",
so that the second one could be easily replaced.
The API is also simplified.

The command line option are reduced, since we only level
is raid-6, but the ":offset" option is included.

The output reports the block/stripe rotation, P/Q errors
and the possible HDD (or unknown).

BTW, the patch applies also to the already patched "restripe.c",
including the last ":offset" patch (which is not yet in git).

Other item is that due to "sysfs.c" linking (see below) the
"Makefile" needed some changes, I hope this is not a problem.

Next steps (TODO list you like) would be:

1) Add the "sysfs.c" code in order to retrieve the HDDs info
from the MD device. It is already linked, together with the
whole (mdadm) universe, since it seems it cannot leave alone.
I'll need some advice or hint on how to do use it. I checked
"sysfs.c", but before I dig deep into it maybe better to
have some advice (maybe just one function call will do it).

2) Add the suspend lo/hi control. Fellow John Robinson was
suggesting to look into "Grow.c", which I did, but I guess
the same story as 1) is valid: better to have some hint on
where to look before wasting time.

3) Add a repair option (future). This should have different
levels, like "all", "disk", "stripe". That is, fix everything
(more or less like "repair"), fix only if a disk is clearly
having problems, fix each stripe which has clearly a problem
(but maybe different stripes may belong to different HDDs).

So, for the point 1) and 2) would be nice to have some more
detail on where to look what. Point 3) we will discuss later.

Thanks, please consider for inclusion,

bye,

pg

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Labun, Marcin [Sun, 20 Mar 2011 04:47:33 +0000 (15:47 +1100)]

platform_intel: support EFI SCU OEM variable

RstScuV and RstScuO variable names are supported.
First try reading from RstScuV, when it fails try RstScuO.

Signed-off-by: Marcin Labun <marcin.labun@intel.com>
Tested-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Sun, 20 Mar 2011 04:47:31 +0000 (15:47 +1100)]

imsm: FIX: indicate that metadada has to be written

During adding spare disks to raid0, spare metadata is not written.
This is due to exit form sync_metadata() on empty updates_pending flag.

When mdmon is absent indicate sync_metadata() to flush changes to disks.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Sun, 20 Mar 2011 04:47:17 +0000 (15:47 +1100)]

FIX: Add spare throws exception (v2)

sync_metadata() requires st->sb to be loaded, otherwise exception is
generated. This fails expansion, because spares cannot be added.

metadata update uses tst instead st pointer, it is better than
loading anchor for st as I proposed previously.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Krzysztof Wojcik [Fri, 18 Mar 2011 01:42:17 +0000 (12:42 +1100)]

Retry writing 'inactive' state during stopping array

Issue observed:
Sporadicaly stopping arrays using "mdadm -Ss" command does not succeded.
Cause:
Writting "inactive" to the array state not succeded- array is busy
(accessed by udev, blkid etc.)
Resolution:
If writing 'inactive' fails, wait and retry again (because it is possibly
a transient failure)

Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Fri, 18 Mar 2011 01:32:16 +0000 (12:32 +1100)]

FIX: ping_monitor() usage causes memory leaks

When for ping_monitor() input devnum2devname() is used,
received string pointer should be passed to free() for memory release.
It is not made in several places. This use case should have function
to avoid memory leak.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Fri, 18 Mar 2011 01:31:45 +0000 (12:31 +1100)]

Manage: fix the mess I made in earlier patch.

When I separated the 'native metadata' case more cleanly from the
"external metadata" case for adding a drive, I left some 'external'
code in the 'native' case, and didn't copy it to the 'external' case.

When - in the external case - we add to super, we much check for
mdmon first, so we know whether to do the metadata update ourselves
or not, then afterwards call either flush_metadata_updates (to send
to mdmon) or sync_metadata (to do it directly).

Reported-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 17 Mar 2011 02:35:10 +0000 (13:35 +1100)]

--stop: separate 'is busy' test for 'did it stop properly'.

Stopping an md array requires that there is no other user of it.
However with udev and udisks and such there can be transient other
users of md devices which can interfere with stopping the array.

If there is a transient users, we really want "mdadm --stop" to wait a
little while and retry.
However if the array is genuinely in-use (e.g. mounted), then we
don't want to wait at all - we want to fail immediately.

So before trying to stop, re-open device with O_EXCL. If this fails
then the device is probably in use, so give up.

If it succeeds, but a subsequent STOP_ARRAY fails, then it is possibly
a transient failure, so try again for a few seconds.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 05:31:20 +0000 (16:31 +1100)]

Fix regression when using 'grow' to add a bitmap.

When we allowed a devlist to accompany some --grow modes - but not
--bitmap - we made --bitmap always fail, in stead of fail of a device
was given to add.
As 'devs_found' includes the md device, we need to compare against
'1'.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 04:35:04 +0000 (15:35 +1100)]

Merge branch 'master' into devel-3.2

Conflicts:
Manage.c
managemon.c
super-ddf.c
super-intel.c

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 04:24:03 +0000 (15:24 +1100)]

mdadm.man: added encouragement to shrink filesystem before array.

Suggesting by Rory Jaffe <rsjaffe@gmail.com> to make the danger
of shrinking, and to recommended avoidance technique, more explicit.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:56:16 +0000 (18:56 +1100)]

ddf: implement remove_from_super

This is needed to remove devices from mdmon's knowledge when the
device is removed from the md container.

Now that ddf have a remove_from_super we don't need the code
that allows some personalities not to implement this.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Labun, Marcin [Tue, 15 Mar 2011 04:09:31 +0000 (15:09 +1100)]

IMSM: Fix problem in mdmon monitor of using removed disk in imsm container.

Manager thread shall pass the information to monitor thread (mdmon)
that some devices are removed from container. Otherwise, monitor
(mdmon) might use such devices (spares) to rebuild the array that has
gone degraded.

This problem happens for imsm containers, since a list of the
container disks is maintained in intel_super structure. When array
goes degraded, the list is searched to find a spare disks to start
rebuild. Without this fix the rebuild could be stared on the spare
device that was a member of the container, but has been removed from
it.

New super type function handler has been introduced to prepare
metadata format specific information about removed devices.

int (*remove_from_super)(struct supertype *st, mdu_disk_info_t *dinfo)

The message prepared in remove_from_super is later processed by
process_update handler in monitor thread.

Signed-off-by: Marcin Labun <marcin.labun@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 04:09:24 +0000 (15:09 +1100)]

DDF Allow a RAID1 to be 'partially optimal'.

If a RAID1 is meant to have more than 2 device and while it doesn't
have that many, it still has more than 1, then according to the
DDF spec it is "partially optional" rather than "degraded"
So make that so.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 04:02:49 +0000 (15:02 +1100)]

ddf: remove failed devices that are no longer in use.

The DDF spec requires we have a phys disk record for every physically
attached device. But it isn't clear what that means in the case
of soft raid in a general purpose Linux computer.
So remove phys disk records for any failed device that is not
active in any array.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 03:57:46 +0000 (14:57 +1100)]

ddf: set Rebuilding flag when adding devices to a degraded array

This is a big fragile, but DDF has wierd rules that we aren't really
set up to handle properly.

When we add a device to a degraded array it must be a spare, so
mark it as Rebuilding.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 03:54:46 +0000 (14:54 +1100)]

ddf: use correct loop variable in activate_spare

Using 'i' when you mean 'j' just shows how silly it is to use
variables named 'i' and 'j'.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 03:53:00 +0000 (14:53 +1100)]

ddf: Don't consider 'dl' entries with state_fd < 0

These have been marked as invalid (recently failed) so
don't trust the major/minor associated with them.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 03:51:12 +0000 (14:51 +1100)]

managemon: Don't do spare assignment while any updates are pending.

Spare assignment requires full knowledge of array state. A pending
update might modify that state (such as a pending spare assignment)
so don't try while there are updates pending.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 15 Mar 2011 03:48:20 +0000 (14:48 +1100)]

Manage/external: for external metadata, add_to_super needs lock on container.

add_to_super could use information from the current superblock (ddf
does), so add_to_super for external metadata should be called with
the O_EXCL lock held on the container to ensure the update is complete
before any other process tries to make any changes (like adding
another device to array).

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Mon, 14 Mar 2011 14:09:29 +0000 (15:09 +0100)]

imsm: FIX: existing backup file fails unit tests

During normal test execution, backup file is deleted after test execution.
If test is interrupted/broken, backup file can remain for next run.
When backup file exists before unit test run, suits 12 and 13 fails.

To avoid this remove backup file before grow is executed.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:56:16 +0000 (18:56 +1100)]

ddf: implement remove_from_super

This is needed to remove devices from mdmon's knowledge when the
device is removed from the md container.

Now that ddf have a remove_from_super we don't need the code
that allows some personalities not to implement this.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:54:21 +0000 (18:54 +1100)]

ddf: zero space_list in ddf_activate_spare.

Currently ->space_list is uninitialised here, which is obviously bad.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:49:57 +0000 (18:49 +1100)]

Merge branch 'master' into devel-3.2

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:47:47 +0000 (18:47 +1100)]

ddf: set vcnum correctly when creating a new virtual device in conflist

We weren't setting ->vcnum at all when an array was added. This
meant that a subsequent device failure could be assigned to the
wrong array.

Reported-by: Albert Pauw <albert.pauw@gmail.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:45:26 +0000 (18:45 +1100)]

ddf: teach set_disk to cope with new or changed devices.

When set_disk is called, we need to check if the disk has changed or
recently appeared, and update everything properly if it has.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:32:38 +0000 (18:32 +1100)]

ddf: free_super should be add_list as well.

It is possible there is data and even an open file descriptor
on 'add_list' - so it must be freed too.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:30:34 +0000 (18:30 +1100)]

ddf: minor activate_super fixes.

1/ ignore devices with "state_fd < 0" as these have been removed.
2/ Set update 'length' properly and clear 'space'.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 14 Mar 2011 07:24:01 +0000 (18:24 +1100)]

monitor: close recovery_fd when closing state_Fd

These should be open or closed together.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Krzysztof Wojcik [Mon, 14 Mar 2011 07:21:21 +0000 (18:21 +1100)]

Warn the user about too small array size

If single-disk RAID0 or RAID1 array is created, user may preserve data on
disk. If array given size covers all partitions on disk, all data will be
available on created array. If array size is too small (not covers
all partitions), data will be not accessible.
This patch introduces warning message during array creation if given size
is too small. User may interrupt creation process to avoid data loss.

Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Labun, Marcin [Mon, 14 Mar 2011 07:18:46 +0000 (18:18 +1100)]

platfrom_intel: find OROM based on Intel AHCI and SAS driver device id

We use PCI device id exposed by AHCI and ISCU drivers (SAS controller)
to find OROM version table.
In this way there is no need to maintain AHCI and ISCU device id list
in mdadm. The consequence is that the OROM properties can be found by mdadm when AHCI or
SAS drivers are loaded in the system.

Signed-off-by: Marcin Labun <marcin.labun@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Mon, 14 Mar 2011 07:17:53 +0000 (18:17 +1100)]

imsm: FIX: Store checkpoint in per disk units

While last_checkpoint is counter in per disk units, checkpoints
should be stored in the same manner.
Restoring from checkpoint should should recalculate checkpoint in to
array position (reshape_progress).

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Mon, 14 Mar 2011 07:17:52 +0000 (18:17 +1100)]

FIX: Last_checkpoint has to be initialized in per disk units

last_checkpoint is variable that tracks sync_complete sysfs entry.
sync_complete is per disk counter, so initializing during starting from checkpoint
has to have this in mind and convert reshape position properly.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Thu, 10 Mar 2011 14:05:54 +0000 (15:05 +0100)]

FIX: Last checkpoint is not initialized on reshape restart

When reshape is restarted and active array in mdmon is being initialized,
mdmon has to know last checkpoint, otherwise reshape will be restarted
form '0' position.
mdadm when reshaped array is assembled stores reshape_position in sysfs
and runs mdmon. Initialize last_checkpoint in active array structure
to value present in sysfs for reshaped array start.

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

Adam Kwolek [Thu, 10 Mar 2011 07:30:42 +0000 (08:30 +0100)]

FIX: Unfreeze array on success only

Unfreeze array on success only.
rv is initialized by restart variable so we have 2 cases.
1. regular reshape start
rv == restart == 0
   this means that real error (returned by reshape) can cause leaving container frozen
   If array is not touched by reshape it can be unfrozen
2. During reshape restart even untouched array under reshape is left unfrozen,
   If reshape is started do not unfreeze array on error also.

This allows user for array repair action
(mdmon will not change array state).

Signed-off-by: Adam Kwolek <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 10 Mar 2011 07:14:43 +0000 (18:14 +1100)]

ddf: Failed should suppress Online and others.

so the notes say, so make it so.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Thu, 10 Mar 2011 06:37:04 +0000 (17:37 +1100)]

Merge branch 'master' into devel-3.2

Conflicts:
Grow.c
Manage.c
managemon.c
mdadm.8.in
util.c

commit | commitdiff | tree

NeilBrown [Mon, 22 Nov 2010 08:35:25 +0000 (19:35 +1100)]

Manage: be more careful about --add attempts.

If an --add is requested and a re-add looks promising but fails or
cannot possibly succeed, then don't try the add. This avoids
inadvertently turning devices into spares when an array is failed but
the devices seem to actually work.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Mon, 22 Nov 2010 08:35:25 +0000 (19:35 +1100)]

ddf: remove duplicate container_member setting.

We were setting ->container_member twice in ddf get_info.
Once to currentconf->vcnum,
once to atoi(st->subarray).

Both should be the same.
For consistency with super-intel, use the first.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 30 Nov 2010 05:25:26 +0000 (16:25 +1100)]

Fix warning about host-endian bitmaps.

Hostendian bitmaps should be warned about on all arch's.
And fix a speeling mistake.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 30 Nov 2010 05:34:25 +0000 (16:34 +1100)]

Grow: give useful message when adding bitmap gives EBUSY.

If adding a bitmap fails with EBUSY, then it is because the array is
currently resyncing/recovering/reshaping.
As this is non-obvious, give a message explaining the fact.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 30 Nov 2010 05:46:01 +0000 (16:46 +1100)]

Assemble: add --update=no-bitmap

This allows an array with a corrupt internal bitmap to be assembled
without the bitmap.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 30 Nov 2010 05:56:01 +0000 (16:56 +1100)]

Assemble: call remove_partitions later.

We shouldn't call remove_partitions until we have made a really firm
decision to include the device into the array.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 30 Nov 2010 07:35:36 +0000 (18:35 +1100)]

mdmon: don't copy an invalid chunk_size

As chunk_size in mdstat_ent is never set, we shouldn't copy
it into a->info.array.
In fact, it is safest to get rid of the field altogether.

Reported-by: "Kwolek, Adam" <adam.kwolek@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Tue, 30 Nov 2010 22:55:35 +0000 (09:55 +1100)]

ddf: fail creation of new subarray with same name as old.

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Wed, 1 Dec 2010 00:03:28 +0000 (11:03 +1100)]

Create: report failure if array cannot be started.

We weren't checking the result of writing 'active' to array_state

Signed-off-by: NeilBrown <neilb@suse.de>

commit | commitdiff | tree

NeilBrown [Wed, 1 Dec 2010 00:58:32 +0000 (11:58 +1100)]

Grow: disallow placing backup file on array being reshaped.

the tests here aren't perfect, but they could catch some cases.

Signed-off-by: NeilBrown <neilb@suse.de>

Unnamed repository; edit this file 'description' to name the repository.