]> git.ipfire.org Git - thirdparty/xfsprogs-dev.git/log
thirdparty/xfsprogs-dev.git
5 months agoxfs_scrub_fail: return the failure status of the mailer program
Darrick J. Wong [Fri, 12 Jan 2024 02:07:05 +0000 (18:07 -0800)] 
xfs_scrub_fail: return the failure status of the mailer program

We should return the exit code of the mailer program sending the scrub
failure reports, since that's much more important to anyone watching the
system.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub_fail: fix sendmail detection
Darrick J. Wong [Fri, 12 Jan 2024 02:07:05 +0000 (18:07 -0800)] 
xfs_scrub_fail: fix sendmail detection

This script emails the results of failed scrub runs to root.  We
shouldn't be hardcoding the path to the mailer program because distros
can change the path according to their whim.  Modify this script to use
command -v to find the program.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub: fix pathname escaping across all service definitions
Darrick J. Wong [Fri, 12 Jan 2024 02:07:05 +0000 (18:07 -0800)] 
xfs_scrub: fix pathname escaping across all service definitions

systemd services provide an "instance name" that can be associated with
a particular invocation of a service.  This allows service users to
invoke multiple copies of a service, each with a unique string.  For
xfs_scrub, we pass the mountpoint of the filesystem as the instance
name.  However, systemd services aren't supposed to have slashes in
them, so we're supposed to escape them.

The canonical escaping scheme for pathnames is defined by the
systemd-escape --path command.  Unfortunately, we've been adding our own
opinionated sauce for years, to work around the fact that --path didn't
exist in systemd before January 2017.  The special sauce is incorrect,
and we no longer care about systemd of 7 years past.

Clean up this mess by following the systemd escaping scheme throughout
the service units.  Now we can use the '%f' specifier in them, which
makes things a lot less complicated.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub_all: escape service names consistently
Darrick J. Wong [Fri, 12 Jan 2024 02:07:05 +0000 (18:07 -0800)] 
xfs_scrub_all: escape service names consistently

This program is not consistent as to whether or not it escapes the
pathname that is being used as the xfs_scrub service instance name.
Fix it to be consistent, and to fall back to direct invocation if
escaping doesn't work.  The escaping itself is also broken, but we'll
fix that in the next patch.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agodebian: install scrub services with dh_installsystemd
Darrick J. Wong [Fri, 12 Jan 2024 02:07:05 +0000 (18:07 -0800)] 
debian: install scrub services with dh_installsystemd

Use dh_installsystemd to handle the installation and activation of the
scrub systemd services.  This requires bumping the compat version to 11.
Note that the services are /not/ activated on installation.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub: don't report media errors for space with unknowable owner
Darrick J. Wong [Fri, 12 Jan 2024 02:07:04 +0000 (18:07 -0800)] 
xfs_scrub: don't report media errors for space with unknowable owner

On filesystems that don't have the reverse mapping feature enabled, the
GETFSMAP call cannot tell us much about the owner of a space extent --
we're limited to static fs metadata, free space, or "unknown".  In this
case, nothing is corrupt, so str_corrupt is not an appropriate logging
function.  Relax this to str_info so that the user sees a notice that
media errors have been found so that the user knows something bad
happened even if the directory tree walker cannot find the file owning
the space where the media error was found.

Filesystems with rmap enabled are never supposed to return OWN_UNKNOWN
from a GETFSMAP report, so continue to report that as a corruption.
This fixes a regression in xfs/556.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub: flush stdout after printing to it
Darrick J. Wong [Fri, 12 Jan 2024 02:07:04 +0000 (18:07 -0800)] 
xfs_scrub: flush stdout after printing to it

Make sure we flush stdout after printf'ing to it, especially before we
start any operation that could take a while to complete.  Most of scrub
already does this, but we missed a couple of spots.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub: update copyright years for scrub/ files
Darrick J. Wong [Fri, 12 Jan 2024 02:07:04 +0000 (18:07 -0800)] 
xfs_scrub: update copyright years for scrub/ files

Update the copyright years in the scrub/ source code files.  This isn't
required, but it's helpful to remind myself just how long it's taken to
develop this feature.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub: add missing license and copyright information
Darrick J. Wong [Fri, 12 Jan 2024 02:07:04 +0000 (18:07 -0800)] 
xfs_scrub: add missing license and copyright information

These files are missing the required SPDX license and copyright
information.  Add them.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
5 months agoxfs_scrub: fix author and spdx headers on scrub/ files
Darrick J. Wong [Fri, 12 Jan 2024 02:07:04 +0000 (18:07 -0800)] 
xfs_scrub: fix author and spdx headers on scrub/ files

Fix the spdx tags to match current practice, and update the author
contact information.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agolibxfs: fix krealloc to allow freeing data
Darrick J. Wong [Fri, 12 Jan 2024 02:07:03 +0000 (18:07 -0800)] 
libxfs: fix krealloc to allow freeing data

A recent refactoring to xfs_idata_realloc in the kernel made it depend
on krealloc returning NULL if the new size is zero.  The xfsprogs
wrapper instead aborts, so we need to make it follow the kernel
behavior.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
5 months agoxfs_scrub: try to use XFS_SCRUB_IFLAG_FORCE_REBUILD
Darrick J. Wong [Wed, 20 Dec 2023 16:53:47 +0000 (08:53 -0800)] 
xfs_scrub: try to use XFS_SCRUB_IFLAG_FORCE_REBUILD

Now that we have a FORCE_REBUILD flag to the scrub ioctl, try to use
that over the (much noisier) error injection knob, which may or may not
even be enabled in the kernel config.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub: don't retry unsupported optimizations
Darrick J. Wong [Wed, 20 Dec 2023 16:53:46 +0000 (08:53 -0800)] 
xfs_scrub: don't retry unsupported optimizations

If the kernel says it doesn't support optimizing a data structure, we
should mark it done and move on.  This is much better than requeuing the
repair, in which case it will likely keep failing.  Eventually these
requeued repairs end up in the single-threaded last resort at the end of
phase 4, which makes things /very/ slow.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_scrub: handle spurious wakeups in scan_fs_tree
Darrick J. Wong [Wed, 20 Dec 2023 16:53:46 +0000 (08:53 -0800)] 
xfs_scrub: handle spurious wakeups in scan_fs_tree

Coverity reminded me that the pthread_cond_wait can wake up and return
without the predicate variable (sft.nr_dirs > 0) actually changing.
Therefore, one has to retest the condition after each wakeup.

Coverity-id: 1554280
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_io: support passing the FORCE_REBUILD flag to online repair
Darrick J. Wong [Wed, 20 Dec 2023 16:53:46 +0000 (08:53 -0800)] 
xfs_io: support passing the FORCE_REBUILD flag to online repair

Add CLI options to the scrubv and repair commands so that the user can
pass FORCE_REBUILD to force the kernel to rebuild metadata.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_io: extract control number parsing routines
Darrick J. Wong [Wed, 20 Dec 2023 16:53:46 +0000 (08:53 -0800)] 
xfs_io: extract control number parsing routines

Break out the parts of parse_args that extract control numbers from the
CLI arguments, so that the function isn't as long.  This isn't all that
exciting now, but the scrub vectorization speedups will introduce a new
ioctl.  For the new command that comes with that, we'll want the control
number parsing helpers.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_io: collapse trivial helpers
Darrick J. Wong [Wed, 20 Dec 2023 16:53:45 +0000 (08:53 -0800)] 
xfs_io: collapse trivial helpers

Simply the call chain by having parse_args set the scrub ioctl
parameters in the caller's object.  The parse_args callers can then
invoke the ioctl directly, eliminating one function and one indirect
call.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_io: set exitcode = 1 on parsing errors in scrub/repair command
Darrick J. Wong [Wed, 20 Dec 2023 16:53:45 +0000 (08:53 -0800)] 
xfs_io: set exitcode = 1 on parsing errors in scrub/repair command

Set exitcode to 1 if there is an error parsing the CLI arguments to the
scrub or repair commands, like we do most other places in xfs_io.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_mdrestore: refactor progress printing and sb fixup code
Darrick J. Wong [Wed, 20 Dec 2023 16:53:45 +0000 (08:53 -0800)] 
xfs_mdrestore: refactor progress printing and sb fixup code

Now that we've fixed the dissimilarities between the two progress
printing callsites, refactor them into helpers.  Do the same for the
duplicate code that clears sb_inprogress from the primary superblock
after the copy succeeds.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_mdrestore: fix missed progress reporting
Darrick J. Wong [Wed, 20 Dec 2023 16:53:45 +0000 (08:53 -0800)] 
xfs_mdrestore: fix missed progress reporting

Currently, the progress reporting only triggers when the number of bytes
read is exactly a multiple of a megabyte.  This isn't always guaranteed,
since AG headers can be 512 bytes in size.  Fix the algorithm by
recording the number of megabytes we've reported as being read, and emit
a new report any time the bytes_read count, once converted to megabytes,
doesn't match.

Fix the v2 code to emit one final status message in case the last
extent restored is more than a megabyte.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Chandan Babu R <chandanbabu@kernel.org>
5 months agoxfs_mdrestore: EXTERNALLOG is a compat value, not incompat
Darrick J. Wong [Wed, 20 Dec 2023 16:53:45 +0000 (08:53 -0800)] 
xfs_mdrestore: EXTERNALLOG is a compat value, not incompat

Fix this check to look at the correct header field.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Chandan Babu R <chandanbabu@kernel.org>
5 months agoxfs_mdrestore: emit newlines for fatal errors
Darrick J. Wong [Wed, 20 Dec 2023 16:53:44 +0000 (08:53 -0800)] 
xfs_mdrestore: emit newlines for fatal errors

Spit out a newline after a fatal error message.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Chandan Babu R <chandanbabu@kernel.org>
5 months agoxfs_mdrestore: fix uninitialized variables in mdrestore main
Darrick J. Wong [Wed, 20 Dec 2023 16:53:44 +0000 (08:53 -0800)] 
xfs_mdrestore: fix uninitialized variables in mdrestore main

Coverity complained about the "is fd a file?" flags being uninitialized.
Clean this up.

Coverity-id: 1554270
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Chandan Babu R <chandanbabu@kernel.org>
5 months agoxfs_metadump.8: update for external log device options
Darrick J. Wong [Wed, 20 Dec 2023 16:53:44 +0000 (08:53 -0800)] 
xfs_metadump.8: update for external log device options

Update the documentation to reflect that we can metadump external log
device contents.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Chandan Babu R <chandanbabu@kernel.org>
5 months agoxfs_db: report the device associated with each io cursor
Darrick J. Wong [Wed, 20 Dec 2023 16:53:44 +0000 (08:53 -0800)] 
xfs_db: report the device associated with each io cursor

When db is reporting on an io cursor, have it print out the device
that the cursor is pointing to.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_copy: actually do directio writes to block devices
Darrick J. Wong [Wed, 20 Dec 2023 16:53:43 +0000 (08:53 -0800)] 
xfs_copy: actually do directio writes to block devices

Not sure why block device targets don't get O_DIRECT in !buffered mode,
but it's misleading when the copy completes instantly only to stall
forever due to fsync-on-close.  Adjust the "write last sector" code to
allocate a properly aligned buffer.

In removing the onstack buffer for EOD writes, this also corrects the
buffer being larger than necessary -- the old code declared an array of
32768 pointers, whereas all we really need is an aligned 32768-byte
buffer.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agoxfs_copy: distinguish short writes to EOD from runtime errors
Darrick J. Wong [Wed, 20 Dec 2023 16:53:43 +0000 (08:53 -0800)] 
xfs_copy: distinguish short writes to EOD from runtime errors

Detect short writes to the end of the destination device and report
them.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
5 months agolibxfs: don't UAF a requeued EFI
Darrick J. Wong [Wed, 20 Dec 2023 16:53:43 +0000 (08:53 -0800)] 
libxfs: don't UAF a requeued EFI

In the kernel, commit 8ebbf262d4684 ("xfs: don't block in busy flushing
when freeing extents") changed the allocator behavior such that AGFL
fixing can return -EAGAIN in response to detection of a deadlock with
the transaction busy extent list.  If this happens, we're supposed to
requeue the EFI so that we can roll the transaction and try the item
again.

If a requeue happens, we should not free the xefi pointer in
xfs_extent_free_finish_item or else the retry will walk off a dangling
pointer.  There is no extent busy list in userspace so this should
never happen, but let's fix the logic bomb anyway.

We should have ported kernel commit 0853b5de42b47 ("xfs: allow extent
free intents to be retried") to userspace, but neither Carlos nor I
noticed this fine detail. :(

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Chandan Babu R <chandanbabu@kernel.org>
5 months agolibfrog: move 64-bit division wrappers to libfrog
Darrick J. Wong [Wed, 20 Dec 2023 16:53:43 +0000 (08:53 -0800)] 
libfrog: move 64-bit division wrappers to libfrog

We want to keep the rtgroup unit conversion functions as static inlines,
so share the div64 functions via libfrog instead of libxfs_priv.h.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Chandan Babu R <chandanbabu@kernel.org>
6 months agolibxfs: split out a libxfs_dev structure from struct libxfs_init
Christoph Hellwig [Mon, 11 Dec 2023 16:37:42 +0000 (17:37 +0100)] 
libxfs: split out a libxfs_dev structure from struct libxfs_init

Most of the content of libxfs_init is members duplicated for each of the
data, log and RT devices.  Split those members into a separate
libxfs_dev structure.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: stash away the device fd in struct xfs_buftarg
Christoph Hellwig [Mon, 11 Dec 2023 16:37:41 +0000 (17:37 +0100)] 
libxfs: stash away the device fd in struct xfs_buftarg

Cache the open file descriptor for each device in the buftarg
structure and remove the now unused dev_map infrastructure.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agoxfs_repair: remove various libxfs_device_to_fd calls
Christoph Hellwig [Mon, 11 Dec 2023 16:37:40 +0000 (17:37 +0100)] 
xfs_repair: remove various libxfs_device_to_fd calls

A few places in xfs_repair call libxfs_device_to_fd to get the data
device fd from the data device dev_t stored in the libxfs_init
structure.  Just use the file descriptor stored right there directly.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: pass the device fd to discard_blocks
Christoph Hellwig [Mon, 11 Dec 2023 16:37:39 +0000 (17:37 +0100)] 
libxfs: pass the device fd to discard_blocks

No need to do a dev_t to fd lookup when the caller already has the fd.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: return the opened fd from libxfs_device_open
Christoph Hellwig [Mon, 11 Dec 2023 16:37:38 +0000 (17:37 +0100)] 
libxfs: return the opened fd from libxfs_device_open

So that the caller can stash it away without having to call
xfs_device_to_fd.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: mark libxfs_device_{open,close} static
Christoph Hellwig [Mon, 11 Dec 2023 16:37:37 +0000 (17:37 +0100)] 
libxfs: mark libxfs_device_{open,close} static

libxfs_device_open and libxfs_device_close are only used in init.c.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: remove dead size < 0 checks in libxfs_init
Christoph Hellwig [Mon, 11 Dec 2023 16:37:36 +0000 (17:37 +0100)] 
libxfs: remove dead size < 0 checks in libxfs_init

libxfs_init initializes the device size to 0 at the start of the function
and libxfs_open_device never sets the size to a negativ value.  Remove
these checks as they are dead code.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibfrog: make platform_set_blocksize exit on fatal failure
Christoph Hellwig [Mon, 11 Dec 2023 16:37:35 +0000 (17:37 +0100)] 
libfrog: make platform_set_blocksize exit on fatal failure

platform_set_blocksize has a fatal argument that is currently only
used to change the printed message.  Make it actually fatal similar to
other libfrog platform helpers to simplify the caller.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: remove the setblksize == 1 case in libxfs_device_open
Christoph Hellwig [Mon, 11 Dec 2023 16:37:34 +0000 (17:37 +0100)] 
libxfs: remove the setblksize == 1 case in libxfs_device_open

All callers of libxfs_init always pass an actual sector size or zero in
the setblksize member.  Remove the unreachable setblksize == 1 case.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: making passing flags to libxfs_init less confusing
Christoph Hellwig [Mon, 11 Dec 2023 16:37:33 +0000 (17:37 +0100)] 
libxfs: making passing flags to libxfs_init less confusing

The libxfs_xinit stucture has four different ways to pass flags to
libxfs_init:

 - the isreadonly argument despite it's name contains various LIBXFS_
   flags that go beyond just the readonly flag
 - the isdirect flag contains a single LIBXFS_ flag from the same name
 - the usebuflock is an integer used as bool
 - the bcache_flags member is used to pass flags directly to cache_init()
   for the buffer cache

While there is good arguments for keeping the last one separate, all the
others are rather confusing.  Consolidate them into a single flags member
using flags in the LIBXFS_* namespace.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: merge the file vs device cases in libxfs_init
Christoph Hellwig [Mon, 11 Dec 2023 16:37:32 +0000 (17:37 +0100)] 
libxfs: merge the file vs device cases in libxfs_init

The only special handling for an XFS device on a regular file is that
we skip the checks in check_open.  Simplify perform those conditionally
instead of duplicating the entire sequence.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: pass a struct libxfs_init to libxfs_alloc_buftarg
Christoph Hellwig [Mon, 11 Dec 2023 16:37:31 +0000 (17:37 +0100)] 
libxfs: pass a struct libxfs_init to libxfs_alloc_buftarg

Pass a libxfs_init structure to libxfs_alloc_buftarg instead of three
separate dev_t values.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: pass a struct libxfs_init to libxfs_mount
Christoph Hellwig [Mon, 11 Dec 2023 16:37:30 +0000 (17:37 +0100)] 
libxfs: pass a struct libxfs_init to libxfs_mount

Pass a libxfs_init structure to libxfs_mount instead of three separate
dev_t values.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: rename struct libxfs_xinit to libxfs_init
Christoph Hellwig [Mon, 11 Dec 2023 16:37:29 +0000 (17:37 +0100)] 
libxfs: rename struct libxfs_xinit to libxfs_init

Make the struct name more usual, and remove the libxfs_init_t typedef.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxlog: remove the global libxfs_xinit x structure
Christoph Hellwig [Mon, 11 Dec 2023 16:37:28 +0000 (17:37 +0100)] 
libxlog: remove the global libxfs_xinit x structure

There is no need to export a libxfs_xinit with the somewhat unsuitable
name x from libxlog.  Move it into the tools linking against libxlog
that actually need it.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxlog: don't require a libxfs_xinit structure for xlog_init
Christoph Hellwig [Mon, 11 Dec 2023 16:37:27 +0000 (17:37 +0100)] 
libxlog: don't require a libxfs_xinit structure for xlog_init

xlog_init currently requires a libxfs_args structure to be passed in,
and then clobbers various log-related arguments to it.  There is no
good reason for that as all the required information can be calculated
without it.

Remove the x argument to xlog_init and xlog_is_dirty and the now unused
logBBstart member in struct libxfs_xinit.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxlog: add a helper to initialize a xlog without clobbering the x structure
Christoph Hellwig [Mon, 11 Dec 2023 16:37:26 +0000 (17:37 +0100)] 
libxlog: add a helper to initialize a xlog without clobbering the x structure

xfsprogs has three copies of a code sequence to initialize an xlog
structure from a libxfs_init structure. Factor the code into a helper.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxlog: remove the verbose argument to xlog_is_dirty
Christoph Hellwig [Mon, 11 Dec 2023 16:37:25 +0000 (17:37 +0100)] 
libxlog: remove the verbose argument to xlog_is_dirty

No caller passes a non-zero verbose argument to xlog_is_dirty.
Remove the argument the code keyed off by it.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agoxfs_logprint: move all code to set up the fake xlog into logstat()
Christoph Hellwig [Mon, 11 Dec 2023 16:37:24 +0000 (17:37 +0100)] 
xfs_logprint: move all code to set up the fake xlog into logstat()

Isolate the code that sets up the fake xlog into the logstat() helper to
prepare for upcoming changes.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: remove the volname concept
Christoph Hellwig [Mon, 11 Dec 2023 16:37:23 +0000 (17:37 +0100)] 
libxfs: remove the volname concept

IRIX has the concept of a volume that has data/log/rt subvolumes (that's
where the subvolume name in Linux comes from), but in the current
Linux-only xfsprogs version trying to pretend we do anything with that
it is just utterly confusing.  The volname is basically just a very
obsfucated second way to pass the data device name, so get rid of it
in the libxfs and progs internals.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs/frog: remove latform_find{raw,block}path
Christoph Hellwig [Mon, 11 Dec 2023 16:37:22 +0000 (17:37 +0100)] 
libxfs/frog: remove latform_find{raw,block}path

Stop pretending we try to distinguish between the legacy Unix raw and
block devices nodes.  Linux as the only currently support platform never
had them, but other modern Unix variants like FreeBSD also got rid of
this distinction years ago.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: remove the dead {d,log,rt}path variables in libxfs_init
Christoph Hellwig [Mon, 11 Dec 2023 16:37:21 +0000 (17:37 +0100)] 
libxfs: remove the dead {d,log,rt}path variables in libxfs_init

These variables are only initialized, and then unlink is called if they
were changed from the initial value, which can't happen.  Remove the
variables and the conditional unlink calls.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agolibxfs: remove the unused icache_flags member from struct libxfs_xinit
Christoph Hellwig [Mon, 11 Dec 2023 16:37:20 +0000 (17:37 +0100)] 
libxfs: remove the unused icache_flags member from struct libxfs_xinit

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agoxfs_io/encrypt: support specifying crypto data unit size
Eric Biggers [Fri, 13 Oct 2023 06:26:39 +0000 (23:26 -0700)] 
xfs_io/encrypt: support specifying crypto data unit size

Add an '-s' option to the 'set_encpolicy' command of xfs_io to allow
exercising the log2_data_unit_size field that is being added to struct
fscrypt_policy_v2 (kernel patch:
https://lore.kernel.org/linux-fscrypt/20230925055451.59499-6-ebiggers@kernel.org).

The xfs_io support is needed for xfstests
(https://lore.kernel.org/fstests/20231013061403.138425-1-ebiggers@kernel.org),
which currently relies on xfs_io to access the encryption ioctls.

Signed-off-by: Eric Biggers <ebiggers@google.com>
Reviewed-by: Carlos Maiolino <cem@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Add support for passing log device as an argument
Chandan Babu R [Mon, 6 Nov 2023 13:10:54 +0000 (18:40 +0530)] 
mdrestore: Add support for passing log device as an argument

metadump v2 format allows dumping metadata from external log devices. This
commit allows passing the device file to which log data must be restored from
the corresponding metadump file.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Define mdrestore ops for v2 format
Chandan Babu R [Mon, 6 Nov 2023 13:10:53 +0000 (18:40 +0530)] 
mdrestore: Define mdrestore ops for v2 format

This commit adds functionality to restore metadump stored in v2 format.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Extract target device size verification into a function
Chandan Babu R [Mon, 6 Nov 2023 13:10:52 +0000 (18:40 +0530)] 
mdrestore: Extract target device size verification into a function

A future commit will need to perform the device size verification on an
external log device. In preparation for this, this commit extracts the
relevant portions into a new function. No functional changes have been
introduced.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Introduce mdrestore v1 operations
Chandan Babu R [Mon, 6 Nov 2023 13:10:51 +0000 (18:40 +0530)] 
mdrestore: Introduce mdrestore v1 operations

In order to indicate the version of metadump files that they can work with,
this commit renames read_header(), show_info() and restore() functions to
read_header_v1(), show_info_v1() and restore_v1() respectively.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Replace metadump header pointer argument with a union pointer
Chandan Babu R [Mon, 6 Nov 2023 13:10:50 +0000 (18:40 +0530)] 
mdrestore: Replace metadump header pointer argument with a union pointer

We will need two variants of read_header(), show_info() and restore() helper
functions to support two versions of metadump formats. To this end, A future
commit will introduce a vector of function pointers to work with the two
metadump formats. To have a common function signature for the function
pointers, this commit replaces the first argument of the previously listed
function pointers from "struct xfs_metablock *" with "union
mdrestore_headers *".

Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Add open_device(), read_header() and show_info() functions
Chandan Babu R [Mon, 6 Nov 2023 13:10:49 +0000 (18:40 +0530)] 
mdrestore: Add open_device(), read_header() and show_info() functions

This commit moves functionality associated with opening the target device,
reading metadump header information and printing information about the
metadump into their respective functions. There are no functional changes made
by this commit.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Detect metadump v1 magic before reading the header
Chandan Babu R [Mon, 6 Nov 2023 13:10:48 +0000 (18:40 +0530)] 
mdrestore: Detect metadump v1 magic before reading the header

In order to support both v1 and v2 versions of metadump, mdrestore will have
to detect the format in which the metadump file has been stored on the disk
and then read the ondisk structures accordingly. In a step in that direction,
this commit splits the work of reading the metadump header from disk into two
parts,
1. Read the first 4 bytes containing the metadump magic code.
2. Read the remaining part of the header.

A future commit will take appropriate action based on the value of the magic
code.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Define and use struct mdrestore
Chandan Babu R [Mon, 6 Nov 2023 13:10:47 +0000 (18:40 +0530)] 
mdrestore: Define and use struct mdrestore

This commit collects all state tracking variables in a new "struct mdrestore"
structure. This is done to collect all the global variables in one place
rather than having them spread across the file. A new structure member of type
"struct mdrestore_ops *" will be added by a future commit to support the two
versions of metadump.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Declare boolean variables with bool type
Chandan Babu R [Mon, 6 Nov 2023 13:10:46 +0000 (18:40 +0530)] 
mdrestore: Declare boolean variables with bool type

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agoxfs_db: Add support to read from external log device
Chandan Babu R [Mon, 6 Nov 2023 13:10:45 +0000 (18:40 +0530)] 
xfs_db: Add support to read from external log device

This commit introduces a new function set_log_cur() allowing xfs_db to read
from an external log device. This is required by a future commit which will
add the ability to dump metadata from external log devices.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Define metadump ops for v2 format
Chandan Babu R [Mon, 6 Nov 2023 13:10:44 +0000 (18:40 +0530)] 
metadump: Define metadump ops for v2 format

This commit adds functionality to dump metadata from an XFS filesystem in
newly introduced v2 format.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Define metadump v2 ondisk format structures and macros
Chandan Babu R [Mon, 6 Nov 2023 13:10:43 +0000 (18:40 +0530)] 
metadump: Define metadump v2 ondisk format structures and macros

The corresponding metadump file's disk layout is as shown below,

     |------------------------------|
     | struct xfs_metadump_header   |
     |------------------------------|
     | struct xfs_meta_extent 0     |
     | Extent 0's data              |
     | struct xfs_meta_extent 1     |
     | Extent 1's data              |
     | ...                          |
     | struct xfs_meta_extent (n-1) |
     | Extent (n-1)'s data          |
     |------------------------------|

The "struct xfs_metadump_header" is followed by alternating series of "struct
xfs_meta_extent" and the extent itself.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Rename XFS_MD_MAGIC to XFS_MD_MAGIC_V1
Chandan Babu R [Mon, 6 Nov 2023 13:10:42 +0000 (18:40 +0530)] 
metadump: Rename XFS_MD_MAGIC to XFS_MD_MAGIC_V1

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Introduce metadump v1 operations
Chandan Babu R [Mon, 6 Nov 2023 13:10:41 +0000 (18:40 +0530)] 
metadump: Introduce metadump v1 operations

This commit moves functionality associated with writing metadump to disk into
a new function. It also renames metadump initialization, write and release
functions to reflect the fact that they work with v1 metadump files.

The metadump initialization, write and release functions are now invoked via
metadump_ops->init(), metadump_ops->write() and metadump_ops->release()
respectively.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Introduce struct metadump_ops
Chandan Babu R [Mon, 6 Nov 2023 13:10:40 +0000 (18:40 +0530)] 
metadump: Introduce struct metadump_ops

We will need two sets of functions to implement two versions of metadump. This
commit adds the definition for 'struct metadump_ops' to hold pointers to
version specific metadump functions.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Postpone invocation of init_metadump()
Chandan Babu R [Mon, 6 Nov 2023 13:10:39 +0000 (18:40 +0530)] 
metadump: Postpone invocation of init_metadump()

The metadump v2 initialization function (introduced in a later commit) writes
the header structure into the metadump file. This will require the program to
open the metadump file before the initialization function has been invoked.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Add initialization and release functions
Chandan Babu R [Mon, 6 Nov 2023 13:10:38 +0000 (18:40 +0530)] 
metadump: Add initialization and release functions

Move metadump initialization and release functionality into corresponding
functions. There are no functional changes made in this commit.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Define and use struct metadump
Chandan Babu R [Mon, 6 Nov 2023 13:10:37 +0000 (18:40 +0530)] 
metadump: Define and use struct metadump

This commit collects all state tracking variables in a new "struct metadump"
structure. This is done to collect all the global variables in one place
rather than having them spread across the file. A new structure member of type
"struct metadump_ops *" will be added by a future commit to support the two
versions of metadump.

Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Declare boolean variables with bool type
Chandan Babu R [Mon, 6 Nov 2023 13:10:36 +0000 (18:40 +0530)] 
metadump: Declare boolean variables with bool type

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agomdrestore: Fix logic used to check if target device is large enough
Chandan Babu R [Mon, 6 Nov 2023 13:10:35 +0000 (18:40 +0530)] 
mdrestore: Fix logic used to check if target device is large enough

The device size verification code should be writing XFS_MAX_SECTORSIZE bytes
to the end of the device rather than "sizeof(char *) * XFS_MAX_SECTORSIZE"
bytes.

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agometadump: Use boolean values true/false instead of 1/0
Chandan Babu R [Mon, 6 Nov 2023 13:10:34 +0000 (18:40 +0530)] 
metadump: Use boolean values true/false instead of 1/0

Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Chandan Babu R <chandan.babu@oracle.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agorepair: fix the call to search_rt_dup_extent in scan_bmapbt
Christoph Hellwig [Thu, 9 Nov 2023 16:02:33 +0000 (17:02 +0100)] 
repair: fix the call to search_rt_dup_extent in scan_bmapbt

search_rt_dup_extent expects an RT extent number and not a fsbno.
Convert the units before the call.  Without this we are unlikely
to ever found a legit duplicate extent on the RT subvolume because
the search will always be off the end.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agoxfs_quota: fix missing mount point warning
Pavel Reichl [Wed, 11 Oct 2023 20:50:54 +0000 (22:50 +0200)] 
xfs_quota: fix missing mount point warning

When user have mounted an XFS volume, and defined project in
/etc/projects file that points to a directory on a different volume,
then:
`xfs_quota -xc "report -a" $path_to_mounted_volume'

complains with:
"xfs_quota: cannot find mount point for path \
`directory_from_projects': Invalid argument"

unlike `xfs_quota -xc "report -a"' which works as expected and no
warning is printed.

This is happening because in the 1st call we pass to xfs_quota command
the $path_to_mounted_volume argument which says to xfs_quota not to
look for all mounted volumes on the system, but use only those passed
to the command and ignore all others (This behavior is intended as an
optimization for systems with huge number of mounted volumes). After
that, while projects are initialized, the project's directories on
other volumes are obviously not in searched subset of volumes and
warning is printed.

I propose to fix this behavior by conditioning the printing of warning
only if all mounted volumes are searched.

Signed-off-by: Pavel Reichl <preichl@redhat.com>
Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agodb: fix unsigned char related warnings
Christoph Hellwig [Fri, 3 Nov 2023 16:02:10 +0000 (17:02 +0100)] 
db: fix unsigned char related warnings

Clean up the code in hash.c to use the normal char type for all
high-level code, only casting to uint8_t when calling into low-level
code.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
6 months agoPolish translation update for xfsprogs 6.5.0.
Jakub Bogusz [Wed, 15 Nov 2023 13:51:58 +0000 (14:51 +0100)] 
Polish translation update for xfsprogs 6.5.0.

Signed-off-by: Jakub Bogusz <qboosh@pld-linux.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
7 months agoxfs: adjust the incore perag block_count when shrinking
Darrick J. Wong [Tue, 14 Nov 2023 12:27:44 +0000 (13:27 +0100)] 
xfs: adjust the incore perag block_count when shrinking

Source kernel commit: 6868b8505c807ad9397d78cc4e07cb1cb3582152

If we reduce the number of blocks in an AG, we must update the incore
geometry values as well.

Fixes: 0800169e3e2c9 ("xfs: Pre-calculate per-AG agbno geometry")
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
7 months agoRevert "xfs: switch to multigrain timestamps"
Christian Brauner [Tue, 14 Nov 2023 12:27:31 +0000 (13:27 +0100)] 
Revert "xfs: switch to multigrain timestamps"

Source kernel commit: f798accd5987dc2280e0ba9055edf1124af46a5f

This reverts commit e44df2664746aed8b6dd5245eb711a0ce33c5cf5.

Users reported regressions due to enabling multi-grained timestamps
unconditionally. As no clear consensus on a solution has come up and the
discussion has gone back to the drawing board revert the infrastructure
changes for. If it isn't code that's here to stay, make it go away.

Message-ID: <20230920-keine-eile-c9755b5825db@brauner>
Acked-by: Jan Kara <jack@suse.cz>
Acked-by: Jeff Layton <jlayton@kernel.org>
Signed-off-by: Christian Brauner <brauner@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
7 months agoxfs: fix log recovery when unknown rocompat bits are set
Darrick J. Wong [Tue, 14 Nov 2023 09:48:07 +0000 (10:48 +0100)] 
xfs: fix log recovery when unknown rocompat bits are set

Source kernel commit: 74ad4693b6473950e971b3dc525b5ee7570e05d0

Log recovery has always run on read only mounts, even where the primary
superblock advertises unknown rocompat bits.  Due to a misunderstanding
between Eric and Darrick back in 2018, we accidentally changed the
superblock write verifier to shutdown the fs over that exact scenario.
As a result, the log cleaning that occurs at the end of the mounting
process fails if there are unknown rocompat bits set.

As we now allow writing of the superblock if there are unknown rocompat
bits set on a RO mount, we no longer want to turn off RO state to allow
log recovery to succeed on a RO mount.  Hence we also remove all the
(now unnecessary) RO state toggling from the log recovery path.

Fixes: 9e037cb7972f ("xfs: check for unknown v5 feature bits in superblock write verifier"
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Dave Chinner <dchinner@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
7 months agoxfs: switch to multigrain timestamps
Jeff Layton [Tue, 14 Nov 2023 09:47:49 +0000 (10:47 +0100)] 
xfs: switch to multigrain timestamps

Source kernel commit: e44df2664746aed8b6dd5245eb711a0ce33c5cf5

Enable multigrain timestamps, which should ensure that there is an
apparent change to the timestamp whenever it has been written after
being actively observed via getattr.

Also, anytime the mtime changes, the ctime must also change, and those
are now the only two options for xfs_trans_ichgtime. Have that function
unconditionally bump the ctime, and ASSERT that XFS_ICHGTIME_CHG is
always set.

Acked-by: "Darrick J. Wong" <djwong@kernel.org>
Signed-off-by: Jeff Layton <jlayton@kernel.org>
Message-Id: <20230807-mgctime-v7-11-d1dec143a704@kernel.org>
Signed-off-by: Christian Brauner <brauner@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
7 months agoxfs: allow userspace to rebuild metadata structures
Darrick J. Wong [Tue, 14 Nov 2023 09:39:26 +0000 (10:39 +0100)] 
xfs: allow userspace to rebuild metadata structures

Source kernel commit: 5c83df2e54b6af870e3e02ccd2a8ecd54e36668c

Add a new (superuser-only) flag to the online metadata repair ioctl to
force it to rebuild structures, even if they're not broken.  We will use
this to move metadata structures out of the way during a free space
defragmentation operation.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Dave Chinner <dchinner@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
7 months agoxfs: convert to ctime accessor functions
Jeff Layton [Tue, 14 Nov 2023 09:39:12 +0000 (10:39 +0100)] 
xfs: convert to ctime accessor functions

Source kernel commit: a0a415e34b57368acd262e1172720252c028b936

In later patches, we're going to change how the inode's ctime field is
used. Switch to using accessor functions instead of raw accesses of
inode->i_ctime.

Signed-off-by: Jeff Layton <jlayton@kernel.org>
Reviewed-by: Jan Kara <jack@suse.cz>
Message-Id: <20230705190309.579783-80-jlayton@kernel.org>
Signed-off-by: Christian Brauner <brauner@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agoxfsprogs: Release v6.5.0 v6.5.0
Carlos Maiolino [Thu, 12 Oct 2023 11:23:53 +0000 (13:23 +0200)] 
xfsprogs: Release v6.5.0

Update all the necessary files for a 6.5.0 release.

Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agolibfrog: drop build host crc32 selftest
Krzesimir Nowak [Thu, 28 Sep 2023 11:23:38 +0000 (13:23 +0200)] 
libfrog: drop build host crc32 selftest

CRC selftests running on a build host were useful long time ago, when
CRC support was added to the on-disk support. Now it's purpose is
replaced by fstests. Also mkfs.xfs and xfs_repair have their own
selftests.

On top of that, it adds a dependency on liburcu on the build host for
no reason - liburcu is not used by the crc32 selftest.

Signed-off-by: Krzesimir Nowak <knowak@microsoft.com>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agolibxfs: fix atomic64_t detection on x86 32-bit architectures
Darrick J. Wong [Tue, 12 Sep 2023 19:47:51 +0000 (12:47 -0700)] 
libxfs: fix atomic64_t detection on x86 32-bit architectures

xfsprogs during compilation tries to detect if liburcu supports atomic
64-bit ops on the platform it is being compiled on, and if not it falls
back to using pthread mutex locks.

The detection logic for that fallback relies on _uatomic_link_error()
which is a link-time trick used by liburcu that will cause compilation
errors on archs that lack the required support. That only works for the
generic liburcu code though, and it is not implemented for the
x86-specific code.

In practice this means that when xfsprogs is compiled on 32-bit x86
archs will successfully link to liburcu for atomic ops, but liburcu does
not support atomic64_t on those archs. It indicates this during runtime
by generating an illegal instruction that aborts execution, and thus
causes various xfsprogs utils to be segfaulting.

Fix this by requiring that unsigned longs are at least 64 bits in size,
which /usually/ means that 64-bit atomic counters are supported.  We
can't simply execute the liburcu atomic64_t detection code during
configure instead of only relying on the linker error because that
doesn't work for cross-compiled packages.

Fixes: 7448af588a2e ("libxfs: fix atomic64_t poorly for 32-bit architectures")
Reported-by: Anthony Iliopoulos <ailiop@suse.com>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Reviewed-by: Bill O'Donnell <bodonnel@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agoxfs_repair: set aformat and anextents correctly when clearing the attr fork
Darrick J. Wong [Tue, 12 Sep 2023 19:40:04 +0000 (12:40 -0700)] 
xfs_repair: set aformat and anextents correctly when clearing the attr fork

Ever since commit b42db0860e130 ("xfs: enhance dinode verifier"), we've
required that inodes with zero di_forkoff must also have di_aformat ==
EXTENTS and di_naextents == 0.  clear_dinode_attr actually does this,
but then both callers inexplicably set di_format = LOCAL.  That in turn
causes a verifier failure the next time the xattrs of that file are
read by the kernel.  Get rid of the bogus field write.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Reviewed-by: Bill O'Donnell <bodonnel@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agoxfs_scrub: actually return errno from check_xattr_ns_names
Darrick J. Wong [Tue, 12 Sep 2023 19:39:58 +0000 (12:39 -0700)] 
xfs_scrub: actually return errno from check_xattr_ns_names

Actually return the error code when extended attribute checks fail.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agolibxfs: use XFS_IGET_CREATE when creating new files
Darrick J. Wong [Tue, 12 Sep 2023 19:39:52 +0000 (12:39 -0700)] 
libxfs: use XFS_IGET_CREATE when creating new files

Use this flag to check that newly allocated inodes are, in fact,
unallocated.  This matches the kernel, and prevents userspace programs
from making latent corruptions worse by unintentionally crosslinking
files.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Reviewed-by: Bill O'Donnell <bodonnel@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agolibfrog: don't fail on XFS_FSOP_GEOM_FLAGS_NREXT64 in xfrog_bulkstat_single5
Darrick J. Wong [Tue, 12 Sep 2023 19:39:47 +0000 (12:39 -0700)] 
libfrog: don't fail on XFS_FSOP_GEOM_FLAGS_NREXT64 in xfrog_bulkstat_single5

This flag is perfectly acceptable for bulkstatting a single file;
there's no reason not to allow it.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agolibfrog: fix overly sleep workqueues
Darrick J. Wong [Tue, 12 Sep 2023 19:39:41 +0000 (12:39 -0700)] 
libfrog: fix overly sleep workqueues

I discovered the following bad behavior in the workqueue code when I
noticed that xfs_scrub was running single-threaded despite having 4
virtual CPUs allocated to the VM.  I observed this sequence:

Thread 1 WQ1 WQ2...N
workqueue_create
<start up>
pthread_cond_wait
<start up>
pthread_cond_wait
workqueue_add
next_item == NULL
pthread_cond_signal

workqueue_add
next_item != NULL
<do not pthread_cond_signal>

<receives wakeup>
<run first item>

workqueue_add
next_item != NULL
<do not pthread_cond_signal>

<run second item>
<run third item>
pthread_cond_wait

workqueue_terminate
pthread_cond_broadcast
<receives wakeup>
<nothing to do, exits>
<wakes up again>
<nothing to do, exits>

Notice how threads WQ2...N are completely idle while WQ1 ends up doing
all the work!  That wasn't the point of a worker pool!  Observe that
thread 1 manages to queue two work items before WQ1 pulls the first item
off the queue.  When thread 1 queues the third item, it sees that
next_item is not NULL, so it doesn't wake a worker.  If thread 1 queues
all the N work that it has before WQ1 empties the queue, then none of
the other thread get woken up.

Fix this by maintaining a count of the number of active threads, and
using that to wake either the sole idle thread, or all the threads if
there are many that are idle.  This dramatically improves startup
behavior of the workqueue and eliminates the collapse case.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agoxfs_db: use directio for device access
Darrick J. Wong [Mon, 25 Sep 2023 21:59:16 +0000 (14:59 -0700)] 
xfs_db: use directio for device access

XFS and tools (mkfs, copy, repair) don't generally rely on the block
device page cache, preferring instead to use directio.  For whatever
reason, the debugger was never made to do this, but let's do that now.

This should eliminate the weird fstests failures resulting from
udev/blkid pinning a cache page while the unmounting filesystem writes
to the superblock such that xfs_db finds the stale pagecache instead of
the post-unmount superblock.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agolibxfs: make platform_set_blocksize optional with directio
Darrick J. Wong [Mon, 25 Sep 2023 21:59:10 +0000 (14:59 -0700)] 
libxfs: make platform_set_blocksize optional with directio

If we're accessing the block device with directio (and hence bypassing
the page cache), then don't fail on BLKBSZSET not working.  We don't
care what happens to the pagecache bufferheads.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agomkfs: add a config file for 6.6 LTS kernels
Darrick J. Wong [Mon, 25 Sep 2023 21:59:36 +0000 (14:59 -0700)] 
mkfs: add a config file for 6.6 LTS kernels

Enable 64-bit extent counts and reverse mapping for 6.6.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agomkfs: enable reverse mapping by default
Darrick J. Wong [Mon, 25 Sep 2023 21:59:30 +0000 (14:59 -0700)] 
mkfs: enable reverse mapping by default

Now that online fsck is feature complete, there's actually a compelling
story for having the reverse mappings enabled.  Turn it on by default.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agomkfs: enable large extent counts by default
Darrick J. Wong [Mon, 25 Sep 2023 21:59:25 +0000 (14:59 -0700)] 
mkfs: enable large extent counts by default

Format filesystems with the large extent counter feature turned on.
We shall now support 64-bit extent counts for the data fork and 32-bit
extent counts for the attr fork.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agoxfs_db: create unlinked inodes
Darrick J. Wong [Mon, 25 Sep 2023 21:59:51 +0000 (14:59 -0700)] 
xfs_db: create unlinked inodes

Create an expert-mode debugger command to create unlinked inodes.
This will hopefully aid in simulation of leaked unlinked inode handling
in the kernel and elsewhere.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Bill O'Donnell <bodonnel@redhat.com>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
8 months agoxfs_db: dump unlinked buckets
Darrick J. Wong [Mon, 25 Sep 2023 21:59:45 +0000 (14:59 -0700)] 
xfs_db: dump unlinked buckets

Create a new command to dump the resource usage of files in the unlinked
buckets.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Bill O'Donnell <bodonnel@redhat.com>
Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
Signed-off-by: Carlos Maiolino <cem@kernel.org>
9 months agoxfs: convert flex-array declarations in xfs attr shortform objects
Darrick J. Wong [Thu, 7 Sep 2023 09:41:01 +0000 (11:41 +0200)] 
xfs: convert flex-array declarations in xfs attr shortform objects

Source kernel commit: f6250e205691a58c81be041b1809a2e706852641

As of 6.5-rc1, UBSAN trips over the ondisk extended attribute shortform
definitions using an array length of 1 to pretend to be a flex array.
Kernel compilers have to support unbounded array declarations, so let's
correct this.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Kees Cook <keescook@chromium.org>
Signed-off-by: Carlos Maiolino <cem@kernel.org>