git.ipfire.org Git - thirdparty/xfsprogs-dev.git/commit

libxfs: stop caching inode structures

Currently libxfs has a cache for xfs_inode structures.  Unlike in kernelspace
where the inode cache, and the associated page cache for file data is used
for all filesystem operations the libxfs inode cache is only used in few
places:

- the libxfs init code reads the root and realtime inodes when called from
   xfs_db using a special flag, but these inode structure are never referenced
   again
- mkfs uses namespace and bmap routines that take the xfs_inode structure
   to create the root and realtime inodes, as well as any additional files
   specified in the proto file
- the xfs_db attr code uses xfs_inode-based attr routines in the attrset
   and attrget commands
- phase6 of xfs_repair uses xfs_inode-based routines for rebuilding
   directories and moving files to the lost+found directory.
- phase7 of xfs_repair uses struct xfs_inode to modify the nlink count
   of inodes.

So except in repair we never ever reuse a cached inode, and even in repair
the logical inode caching doesn't help:

- in phase 6a we iterate over each inode in the incore inode tree,
   and if it's a directory check/rebuild it
- phase6b then updates the "." and ".." entries for directories
   that need, which means we require the backing buffers.
- phase6c moves disconnected inodes to lost_found, which again needs
   the backing buffer to actually do anything.
- phase7 then only touches inodes for which we need to reset i_nlink,
   which always involves reading, modifying and writing the physical
   inode.
   which always involves modifying the . and .. entries.

Given these facts stop caching the inodes to reduce memory usage
especially in xfs_repair, where this makes a different for large inode
count inodes.  On the upper end this allows repair to complete for
filesystem / amount of memory combinations that previously wouldn't.

With this we probably could increase the memory available to the buffer
cache in xfs_repair, but trying to do so I got a bit lost - the current
formula seems to magic to me to make any sense, and simply doubling the
buffer cache size causes us to run out of memory given that the data cached
in the buffer cache (typically lots of 8k inode buffers and few 4k other
metadata buffers) are much bigger than the inodes cached in the inode
cache.  We probably need a sizing scheme that takes the actual amount
of memory allocated to the buffer cache into account to solve this better.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Dave Chinner <dchinner@redhat.com>
Signed-off-by: Dave Chinner <dchinner@redhat.com>
Signed-off-by: Rich Johnston <rjohnston@sgi.com>

author	Christoph Hellwig <hch@infradead.org>
	Wed, 13 Nov 2013 06:40:40 +0000 (06:40 +0000)
committer	Rich Johnston <rjohnston@sgi.com>
	Wed, 13 Nov 2013 17:08:20 +0000 (11:08 -0600)
commit	3a19fb7dce9d570e78deaf5c26c0ab8a4a5bef67
tree	cdbd8125d6eaa753cb13520b04f046220f5c013f	tree
parent	9aa5711629b47642bb5b688a6a1410d223456fc8	commit \| diff

include/libxfs.h		diff \| blob \| blame \| history
libxfs/init.c		diff \| blob \| blame \| history
libxfs/rdwr.c		diff \| blob \| blame \| history
man/man8/xfs_repair.8		diff \| blob \| blame \| history
mkfs/xfs_mkfs.c		diff \| blob \| blame \| history
repair/xfs_repair.c		diff \| blob \| blame \| history