[Inside LBS Ⅱ] Dividing and Conquering Linux High-Capacity SSD Support with LBS

Written by Luis Chamberlain, R&D-DTC/DSRA(DS)

A stylized 3D penguin character stands next to a Samsung microchip labeled "Inside LBS II," with glowing blue data lines and golden coin stacks in a tech-themed environment.

Image generated using Gimp for illustrative purposes.

Part 2. Addressing the large indirection unit storage stack challenge

This is part 2 of our series on addressing support for HC SSDs in the software ecosystem.

The demand for larger capacity drives requires the adoption of larger IUs. This creates a challenge for software stacks: how can drives with larger IUs be optimally adopted without any modifications to software applications? Is this possible? Intel's 2018 white paper, titled "Achieving optimal performance & endurance on coarse indirection unit SSDs" about the topic on QLC suggests that software applications should use direct I/O instead of buffered I/O to align writes to the IU and that applications should also use allocators for buffers for writes to data which also allow explicit alignment requirements such as the libc posix_memalign(). Intel's suggestions require software applications to be modified to be aware of the IU and support for buffered I/O is not possible. Many workloads need to use buffered I/O, provided the Linux page cache, either because of software limitations such as in the case of PostgreSQL or because of other requirements such as when working with specific large data set AI workloads. One way to grow support for large IUs is to require support for larger LBA formats. However, that would first require confidence that existing workloads are aligning writes to the IU through I/O introspection. Analysis is required to prove that this is possible first. Using a larger LBA format is also a non-backward compatible and non-scalable solution. Drive capacities would have to be reduced if smaller LBA formats were to be used, and a complete analysis of changes required in standards would be required. With ever increasing HC SSDs requiring an ever-increasing LBA size, the host SW would need to be written to dynamically operate across various numerous SSD LBA sizes concurrently. Can anything be done to avoid any software application changes while also supporting buffered I/O without requiring the industry to move to a new LBA format?

Linux filesystem sector size

Linux supports multiple filesystems. Each filesystem designs how data is written on disk. To ensure protection against power failure filesystems have different strategies they can use to ensure writes are consistent in case of power failure. One of these strategies is to embrace a filesystem journal, the idea is you write to the journal prior to considering data written on disk. On journal-based filesystems such as XFS and EXT4, in case of power failure the filesystem can replay the journal for unfinished writes. Copy on Write filesystems, such as btrfs, take a slightly different strategy to address power failure, it writes data to a new location and then links the change in, changes are not committed until the last write. With this strategy if a power failure occurs the uncommitted portion of data is lost. The btrfs filesystem however already relies on 16 KiB for metadata, and so if it could write that entire 16 KiB atomically, it would allow btrfs to simplify their writes in the future. Regardless of what strategy a filesystem follows, this is purely a filesystem design question, it is up to filesystem developers to implement a solution to this problem. In order for filesystems to design a solution, a filesystem needs to know the minimum I/O size it can use to write atomically in case of power failure. This size is known as the filesystem sector size. The sector size is the minimum size the filesystem can rely on to write without concern of power failure. The actual layout of how a filesystem writes to a journal or metadata varies by filesystem.

What is the filesystem sector size used for?

The filesystem sector size is used by filesystems for the minimum I/O possible when writing filesystem data. It leverages this as the minimum I/O to use for writing to the journal or metadata.

What NVMe parameters increase the allowed filesystem sector size

The NVMe parameter Namespace Preferred Write Granularity (NPWG) is used to represent the smallest recommended write granularity. In practice today, most HC NVMe SSD drives should be reporting the IU through the NPWG. This is for two reasons. First, the author of the NVMe specification that introduced NPWG had the IU in mind as an appropriate value. Second, its required as part of the Open Compute Project NVMe Cloud Specification. Starting with the v6.15 release of Linux you can query for this with the simple stat --print=%o call on the respective block device.

■ NPWG as IU

The value of NPWG only helps us get the IU of a drive. Operating system developers should take care to ensure that NPWG is not used to implicate that a write will be atomic. Atomic writes are handled through other NVMe parameters, described next.

■ AWUN for normal operation

The NVMe Atomic Write Unit Normal, AWUN, tells us the controller's atomic write size during normal operation. That is, it is atomic to the NVM with respect to other reads and write operations. If a write is larger than this size atomicity is not guaranteed by the controller. Normal here is used to describe the world where we're not considering power failure. Power failure is a very special case to support on SSDs and requires its own separate parameter, described next.

■ AWUPF for power fail consideration

The NVMe parameter atomic write unit power fail (AWUPF) is used to represent the maximum I/O size allowed to be used for a write which is guaranteed to not fail in case of power failure.

■ Leveraging NVMe parameters for larger filesystem sector sizes

Based on the review in the prior sections we can now evaluate which ones we can use to support larger filesystem sector sizes. There are two mechanisms by which an NVMe drive can allow filesystems to leverage a larger filesystem sector size:

LBA Format → forces users to issue all writes at the specified format
NPWG + AWUPF → flexible

Supporting a larger filesystem sector size is expected with a larger LBA Format. However, that requires all users to support these larger sector sizes. Allowing users to create filesystems with a filesystem sector size matching the IU size is only possible with support from an SSD with its maximum power fail safe write size greater than or equal to the IU size.

■ How AWUPF ≥ NPWG = IU is flexible

NVMe drives which follow an NVMe AWUPF ≥ NPWG = IU paradigm strategy can provide write protection against power failure against filesystems on large IU SSDs. This strategy is also flexible for users, in that they can opt-in to specify a larger sector size at filesystem creation time. A namespace can have its own AWUPF value, in which case NAWUPF would be defined. The same applies to when NAWUPF is defined NAWUPF ≥ NPWG = IU, however it is simpler and shorter to just refer to this concept as AWUPF ≥ NPWG = IU. For example, users of an NVMe drive on a 4 KiB LBA format with AWUPF 16k and NPWG of 16 KiB can use either of these commands to create 16 KiB XFS filesystem, the only difference between the two is the sector size changes.

mkfs.xfs -f -b size=16k -s size=4k → uses a 4 KiB sector size → supported as of v6.12
mkfs.xfs -f -b size=16k -s size=16k → uses a 16 KiB sector size → supported as of v6.15

The flexibility comes from the fact that users don't need to leverage the larger sector size on filesystems, they can simply opt-in to use the larger sector size only if they wish to and if the filesystem supports it. This puts the power and flexibility in the hands of the users to opt in for larger sector sizes when and if they're ready.

■ Determinism of AWUPF ≥ NPWG = IU

Supporting an AWUPF matching the IU also enables users to benefit from large atomics. Hyperscalers have been enabling support for large atomics on databases for at least 6 years now using custom storage solutions. Given an API for large atomics was only merged as of v6.13 it begs the question how hyperscalers were able to support large atomics without a filesystem API for it. The answer lies in manual code vetting on the block layer and filesystem. The point to a filesystem API is to enable the Operating System to help provide guarantees over the requirements. While the industry has been relying on ext4 with bigalloc feature with 16 KiB cluster sizes, software vetting is required to ensure proper functionality. The Operating System API also enables NVMe users to also require an error from the kernel if a write does not meet the criteria to be atomic – this is because contrary to SCSI, NVMe does not require a special write command for it be atomic. So long as your write follows the requirements for being atomic NVMe will write it atomically. If you want assistance by the kernel to assist with vetting the requirements you can adopt the atomic write API on your user space application.

An important but overlooked requirement for vetting correctness when using atomics is to respect the required SSD hardware atomic boundary sizes. For NVMe there are two values to consider the Namespace Atomic Boundary Size Normal (NABSN) and Namespace Atomic Boundary Power Fail (NABSPF). One is important for normal operation, while another in case of power failure. For HC SSDs with a 16 KiB IU both NABSN and NABSPF may also be 16 KiB. An XFS filesystem with 16 KiB filesystem block size but 4 KiB sector size on these types of drives will ensure most writes aligned to the 16 KiB boundary. Some writes are not aligned, and so they cannot take advantage of the atomic writes feature and they may incur a read modify write, so are not as performant. I/O introspection reveals that in XFS these unaligned writes in XFS are doing metadata write work. I/O introspection also reveals that by leveraging a 16 KiB sector size we get deterministic alignment to NABSN / NABSPF / NPWG alignment. Similarly, IO introspection of ext4 with bigalloc feature with 16 KiB cluster sizes also reveals some write can still be 4 KiB. At this year's LSFMM during the ext4 write atomics talk this topic came up, and Ted Ts'o revealed that this is due to the metadata writes. Additionally, the topic of how ext4 could support 16 KiB sector sizes came up, and the path forward to enable that would be to have ext4 supporting LBS eventually as well.

Supporting larger sector size through LBS enables a filesystem to have deterministic aligned writes for both the larger IU and for large atomics. The AWUPF ≥ NPWG = IU strategy provides empowerment to users so that they can opt-in for the larger sector sizes when and if they are ready for them.

Part 3. Empirical evaluation of large atomics

This is part 3 of our series on addressing support for HC SSDs in the software ecosystem.

What are large atomics good for?

Other than helping with aligning writes for HC SSDs, are there other benefits today's software and filesystems can take advantage from large atomics?

Database use cases

We presented our findings at last year's Open Compute Project on the talk titled "Enabling large block sizes to facilitate adoption of large capacity QLC SSDs". We also wrote automation to support reproducing our findings on bare metal and on the cloud through kdevops on both MySQL and PostgreSQL. We define the TPS variability as the square of the standard deviation. We define outliers as TPS values 1.5 outside IQR. To provide an easy to reproduce baseline we have used AWS i4i.4xlarge 4 KiB IU NVMe nitro drives for our evaluation. A summary our findings show:

~ 1.7x Transactions Per Second boost on databases
Up to 3x-4x performance variability reduction in TPS for MySQL
Up to 14x-18x performance variability reduction in TPS for PostgreSQL
We additionally observe a reduction of TPS outliers by a factor of 1.3

LBS relies on the fact that you can simply write one filesystem block size atomically, if your hardware supports it. The XFS filesystem first supported large atomics through LBS. Although LBS was written to help support large IU drives you can still leverage LBS on 4 KiB IU drives to take advantage of large hardware atomics. Discussion about actually leveraging large hardware atomics for MySQL through LBS on 4 KiB IU drives has been discussed in the community. The performance evaluation in the community reveals that if you leverage at least 10 threads for MySQL then LBS works well on a 4 KiB IU drive. If you use less than 10 threads then the MySQL redo log may write 512 bytes at a time followed by periodic syncs, and this can cause a performance regression against 4 KiB XFS filesystem workloads. One solution to this problem is to just use a separate directory for the redo log on another drive on another filesystem which will write 512 bytes at a time efficiently. However, this would not be suitable if you wanted to leverage snapshotting using all of MySQL data and redo log on the same filesystem. To help with that corner case on 4 KiB IU drives there has been community development effort on supporting large atomic writes where the atomic write may be larger than the filesystem block size. A v9 series supporting this is now out for review. The only caveat with this support is extent allocations by the filesystem are not deterministic when larger than the filesystem block size. To address this a software solution to large atomics is needed with CoW in case a write is not aligned or the granularity size is not guaranteed. One concern raised at LSFMM this year was the possible impact to variability in performance this might have. However, performance evaluation so far by community stakeholder seems to yield good results. And so, while this is the case for 4 KiB IU drives, LBS provides a cleaner requirement for 16 KiB IU drives since you want all writes ideally aligned to 16 KiB. This currently puts the onus of a redo log architecture on MySQL development for smaller than 10 threads on HC SSDs, unless of course you can manage to place the redo log in a separate directory on a separate filesystem. A future possible solution to evaluate would be if small file embeddings could be supported, where a 16 KiB write would include both data and metadata.

Despite the software challenges – most hyperscalers have been supporting large atomic writes on the cloud for databases for at least 6 years now and have been doing so by leveraging ext4 with the bigalloc feature with 16 KiB cluster sizes for MySQL and with Direct IO. LBS provide a unified strategy for filesystems to support large atomics on HC SSDs. The advances with the atomic API in the Linux kernel, support for LBS with larger sector sizes to provide complete atomic alignment determinism, and the Linux kernel community's recent advancements for supporting large atomic write than the filesystem block size will only improve the situation further. At this year's LSFMM we also started to review how we could leverage large hardware atomics for databases with buffered IO given the clear gains observed with PostgreSQL.

Filesystem use cases

How filesystem can leverage large atomics are a topic of future R&D in the community which is ripe for exploration. For example, can small writes be embedded atomically with metadata? Can we avoid the journal for atomic writes?

Explore more
episodes

Related Contents

Company	Domain
Samsung Electronics	semiconductor.samsung.com, image.semiconductor.samsung.com, smetrics.samsung.com

Company

Domain

Samsung Electronics

semiconductor.samsung.com, image.semiconductor.samsung.com, smetrics.samsung.com

Analytical or performance cookies

Domain

Google

ajax.googleapis.com, apis.google.com, calendar.google.com, developers.google.com, docs.google.com, google.com, maps.googleapis.com, spreadsheets.google.com, www.google.com, www.google.ie

Google

www.google-analytics.com, www.googletagmanager.com, www.gstatic.com

Adobe

assets.adobedtm.com

Functionality Cookies

Domain

Purpose

Akamai

176-34-86-175_s-23-203-249-81_ts-1604430438-clienttons-s.akamaihd.net, 176-34-86-175_s-23-203-249-81_ts-1604432488-clienttons-s.akamaihd.net, 176-34-86-175_s-23-203-249-90_ts-1604428164-clienttons-s.akamaihd.net, 176-34-86-175_s-95-101-143-18_ts-1604428258-clienttons-s.akamaihd.net, 176-34-86-175_s-95-101-143-24_ts-1604428321-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604425495-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604425563-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604425669-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604427540-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604427617-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604427664-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604427922-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604439090-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604439174-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604441206-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-81_ts-1604441267-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-90_ts-1604425484-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-90_ts-1604425610-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-90_ts-1604427737-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-90_ts-1604427797-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-90_ts-1604438922-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-90_ts-1604438968-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-90_ts-1604439033-clienttons-s.akamaihd.net, 34-242-207-243_s-23-203-249-90_ts-1604441023-clienttons-s.akamaihd.net, 34-242-207-243_s-95-101-129-82_ts-1604425732-clienttons-s.akamaihd.net, 34-245-202-11_s-23-203-249-81_ts-1604425513-clienttons-s.akamaihd.net, 34-245-202-11_s-23-203-249-81_ts-1604427569-clienttons-s.akamaihd.net, 34-245-202-11_s-23-203-249-90_ts-1604425365-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-81_ts-1604424915-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-81_ts-1604425000-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-81_ts-1604425155-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-81_ts-1604425567-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-81_ts-1604427446-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-81_ts-1604429495-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-90_ts-1604424817-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-90_ts-1604424939-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-90_ts-1604427359-clienttons-s.akamaihd.net, 34-246-182-217_s-23-203-249-90_ts-1604429563-clienttons-s.akamaihd.net, 34-246-182-217_s-95-101-129-82_ts-1604425062-clienttons-s.akamaihd.net, 34-246-182-217_s-95-101-143-18_ts-1604429398-clienttons-s.akamaihd.net, 34-246-182-217_s-95-101-143-24_ts-1604429274-clienttons-s.akamaihd.net, 34-246-182-217_s-95-101-143-24_ts-1604429365-clienttons-s.akamaihd.net, 34-246-182-217_s-95-101-143-24_ts-1604429616-clienttons-s.akamaihd.net, 364bf52c.akstat.io, 364bf5fa.akstat.io, 364bf6cc.akstat.io, 36c3fef2.akstat.io, 54-154-186-178_s-23-203-249-81_ts-1604425586-clienttons-s.akamaihd.net, 54-154-186-178_s-23-203-249-81_ts-1604429882-clienttons-s.akamaihd.net, 54-154-186-178_s-23-203-249-90_ts-1604425341-clienttons-s.akamaihd.net, 54-154-186-178_s-23-203-249-90_ts-1604425577-clienttons-s.akamaihd.net, 54-154-186-178_s-23-203-249-90_ts-1604425679-clienttons-s.akamaihd.net, 54-154-186-178_s-23-203-249-90_ts-1604427498-clienttons-s.akamaihd.net, 54-154-186-178_s-23-203-249-90_ts-1604431774-clienttons-s.akamaihd.net, 54-154-186-178_s-92-123-142-66_ts-1604427735-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-81_ts-1604425115-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-81_ts-1604427273-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-81_ts-1604427303-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-81_ts-1604427359-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-81_ts-1604431429-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-81_ts-1604431547-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-81_ts-1604435637-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-90_ts-1604427151-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-90_ts-1604429503-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-90_ts-1604429594-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-90_ts-1604433473-clienttons-s.akamaihd.net, 54-246-30-86_s-23-203-249-90_ts-1604433539-clienttons-s.akamaihd.net, 54-246-30-86_s-88-221-134-224_ts-1604435698-clienttons-s.akamaihd.net, 54-246-30-86_s-95-101-129-96_ts-1604424926-clienttons-s.akamaihd.net, 54-246-30-86_s-95-101-129-96_ts-1604424989-clienttons-s.akamaihd.net, 54-75-39-103_s-23-203-249-81_ts-1604425265-clienttons-s.akamaihd.net, 54-75-39-103_s-23-203-249-81_ts-1604425415-clienttons-s.akamaihd.net, 54-75-39-103_s-23-203-249-90_ts-1604425504-clienttons-s.akamaihd.net, 54-75-39-103_s-95-101-143-24_ts-1604432234-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-81_ts-1604424935-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-81_ts-1604425058-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-81_ts-1604425120-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-81_ts-1604425189-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-81_ts-1604427540-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-90_ts-1604424875-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-90_ts-1604425270-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-90_ts-1604427110-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-90_ts-1604429433-clienttons-s.akamaihd.net, 54-75-41-190_s-23-203-249-90_ts-1604429456-clienttons-s.akamaihd.net, 54-75-41-190_s-92-123-140-11_ts-1604427291-clienttons-s.akamaihd.net, 54-75-41-190_s-92-123-140-11_ts-1604427412-clienttons-s.akamaihd.net, 54-75-41-190_s-95-101-129-96_ts-1604425019-clienttons-s.akamaihd.net, 54-75-41-190_s-95-101-143-18_ts-1604429529-clienttons-s.akamaihd.net, 684dd305.akstat.io, 684dd306.akstat.io, 684dd307.akstat.io, 684dd308.akstat.io, 684dd309.akstat.io, 684dd30a.akstat.io, 684dd30c.akstat.io, 684dd30d.akstat.io, 6852bd07.akstat.io, 6852bd08.akstat.io, 6852bd09.akstat.io, 6852bd0a.akstat.io, 6852bd0b.akstat.io, 6852bd0c.akstat.io, 6852bd0d.akstat.io, 6852bd0e.akstat.io, 6852bd0f.akstat.io, 6852bd10.akstat.io, 6852bd11.akstat.io, 6852bd12.akstat.io, 6852bd13.akstat.io, 6852bd14.akstat.io, 685d5b18.akstat.io, 685d5b19.akstat.io, 685d5b1b.akstat.io, 686eb51b.akstat.io, 686eb704.akstat.io, bcsecure01-a.akamaihd.net, brightcove04pmdo-a.akamaihd.net, ds-aksb-a.akamaihd.net, el24ucyccuqvax5bs2kq-pblhb6-a723eeea5-clientnsv4-s.akamaihd.net, el24ucyccuqvax5bt4yq-ptbmxa-6ef8e4803-clientnsv4-s.akamaihd.net, el24ucyccuqwcx5bs4uq-p03zy7-676237e5e-clientnsv4-s.akamaihd.net, el3lnwiccuqvax5bstvq-pch0tk-1cdf76638-clientnsv4-s.akamaihd.net, el3lnwiccuqvax5buy2q-pqnfkn-f673b4feb-clientnsv4-s.akamaihd.net, el3lnwiccuqvax5buzkq-pl30i3-08d7d87df-clientnsv4-s.akamaihd.net, el3lnwiccuqwcx5bu4ya-pyg66y-cb19a994e-clientnsv4-s.akamaihd.net, el3lnwiccuqxax5bstjq-puyi2b-1f022524f-clientnsv4-s.akamaihd.net, el3lnwiccuqxax5bsuua-pioden-695058c8f-clientnsv4-s.akamaihd.net, el3lnwiccuqxax5bsvta-pqns0s-b6979dbf5-clientnsv4-s.akamaihd.net, el3lnwiccuqxax5btzpq-pbifp1-07760bdf0-clientnsv4-s.akamaihd.net, el3lnwiccuqxax5bu23q-p2ez1a-7d289db29-clientnsv4-s.akamaihd.net, el3lnwixzp4swx5bs5pq-pnfw20-03cb87b70-clientnsv4-s.akamaihd.net, el3lnwixzp4swx5bsryq-p52tb9-f3dab0dd0-clientnsv4-s.akamaihd.net, el3lnwixzp4swx5bu35q-pdannf-fd1139023-clientnsv4-s.akamaihd.net, el3lnwixzp4swx5buxna-pyccr1-f710a073b-clientnsv4-s.akamaihd.net, el3lnwky3wdkax5bsxbq-p3hn9l-a2a7437e4-clientnsv4-s.akamaihd.net, el3lnwky3wdkex5bt23a-pfcryk-8b7c1430e-clientnsv4-s.akamaihd.net, elzm742y3wdkex5bs4lq-p0p40d-3a2e745b5-clientnsv4-s.akamaihd.net, elzm742y3wdkex5bzofa-pqb527-96b6b1fc9-clientnsv4-s.akamaihd.net, elzm74yccuqvax5b2szq-pf5z0b-8e0fe713e-clientnsv4-s.akamaihd.net, elzm74yccuqvax5bs5nq-pt4puj-60e29ce0a-clientnsv4-s.akamaihd.net, elzm74yccuqvax5bzo4a-ptxi68-223a872ab-clientnsv4-s.akamaihd.net, elzm74yccuqwcx5b2r3a-p84t0a-b5b6d0cb9-clientnsv4-s.akamaihd.net, elzm74yccuqwcx5btaca-p2p13t-2edd5f4d6-clientnsv4-s.akamaihd.net, elzm74yccuqwcx5buakq-p7s1ie-7095e2510-clientnsv4-s.akamaihd.net, elzm74yccuqxax5b2o7q-partxm-0ba99e22d-clientnsv4-s.akamaihd.net, elzm74yccuqxax5bs6fa-pnivpg-c492934bb-clientnsv4-s.akamaihd.net, elzm74yccuqxax5bt5qq-pcrjf9-bdc24fa26-clientnsv4-s.akamaihd.net, elzm74yccuqxax5bzp4q-pkl6rx-fb475a90e-clientnsv4-s.akamaihd.net, elzm74yxzp4swx5bs4ga-p9xzbs-ed47165ae-clientnsv4-s.akamaihd.net, elzm74yxzp4swx5bs7cq-p4s4el-cd1a19887-clientnsv4-s.akamaihd.net, elzm74yxzp4swx5bt4ka-p0qvim-2e8a5e71e-clientnsv4-s.akamaihd.net, elzm74yxzp4swx5bt6hq-pzy1yp-35d9d01e0-clientnsv4-s.akamaihd.net, elzm74yxzp4swx5bt7mq-p1duy0-1060998fa-clientnsv4-s.akamaihd.net, elzm74yxzp4swx5bucja-p0twy9-19851792c-clientnsv4-s.akamaihd.net, elzm74yxzp4swx5bzqza-pn76ir-1c0c55ff7-clientnsv4-s.akamaihd.net, elzm74yxzp4swx5bzsda-pqodge-888ec876f-clientnsv4-s.akamaihd.net, g2nlvmqcchiscx5bva5a-pwotro-14b66ca5a-clientnsv4-s.akamaihd.net, g2nlvmqccuqvax5bs7hq-p4vzcl-ad59a5fd9-clientnsv4-s.akamaihd.net, g2nlvmqccuqxax5bsz6q-pm3a6a-3feb7d021-clientnsv4-s.akamaihd.net, g2nlvmqxzp4swx5bs5uq-pd12b9-62c8cb38d-clientnsv4-s.akamaihd.net, g2nlvmqxzp4swx5bt3va-p7puv0-d4fafcfea-clientnsv4-s.akamaihd.net, g2nlvmsy3wdkax5bs5zq-p675cj-d0b1fd299-clientnsv4-s.akamaihd.net, g33b4vqccuqvax5btwhq-pfp8ei-5c0ea4329-clientnsv4-s.akamaihd.net, g33b4vqccuqvax5btytq-pupet4-0083df35c-clientnsv4-s.akamaihd.net, g33b4vqccuqvax5btzpq-pr1f2f-01d5fb765-clientnsv4-s.akamaihd.net, g33b4vqccuqvax5bvzcq-phk9tj-828709858-clientnsv4-s.akamaihd.net, g33b4vqccuqvax5bw2bq-py1x2v-a7310f6e5-clientnsv4-s.akamaihd.net, g33b4vqccuqvax5bx3za-pge3ox-a91a32353-clientnsv4-s.akamaihd.net, g33b4vqccuqwcx5bu4na-pqdvvi-3aaa5c611-clientnsv4-s.akamaihd.net, g33b4vqccuqwcx5bwzaq-pvw5k6-d3e3dcd05-clientnsv4-s.akamaihd.net, g33b4vqccuqwcx5bx22q-p8kovq-e038e0c0c-clientnsv4-s.akamaihd.net, g33b4vqccuqxax5bstpa-p4rsfx-bd0382a30-clientnsv4-s.akamaihd.net, g33b4vqccuqxax5btyeq-poz8cc-9955b8a36-clientnsv4-s.akamaihd.net, g33b4vqxzp4swx5bu27q-pxv1vf-89db7a111-clientnsv4-s.akamaihd.net, g33b4vqxzp4swx5bv25q-pt8447-731cc407d-clientnsv4-s.akamaihd.net, g33b4vsy3wdkax5bswnq-plqmrf-ff7289811-clientnsv4-s.akamaihd.net, g33b4vsy3wdkex5bsuoq-p56ka1-9bf23f300-clientnsv4-s.akamaihd.net, gzfsozyccuqwcx5bs3dq-p2yzo8-69eb1f4d7-clientnsv4-s.akamaihd.net, gzfsozyccuqwcx5bs4qa-p299q7-a9521f4ee-clientnsv4-s.akamaihd.net, gzfsozyccuqwcx5bsyyq-pv69oz-aed1b09c6-clientnsv4-s.akamaihd.net, gzfsozyccuqwcx5bwfuq-pw4gfb-c2c42381f-clientnsv4-s.akamaihd.net, gzfstpqccuqvax5bssvq-p0x8hm-7a3d7367f-clientnsv4-s.akamaihd.net, gzfstpqccuqvax5bsxsq-p2oajs-b2e67f00b-clientnsv4-s.akamaihd.net, gzfstpqccuqvax5bsy3a-pfuzjd-60f8ba5de-clientnsv4-s.akamaihd.net, gzfstpqccuqvax5bu2ia-p6uwyn-30e7a92df-clientnsv4-s.akamaihd.net, gzfstpqccuqwcx5bswqa-pplxq4-ee58ceb89-clientnsv4-s.akamaihd.net, gzfstpqccuqwcx5bu3mq-p6qff7-f4c4075e7-clientnsv4-s.akamaihd.net, gzfstpqccuqwcx5buz4q-pbk4m8-d20c90e54-clientnsv4-s.akamaihd.net, gzfstpqccuqxax5bt4ka-p3fi1s-1fcad7cd5-clientnsv4-s.akamaihd.net, gzfstpqxzp4swx5bsttq-p683qt-2c3f6e21e-clientnsv4-s.akamaihd.net, gzfstpqxzp4swx5bsu5q-pyioyl-3b5424f35-clientnsv4-s.akamaihd.net, gzfstpqxzp4swx5bsvra-ps8whv-800c4ca06-clientnsv4-s.akamaihd.net, gzfstpqxzp4swx5bt2ka-p3owfu-9bef421db-clientnsv4-s.akamaihd.net, gzfstpqxzp4swx5btynq-p80cg4-5fbda6ae3-clientnsv4-s.akamaihd.net, gzfstpsy3wdkax5btvta-pc4hb3-c24fbde0b-clientnsv4-s.akamaihd.net, i03f9f400-ds-aksb-a.akamaihd.net, i03fa4400-ds-aksb-a.akamaihd.net, i03faac00-ds-aksb-a.akamaihd.net, i03fae300-ds-aksb-a.akamaihd.net, i03fb4f00-ds-aksb-a.akamaihd.net, i22f29600-ds-aksb-a.akamaihd.net, i22f44600-ds-aksb-a.akamaihd.net, i22f47c00-ds-aksb-a.akamaihd.net, i22f4a100-ds-aksb-a.akamaihd.net, i22f55c00-ds-aksb-a.akamaihd.net, i22f5ca00-ds-aksb-a.akamaihd.net, i22f6b100-ds-aksb-a.akamaihd.net, i22f7bf00-ds-aksb-a.akamaihd.net, i22fdb800-ds-aksb-a.akamaihd.net, i22fdd700-ds-aksb-a.akamaihd.net, i3430c200-ds-aksb-a.akamaihd.net, i34d04400-ds-aksb-a.akamaihd.net, i34d71700-ds-aksb-a.akamaihd.net, i36486200-ds-aksb-a.akamaihd.net, i364b2700-ds-aksb-a.akamaihd.net, i369aba00-ds-aksb-a.akamaihd.net, i36d85900-ds-aksb-a.akamaihd.net, i36d86800-ds-aksb-a.akamaihd.net, i36e56b00-ds-aksb-a.akamaihd.net, i36f61e00-ds-aksb-a.akamaihd.net, i36f6c000-ds-aksb-a.akamaihd.net, i3f23f800-ds-aksb-a.akamaihd.net, ib0225600-ds-aksb-a.akamaihd.net, s.go-mpulse.net, trial-eum-clientnsv4-s.akamaihd.net, trial-eum-clienttons-s.akamaihd.net, warfnl2y3wdkex5buhra-pvnsej-42dd2535b-clientnsv4-s.akamaihd.net, warfnlyccuqvax5bvjta-pivu9l-324052216-clientnsv4-s.akamaihd.net, warfnlyccuqxax5bwjua-pt5xj8-63e5f59c4-clientnsv4-s.akamaihd.net, warfnlyxzp4swx5bugca-p9ihiy-2a56daf9f-clientnsv4-s.akamaihd.net, warfnlyxzp4swx5buiqq-p5eemy-20706e9d7-clientnsv4-s.akamaihd.net To provide the optimized image quality and enhance page loading speed

To provide the optimized image quality and enhance page loading speed

Amazon
(Cloud Front)

Amazon (Cloud Front) d15mv1adrb1s6e.cloudfront.net, d1vp9jkpfdwr15.cloudfront.net, d25jv1xpupcva6.cloudfront.net, d2cmqkwo8rxlr9.cloudfront.net, d2m3ikv8mpgiy8.cloudfront.net, d334tbn9icrqnt.cloudfront.net, d38nbbai6u794i.cloudfront.net, d3dvvd5arbl3b4.cloudfront.net, d3nkfb7815bs43.cloudfront.net, d9qz450atvita.cloudfront.net To speed up the delivery of your static content (e.g., images, style sheets, JavaScript, etc.) to viewers across the globe

To speed up the delivery of your static content (e.g., images, style sheets, JavaScript, etc.) to viewers across the globe

Brightcove

admin.brightcove.com, metrics.brightcove.com, players.brightcove.net, sadmin.brightcove.com, vjs.zencdn.net

To support video streaming

advertising cookies

Domain

Facebook

atdmt.com, connect.facebook.net, cx.atdmt.com, facebook.com, www.facebook.com

Google Advertising

ad.doubleclick.net, adservice.google.com, adservice.google.ie, cm.g.doubleclick.net, doubleclick.net, googleads.g.doubleclick.net, pubads.g.doubleclick.net, static.doubleclick.net, stats.g.doubleclick.net, www.googleadservices.com

Google

s.ytimg.com, www.youtube.com, youtube.com

ads.linkedin.com, linkedin.com, px.ads.linkedin.com, www.linkedin.com

[Inside LBS Ⅱ] Dividing and Conquering Linux High-Capacity SSD Support with LBS

Written by Luis Chamberlain, R&D-DTC/DSRA(DS)

Part 2. Addressing the large indirection unit storage stack challenge

Linux filesystem sector size

What is the filesystem sector size used for?

What NVMe parameters increase the allowed filesystem sector size

Part 3. Empirical evaluation of large atomics

What are large atomics good for?

Database use cases

Filesystem use cases

Related Tech Articles

Related News