Linux Copy Fail Risk: The Invisible 4-Byte Root…

The Linux Copy Fail vulnerability allows root escalation in 4 bytes, corrupting only RAM. Discover the impact on Kubernetes and how to mitigate the threat.

In modern computing architectures, on-disk file integrity has always been the final bastion against unauthorized tampering. On April 30, 2026, security researchers revealed the details of the 'Copy Fail' vulnerability (CVE-2026-31431), a deterministic logical flaw in the Linux kernel that subverts this very assumption, allowing an unprivileged local user to gain root access by exclusively corrupting the page cache in RAM, without physically altering files on disk and effectively bypassing traditional file integrity monitoring systems.

The Copy Fail exploit mechanism and the absence of race conditions

On April 29, 2026, technical analyses outlined the exact nature of the flaw: Copy Fail is a deterministic logical vulnerability, devoid of race conditions, residing in the Linux kernel's crypto submodule algif_aead, specifically in the authencesn template. Unlike historical bugs like Dirty COW or Dirty Pipe, which required complex synchronization techniques to win race conditions, this exploit acts deterministically, making the attack extremely reliable and repeatable. The flaw transforms the AF_ALG and splice interfaces into a page cache write primitive.

The vulnerability research team at Xint.io and Theori explained the crucial mechanism of the attack: "An unprivileged local user can write four controlled bytes into the page cache of any readable file on a Linux system, and use that to gain root". The ability to write even just 4 controlled bytes into the page cache of any readable file translates into a devastating attack surface. In practice, the exploit materializes as a mere 732-byte Python script that modifies a setuid binary, such as /usr/bin/su, to obtain root privileges. The scope of the flaw is amplified by its compatibility: the exploit works on essentially every Linux distribution shipped since 2017, with a CVSS score of 7.8.

The reflection on the vulnerability class was synthesized by David Brumley of Bugcrowd, who stated: "Copy Fail is the same class of primitive, in a different subsystem". This underscores how page cache write primitives remain a recurring structural problem in operating system architectures, manifesting in different subsystems over the years.

The attack's invisibility and the limit of monitoring systems

The most insidious angle of the Copy Fail vulnerability lies in its ability to evade traditional detection systems. As highlighted on April 29, 2026, the corrupted page in the page cache is never reported as 'dirty' by the kernel's writeback machine. Consequently, the kernel does not initiate the synchronization process from RAM to disk, leaving the physical file unchanged. Disk-based integrity tools, which calculate hashes and signatures of persistent files, detect no anomaly.

This invisibility also extends to persistence: the modification exists solely in the volatile space of the page cache and completely disappears upon system reboot. For a security analyst, this scenario represents a procedural nightmare: the attack is ephemeral, leaves no on-disk traces, and invalidates the assumptions upon which file integrity monitoring systems like Tripwire or AIDE rely. The compromise occurs in a transitory limbo where privileged code execution coexists with the apparent integrity of the underlying filesystem.

Cross-container impact: the risk for Kubernetes clusters and CI/CD

On April 30, 2026, researchers emphasized another critical dimension of the vulnerability: its cross-container impact. In Linux, the page cache is a kernel-level shared structure and is not isolated by container boundaries. Copy Fail is therefore not merely a simple local privilege escalation vulnerability, but acts as a container escape primitive and a compromise of the Kubernetes node itself. An unprivileged user inside a container can exploit the flaw to write into the page cache related to a file shared with the host or other containers, bypassing isolation boundaries.

A spokesperson for Xint.io highlighted the universality of the threat: "This vulnerability is unique because it has four properties that almost never appear together: it's portable, tiny, stealthy, and cross-container". The combination of these four characteristics makes shared environments, such as Kubernetes clusters and CI/CD runners, exposed to catastrophic risk. In these scenarios, where multiple workloads coexist on the same host machine, the compromise of a single container can rapidly degenerate into total control of the node, leading to the exfiltration of secrets, the manipulation of deploy pipelines, or the installation of persistent backdoors on adjacent machines.

Exploiting the vulnerability requires a local entry point, a condition that is practically common in many modern attack chains: compromised credentials, an unprivileged SSH access obtained via brute force, or a remote application vulnerability allowing code execution. Once initial access is obtained, Copy Fail guarantees unnoticed escalation and boundary traversal.

Bug origins and available mitigations

The roots of the vulnerability lie in a code optimization introduced in August 2017 through source code commit 72548b093ee3. That 'in-place' optimization broke a critical security assumption that went unnoticed for nearly a decade. The discovery of the vulnerability, dubbed copy.fail, is attributed to the Xint Code Research Team, with the initial contribution of researcher Taeyang Lee.

On April 30, 2026, the fix was made available in mainline commit a664bf3d603d. The official patch removes the optimization introduced in 2017, separating the input and output structures and preventing cache pages from ending up in a writable zone. For systems that cannot be updated immediately, practical mitigations exist: it is possible to disable the vulnerable module using the command echo "install algif_aead /bin/false" > /etc/modprobe.d/disable-algif-aead.conf, or block the creation of AF_ALG sockets via seccomp to limit the attack surface at the container level.

Frequently asked questions

How does the Copy Fail vulnerability work on Linux?: Copy Fail (CVE-2026-31431) exploits a deterministic logical flaw in the algif_aead crypto submodule, allowing an unprivileged user to write 4 controlled bytes into the page cache of any readable file to gain root privileges.
Why don't file integrity monitoring tools detect the Copy Fail exploit?: The exploit exclusively corrupts RAM in the page cache without marking the page as 'dirty'. Consequently, the kernel does not synchronize the change to disk, the physical file remains unchanged, and disk-based tools detect no alterations.
What is the impact of Copy Fail on Kubernetes and CI/CD environments?: The impact is cross-container because the page cache is shared among all system processes. This allows an attacker to bypass container boundaries, making Copy Fail a container escape and host node compromise primitive for Kubernetes clusters and CI/CD runners.

This article is a summary based exclusively on the listed sources.

Sources

Sources and references