💾 Archived View for vigrey.com › wiki › polyglot_file.gmi captured on 2024-07-09 at 01:38:04. Gemini links have been rewritten to link to archived content

View Raw

More Information

⬅️ Previous capture (2024-06-19)

➡️ Next capture (2024-08-18)

-=-=-=-=-=-=-

polyglot file

A polyglot file is a file that is more than 1 type of file at the same time. Polyglot files take advantage of the fact that (binary) computer files are all just stored as binary data (bits is bits). Some files expect to start at file offset 0 (the beginning of the file). while other files may try to find specific data inside of the file that says where the file data starts, while other files may be very flexible about where they start.

By carefully aligning file data, it is possible to create a file that is valid as multiple file types.

For instance, the following commands can be run on Linux to create a file that is simultaneously the JPG file "a.jpg" and ZIP file "b.zip". The resulting polyglot file will be called "polyglot.gif.zip".

cat a.jpg b.zip > polyglot-build.gif.zip
zip -F polyglot-build.gif.zip --out polyglot.gif.zip

Some File Formats That Start at Offset 0

Some File Formats That Search for the Starting Offset

Some File Formats That Are at Least Somewhat Flexible About the Starting Offset

Examples of Polyglot Files

Ange Albertini has made many a lot of polyglot files, many of which are featured in issues of the International Journal of "Proof-of-Concept or Get the F**k Out" (PoC||GTFO) are polyglot files, with different issues starting from Issue 0x01 featuring different types of polyglot files. Some examples of PoC||GTFO polyglot files are:

International Journal of "Proof-of-Concept or GTFO"

Others have also contributed polyglot files to PoC||GTFO, including myself in issue 0x18 section 4 in an article titled "Concealing ZIP Files in NES Cartridges".

In my git repository "gb-nes-pdf-html-zip", all of the files in the "bin/" directory are simultaneously Gameboy ROM, NES ROM, PDF, HTML, and ZIP files. As shown below, they all have the same SHA256 sum. This is because they are all bit for bit the exact same file.

$ sha256sum gb-nes-pdf-html-zip/bin/*
7a1ac443458c68ca1c2128da9586c818f98cb6656be92923d54cc2de555e01ff  gb-nes-pdf-html-zip.gb
7a1ac443458c68ca1c2128da9586c818f98cb6656be92923d54cc2de555e01ff  gb-nes-pdf-html-zip.html
7a1ac443458c68ca1c2128da9586c818f98cb6656be92923d54cc2de555e01ff  gb-nes-pdf-html-zip.nes
7a1ac443458c68ca1c2128da9586c818f98cb6656be92923d54cc2de555e01ff  gb-nes-pdf-html-zip.pdf
7a1ac443458c68ca1c2128da9586c818f98cb6656be92923d54cc2de555e01ff  gb-nes-pdf-html-zip.zip

gb-nes-pdf-html-zip git repository

Incoming Wiki Links

{index}