SPDX-FileCopyrightText: 2023 The "Notes on OpenPGP" project
SPDX-License-Identifier: CC-BY-SA-4.0
-->
# Signatures over data
In OpenPGP, a *{term}`data signature`* guarantees the {term}`authenticity<Authentication>` and, implicitly, the integrity of certain data. Typical use cases of {term}`data signatures<DataSignature>` include the {term}`authentication` of software packages and emails.
"{term}`Authenticity<Authentication>`" in this context means that the {term}`data signature` was issued by {term}`the entity controlling the signing key material<CertificateHolder>`. However,
it does not automatically signal if the expected party indeed controls the {term}`signer` {term}`certificate<OpenPGPCertificate>`. OpenPGP does offer mechanisms for *strong {term}`authentication`*, connecting {term}`certificates<OpenPGPCertificate>` to specific {term}`identities<Identity>`. This verifies that the intended communication partner is indeed associated with the cryptographic {term}`identity` behind the {term}`signature<OpenPGPSignaturePacket>`[^sign-auth].
[^sign-auth]: Other signing solutions, like [signify](https://flak.tedunangst.com/post/signify), focus on pure signing without strong {term}`authentication` of the {term}`signer`'s {term}`identity`.
{term}`Data signatures<DataSignature>` can only be issued by {term}`component keys<ComponentKey>` with the *{term}`signing<Signing Key Flag>`* [key flag](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#name-key-flags).
Note that {term}`data signatures<DataSignature>` are distinct from [](/signing_components), which are used to form and maintain {term}`certificates<OpenPGPCertificate>`, as well as to {term}`certify<Certification>` {term}`identities<Identity>` on {term}`certificates<OpenPGPCertificate>`.
(data-signature-types)=
## Signature types
{term}`OpenPGP data signatures<DataSignature>` use one of two [signature types](signature-types):
- [**Binary signature**](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#sigtype-binary) ({term}`type ID<SignatureTypeID>` `0x00`): This is the standard {term}`signature type` for binary data and is typically used for files or data streams. {term}`Binary signatures<BinarySignature>` are calculated over the data without any modifications or transformations.
- [**Text signature**](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#name-signature-of-a-canonical-te) ({term}`type ID<SignatureTypeID>` `0x01`): Used for textual data, such as email bodies. When calculating a {term}`text signature`, the data is first normalized by converting line endings into a canonical form (`<CR><LF>`). This approach mitigates issues caused by platform-specific text encodings. This is especially important for detached and {term}`cleartext signatures<CleartextSignature>`, where the message file might undergo re-encoding between the creation and {term}`verification` of the {term}`signature<OpenPGPSignaturePacket>`.
{term}`Data signatures<DataSignature>` are generated by {term}`hashing<HashDigest>` the message content along with the {term}`metadata` in the {term}`OpenPGP signature packet`, and calculating a {term}`cryptographic signature` over that {term}`hash<HashDigest>`. The resulting {term}`cryptographic signature` is stored in the {term}`signature packet<OpenPGPSignaturePacket>`.
{term}`OpenPGP data signatures<DataSignature>` can be applied in three distinct forms[^sign-modes-gpg]:
- **{term}`Detached<DetachedSignature>`**: The OpenPGP signature exists as a separate entity, independent of the signed data.
- **{term}`Inline<InlineSignature>`**: Both the original data and its corresponding {term}`OpenPGP signature<OpenPGPSignaturePacket>` are encapsulated within an {term}`OpenPGP message`.
- **{term}`Cleartext signature`**: A plaintext message and its {term}`OpenPGP signature<OpenPGPSignaturePacket>` coexist in a combined text format, preserving the readability of the original message.
[^sign-modes-gpg]: These three forms of {term}`signature<OpenPGPSignaturePacket>` application align with GnuPG's `--detach-sign`, `--sign`, and `--clearsign` command options.
A {term}`detached signature` is produced by calculating an {term}`OpenPGP signature<OpenPGPSignaturePacket>` over the data intended for signing. The original data remains unchanged, and the {term}`OpenPGP signature<OpenPGPSignaturePacket>` is stored as a standalone file. A {term}`detached signature` file can be distributed alongside or independent of the original data. The {term}`authenticity<Authentication>` and integrity of the original data file can be {term}`verified<Verification>` by using the {term}`detached signature` file.
This {term}`signature<OpenPGPSignaturePacket>` format is especially useful for signing software releases and other files where it is imperative that the content remains unaltered during the signing process.
An {term}`inline signature` joins the signed data and its corresponding {term}`data signature` into a single {term}`OpenPGP message`.
This method is commonly used for signing or encrypting emails. Most email software capable of handling OpenPGP communications typically uses {term}`inline signatures<InlineSignature>`.
1.**{term}`One-pass signed messages<One-passsignedMessage>`** This is the commonly used format for inline-signed messages. A signer can produce this format in one pass.
2.**{term}`Prefixed signed messages<PrefixedsignedMessage>`** This format predates[^inline-signature-formats] {term}`one-pass signed messages<One-passsignedMessage>` and is conceptually slightly simpler. However, it has no strong benefits and is now rarely used.
1. [**One-pass signature packets**](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#one-pass-sig): These one or more {term}`packets<Packet>` precede the signed data and enable {term}`signature<OpenPGPSignaturePacket>` computation in one pass.
2. [**Literal data packet**](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#lit): This contains the original data (e.g., the body of a message), without additional interpretation or conversion.
3.**{term}`Data signature packets<OpenPGPSignaturePacket>`**: These contain the {term}`cryptographic signature` corresponding to the original data.
Despite its name, a {term}`one-pass signature packet` is not a type of {term}`signature packet<OpenPGPSignaturePacket>`.
Instead, it's a type of auxiliary packet that can be used in conjunction with {term}`signature packets<OpenPGPSignaturePacket>`. Its use allows storing the {term}`signature packets<OpenPGPSignaturePacket>` after the message body.
To produce an {term}`inline signature`, the {term}`signer` processes the entirety of the data by reading from an input file and writing into an output {term}`OpenPGP message` file. As the data is processed, the {term}`signer` simultaneously calculates a {term}`cryptographic signature`. This procedure results in the appending of a {term}`data signature packet` to the output {term}`OpenPGP message` file, where it can be efficiently stored.
For efficient {term}`verification`, an application must understand how to handle the {term}`literal data<LiteralDataPacket>` prior to its reading. This requirement is addressed by the {term}`one-pass signature packets<One-passSignaturePacket>` located at the beginning of {term}`inline-signed<InlineSignature>` messages. These {term}`packets<Packet>` include essential information such as the {term}`fingerprint<OpenPGPFingerprint>` of the {term}`signing key<OpenPGPComponentKey>` and the {term}`hash<HashDigest>` algorithm used for computing the {term}`signature<OpenPGPSignaturePacket>`'s {term}`hash digest`. This setup enables the verifier to process the data correctly and efficiently.
Strictly speaking, knowing just the hash algorithm would be sufficient to begin the verification process. However, having efficient access to the signer's fingerprint or key ID upfront allows OpenPGP software to fetch the signer's certificate(s) before processing the entirety of the - potentially large - signed data. This may, for example, involve downloading the certificate from a keyserver. In case fetching the signer's certificate(s) fails, or requires additional input from the user, it is better to signal the user about this before processing the data.
{term}`Inline-signed<InlineSignature>` messages enable efficient {term}`verification` in *one pass*, structured as follows:
1.**Initiation with {term}`one-pass signature packets<One-pass Signature Packet>`**: These {term}`packets<Packet>` begin the {term}`verification` process. They include the {term}`signer`'s {term}`key ID`/{term}`fingerprint<OpenPGPFingerprint>`, essential for identifying the appropriate {term}`public key<OpenPGPCertificate>` for signature {term}`validation`.
2.**Processing the {term}`literal data packet`**: This step involves {term}`hashing<HashDigest>` the literal data, preparing it for {term}`signature<OpenPGPSignaturePacket>` {term}`verification`.
3.**{term}`Verifying<Verification>` {term}`signature packets<OpenPGPSignaturePacket>`**: Located at the end of the message, these {term}`packets<Packet>` are checked against the previously calculated {term}`hash digest`.
Important to note, the {term}`signer`'s {term}`public key<OpenPGPCertificate>`, critical for the final {term}`verification` step, is not embedded in the message. Verifiers must acquire this {term}`key` externally (e.g., from a {term}`key server`) to authenticate the {term}`signature<OpenPGPSignaturePacket>` successfully.
A {term}`prefixed signed message` consists of {term}`signature packet(s)<signaturepacket>` followed by the message. This inline signature format serves the same function as a {term}`one-pass signed message`. For the verifier, the two formats are equally convenient. However, on the signer's side, it takes more computational work to generate a {term}`prefixed signed message`.
In this format, the signature packets are stored ahead of the message itself:
1.**{term}`Data signature packets<OpenPGPSignaturePacket>`**: These one or more packets contain the {term}`cryptographic signature` corresponding to the original data.
2. [**Literal data packet**](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#lit): This contains the original data (e.g., the body of a message), without additional interpretation or conversion.
Compared to a {term}`one-pass signed message`, there are no {term}`one-pass signature packets<One-passSignaturePacket>` in this format, and the (otherwise equivalent) {term}`signature packet(s)<signaturepacket>` are stored ahead of the signed data.
For verification, this is equally convenient as the one-pass signed message form.
However, when a signer creates a {term}`prefixed signed message`, the signed data must be processed twice:
- once reading it to calculate the cryptographic signature, and
- once more to store the data in the generated OpenPGP message, after the signature packet(s).
This {term}`signature<CleartextSignature>` consists of two parts: a message ("hello world") and an ASCII-armored {term}`OpenPGP signature<OpenPGPSignaturePacket>`. The message is immediately comprehensible to a human reader, while the {term}`signature<OpenPGPSignaturePacket>` block allows for the message's {term}`authenticity<Authentication>` {term}`verification` via OpenPGP software.
{term}`Cleartext signatures<CleartextSignature>` combine the advantages of both {term}`detached<DetachedSignature>` and {term}`inline signatures<InlineSignature>`:
- **Self-contained format**: {term}`Cleartext signatures<Cleartextsignature>` enable the message and its {term}`signature<OpenPGPSignaturePacket>` to be stored as a single file.
- **Human readability**: The message within a {term}`cleartext signature` remains accessible in a plain text format. This eliminates the need for specialized software to read the message content.
These features are particularly beneficial in scenarios where signed messages are managed semi-manually and where existing system infrastructure offers limited or no native support for OpenPGP in the workflow[^arch-certifications].
[^arch-certifications]: An illustrative example is the workflow adopted by Arch Linux to {term}`certify<Certification>` {term}`User IDs<UserID>` of new packagers. This process relies on [cleartext signed statements from existing packagers](https://gitlab.archlinux.org/archlinux/archlinux-keyring/-/blob/master/.gitlab/issue_templates/New%20Packager%20Key.md?ref_type=heads&plain=1#L33-46). These signed statements are stored as attachments in an issue tracking system for later inspection. The advantage of this approach lies in the convenience of having the message and signature in a single file, which simplifies manual handling. Based on the vouches in these {term}`cleartext signed<CleartextSignature>` messages and an [email confirmation from the new packager](https://gitlab.archlinux.org/archlinux/archlinux-keyring/-/wikis/workflows/verify-a-packager-key), the main key operators can issue {term}`OpenPGP third-party certifications<Third-partyIdentityCertification>`.
The {term}`cleartext signature framework` includes specific text normalization procedures to ensure the integrity and clarity of the message:
- **Escaping dashes**: The framework implements a method of [dash-escaped text](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#name-dash-escaped-text) within the message. Dash-escaping ensures that the parser correctly distinguishes between the armor headers, which are part of the {term}`signature<OpenPGPSignaturePacket>`'s structure, and any lines in the message that happen to start with a dash.
- **Normalization of line endings**: Consistent with the approach for any other [text signature](data-signature-types), a {term}`cleartext signature` is calculated on the text with normalized line endings (`<CR><LF>`). This ensures that the {term}`signature<OpenPGPSignaturePacket>` remains valid regardless of the text format of the receiving {term}`implementation<OpenPGPImplementation>`.
Despite their widespread adoption, {term}`cleartext signatures<CleartextSignature>` have their limitations and are sometimes viewed as a "legacy method"[^csf-gnupg]. The {term}`RFC` details the [pitfalls of cleartext signatures](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#name-issues-with-the-cleartext-s), such as incompatibility with semantically meaningful whitespace, challenges with large messages, and security vulnerabilities related to misleading Hash header manipulations. Given these issues, safer alternatives like {term}`inline<InlineSignature>` and {term}`detached signature` forms are advised.