Documents

How to Remove Metadata from PDF Files

Remove supported PDF document properties, understand what the cleaner does not sanitize, and verify the result.

metapeel Editorial Team Updated Jun 28, 2026 8 min read

Where PDF metadata can live

The familiar fields shown in a PDF reader—title, author, subject, keywords, creator application, producer, creation date, and modification date—are commonly exposed through the document information dictionary. A PDF may also contain an XMP metadata packet and application-specific custom data.

Beyond metadata, PDF files can include annotations, attachments, form values, JavaScript actions, layers, comments, signatures, and content that was visually covered but not securely redacted. The Adobe PDF technical references describe these as distinct structures. Clearing the author field does not inspect all of them.

How to remove supported PDF metadata

Keep the original. Work from a copy, especially if the document is signed, encrypted, form-enabled, or part of a legal workflow.
Open the PDF tool. Select the file in the browser. metapeel reads it locally with the PDF library; the document is not uploaded to a metapeel processing server.
Review the detected properties. Check title, author, subject, keywords, creator, producer, and the date fields shown.
Create and download the cleaned copy. The current PDF path clears its supported property set together; it does not selectively preserve unchecked fields.
Open and inspect the output. Confirm that pages render correctly and that the targeted document properties no longer contain the original values.

The exact boundary of the current PDF cleaner

What this cleaning path targets

• Title, author, subject, keywords, creator application, and producer values exposed through the pdf-lib document API.
• Creation and modification date values handled by the same API; the output receives neutral epoch date values rather than preserving the original timestamps.
• A new downloadable PDF copy produced in the browser.

What still needs separate review

• Every XMP packet, custom property, application-private object, or historical/incremental object that another forensic parser might inspect.
• Annotations, comments, attachments, form values, JavaScript, layers, embedded media, or hidden optional content.
• Text or images that were covered visually instead of properly redacted.
• File names, operating-system attributes, cloud history, email headers, or records held by the sharing platform.

The implementation uses the documented pdf-lib PDFDocument property methods. That makes the supported property set auditable, but it also gives the article a firm limit: metapeel should not claim to remove every hidden element in a PDF.

Metadata removal is not PDF redaction

If a PDF contains a name in visible text, an image, a comment, an attachment, or text underneath a black rectangle, metadata cleaning will not make that information safe. Redaction changes or removes page content; sanitization reviews active and hidden structures; document-property cleaning handles descriptive fields. Treat them as separate checks.

Adobe's guidance on removing sensitive PDF content separates redaction from finding and removing hidden information. For a high-risk document, follow an approved redaction/sanitization process and validate the final export independently.

How to verify the cleaned PDF

Reopen the cleaned copy in metapeel and compare the detected standard properties with the original.
Open Document Properties in a desktop PDF reader and check title, author, subject, keywords, creator, producer, and dates.
Use a second metadata parser for sensitive sharing; different tools can expose different structures.
Review annotations, attachments, forms, layers, signatures, and redaction separately.
Open every page and confirm that the saved copy still renders and behaves as expected.

Verification should be scoped: “the standard author and date properties are cleared” is testable. “Nothing hidden remains anywhere in the PDF” requires a much broader examination.

What if the PDF came from Word, Excel, or PowerPoint?

Clean the editable source file separately before sharing it. OOXML and ODF documents have their own package metadata, comments, tracked changes, hidden sheets, and embedded content. metapeel's document tool clears supported core/application properties, but it does not replace Microsoft's Document Inspector or an equivalent application-native review.

For the distinction across all file types, return to the file metadata field guide. If the source includes photos, inspect those separately with the photo metadata workflow before placing them into a document.

Sources and primary references

These links point to format specifications, official documentation, or the implementation library used by metapeel.

PDF Reference and Adobe Extensions to the PDF Specification
Adobe
Technical reference for PDF objects, document information, annotations, actions, and other structures discussed in this guide.
PDFDocument API
pdf-lib
API documentation for the browser library metapeel uses to read and rewrite standard PDF title, author, subject, keyword, creator, producer, and date properties.
Removing sensitive content from PDFs
Adobe Acrobat Help
Official guidance distinguishing metadata removal from redaction and the broader removal of hidden information.
Remove hidden data and personal information with Document Inspector
Microsoft Support
Useful when a PDF originated from Word, Excel, or PowerPoint and the editable source file will also be shared.

Written and maintained by

metapeel Editorial Team

We review every capability statement against the current code path, distinguish metadata cleaning from hidden-content sanitization, and link the primary references used.

Read our editorial method

Private by design

Inspect the file before you share it

File contents stay on your device. metapeel shows the fields it can detect and creates a cleaned copy using the supported path for that format.

Open the document metadata remover

Continue through the metadata map

FoundationsFile Metadata Explained: What It Reveals and What Can Be Removed ImagesHow to Remove EXIF Data from Photos VideoHow to Remove Metadata from Video Files