Skip to content
PDF Word Excel
← Back to Blog

Hidden Metadata in Documents: What's Inside Your PDF or Word File That You Can't See

By Vinay Kumar

Years ago there was a famous case where a government department published a Word document with the names of internal reviewers still embedded in the file's revision history. They had been removed from the visible text, but they were sitting quietly in the metadata, invisible to anyone who didn't know to look.

It happens more often than you'd think. The reason is simple: documents carry a lot of information you can't see in the main view.

What's actually inside a Word file

Author name (usually the name on the laptop's user account). Company name. Original file path. Time spent editing. Track-changes history if not deleted. Comments, even if hidden. Document properties like "based on template" with the template's location revealed.

Sometimes — and this is the embarrassing one — fragments of other documents you've had open recently, depending on the version of Word and the operating system.

What's inside a PDF

Author. Producer (the software that created it). Creation date and modification date. Sometimes a list of fonts used, which can hint at where the document was made. JavaScript actions if added. Hidden form fields with default values. Layered content where annotations sit on top of the original — and the original is still in the file.

Photos pasted into PDFs sometimes still carry their own GPS data and original timestamps.

What's inside a photo

EXIF data: the camera or phone model, the exact GPS coordinates, the date and time, sometimes even the camera settings. If you took the photo with a phone, those coordinates can pinpoint your home.

Most social media platforms strip this data automatically before posting. Email and direct file sharing usually do not. So a photo sent over WhatsApp loses its location, but a photo emailed as an attachment keeps everything.

When metadata matters

Sharing a document publicly. Submitting a document to a competitor or external party. Sending a CV to a company (you don't necessarily want them to know you also reviewed five other CVs an hour ago, all visible in the file's recent path history). Posting documents online. Sending photos of children, especially with their location embedded.

How to clean metadata

In Word: File → Info → Inspect Document. Run the inspector and remove what it finds.

In PDF: many editors have a "document properties" dialog where you can clear the author and other fields. For thorough cleaning, use a tool that explicitly strips PDF metadata. Even better, print the PDF to a new PDF — the new file will have only what you want and lose most extras.

For photos: most phone galleries have an "options" menu where you can disable location data or remove it before sharing. On computers, Windows lets you right-click a photo, go to Properties → Details → Remove Properties. Mac has similar options.

A simple rule

If you're sharing a document outside your circle of close colleagues or family, run it through a metadata clean-up first. Two minutes of effort, no embarrassing leaks later.