A single scanned page at 300 DPI weighs roughly 3 to 8 MB depending on the content. A 20-page scanned contract hits 60 to 160 MB. A photo-heavy annual report can exceed 200 MB. Meanwhile, Gmail caps attachments at 25 MB, Outlook at 20 MB, and most web upload forms top out at 10 to 50 MB. The math does not work — and that mismatch between how documents are created and how they need to be shared is why PDF compression is one of the most commonly searched-for technical tasks on the internet.
The problem is that most people reach for the first free online tool that appears in search results. They upload a PDF containing tax returns, medical records, legal contracts, or internal business documents to a server operated by a company whose privacy policy they have never read. The compression works, but the privacy cost is invisible and real.
There is a better approach: understanding why PDFs get large in the first place, knowing what compression actually does at a technical level, and using tools that compress locally so your documents never leave your device.
The Anatomy of a Bloated PDF
PDFs are not monolithic files — they are structured containers that hold multiple types of data, each contributing to the overall file size in different ways. Understanding these components helps you make informed decisions about compression and avoid quality loss where it matters.
Embedded Images
Images are the single largest contributor to PDF file size in the vast majority of cases. When you scan a document, what you get is not text in a container — it is a photograph of text, stored at whatever resolution the scanner used. A single 300 DPI color scan of a letter-sized page produces an uncompressed image of approximately 25 MB. Most scanners apply some JPEG compression, bringing that down to 3 to 8 MB per page, but that is still enormous compared to actual text.
Reports, presentations, and marketing materials compound this problem by embedding high-resolution photographs and graphics. Many document creation tools embed images at their original resolution — so a 12-megapixel photo displayed in a 2-inch box inside the PDF is still stored at full 12-megapixel resolution, consuming 5 to 10 MB for a single image that occupies a fraction of the page.
Font Embedding
PDF files can embed the fonts they use so that documents render identically on any system, regardless of what fonts are installed. A single font family (regular, bold, italic, bold-italic) can add 200 KB to 1 MB to the file. Documents that use many typefaces — common in designed reports and presentations — can carry several megabytes of font data alone.
Font subsetting, where only the characters actually used in the document are embedded, reduces this significantly. Well-made PDF creation tools subset by default, but older software and some export workflows embed entire font families unnecessarily.
Metadata and Hidden Layers
PDF editing history can leave behind invisible objects. Every time you edit a PDF and re-save it, some editors append the changes rather than rewriting the file from scratch. This means a document that has been revised 15 times may contain 15 versions’ worth of deleted objects, still present in the file but not rendered. Adobe Acrobat calls this “linearization overhead” and provides a “Save As” (not “Save”) option to strip it out — but most users never think to do this.
Layer data from design tools, ICC color profiles for print production, embedded JavaScript (common in fillable forms), and XMP metadata all add bulk that is invisible during normal viewing.
Redundant Objects
When PDFs are created by merging multiple source documents, repeated elements like logos, headers, and common graphics may be stored multiple times rather than referenced once. A 50-page report with the same company logo on every page might contain 50 separate copies of that logo image, each consuming 100 KB or more.
How PDF Compression Works (Without Destroying Quality)
Compression algorithms for PDF fall into two broad categories, and understanding the difference is essential for choosing the right settings.
Lossless Compression
Lossless compression reduces file size without altering any data. The decompressed output is bit-for-bit identical to the original. Techniques include:
- Flate/Deflate compression (the same algorithm used in ZIP files) applied to text streams and simple graphics
- Object stream consolidation, where multiple small objects are packed into a single compressed stream
- Duplicate resource elimination, where identical images or fonts referenced multiple times are stored once and pointed to from multiple locations
- Metadata stripping, removing editing history, thumbnail previews, and other non-essential data
Lossless compression typically achieves 10% to 40% reduction, depending on how well-optimized the original PDF was. For text-heavy documents with minimal imagery, this is often sufficient.
Lossy Compression
Lossy compression achieves much higher reduction ratios — 50% to 95% — by reducing the quality of embedded images. The text remains crisp (text is stored as vector data in well-made PDFs, unaffected by image compression), but photographs and scanned content are re-encoded at lower resolution or higher JPEG compression.
The key insight is that lossy compression on images inside a PDF does not mean the document looks bad. A scanned page at 300 DPI downsampled to 150 DPI is still perfectly legible on screen and acceptable for most printing. A photograph displayed at 2 inches wide in the PDF does not need to be stored at 4000x3000 pixels — 600x450 pixels is visually identical at that display size.
The visible quality loss depends entirely on the compression level chosen and the type of content in the document. Text-heavy scanned documents tolerate aggressive compression well. Photo portfolios and design proofs require gentler settings.
Compressing PDFs Locally on Your Device
The privacy argument for local compression is straightforward: when you upload a PDF to an online compression service, you are transmitting the document to a third-party server. Even services that claim to delete files after processing have had breaches, policy changes, and employee access incidents. If your document contains financial data, medical records, legal contracts, personal identification, or confidential business information, uploading it to a server you do not control is an unnecessary risk.
ThinPDF compresses PDF files entirely on-device. The document never leaves your iPhone or iPad — no upload, no cloud processing, no third-party access.
Step 1: Select Your PDF
Open ThinPDF and choose the file you want to compress. You can pick from the Files app, iCloud Drive, Dropbox, Google Drive, or any file provider configured on your device. The app displays the current file size so you know your starting point.
Step 2: Choose Your Compression Level
ThinPDF provides multiple compression tiers, each representing a different balance between file size and image quality:
- Light compression reduces file size by 20% to 40% with virtually no visible quality change. Use this for documents you intend to print at high quality or that contain important photographic detail.
- Medium compression achieves 40% to 70% reduction. Images are slightly softer at extreme zoom but look identical at normal viewing distance. This setting works for the vast majority of sharing and archival purposes.
- Maximum compression pushes reduction to 70% to 90%. Image quality degrades noticeably if you zoom in, but text remains sharp and the document is perfectly usable for on-screen reading, email attachments, and web uploads.
Step 3: Preview the Result
Before saving, review the compressed document to verify that the quality meets your needs. ThinPDF shows a side-by-side comparison of file sizes so you can see exactly how much space you saved. If the compression was too aggressive, step back to a lighter setting. If you need it smaller, try a higher level.
Step 4: Save and Share
Save the compressed file back to your preferred location or share it directly through the iOS share sheet — email, Messages, AirDrop, Slack, or any other app. The smaller file size means faster uploads, no bounced emails, and less bandwidth consumption for recipients.
Practical Scenarios and Recommended Settings
Different situations call for different compression approaches. Here are the most common use cases with recommended settings:
Email Attachments
Target: Under 20 MB (Outlook) or 25 MB (Gmail). Use medium to maximum compression. For a 40 MB scanned document, medium compression will typically bring it under 20 MB. If it is still too large, maximum compression will almost certainly get you there.
Insurance and Legal Documentation
Target: Readable and complete, with privacy maintained. Scanned insurance policies, claims documentation, and legal filings often need to be emailed to agents, adjusters, or attorneys. Medium compression preserves all detail while keeping files manageable. Always compress locally for these sensitive documents. If you are assembling a home inventory for insurance purposes, our guide on how to create a home inventory for insurance explains how to document your belongings systematically.
Academic Submissions
Target: Under the portal’s upload limit (often 10 to 50 MB). University submission systems, journal portals, and grant application forms frequently impose strict size limits. A thesis with embedded figures, charts, and photographs can easily exceed these limits. Medium compression usually does the job; light compression if the document contains important visual data like microscopy images or detailed charts.
Business Reports and Presentations
Target: Easy to distribute, fast to download. Quarterly reports with charts, product photos, and infographics commonly reach 30 to 100 MB. Compressing before distribution saves bandwidth for every recipient. Medium compression is the default recommendation.
Archival and Long-Term Storage
Target: Minimal footprint without information loss. For documents you want to store for years — tax returns, contracts, property records — light compression provides meaningful size reduction with zero quality compromise. The space savings compound as your archive grows.
The Technical Details: What Happens During Compression
For those who want to understand what is happening under the hood, PDF compression tools typically perform some combination of these operations:
Image resampling. High-resolution images are downsampled to a lower DPI. A 300 DPI scan resampled to 150 DPI contains one-quarter the pixel data, resulting in roughly 75% size reduction for that image. The visible quality difference at screen viewing distance is negligible.
JPEG re-encoding. Images already stored as JPEG inside the PDF can be re-encoded at a higher compression ratio (lower quality factor). Moving from JPEG quality 95 to quality 75 typically halves the image file size with minimal visible degradation. Going below quality 50 introduces noticeable artifacts.
Color space conversion. Some compression tools convert images from CMYK (used for print) to RGB (used for screen display), which reduces the data per pixel by 25%. This is appropriate for documents that will only be viewed on screen but not for those intended for professional printing.
Stream compression. All data streams in the PDF (text, graphics commands, metadata) are compressed using Flate/Deflate encoding. This is lossless and typically happens automatically.
Object deduplication. Redundant resources — duplicate images, repeated fonts — are consolidated so each unique resource is stored only once.
Common Mistakes When Compressing PDFs
Compressing an Already-Compressed File
Applying compression twice does not meaningfully reduce file size further and can degrade quality. If a PDF has already been compressed, running it through compression again will re-encode the already-compressed images, introducing additional artifacts with little size benefit. Compress once, at the right level.
Using Maximum Compression for Print Documents
If a document needs to be printed at high quality — architectural drawings, medical imaging, photography portfolios — maximum compression will produce visible artifacts. Use light compression or skip compression entirely for print-critical materials.
Forgetting to Check the Result
Always open the compressed PDF and review it before sending or archiving. Scroll through every page. Zoom into images and diagrams. Check that no content has been corrupted or made illegible. This takes 60 seconds and prevents embarrassing or problematic outcomes.
Uploading Sensitive Documents to Online Compressors
This point bears repeating because the temptation is strong. Online compressors are fast and free. They are also a privacy liability for any document containing personal, financial, medical, or legal information. Even services with good intentions can be breached. Local compression eliminates this risk category entirely.
Keeping PDF Sizes Small from the Start
Prevention is always more efficient than remediation. These practices produce smaller PDFs at creation time:
Scan at the right resolution. 200 DPI is sufficient for text documents viewed on screen. 300 DPI is appropriate for documents that will be printed. Higher resolutions (400+ DPI) are only necessary for photographic or archival scanning and produce dramatically larger files.
Use grayscale or black-and-white for text documents. A color scan of a black-and-white letter is three times larger than necessary. Most scanners offer grayscale and black-and-white modes that produce significantly smaller files with no information loss for text-only content.
Crop and straighten before saving. Unnecessary margins, skewed pages, and border artifacts all increase file size. Cropping to the actual content area before saving the PDF eliminates wasted data.
Combine then compress. If you need to merge multiple PDFs into a single document, do the merge first and then compress the combined file. Compressing each file individually and then merging often produces a larger result because the merger cannot deduplicate resources across the separately compressed files.
Export from source applications wisely. When creating PDFs from Word, PowerPoint, or design tools, use the “Reduce File Size” or “Minimum Size” export preset if available. These presets apply sensible compression at creation time, avoiding the need for post-processing.
Beyond Compression: Other PDF Workflows
PDF compression is often part of a larger document workflow. If you regularly work with PDFs on your iPhone, you might also find these guides useful:
- How to compare PDF documents side by side covers tools for reviewing document revisions, which is especially useful before compressing and archiving a final version.
- How to convert photos to PDF on iPhone explains how to create clean PDFs from photos and scans, including tips for keeping the output size manageable from the start.
- How to save any web page as a PDF in Safari is useful for archiving web content as PDFs — and those PDFs often benefit from compression before long-term storage.
For a broader look at protecting your documents and digital privacy on Apple devices, see our complete best privacy and security apps for iPhone and Mac roundup. And if you are looking to manage passwords for all the services you share documents through, our guide on managing passwords securely with KeePass covers the foundation of keeping your credentials organized and encrypted.
Things Most Guides Do Not Tell You
Re-compressing JPEG images inside a PDF is lossy every time. Each round of JPEG compression introduces new artifacts. If you compress a PDF, then later decide you need it smaller, go back to the original uncompressed version and compress from there rather than re-compressing the already-compressed copy.
“Lossless” compression has limits. If a PDF is already well-optimized (created by modern software with default compression), lossless techniques might only shave off 5% to 10%. Do not expect dramatic results without some image quality tradeoff.
File size and page count are not linearly related. A 5-page PDF with embedded photographs can be larger than a 100-page text-only PDF. The content type matters far more than the page count.
Compressed PDFs are still searchable. Image compression does not affect the text layer in PDFs that have been OCR-processed. The searchable text remains intact regardless of how aggressively the images are compressed.
macOS Preview can compress PDFs natively. On a Mac, open a PDF in Preview, go to File > Export, and choose “Reduce File Size” from the Quartz Filter dropdown. The compression is aggressive and not configurable, but it works in a pinch and runs entirely locally. For more control over compression levels, a dedicated tool gives you better results.
The right compression approach depends on what the document contains, where it needs to go, and how sensitive its contents are. For most people, medium compression applied locally covers 90% of scenarios — shrinking files enough to share easily while keeping quality perfectly acceptable for any on-screen use.