[pypdf] Add new project integration by MR-SS · Pull Request #15789 · google/oss-fuzz

MR-SS · 2026-06-22T18:07:23Z

Adds fuzzing integration for pypdf, a widely used Python PDF library with millions of downloads.

This integration implements a comprehensive Atheris fuzzer that exercises:

PDF parsing via PdfReader
Text extraction (in both layout and plain modes)
Image extraction and stream decoding
PDF metadata parsing
PDF writing via PdfWriter

The build.sh script automatically fetches a diverse set of highly complex PDFs (from Mozilla's pdf.js test suite) to use as a seed corpus, ensuring high code coverage. Standard robustness exceptions (like ValueError, TypeError, AttributeError, OverflowError) are ignored to focus the fuzzer purely on Native crashes, OOM, and Timeouts.

Proof of Value:
During local testing with infra/helper.py, this fuzzer successfully discovered multiple unhandled edge cases and crashes in the upstream library within minutes, including:

An unhandled AttributeError during dictionary casting
An unhandled OverflowError during startxref parsing
An Infinite Loop (Timeout) during text extraction of a malformed content stream

These findings have been reported to pypdf

google-cla · 2026-06-22T18:07:34Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

github-actions · 2026-06-22T18:08:43Z

MR-SS is integrating a new project:
- Main repo: https://github.com/py-pdf/pypdf
- Criticality score: 0.56828

MR-SS added 2 commits June 22, 2026 21:30

[pypdf] Add new project integration

8089e02

[pypdf] Add new project integration

f8ee8c2

MR-SS added 2 commits June 22, 2026 21:39

new changes

5583a99

Add required Google copyright headers

ba0a0ce

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pypdf] Add new project integration#15789

[pypdf] Add new project integration#15789
MR-SS wants to merge 4 commits into
google:masterfrom
MR-SS:pypdf-integration

MR-SS commented Jun 22, 2026

Uh oh!

google-cla Bot commented Jun 22, 2026

Uh oh!

github-actions Bot commented Jun 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

MR-SS commented Jun 22, 2026

Uh oh!

google-cla Bot commented Jun 22, 2026

Uh oh!

github-actions Bot commented Jun 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant