Rewrite `decode.py` to always pick a valid codec by StanFromIreland · Pull Request #19 · python/library-fuzzers

StanFromIreland · 2026-03-26T20:27:25Z

I don't quite get the original idea behind the fuzzer, as I assume currently it is failing with a LookupError on an invalid codec most of the time, never reaching any actual decoding. Instead, I suggest we drop the dictionary and pick a known codec. The codec rejection path is quite simple, I don't really think it is worth spending time fuzzing it.

sethmlarson

Since we're using FuzzerInput[0] directly as an integer this means potentially we'd start missing codecs if there are more than 256 of them. How many codecs are there today, are we at risk of getting close to that number? If so: maybe we take two bytes for the index, add an assert in there than len(ALL_CODECS) < 0xFFFF and call that good?

StanFromIreland added 2 commits March 26, 2026 20:20

Rewrite decode.py fuzzer

3dbffe5

drop old approach altogether

2fa45bf

StanFromIreland requested a review from a team March 26, 2026 20:27

sethmlarson reviewed Apr 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rewrite `decode.py` to always pick a valid codec#19

Rewrite `decode.py` to always pick a valid codec#19
StanFromIreland wants to merge 2 commits intopython:mainfrom
StanFromIreland:decode

StanFromIreland commented Mar 26, 2026

Uh oh!

sethmlarson left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

StanFromIreland commented Mar 26, 2026

Uh oh!

sethmlarson left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants