Python tools to work with the USPTO weekly Patent Official Gazettes.
An example use, is the conversion of a zipped HTML + GIF assets of the gazette directly to a zipped tabular CSV file.
Additional uses include extraction of the exemplary claims and/or corresponding drawings.
The repository includes some light-weight example data and documentation to get started.
The source includes simple examples and stand alone command line programs.
This repository is an active project. Anticipated revisions and extensions include :
- better document specific fields, such as the identfier
- analysis for why using
filed_byversusassigned_towhen generating CSV's - pipreqs a requirements file (or use current Anaconda
[base]environment) - extend HTML parsing to additional fields, such as CPC classes
- include some minimal testing using both subset and full gazette data
- note about the
not_issuedpatents in the gazettes - post-processing tools for formatting query results
- comparsion of methods to query exemplary claims
- tools to post-process exemplary claims
- comparison of methods to query patent drawings
- additional features and functionality to add
