Is there a way to download all of the patents?
1
1
Entering edit mode
6.1 years ago
Wayne ★ 1.0k

Hey everyone,

I am looking to do some clustering analysis on patents.... and of course it would be great to be able to create a document corpus that consistent of all patents year by year ( or certainly a good chunk).

Idea is to analyze all emerging areas around certain fields in genomics and bioinformatics, but I don't want to limit the analysis by including arbitrary (non ML derived ) categories...I'm looking to create the categories myself.

Does anyone know how to access/download patents in high-throughput?

Thanks so much !!

Patents database scraping publications • 4.9k views
ADD COMMENT
0
Entering edit mode
ADD REPLY
0
Entering edit mode

The whole reason for doing this is to analyze bioinformatics patents... I don't want to take a category view though I want to cluster all patents and find new clusters relating to bioinformatics...

ADD REPLY
1
Entering edit mode

If only that were mentioned somewhere in the question... I'll reopen it now.

ADD REPLY
1
Entering edit mode
6.1 years ago
h.mon 35k

Look at this repository to get some ideas and code on how to download patents from Google Patents database:

https://github.com/yarden/iv_patents_and_universities

By the way, searching for "patents" on GitHub gives me, among the top hits:

https://github.com/iamlemec/patents

Parse, match (within and Compustat), and analyze USPTO patent data

https://github.com/google/patents-public-data

Patent analysis using the Google Patents Public Datasets on BigQuery

https://github.com/hopped/uspto-patents-parsing-tools

Some parsing tools for USPTO patents google.com/googlebooks/uspto-patents.html

https://github.com/55minutes/python-epo-ops-client

Python Client for the European Patent Office's Open Patent Services API

But the best is:

https://github.com/antiboredom/patent-generator

Transforms literary/philosophical texts into patent applications

ADD COMMENT

Login before adding your answer.

Traffic: 2069 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6