docyx / pc-part-dataset

A dataset of PC parts scraped from PCPartPicker
MIT License
172 stars 41 forks source link
data dataset pcpartpicker typescript

PC Part Dataset

A dataset of PC parts scraped from PCPartPicker.

Part count: 61,246

Last updated: May 14, 2024

Download

The parts are available in JSON, JSON Lines, and CSV format. You can find them in the ./data directory.

JSON Property Descriptions

Check out API.md for JSON property descriptions of each product category.

Contents

Running the Scraper

Warning Make sure to turn on your VPN before running this scraper!

  1. Run npm install
  2. Run npm run start
  3. Wait about 5-10 minutes for the scraper to finish doing its thing.
  4. Everything will be emitted to a directory named data-staging.

If you don't want to scrape every part, you can specify individual parts by passing them as arguments. You can find the list of part type IDs in API.md.

Example usage:

# Only get CPUs and video cards
npm run start -- cpu video-card

package.json Scripts

License

MIT