Rawcomps

Open Data

Rawcomps Open Data

The sold-comp and card-identity datasets that power rawcomps.com are open for non-commercial reuse under CC-BY 4.0. Each dataset is served as JSON from /data/<slug>/. Schemas + per-row variable definitions live on the per-dataset pages.

Google Dataset Search indexes the schema.org/Dataset metadata embedded on each page below; you can search Rawcomps datasets directly at datasetsearch.research.google.com.

comp_v2 — end-to-end comp aggregates

Per-card sold-comp aggregates with confidence tier (GREEN / YELLOW / RED), source breakdown, raw and graded summary stats, and the honesty flags surfaced on /cards/[slug]. The deep end-to-end pipeline.

Rows
9,013
Update cadence
On re-tier (manual)
License
CC-BY 4.0
Schema + download →
cards — master card identity index

Every baseball card identity catalogued in Rawcomps: canonical (sport, year, set_code, card_number, variation) plus a stable slug and card_id. Drives every routable /cards/[slug] page on the site.

Rows
108,522
Update cadence
On corpus update (weekly-ish)
License
CC-BY 4.0
Schema + download →

License

Rawcomps datasets are released under the Creative Commons Attribution 4.0 International (CC-BY 4.0) license. You may share, adapt, and build commercially on top of the data, provided you credit Rawcomps and link back. The sold-comp source rows themselves (auction-archive listings, PSA APR records, etc.) belong to their respective auction houses and graders; we license only the aggregated dataset.

Citation

If you cite Rawcomps datasets in academic or editorial work:

Rawcomps. (2026). Rawcomps Open Data — comp_v2,
walker_comps, cards. https://rawcomps.com/data
Accessed YYYY-MM-DD.
Open Data — Rawcomps · Rawcomps