The sold-comp and card-identity datasets that power rawcomps.com are open for non-commercial reuse under CC-BY 4.0. Each dataset is served as JSON from /data/<slug>/. Schemas + per-row variable definitions live on the per-dataset pages.
Google Dataset Search indexes the schema.org/Dataset metadata embedded on each page below; you can search Rawcomps datasets directly at datasetsearch.research.google.com.
Per-card sold-comp aggregates with confidence tier (GREEN / YELLOW / RED), source breakdown, raw and graded summary stats, and the honesty flags surfaced on /cards/[slug]. The deep end-to-end pipeline.
Per-card three-anchor verdicts (max / min / recent sale) and matched_comps from the free-source walker fleet. Wide coverage across the corpus, growing daily as the walker chain runs.
Every baseball card identity catalogued in Rawcomps: canonical (sport, year, set_code, card_number, variation) plus a stable slug and card_id. Drives every routable /cards/[slug] page on the site.
Rawcomps datasets are released under the Creative Commons Attribution 4.0 International (CC-BY 4.0) license. You may share, adapt, and build commercially on top of the data, provided you credit Rawcomps and link back. The sold-comp source rows themselves (auction-archive listings, PSA APR records, etc.) belong to their respective auction houses and graders; we license only the aggregated dataset.
Citation
If you cite Rawcomps datasets in academic or editorial work:
Rawcomps. (2026). Rawcomps Open Data — comp_v2,
walker_comps, cards. https://rawcomps.com/data
Accessed YYYY-MM-DD.