Data & Research

Publishing industry data, AI research, and legal trackers

Citation-grade primary-source datasets for journalists, authors, publishers, and researchers. Every number links to its original source; every case links to its court docket.

How we build these

Every dataset on this site is sourced from primary research — industry-association surveys (Authors Guild, BookBub Partners, BISG, BookNet Canada, ALLi), academic working papers (NBER, Stanford HAI), and court filings (CourtListener dockets, court PDFs, official press releases). We do not paraphrase rulings — we quote them. Dollar amounts are sourced; where unverifiable, we say so explicitly.

Pages are updated on a cadence appropriate to the underlying data: the statistics page is refreshed quarterly when new surveys land; the lawsuits tracker is refreshed monthly while major cases are active. Each page has a visible "Last verified" date and a changelog.

If you cite something from one of these pages, please cite the underlying primary source first. We're aggregators; the surveys and courts are the sources.

Working with this data?

We build marketing reports for authors and publishers operating in this landscape. If you need the workflow done for one of your titles, here's how.

See a sample report