Data-Dancing: How big is self-publishing?

Image - iStockphoto: ErnestoG
Image – iStockphoto: ErnestoG

‘Shock and awe.’

I’m here with the same question asked by my colleague, Bookseller editor Philip Jones in his companion piece to this one: How big is the market for self-published titles? 

Philip Jones
Philip Jones

“The question is a simple one for which there is no simple answer,” Jones writes. “In fact, there are lots of complicated answers.”

We want your answers, complications and all. To that end, we’ve launched a short survey for your input to, as Jones puts it, “find out what the hive-mind thinks.” Results are to come on Tuesday (16th June), so don’t dally, let us hear from you. Jones:

There are two things we will learn from this. The first is whether we can — between us — come up with a consensus both for the UK market and the much bigger one in the States. The second is whether we can come up with a methodology: do we look at value and volume? Do we include print? Do we assume Amazon is the dominant retailer, and if so do we assume that its hourly updated bestselling charts are an accurate reflection of sales? Or are they algorithmically challenged — as many suspect?

My sense is that the numbers will have wide variances, as will the jottings behind them. But I also think the size of the marketplace will shock and awe. That is why I am doing this.

And if you’ll tango with us Friday (12th June) for our weekly #FutureChat, you’ll find another chance to join the “hive-mind” of our digital publishing community here at The Bookseller’s The FutureBook and weigh in on the perceived size and value of the self-publishing market.

 This story was written as the walkup to our #FutureChat from The Bookseller’s The FutureBook of Friday 12 June. Join us each Friday at 4:00 p.m. London (BST), 3:00 p.m. GMT, 5:00 p.m. Rome (CEST), 11:00 a.m. New York (ET), 10:00 a.m. Chicago (CT), 9:00 a.m. Denver (MT), 8:00 a.m. Los Angeles (PT), 5:00 a.m. Honolulu (HAST).

Not that other parts of this business are easy to measure. In his recent New Insights on eBook Unit Sales from Nielsen, Michael Cader at Publishers Lunch had a couple of explanatory paragraphs about how the traditional publishing market action is reported. You may want to hold onto something in case of vertigo while reading this:

PubTrack Digital reports actual unit ebook sales, in the same way that Bookscan reports actual unit print book sales. While Bookscan reports weekly — getting sales data direct from retailers — PubTrack Digital reports monthly, and gets sales on a delayed basis from participating publishers, since retailers have been unwilling to share their ebook sales. So PTD runs on a three-month delay, as publishers reconcile their ebook sales with accounts.

Bookscan is incomplete for every title’s sales in that it covers the reporting sales outlets only, but it includes every title sold somewhere at retail. PTD is complete for each title — the publishers report all their ebook sales at all of their retailer accounts  — but incomplete for the business as a whole, since it only represents reporting publishers. 25 publishers participate in PTD right now, but that group includes all of the largest trade publishers. PTD does include some publishers who do not currently report sales to the AAP (F+W, Lonely Planet, Sourcebooks and Time, Inc. — plus religious publishers B&H and Baker). Primarily, PTD is missing most of the distributed clients reported to the AAP by Brookings, the ECPA, Ingram, Perseus, Random House, S&S, and University of Chicago Press.

I trust you’ve committed that to memory.

Suffice it to say, easy and clear metrics are not the book-publishing world’s bragging point. And Jones’ article sets out the key elements that make it impossible to get a verified quantification of the self-publishing side of the business. In essence, as he writes:

The reason self-published titles are not tracked is that they are predominantly sold digitally, and no e-book retailer that matters releases its sales data to Nielsen (or another provider). The fact that Amazon uses its own Standard Identification Number (ASIN)—which like the old ISBN is a 10-digit number—is immaterial to the larger issue of having no data. Could Nielsen accommodate the ASIN within its system if Amazon chose to supply its data that way: of course it could.

He adds, helpfully, that the perennial and worthy debate about “why Amazon does not helps indie writers by making them visible in places outside of Amazon” is not what we’re on about this time.

No, what we’re setting out to do here is to get our heads around the question, if not a definitive answer, of “how big is this market now?”

And to that end, as Jones says, it’s my brief to collect some input on the US market, the largest and most mature of the self-publishing realms to date.

elephant-650631_640 by e-smile
Image – Pixabay: e-smile

What we learn very quickly in this exercise is that, as in high-grade tango, the responses you get contain some push and some pull, always. Every response comes with some basic viewpoint, perspective, stance. And that’s fine. You just want to keep it in mind. To mix my similes (most elephants don’t tango well), we’re all going to get drunk in a hurry if we take a shot every time somebody mentions those blind men.

Or, as a friend has said recently, many people in the business will tell that your concept of the self-publishing market’s size is wrong, wrong, wrong — but they have no better numbers to offer, themselves.

I’m grateful to everyone who has been in touch with me in response to my inquiries. And rather than do them the disservice of trying to present a tangoing pachyderm cobbled together from all of them (cubism, it’s been done), I’m going to offer them separately to you. This gives each of my US-market responders a chance to make his or her point, and have it stand on its own.

Let’s start with one of the best-known numbers-dancing teams in the US.

On the dance floor now: Hugh Howey and Data Guy

Hugh Howey
Hugh Howey

The author Hugh Howey (pictured) and his still-unnamed “Data Guy” (um, not pictured) who produce the quarterly reports may not have numbers that everyone agrees with, but, they certainly do have numbers. This, in itself, is more than many other pundits can claim. In terms of that perspective/viewpoint element, theirs is tied to a mission of demonstrating for authors how viable self-publishing may be, financially, when compared to traditional publishing. This is an intent that Howey and Monsieur Guy are completely open about. Their project analyses sales data that is “scraped” electronically from the sales pages of Amazon ebook bestsellers  (200,000 titles for the most recent report in May).

And both of them were kind enough to jump in when I asked what they might have in terms of our question this week.

Question: What is your estimate for how big the US market for self-published books is (value and volume)?

Data Guy: US Indie sales in 2014:

  • 185 million ebooks
  • 9 million print books
  • audiobooks TBD (tackling this soon)

In other words:

  • 14% of the books of any format sold in the US in 2014 were indie (ebooks).

For adult fiction, the indie share (127 million ebooks) is proportionately larger:

  • 24% of all adult fiction books of any format sold in the US in 2014 were indie (ebooks).

In dollar terms, US Indie sales in 2014 were:

  • $459 million in consumer spending on indie ebooks.
  • $294 million in indie author earnings (with online retailers earning $165 million on those indie sales).

Question: Can you detail (in brief) how you came to these numbers?

Image - Pixabay: Kyzlikova
Image – Pixabay: Kyzlikova

Data Guy: The answers come from merging the data from AAP, PubTrack, and AuthorEarnings. It’s the classic parable of the three blind men and the elephant [drink]: each can only describe the part they are touching. But by pooling their knowledge, they can “see” the whole beast.

The AuthorEarnings [AE] methodology gives us highly detailed visibility into the relative ebook market shares (in both units and dollars) for each sector of publishing (Big 5, Small/Medium Publisher, Indie Published, and Amazon Pub Imprints). The AAP and Nielsen PubTrack, on the other hand, provide aggregate absolute numbers… but for only one and a half of those sectors (Big 5 + a subset of Small/Medium Publisher). Thus by tagging AAP & PubTrack-participating publishers in our AE data, I was able to recalibrate our AE rank-to-sales curve to match the AAP’s monthly dollar sales and PubTrack’s 2014 total units. That yielded equally accurate absolute numbers for the other, non-AAP-measured ebook categories: non-participating Small/Medium Publishers, Indie Published books, and those from Amazon Pub Imprints.

Read More

By Porter Ander­son

The FutureBook:  #FutureChat Friday — Data-Dancing: How big is self-publishing?

Read the full post at:


Join The Bookseller’s The FutureBook #FutureChat each Friday at 4 p.m. London (BST), 5 p.m. Rome (CEST), 11 a.m. New York (ET), 10 a.m. Chicago (CT), 9 a.m. Denver (MT), 8 a.m. Los Angeles (PT), 5 a.m. Honolulu (HAST).


600 x 200 TEP this one

Bookseller Mktg & Publicity 600 x 180


Banner big

Matera 600 x 200

600 X 200 NINC

Leave a Reply