Twitter’s spam problem
There’s been an increasing amount of reporting about the underground economy that powers the sale of retweets and followers on Twitter, leading to some questions as to how popular the service truly is. With reports of 500 million registered accounts, what is the actual number of users on the service?
To figure things out, I tried to get a sense of how widespread the problem is and I used the top 25 accounts on Twitter, as reported by Twitter counter:
|Name||Account||Number of followers|
This provided me with a good sample set of anywhere between 13 and 37 million users, or roughly 2-7% of Twitter’s 500 million registered accounts. Considering Twitter recently reporting that it had 200 million active users, we should be able to observe similar patterns in that data set, or 4-14 million active users.
To assess the percentage of active users, I used a tool called the Fake Follower checker by StatusPeople. When given a Twitter account, the system reports how many users it thinks are fake, how many are inactive accounts (ie. could be real accounts but do not have much activity) and how many are active accounts. Throwing our data sample at this, we got the following percentages:
or, in terms of real numbers, this is what it looks like:
|Account||Number of followers||Fake followers||Inactive||Good|
Looking through this data, we find that, on average, 42.44% of users are fake, while another 30.12% are inactive. This means that only 27.44% of users in that population are active on the service. If we were to look at Twitter’s 500 million registered accounts, this would mean that a bit over 137 million users are active on the service, leaving us with a question mark around the other 63 million active users Twitter is reporting. One could assume that, as you move further down the list of users, the number of inactive accounts drop and the number of fake followers also does. Or could it be that the 63 million missing users in that count represent the population of fake accounts on the service?
We could try to look at it through the lens of inactive users. Twitter is reporting that roughly 40% of their registered accounts are inactive (since they have reported 500 million accounts and 200 million active users). This would mean that, among the top 25 most followed accounts, we have a more active population of followers, as only 30% are reported inactive. However, this also means that there is a 25% divergence between the two numbers. So if we were to apply the same logic to the Fake followers numbers, we may see that 31.83% of Twitter’s users could be fake, or roughly 159 million accounts.
… and if those accounts were acting as active accounts, we could be looking at an active population on Twitter of only roughly 40 million real users.
Why Twitter followers matter
One may think that looking at Twitter followers is a ridiculous endeavor. However, in a world where anything from reputation to credit worthiness is increasingly based on such information, it is important that we look at the source data and ensure its integrity. Today, one can buy Twitter followers by the thousands for mere penny per user. The fake accounts that plague the service are undermining it, creating an environment where one cannot necessarily trust the audience. If Twitter’s future is based on delivering audiences to advertisers, the company needs to do a better job culling its user base and going after those fake follower accounts.
Will Twitter clean up its act? At the current time, it seems hard to tell. The company has not made any indication that it sees the underground economy of follower sales and spam accounts as anything to be too concerned about. If anything, the company may actually be disincentivized from doing anything about it, as those fake followers make the service appear a lot busier than it actually is and appear to be narrowing the gap in userbase between Twitter and Facebook.
But as it keeps delaying dealing with the issue, the problem will become larger and more ingrained, making it more difficult to deal with further down the line. Already, fake account creators have now figured ways to create profiles that appear legitimate on first pass. The reason for their working so hard to do so is that better looking fake profiles represent better sales for fake followers vendors (the murky world of followers sales has already divided itself into bottom feeders who will sell any kind of followers and “premium” vendors who claim to provide a guarantee that the followers they add are real.) The market is speaking but it is the same kind of murky market that created the spam that destroyed usenet forums, the spam that made it more difficult to operate email services, and the kind of dark SEO techniques that have gotten the likes of Google and Microsoft to spend millions to fight. Social spam, as it already exists on Twitter, will continue to grow and unless the company addresses the problem quickly, it may be the one thing that sinks it.