Wednesday, November 26, 2014

For a few weeks this fall I had my computer probe the Twitterverse, gathering details on a random sa


Writers who cover Twitter find the grandiose irresistible: nearly every article about the service s IPO this fall mentioned the heroes of the Arab Spring who toppled dictators with 140-character stabs, or the size of Lady Gaga s readership , which is larger than the population of Argentina.
But the bulk of the service is decidedly smaller-scale–a low murmur zona rosa restaurants with an occasional celebrity shouting on top of it. In comparative zona rosa restaurants terms, almost nobody on Twitter is somebody: the median Twitter account has a single follower. Among the much smaller subset of accounts that have posted in the last 30 days, the median account has just 61 followers. If you ve got a thousand followers, you re at the 96th percentile of active Twitter users. (I write active users to refer to publicly-viewable accounts that have posted at least once in the last 30 days; Twitter uses a more generous definition of that term, including anyone who has logged zona rosa restaurants into the service.)
For a few weeks this fall I had my computer probe the Twitterverse, gathering details on a random sampling of about 400,000 Twitter accounts. The profile that emerges suggests that Twitter is more a consumption medium than a conversational one–an only-somewhat-democratized successor zona rosa restaurants to broadcast television, in which a handful of people wield enormous influence zona rosa restaurants and everyone else chatters with a few friends on living-room couches. There are undoubtedly some influential Twitter users who would not be influential without Twitter, but I suspect that most people who have, say, 3,000 followers (the top one percent) were prominent commentators, industry zona rosa restaurants experts, or gregarious accumulators of friends zona rosa restaurants to begin with.
Active Twitter accounts follow a median 117 users, and the vast majority of them–76%–follow more people than follow them. Which brings to mind both discussions about the mathematics zona rosa restaurants of pairing and studies that suggest reciprocated friendship is both rare and valuable . Here s the histogram from above with the distribution of number of accounts that users follow superimposed.
Not that number of followers is an indicator of quality. Twitter s users are prone to swarms and fads; they flock to famous zona rosa restaurants people as soon as they appear on Twitter, irrespective of both activity and brow height. Former New York Times editor zona rosa restaurants Bill Keller amassed thousands of followers in his first months on Twitter , despite posting just eight times in 2009 (and then baffling his readers with this tweet upon reappearing on Christmas Eve in 2010). zona rosa restaurants On the other end, just under one in every thousand Twitter accounts has a name that refers to Justin Bieber in some way; an additional one in every thousand refers to Bieber in its account description.
Far more inscrutable than the famous zombies are the anonymous ones, like a Wayne Rooney fan account zona rosa restaurants , a skin-care promotion feed , and a fake Taylor Lautner account that each managed to amass thousands of followers with just a single tweet. (The commercial accounts of this sort are probably the result of promotions– follow us on Twitter for a discount! –that got no follow-up, zona rosa restaurants or are the beneficiaries of bot armies hired to make a business look popular.)
Twitter is giant, and it has an outsize influence on popular and not-so-popular culture, but that influence seems due to the fact that it s popular among influential people and provides energetic reverberation for their thoughts–and lots and lots of people who sit back and listen.
Twitter assigns zona rosa restaurants each account a numerical ID on creation. These IDs aren t consecutive, but they do, with just a few exceptions, monotonically increase over time–that is, a newer account will always have a higher ID number zona rosa restaurants than an older account. In mid-September, new accounts were being assigned IDs just under 1.9 billion.
Every few minutes, a Python script that I wrote generated a fresh list of 300 random zona rosa restaurants numbers between zero and 1.9 billion and asked Twitter s API to return basic information for the corresponding accounts. I logged the results–including empty results when an ID number didn t correspond to any account–in a MySQL table and let the script zona rosa restaurants run on a cronjob for 32 days. I ve only included accounts created before September 2013 in my analysis in order to avoid under-sampling accounts zona rosa restaurants that were created during the period of data collection.
Twitter IDs are assigned at an overall density of about 63%–that is, given an integer between zona rosa restaurants zero and the highest number so far assigned, there s a 63% chance that a Twitter account has been opened with that number at some point. That density isn t constant over the whole range of ID numbers, though; Twitter zona rosa restaurants appears to have changed its ID-assignment scheme around July 2012. Before then, Twitter assigned IDs at a density of about 86% and afterward at 49%.
Just a suggestion, but I find that a cumulative density plot (stat_ecdf in ggplot) might be even more useful than

No comments:

Post a Comment