Image credit: Kevin Dooley on Flickr
This is a follow-up post to this essay on accusations of censorship on Reddit and the unpredictable consequences of algorithmic quirks.
Reddit is the self-described “front page of the internet.” Millions of users rely on Reddit to keep them informed on a wide range of topics from world news to gaming developments to the latest in pictures of cute dogs (or, often as not, reposts of pictures of cute dogs). But what happens when the front page fails us, and how do Reddit administrators respond?
In the aftermath of the Reddit debacle surrounding /r/news deletions of posts about the Orlando shooting, Reddit has rolled out new changes to the algorithm that ranks posts on /r/all. /r/all is the frontest of Reddit front pages; it is the algorithmic ranking of popular posts from all subreddits that 1) choose to be featured on the front page and 2) haven’t been quarantined based on questionable (read: bigoted) content. The exact details of Reddit’s ranking algorithm are complicated and unnecessary for this brief discussion, but it’s essentially a combination of when something was posted, how many upvotes it has received, and how recently it’s been upvoted. Time + Attention = Rank. You can read more about it in this (slightly dated) explainer.
According to Reddit CEO Steve Huffman, /u/spez, the new algorithm has been in the works for a while but was implemented earlier than intended given recent events. The quick and dirty of the incident is: /r/news mods deleted a large number of posts and comments on the Orlando shooting, leaving a huge chasm on the front page that was filled with reports from the Donald Trump subreddit /r/The_Donald. Redditors were (rightfully) outraged that the incident was only being reported on by a widely-despised, non news-related subreddit.
Huffman announced the changes shortly after the /r/The_Donald incident in a post that he called a “town hall” about /r/all. In it, he argued that “/r/all is a reflection of what is happening online in general. It is culturally important and drives many conversations around the world.” Leaving aside the grandiosity of this statement, Reddit is undoubtedly an important source for the millions of users who depend on the site to stay up to date on new, popular online content.
The changes to the algorithm are, to hear Huffman describe them, relatively simple. The goal is to “prevent any one community from dominating the listing… as a community is represented more and more often in the listing, the hotness of its posts will be increasingly lessened. This results in more variety in /r/all.” In other words, /r/The_Donald, or any other subreddit, can’t completely take over the front page.
For most users, Reddit’s algorithm is a black box. Content goes in, content comes out, and what happens in-between is neither comprehensible nor relevant. However, when the input and output of Reddit’s black box changes significantly, everyone notices.
Initial reactions in Huffman’s “town hall” thread were largely positive and hopeful. But in the following days and weeks, some redditors began noticing a major, frustrating change: not enough new content. An undoubtedly unintended consequence of the change was greater stability on /r/all, with posts hanging around longer and new content rising to the top much more slowly.
Also, porn. Lots of porn. The insurgence of porn on the first few pages of /r/all is likely due to two factors: 1) porn is really popular on Reddit and 2) there are so many porn subreddits that the possibility of any one of them overtaking /r/all and, subsequently, getting pushed down the listing is less likely.
Of course, porn is one of those things where even a small uptick is very noticeable. So just how much of /r/all is porn, relative to other things? This morning I did a rough analysis of the first five pages (top 100 posts) of /r/all. Here’s a general breakdown of the number of posts related to each of these popular topics:
Pets: 6
Sports: 6
Gifs (funny, interesting, reaction): 12
Gaming: 11
News: 3
Politics: 3
Porn: 12
It was a lot of porn. Porn’s only real competitors are gaming and gifs, and “gifs” comprises so many different topics it hardly seems fair to count it as a category. The amount of porn content on /r/all has gotten so large that some users have asked for a safe-for-work version of /r/all so they can browse during office hours.
So the algorithm change that was supposed to prevent large subreddits like /r/the_donald from dominating the front pages of /r/all has instead paved the way for a mass of porn and significantly reduced the amount of news content, while also reducing the frequency with which content changes. If most of /r/all’s news content comes from the large subreddits like /r/news, /r/worldnews, and /r/politics, then the new algorithm makes it less likely for multiple posts from these large subreddits on various topics to make it to the top pages. Meanwhile, because there are so many porn subreddits, it’s easy for that content to rise to the top.
In trying to develop some conclusions about this phenomenon, I spoke with my media studies colleague Nick Hanford. He was struck by Huffman’s description of the changes to the algorithm, and of Reddit generally, as constructing some sort of Platonic Ideal of the internet; a “true” representation of what is “really” happening at any given moment online. This is certainly one of the failings of Reddit’s /r/all algorithm, both before and after the change; it cannot hope to accomplish the lofty aims set by Huffman because the internet is a many-sided prism, and it cannot be reflected on a flat page of ranked, numbered posts. My internet may have significantly more recipes or cat gifs than yours, and “what is happening online in general” as Huffman puts it will vary greatly from region to region, not to mention various subcultural groups. /r/all can never really be /r/all, it is necessarily /r/some.
Perhaps the biggest takeaway from this story is best summed up by Mr. Hanford: “PORN RULES ALL, ESPECIALLY SO-CALLED DEMOCRACIES.”*
Britney is on Twitter
*Nick Hanford agreed to be quoted for this essay on condition that he be quoted in caps lock.