As a violent mob incited by President Donald Trump stormed the US Capitol on January 6, halting the procedure in Congress to formally certify Joe Biden as president-elect, a Redditor with the username Adam Lynch began a thread on the subreddit r/DataHoarder — a forum dedicated to hoarding data that might be erased or deleted. “Archiving videos before potential removal from various websites…” it began.
Within minutes, the thread was so inundated with Twitter links, Snapchat uploads, and other videos that the link was shut down by Mega, a New Zealand–based cloud storage service. Since being reopened, the Reddit thread has received over 2,000 comments with detailed data from the incident.
Lynch, who asked not to be identified by their real name because they have received death threats for their work, is Canadian and was shocked to see the images from Washington. Lynch said they felt an urgency to archive data as soon as possible because they had seen videos, posts, and livestreams get quickly taken down by both platforms and users afraid of repercussions in the aftermath of the Black Lives Matter protests last summer.
“I knew I had to start immediately,” Lynch said.
Livestreams were turned off by platforms and broadcast news networks during the attack on the Capitol, and companies like Facebook, YouTube, Twitch, and Twitter have since systematically removed posts that violated policies against violent or incendiary content. As Redditors send in content, Lynch has spent hours each day uploading it to Mega as well as to offline hard drives for backup.
“If it weren’t for the Mega thread, I am very confident a substantial part of this would not be kept,” Lynch says. But many others are also working to protect information before it disappears. An Instagram account, @homegrownterrorists, garnered about 242,000 followers, crowdsourcing efforts to identify members of the mob. (The account was briefly deactivated and cleared of posts; it was reactivated and started posting ordinary links to news articles on January 8. The account holder did not respond to a request for comment.) The journalism site Bellingcat, which specializes in investigations based on publicly available online material, invited the public to contribute to a publicly editable Google spreadsheet of links, and the Woke collective is protecting livestreams from being erased by publishing them on its own YouTube and Twitch accounts. Other firms, like European search engine Intelligence X, are also collecting and storing data.
These efforts are notable for their broad reach, says Gabriella Coleman, an anthropologist at McGill University who studies the politics and ethics of hacking. “Places like Reddit were really central in the past [for doxxing, i.e. revealing people’s identifying information] and continue to be because you get subreddits and threads where everybody is contributing to particular efforts,” Coleman says. “The difference now is that people share that information on Twitter and once that person is identified, that information is far more visible. It used to just be [hacktivist group] Anonymous that did that.”
Coleman says that Anonymous’s efforts were once considered extreme, but with each passing protest, doxxing has become more mainstream. “Of course, you’ve also got groups like Bellingcat who are like amateur professionals when it comes to open source intelligence formalized into an organization,” Coleman says. “But you’re continuing to see masses of people come together online [and doxx].”
That creates ethical quandaries. The data now being archived could haunt people in the photos for years afterward, even if they later renounce or pay criminal penalties for their actions. On r/DataHoarder, for instance, someone asked, “Do you think it’s ethical to preserve content that features someone who now wants the content to no longer be public?”
I asked Lynch whether it was two-faced for them to ask me to protect their anonymity when they were busy exposing members of the mob.
Lynch says their conscience is clear. “I believe people have the right to protest and share their voice,” they say. “If they [mob members] wanted to protect their identity they could have easily worn a mask or not livestreamed. But they didn’t wear a ski mask, not even a covid mask.”
“I think certainly a lot of this is context dependent,” Coleman says. “If you are engaging in an activity that is meant to call attention to the activity itself and don’t take precautions to hide your identity, it’s understandable how there will be people who will take that information and make it public.”
Lynch, who plans to ultimately submit the data they’re collecting to the Library of Congress,, believes they are preserving history. “We can only hoard what the world gives us,” they say. “We’re just librarians.”