Blog

In version 1.12.0, we paid down some technical debt, changed directions on web harvesting and SFM ELK, and added new features to make managing collections an...

March 20, 2018

Command-line tools for wrangling Twitter data

A Jupyter notebook that demonstrates various command-line tools for manipulating Twitter data.

January 02, 2018

Collecting Facebook data with the Graph API

In which I provide an introduction to the Facebook Graph API and f(b)arc, a tool for collecting Facebook data.

December 13, 2017

Web archiving and/or/vs social media API archiving

In which I discuss the differences between collecting social media via web capture and APIs.

November 06, 2017

Vulnerabilities in the U.S. Digital Registry, Twitter, and the Internet Archive

In which I demonstrate vulnerabilities in the U.S. Digital Registry, Twitter, and the Internet Archive to manipulate web history.

November 04, 2017

Suspended U.S. government Twitter accounts

In which I look at suspended Twitter accounts listed in the U.S. Digital Registry, including some recently tweeting in Russian.

October 23, 2017

Announcing SFM Version 1.11

A minor release with two (hopefully) useful enhancements.

October 11, 2017

A Peek at 251,077,140 #election2016 tweets

With the tweets from the 2016 U.S. presidential election collection now added to TweetSets, we take a look at the top tweeters, mentions, hashtags, and URLs.

September 14, 2017

Where to get Twitter data for academic research

Justin Littman explains the options for acquiring Twitter data for academic research.

September 08, 2017

Building Social Network Visualizations using SFM Data

Soomin Park provides an introduction to network analysis and visualization with instructions for SFM and demonstrations with Gephi, Kumu, Cytoscape, and igra...

August 29, 2017

Announcing SFM Version 1.10

A description of the final release of SFM under our grant from the NHPRC.

August 10, 2017

Finding Twitter handles with Scrapy

Yonah Bromberg Gaber shares a proof-of-concept using Scrapy for finding Twitter handles in websites.

July 26, 2017

SFM for Archivists: Data Model Overview and Implications for Preservation

In the fifth post in the SFM for Archivists series, Chris Prom digs into the SFM data model and the implications for preservation.

July 19, 2017

SFM for Archivists: Establishing a Policy Basis for Access

In the fourth post in the SFM for Archivists series, Chris Prom offers three basic access scenarios for social media collections.

July 12, 2017

SFM for Archivists: Research Consultation Service

In the third post in the SFM for Archivists series, Chris Prom explores possibilities for a social media research consulation service.

July 06, 2017

SFM for Archivists: Shaping Local Services

In the second post in the SFM for Archivists series, Chris Prom discusses strategies for shaping local collections.

June 28, 2017

SFM for Archivists: Use Cases for Institution-Led Collecting

In the first post in the SFM for Archivists series, Chris Prom outlines a few potential institution-led collecting efforts that repositories may wish to cons...

June 23, 2017

Announcing SFM Version 1.9

On the eve of the one year anniversary of version 1.0, the SFM team released version 1.9.

June 20, 2017

Iranian Election 2017

Middle East & North Africa Librarian Amanda Hannoosh Steinberg shares her experiences creating curated research collections for Middle East Studies, includin...

June 19, 2017

Announcing Social Feed Manager: Guide for Building Social Media Archives

We're pleased to share this report by Christopher Prom of University of Illinois, which assesses Social Feed Manager, offers thoughts for how it can support ...

June 01, 2017

New resource: bibliography on ethics and legal aspects of social media collecting

A selected list of resources for consideration when embarking on social media collecting.

May 23, 2017

A List of Twitter Handles for Members of Congress

We collect tweets from almost every member of Congress, and we provide the list of handles here.

May 18, 2017

Implications of changes in Twitter’s Developer Policy

On May 18, Twitter announced changes to its Developer Policy that has significant impacts for researchers and archivists. The goal of this blog post is to de...

May 15, 2017

Announcing SFM Version 1.8

In version 1.8 of Social Feed Manager, we treated ourselves to a handful of enhancements and fixes.

May 08, 2017

A Day of Collecting with Social Feed Manager

Here's what we collected on May 4, 2017 with SFM.

April 12, 2017

Collecting by Geographic Location

SFM provides the opportunity to collect useful metadata about the geographic location of tweets provided by the Twitter API.

April 06, 2017

Announcing SFM Version 1.7

A quick blog post for a quick release.

March 31, 2017

Faculty Research Salon

On March 28, two dozen George Washington University faculty gathered over lunch to discuss social media research.

March 31, 2017

On Extended Tweets

This post describes the impact of recent changes made by Twitter to allow extended tweets on the REST and Streaming APIs.

March 22, 2017

Announcing SFM Version 1.6

In between GW’s spring break and late season snow storm, we squeaked in version 1.6 of Social Feed Manager.

March 20, 2017

Collection development guidelines for social media archiving

We're pleased to share some recently-developed guidelines on building social media archives.

March 15, 2017

Releasing datasets to Dataverse

Instructions are now available for exporting a dataset from SFM and publishing to Dataverse.

February 07, 2017

Announcing SFM Version 1.5

We've made incremental improvements to several key features in version 1.5.

December 21, 2016

Announcing SFM Version 1.4

Under the wire, version 1.4 is our 8th release of SFM for 2016.

November 30, 2016

2016 United States Presidential Election Tweet Ids dataset released

The SFM team released a dataset of 280 million tweet ids related to the 2016 U.S. presidential election.

November 23, 2016

Recipes for processing Twitter data with jq

To help our future selves (and you), we've documented some of the tips and tricks for processing Twitter data with jq.

November 15, 2016

Announcing SFM Version 1.3

The highlights of version 1.3 of Social Feed Manager.

November 10, 2016

On retweets, replies, quotes & favorites: A guide for researchers

A Jupyter notebook that explores the affordances of the Twitter API for retweets, replies, quotes, and favorites.

November 01, 2016

Your SFM (content) exit strategy

This blog post describes how to get your content OUT of SFM.

October 20, 2016

Announcing SFM Version 1.2

The SFM team has overcome the confluence of a series of conferences, vacations, holidays, nasty colds, and other deadlines to get version 1.2 wrapped up.

October 12, 2016

Anatomy of a Social Media Harvester

Version 1.2 of Social Feed Manager will contain a passel of improvements to our social media harvesters.

September 07, 2016

Announcing SFM Version 1.1

In between summer vacations and public transit hurdles, we've cranked out release version 1.1 of Social Feed Manager.

September 07, 2016

When is a Collection not an Archive?

The Twitter API is a powerful tool for archiving, but its use poses particular challenges for archivists.

July 11, 2016

Extracting URLs from #PulseNightclub for seeding web archiving

Last week, Internet Archive put out a call for URL nominations for a Pulse Nightclub web collection. This blog post describes how I extracted 200,094 unique ...

June 20, 2016

Announcing SFM Version 1.0

We're pleased to release version 1.0 of Social Feed Manager, which includes the functionality to select, create, manage, and explore social media collections.

June 17, 2016

SFM at Archives Unleashed 2.0 and the Saving The Web symposium

This week, GW Libraries' software development librarians working on Social Feed Manager participated in the Archives Unleashed 2.0 hackathon hosted at the Li...

June 13, 2016

#PulseNightclub

It is heartbreaking that our first use of SFM to capture a breaking event was for the shooting at the Pulse Nightclub. My thoughts go out to the families, t...

May 04, 2016

Another Try at Harvesting the Twitter Streaming API to WARC files

We'e abandoning record segmentation for harvesting the Twitter Streaming API to WARC files and trying a new approach.

April 26, 2016

Weibo API Guide

This is a guide for programmers and researchers who intend to use Weibo's API. Since the current official documentation hasn't been updated for a long time, ...

March 14, 2016

Tracking Changes to Harvests in Social Feed Manager

In her blog post, “Social Media for Good: the Series, Episode 2”, DPC’s Sara Day Thomson explains: New work also reveals the heightened importance of archive...

January 13, 2016

An Experiment with Social Feed Manager and the ELK Stack

The latest in our social media harvesting experiments for the Social Feed Manager project involves analysis, discovery, and visualization of social media con...

December 15, 2015

Harvesting the Twitter Streaming API to WARC files

The Twitter Streaming API is very powerful, allowing harvesting tweets not readily available from the other APIs.

October 28, 2015

Social Media Harvesting Techniques

Social Feed Manager (SFM) is a tool developed by the Scholarly Technology Group for harvesting social media to support research and build archives. As part o...