Library Carpentry: Tools for Librarians and Humanists

George Washington University Libraries

Postponed, new dates TBA

Instructors: Laura Wrubel, Dolsy Smith, Leah Richardson

Helpers: Dan Kerchner

General Information

Who: The course is for anyone in the GWU community interested in learning more about processing and analyzing text-based data. This workshop is open to GW students, faculty, and staff who have a physical GWorld card enabled for campus access. Attendees MUST be able to tap into Gelman Library in order to attend. You don't need to have any previous knowledge of the tools that will be presented at the workshop.

Where: Room 301/302 Gelman Library, George Washington University 2130 H St NW, Washington, DC 20052. Get directions with OpenStreetMap or Google Maps.

When: Postponed, new dates TBA. Add to your Google Calendar.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on. They should have a few specific software packages installed (listed below).

Accessibility: We are committed to making this workshop accessible to everybody. For workshops at a physical location, the workshop organizers have checked that:

Materials will be provided in advance of the workshop and large-print handouts are available if needed by notifying the organizers in advance. If we can help making learning easier for you (e.g. sign-language interpreters, lactation facilities) please get in touch (using contact details below) and we will attempt to provide them.

Contact: Please email , or for more information.

Code of Conduct

Everyone who participates in Carpentries activities is required to conform to the Code of Conduct. This document also outlines how to report an incident if needed.


Please be sure to complete these surveys before and after the workshop.

Pre-workshop Survey

Post-workshop Survey


Attendtion: Library Carpentry Cancelled/Postponed

Due to the University's recent cancellation of all in-person events, we will be rescheduling this workshop for the spring. We appreciate your understanding.

Day 1

Before Starting Pre-workshop survey
9:00 Introductions and Set-up
9:30 Jargon Busting
10:00 OpenRefine Part 1
11:00 Break
11:15 Regular Expressions Part 1
12:00 Lunch
1:00 Regular Expressions Part 2
2:00 Break
2:15 Python Part 1
4:30 END

Day 2

09:00 OpenRefine Part 2
10:30 Morning break
10:45 Python Part 2
12:00 Lunch break
1:00 Python Part 2-3
2:30 Afternoon break
2:45 Python Part 3
4:15 Wrap-up
4:30 Post-workshop survey


To participate in a Library Carpentry workshop, you will need access to software as described below. In addition, you will need an up-to-date web browser.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.


OpenRefine is a tool to clean up and organize messy data. Please find instructions to install it here. Participants should install the latest stable version of OpenRefine 3.5.0. It is recommended that Windows users install the Windows kit with embedded Java.

Data files

Please download the zipped folder of files for use in the exercises for Python and OpenRefine. Once you have downloaded the zipped file, locate the file on your computer and double click to unzip it.