Data Carpentry: Genomics - GWU

George Washington University

Mar 5-6, 2020

9:00 am - 5:00 pm

Instructors: Jennifer Shelton,

Schuyler Smith

General Information

Data Carpentry develops and teaches workshops on the fundamental data skills needed to conduct research. Its target audience is researchers who have little to no prior computational experience, and its lessons are domain specific, building on learners' existing knowledge to enable them to quickly apply skills learned to their own research. Participants will be encouraged to help one another and to apply what they have learned to their own research problems.

For more information on what we teach and why, please see our paper "Good Enough Practices for Scientific Computing".

Where: Room 301/302
Gelman Library, George Washington University
2130 H St NW, Washington, DC 20052.
Get directions with OpenStreetMap or Google Maps.

When: Mar 5-6, 2020. Add to your Google Calendar.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on. They should have a few specific software packages installed (listed below).

Accessibility: We are committed to making this workshop accessible to everybody. The workshop organizers have checked that:

Materials will be provided in advance of the workshop and large-print handouts are available if needed by notifying the organizers in advance. If we can help making learning easier for you (e.g. sign-language interpreters, lactation facilities) please get in touch (using contact details below) and we will attempt to provide them.

Contact: Please email or for more information.


All required software for this workshop is freely available, and can be found in the Setup tab.

To participate in this workshop, you will need access to all of these software, as well as a web-browser. Instructors and assistants will not always be able to take the time to help with installations during the lessons, so please be sure to install and test them prior to attending.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.

Collaborative Notes

We will use this collaborative document for chatting, taking notes, and sharing URLs and bits of code.


Please be sure to complete these surveys before and after the workshop. These surveys are to help both us and yourselves assess the workshop.

Pre-workshop Survey

Post-workshop Survey

Schedule (tentative)

March 5th

Before starting Pre-workshop surveyJennifer & Schuyler
Morning Project Organization and ManagementJennifer Shelton
Introduction to the Command LineJennifer Shelton
AfternoonData Wrangling and ProcessingSchuyler Smith

March 6th

MorningContinuation: Data Wrangling and ProcessingSchuyler Smith
AfternoonIntroduction to Cloud Computing for GenomicsJennifer Shelton
Post-workshop surveyJennifer & Schuyler