Thursday, February 18, 12:30-1:45 pm
Strozier 107A (Main Floor Instructional Lab) [Map]
Preparing “Messy Data” with OpenRefine: A Workshop
The fourth meeting of Digital Scholars for Spring 2016 will be conducted as a workshop, led by Dr. Richard J. Urban of FSU’s School of Information, who will walk us through two tutorials on how to use this tool for digital humanities scholarship–both for gathering and for interpreting unread data sets. Formerly a Google tool for data management, OpenRefine has recently been optimized for understanding, manipulating and transforming data of any kind, combining extant data sets (i.e., such as those that researchers have compiled in Excel spreadsheets) with open data, attained through web services and other external links. From large-scale repositories and networks to small-scale archives and visualizations, most projects constructed or used by digital scholars have benefited from data management with OpenRefine, or similar tools.
Participants are encouraged to browse the following resources in advance:
- Padilla, T. (n.d.) Getting started with OpenRefine
- van Hoolland, S.; Verbough, R.; DeWilde, M. (2013) Cleaning data with OpenRefine
and to read the following for background:
- Handelman, M. (2015). Digital Humanities as Translation: Visualizing Franz Rosenzweig’s Archive. Transit, 10(1). Retrieved from http://escholarship.org/uc/item/69d0g81v.pdf
- Kaplan, F. (2015). A map for Big Data research in Digital Humanities. Frontiers in Digital Humanities, 2(1).
- van Hooland, S., De Wilde, M., Verborgh, R., Steiner, T., & Van de Walle, R. (2015). Exploring entity recognition and disambiguation for cultural heritage collections. Digital Scholarship in the Humanities, 30(2), 262–279.
Access to OpenRefine will be provided in the Strozier Library Learning Lab; thus, registration is helpful (though not required) so that we can gauge attendance. Participants are welcome to bring their own devices and install OpenRefine during the session. While Dr. Urban will mostly focus these tutorials, participants are also welcome to bring datasets that they would like to discuss or explore.
We hope you can join us,