Abstract
Purpose
Libraries throughout the world use OCLC’s EZproxy software to manage access to e-resources. When cleaned, processed, visualized and enhanced, these logs paint a valuable picture of a library’s impact on researcher’s lives. The purpose of this paper is to share techniques and procedures for enhancing and de-identifying EZproxy logs using Tableau, a data analytics and visualization software, and Tableau Prep, a tool used for cleaning, combining and shaping data for analysis.
Design/methodology/approach
In February 2018, The Ohio State University Libraries established an automated daily process to extract and clean EZproxy log files. The assessment librarian created a series of procedures in Tableau and Tableau Prep to union, parse and enhance these files by adding information such as user major, user status (faculty, graduate or undergraduate) and the title of the requested resource. She last stripped the data set of identifiers and applied best practices for maintaining confidentiality to visualize the data.
Findings
The data set is currently 1.5m rows and growing. The visualizations may be filtered by date, user status and user department/major where applicable. Safeguards are in place to limit data presentation when filters might reveal a user’s identity.
Originality/value
Tableau used in concert with Tableau Prep allows an assessment librarian to clean and combine data from various sources. Once procedures for cleaning and combining data sources are established, the data driving visualizations can be set to refresh on a set schedule. This expedites the ability of librarians to derive actionable insights from EZproxy data and to share the library’s positive impact on researcher’s lives.
Subject
Library and Information Sciences
Reference14 articles.
1. Insights from Jisc & HESA analytics labs: an agile cross-institutional approach,2017
2. Dennison, C.C. and Sung, J.S. (2018), “Finding hidden treasures in the data”, paper presented at Library Assessment Conference, Houston, TX, 5–7 December, available at: https://libraryassessment.org/wp-content/uploads/2018/12/Dennison-Sung-FindingHiddenTreasures.pptx (accessed 11 July 2019).
3. Gartner (2019), “Magic quadrant for analytics and business intelligence platforms”, available at: www.gartner.com/doc/reprints?id=1-68720FP&ct=190213&st=sb (accessed 11 July 2019).
4. Analyzing EZproxy SPU logs using python data analysis tools;Code4Lib Journal,2018
5. Just because you can doesn’t mean you should;Portal: Libraries and the Academy,2019