Affiliation:
1. University of California, San Francisco, USA
Abstract
This chapter describes work by the UCSF Industry Documents Library to develop resources, programs, and initiatives to support data science work with a diverse audience in the fields of health sciences, history of medicine, public health policy, and tobacco control. The Industry Documents Library (IDL) is a digital archive of over 15 million documents created by industries impacting public health, hosted by the University of California, San Francisco (UCSF) Library. The chapter describes the public health impact of industry documents research, highlights several examples of computational projects conducted by IDL scholars, outlines the IDL's developing plans for using data science techniques to assist with large-scale digital collection appraisal and metadata enhancement, and discusses how the IDL is expanding its collaborations with the UCSF Library's Data Science Initiative and Archives and Special Collections departments to further develop impactful data science programs across the university.
Reference30 articles.
1. Disrupting the library: Digital scholarship and Big Data at the National Library of Scotland
2. Amin, K. (2017, April 17). UCSF Archives $315,000 to digitize AIDS archives. Retrieved from UCSF Library: https://www.library.ucsf.edu/news/neh-awards-ucsf-archives-315000-to-digitize-aids-archives/
3. Archival Appraisal and the Digital Record: Applying Past Tradition for Future Practice
4. BitCurator. (n.d.). BitCurator NLP. Retrieved from BitCurator: https://bitcurator.net/bitcurator-nlp/
5. California Office of the Attorney General. (2021, November 24). Master settlement agreement. Retrieved from State of California Department of Justice, Office of the Attorney General: https://oag.ca.gov/tobacco/msa