DSA 210 Spring 2025 Term Project
Public libraries serve as critical community resources, offering free access to education, technology, and cultural programs. However, library engagement—such as book checkouts or event attendance—varies widely across neighborhoods. These disparities often reflect underlying socioeconomic factors like income levels, education, and population density. By analyzing library usage alongside demographic data, we can uncover patterns that guide fair resource distribution and advocate for data-driven policy decisions.
This project focuses on answering two key questions:
- How do community demographics correlate with library engagement metrics (e.g., checkouts, program attendance)?
- Can we predict library usage patterns using socioeconomic indicators?
- Source: IMLS Public Library Survey (2022)
- Key Features:
TOTCIR: Total circulation of materialsVISITS: Total annual library visitsREGBOR: Number of registered usersTOTSTAFF: Total paid FTE employeesGPTERMS: Internet computers used by the general publicHRS_OPEN: Total annual public service hours for all service outletsTOTPRO: Total number of synchronous program sessionsTOTATTEN: Total attendance at synchronous programsTOTINCM: Total operating revenueTOTOPEXP: Total operating expendituresPOPU_UND: Unduplicated population of the legal service area for the libraryZIP_CODE: Administrative ZIP code of the library system
- Source: US Census ACS 5-Year Estimates (2022)
- Key Features:
MEDIAN_INCOME: Median household income by ZIP Code Tabulation Area (ZCTA)BACHELORS_PERCENT: Percentage of population with a bachelor’s degree or higherUNEMPLOYMENT_RATE: Unemployment rateZIP_CODE: ZIP code
-
IMLS Data:
- Downloaded the
PLS_FY22_AE_pud22i.csvfile from the IMLS website. - Filtered columns to retain
ZIP_CODE,TOTCIR,VISITS,REGBOR,TOTSTAFF,TOTATTEN,TOTPRO,GPTERMS,TOTINCM,TOTOPEXP,HRS_OPEN,POPU_UND.
- Downloaded the
-
Census Data:
- API Integration: Fetched data using the US Census Bureau API (ACS 5-Year Estimates 2022).
- Tools: Python
censuslibrary with a registered Census API key. - Features: Extracted
B19013_001E(median income),B15003_022E(% bachelor’s degree), andB23025_005E(unemployment rate). - 📎
fetch_census_data.py
- Tools: Python
- API Integration: Fetched data using the US Census Bureau API (ACS 5-Year Estimates 2022).
- Merged datasets on
ZIP_CODE: 📎merge-datasets.py