Site icon AICollaboratory

Students Explore Computational Treatments of Archival Collections

SUMMER 2024: Students Engage w. Innovative Technologies to Explore the Future Processing of Archival Collections through spatial, graph & genAI Techniques

Credits: knowledge graph in header by Andrea Svejda (1950 Census for Asheville ED58: household heads with jobs by street)

Over a 6-week period, nineteen students engaged in a series of single-credit experimental summer courses that explored the future of library and archives collection processing:

Students spanned all three courses with 15 in the G course (genAI), 12 in the C course (network), and 11 in the A course (spatial). The nature of having three overlapping cohorts taking all three, two, or single courses made this a challenging but valuable teaching experience!

Overview:

To tie these courses together, students used cultural content from the Asheville N.C. Urban Renewal Project (https://www.youtube.com/watch?v=kbjZTA0r5V8), including data that was recently contributed to the Asheville Racial Reparations Commission.

What was remarkable was that half-way through these summer courses, on June 17, 2024, the Reparations Commission voted on a cash settlement amount in the value of $148K to families/businesses negatively impacted by Urban Renewal (for Property Value loss) – essentially harmed by the displacement caused by Urban Renewal.  See proposal text at: https://drive.google.com/file/d/13ufHIjS5DAkuJLY1_YtYJvw70t93g-Ej/view.

This adopted recommendation, which made history in Asheville (see: video of the vote: https://youtu.be/38xrxZ5haMU), referenced a preliminary list of the names of individuals and businesses in the Southside neighborhood impacted and eligible for cash payments. Students using innovative techniques in these classes including genAI content extraction, graph generation & querying, and spatial analysis, were able to identify some 19 additional businesses that had not been recorded before.

Sample Analytics:

Geolocating 19 new properties w. metadata [Credits: Chelsea Clarke]
Geolocating 19 properties with labels [Credits: Meg Fletcher]
GIS toolbox spatial intersection [Credits: Jasper Nash]
Computing of spatial overlap [Credits: Lauren White]
ChatGPT4.o image analysis for 477 1/2 South French Broad Ave. [Credits: Hope Lomvardias]
Google NotebookLM summary of which renter paid the highest rent [Credits: Matthew Turner]
ChatGPT analysis of the 1950 ED58 Census dataset [Credits: Sarah Engleman]
QGIS georeferencing of 1950 Census enumeration district 58 [Credits: Liang Zhou]
Google NotebookLM query: “provide a list of the people who lived at the property, including those who may have lived there. exclude other categories of people
besides residents” [Credits: Lee Sampson]
Graph database design based on use cases [Credits: Jason Benner]
1. Who appraised a certain Parcel?
2. Who were the tenants of a certain parcel
3. Who owned a certain parcel
4. When was a parcel appraised
5. When was a parcel sold
6. Who was the purchaser of a certain parcel
Graph database design based on use cases [Credits: Tiffany Porter]
1. Which parcels were purchased by HACA?
2. Who owned the parcels?
3. What were the businesses?
4. Who were the tenants?
5. Who appraised the parcels?
Graph modeling of the urban renewal data [Credits: Andrea Tavakol]
Graph instance model of the urban renewal data [Credits: Ady Weng]
Neo4j graph query for households on Choctaw St. in 1950 [Credits: Yasmin Bromir]
Neo4j knowledge graph for Southside owners harmed by urban renewal (showing Block-Parcel nodes in blue – Owner nodes in orange) [Credits: Nick de Raet]
Querying the knowledge graph for properties owned by Sallie Argintar [Credits: Tahura Turabi]
Advanced data analysis in ChatGPT4.o to identify key trends in the data [Credits: Imdad Baloch]
Advanced data analysis in ChatGPT4.o (ages, gender, count, occupation) [Credits: Etana Laing]

Feedback (based on hands-on exploration of innovative technologies using social justice reparations data):

Lee Sampson concludes that these single-credit experimental summer courses have uncovered new paths for future courses and elicited emerging themes, and makes the following observations:

  1. GenAI tools require that appropriate guardrails be put on their output.
  2. GenAI can be a great teacher, when explaining what code it is producing. Using GenAI to learn Python has been much more effective than trying to learn Python on my own.
  3. Everyone takes different approaches to prompting, and the variations in results has been extremely useful to learn from.

-Authored by Richard Marciano (with input from students)

P.S.: Special thanks to Lori Perine, Rajesh Kumar Gnanasekaran, and Mark Conrad for co-advising and supporting these courses.

Exit mobile version