Observing the “Sidewalk Ballet”: Leveraging LMMs to Detect Social Interactions in Large-scale Urban Imagery datasets

Liu Liu, Massachusetts Institute of Technology

0000-0003-0100-9763

ACCESS Allocation Request CIS250088

CoPI: Andres Sevtsuk Massachusetts Institute of Technology
Abstract: My project aims to revolutionize our understanding of social dynamics in urban environments. Inspired by Jane Jacobs’ concept of the “Sidewalk Ballet,” we seek to capture and analyze the spontaneous interactions that make city streets vibrant and cohesive. By utilizing large multimodal models (LMMs), we will develop a new framework to detect and categorize social interactions from vast collections of geo-tagged images, including street-view imagery. To achieve this, we will leverage ACCESS computational resources, specifically allocating 120,000 CPU hours and 35,000 GPU hours. These resources will support extensive data processing, model training, and validation tasks essential for creating a benchmark dataset of over 100,000 annotated urban images. We will employ and fine-tune advanced models such as Qwen2.5-vl, LLaVA 1.6, and Janus-Pro. Additionally, we will utilize software packages tailored for large-scale image analysis and geospatial data integration. Our approach will enable large-scale mapping and analysis of urban social life, providing valuable insights for urban planners and designers to create more inclusive and vibrant public spaces. By automating the detection of social interactions, this project promises to offer scalable and actionable knowledge, paving the way for future research and practical applications in enhancing urban health and vitality.

Allocations:

2025 Indiana Jetstream2 GPU 285,549.0 SUs
2025 NCSA Delta GPU 1,000.0 GPU Hours
The estimated value of these awarded resources is $61,634.68. The allocation of these resources represents a considerable investment by the NSF in advanced computing infrastructure for the U.S. The dollar value of the allocation is estimated from the NSF awards supporting the allocated resources.
There are no other allocations for this project.

Other Titles:

There are no prior titles for this project.