Report: GroupSight Workshop at HCOMP 2016 – Human Computation for Image and Video Analysis

The GroupSight workshop hit a surprisingly resonant chord with researchers at the intersection of human computation and computer vision in its first year at HCOMP 2016. My co-organizers, Danna Gurari (UT Austin) and Steve Branson (Caltech), and I sought to bring together people from widely different areas of computer vision and computational photography to explore how CV researchers are using the crowd. Attendees included researchers and students curious about this important, emerging field at the intersection of crowdsourced human computation and image/video analysis. About 30 attendees were treated to an unexpected diversity of approaches to crowd computation from some of the most exciting researchers in CV, including talks from:

– Kristen Grumman (UT Austin) : Active and Interactive Image and Video Segmentation

– Kavita Bala (Cornell/GrokStyle) : Crowdsourcing for Material Recognition in the Wild

– Ariel Shamir (Interdisciplinary Center Israel) : Passive Human Computation

– Kotaro Hara (U Maryland, College Park) : Using Crowdsourcing, Computer Vision, and Google Street View to Collect Sidewalk Accessibility Data

– Brendan McCord (Evolve Tech) : AI + IQ: Building Best of Breed Security Systems

We had short talks from 6 students on topics ranging from geolocation to medical imaging to clustering and summarization, as well as encore-track poster presentations from HCOMP and ECCV. (Their papers can be found here.)

Best paper winnerShay Sheinfeld Best paper runner-upMehrnoosh Sameki

Best paper winner Shay Sheinfeld presented work with Yotam Gingold and Ariel Shamir demonstrating a truly inventive use of the crowd for Video Summarization using Crowdsourced Causality Graphs. Attendees marveled at state-of-the art video summarization that made it seem like our future AI video editors have finally arrived. Best paper runner-up Mehrnoosh Sameki delighted us with surreal medical videos of cells splitting and joining all the while maintaining a near perfect segmentation contour achieved by an interactive pipeline.

Our industrial sponsor Evolv Technology hosted a cozy lunch where students and and senior researchers were able to discuss how to advance novel research in this nascent area of scientific exploration.

The aim of this workshop was to promote greater interaction between the diversity of researchers and practitioners who examine how to mix human and computer efforts to convert visual data into discoveries and innovations that benefit society at large. We succeeded in fostering an in-depth discussion of technical and application issues for how to engage humans with computers to optimize cost/quality trade-offs. My big take-away was that if you are doing CV or Graphics research, there is undoubtedly a cool way to exploit human intelligence in your pipeline for unexpected and remarkable outcomes. We look forward to future iterations of GroupSight at HCOMP and possibly ICCV or a future CVPR. If you are interested in participating in a future GroupSight, please don’t hesitate to contact one of us!