{"id":1022124,"date":"2024-04-03T12:23:26","date_gmt":"2024-04-03T19:23:26","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-video&p=1022124"},"modified":"2024-04-03T12:23:43","modified_gmt":"2024-04-03T19:23:43","slug":"behind-the-label-glimpses-of-data-labelling-labours-for-ai","status":"publish","type":"msr-video","link":"https:\/\/www.microsoft.com\/en-us\/research\/video\/behind-the-label-glimpses-of-data-labelling-labours-for-ai\/","title":{"rendered":"Behind the label: Glimpses of data labelling labours for AI"},"content":{"rendered":"\n
ChatGPT is the latest of AI systems to make the headlines for its remarkable computational capabilities. Lesser known and rarely acknowledged is the human labours involved in training and supporting these celebrated AI systems. Thousands of workers, particularly in global south regions, create training datasets, validate model outcomes and mimic computational responses to sustain AI\u2019s research, development and use. Yet little is known about what their work entails. What do data labellers do when they label data for AI?<\/p>\n\n\n\n
Drawing on findings from an ethnographic study of data labelling in India, this talk offers insights into the everyday work practices of data labellers, organisational hierarchies, norms, and values that were caught in global flows of resources, rhetoric, and relations of power. We trace these practices, norms and frictions to better understand their influences on everyday annotation work as well as answer an important question, why should we, AI researchers and practitioners, concern ourselves with these seemingly distant realities?<\/p>\n\n\n\n