Invited Talk
in
Workshop: 1st Workshop on Multimodal Content Moderation
Data Collection for Content Moderation
Abstract:
Data collection and curation is an integral, yet often overlooked component of building content moderation systems. In this presentation we'll discuss optimizing data annotation, the effects of data quality and quantity on overall model performance, techniques for identifying and alleviating biases in models, and discussing appropriate applications of synthetic data.
Chat is not available.