Recent public controversies regarding the collection, analysis, and publication of data sets about sensitive topics—from identity and sexuality to suicide and emotion—have helped push conversations around data ethics to the fore. In popular articles and emerging scholarly work (some of it supported by our backers at CTSP), scholars, practitioners and policymakers have begun to flesh out the longstanding conceptual and practical tensions expressed not only in the notion of “data ethics,” but in related categories such as “data science,” “big data,” and even plain old “data” itself.
Against this uncertain and controversial backdrop, what kind of ethical commitments might bind those who work with data—for example, researchers, analysts, and (of course) data scientists? One impulse might be to claim that the unprecedented size, scope, and attendant possibilities of so-called “big data” sets require a wholly new kind of ethics, one built with digital data’s particular affordances in mind from the start. Another impulse might be to suggest that even though “Big Data” seems new or even revolutionary, its ethical problems are not—after all, we’ve been dealing with issues like digital privacy for quite some time.