Not only in the journalistic field itself, but also in research, data journalism (or: “data-driven journalism”) has become a booming topic. However, so far, a long-term and international study was missing that analyzes how the new data-driven form of reporting develops. To shed some light on this issue, from 2013 to 2016 the project “When Data Become Journalism” analyzed what may be considered the “gold-standard” in data-driven reporting: the works nominated annually for the Data Journalism Awards (DJA), a prize issued by the Global Editors Network. We investigated, amongst other aspects, which kinds of media organizations produce these pieces (newspapers, TV channels etc.), the topics they address (politics, business, sports etc.), the kinds of data they use (geo-data, financial data, survey results etc.), the sources they turn to for these data (official institutions, NGOs, companies etc.), how they visualize these data (graphs, maps, tables etc.), and which opportunities they offer users to explore the data interactively (zooming into maps, filtering data by place of residence etc.). By comparing the projects from the different years, trends in the development of the new reporting style as well as its untapped potentials become apparent.
The phenomena of “big data” and an increasingly data-driven society are doubly relevant for journalism. Firstly, it is a topic that needs to be covered so that the related developments and their consequences are made understandable for the public and publicly discussed. Secondly, the datafication has already begun to affect news production and is giving rise to novel ways of identifying stories of public interest and new forms of telling them. What we witness is the emergence of a new journalistic sub-field most often called “computational journalism”, “data(-driven) journalism”, or “DDJ” for short.
The project addresses a blind spot with the research of data(-driven) journalism. Existing studies can be divided into three categories:
1. interviews with DDJ’s actors in different countries about their self-image (as journalists) and their integration in the production processes of established newsrooms;
2. various attempts to define DDJ as a distinguishable reporting style that revolves around some apparent core characteristics, which often contradict each other in their different restrictions and prioritisations;
3. case studies that empirically analyze the DDJ’s output – data-driven articles and larger publications –, but are restricted in their scope regarding time or geography.
What research lacks is a longitudinal analysis that helps us better understand how DDJ as a new form of reporting is evolving over time, that is, whether the list of key characteristics is changing or these characteristics are simply appearing in different forms. Journalists and researchers agree that data(-driven) journalism is still in its infancy and in continuous transition. Moreover, it is repeatedly seen as the future of journalism. However, we don’t even know much about how it is currently developing. Against this backdrop, our content analysis seeks to give answers to the following research questions:
RQ 1: What structural elements and forms of presentation are data-driven pieces composed of and how is this composition evolving?
RQ 2: How are the topics covered in data-driven projects changing over time?
RQ 3: How is the field of actors who produce data journalism (media organisations, external partners etc.) developing over time?
The selection of data-driven pieces for the analysis follows an inductive and pragmatic approach: our sample consists of projects nominated for the Data Journalism Awards (DJA), a prize issued annually by the Global Editors Network. So, instead of starting with a predetermined definition of what a “data-journalistic article” is, and thus creating a definition that could be too narrow or too broad, we use what is already identified as data journalism within the field itself. However, this advantage leads to a doubly biased sample. First, the analysed pieces are based on self-selection; as any data journalist must submit her/his work herself/himself to be considered for nomination by the organising committee. Second, nominees for a data journalism award do not represent ‘everyday’ data journalism. The field has already diversified and our sample very likely consists of what Borges-Rey (2016: 9) calls ‘an extensive, thoroughly researched, investigative form of data journalism’ rather than of the ‘daily, quick turnaround, generally visualised, brief form of data journalism’. At the same time, the sample has two particular advantages. First, systematic differences between “only nominated” and awarded projects can be identified. Second, we can assume that the analysed cases, being nominees for a DJA, fulfil a certain quality threshold and are considered best practice examples in the field itself.
Our codebook for the content analysis comprises, amongst others, the (presumed) key characteristics of DDJ that have been identified by scholars. These include a (usually larger) data set as a basis of the project, the visualisation of the data in diagrams, charts etc., and interactive elements. Most variables and their values, that is the different forms a variable can take, were developed inductively based on an explorative analysis of a subsample from 2013 (see Table 1).
|authorship||medium; type of medium; external partners; number of people involved mentioned by name|
|story properties||headline; topic; reference to a specific eventI; question(s) posed to data; number of related articlesII; length of article; language; winner of the DJA (or not)|
|data||data source(s); type(s) of data source(s); access to data; kind of data; additional information on dataI; geographical reference; changeability of datasetIII; time period covered; unit of analysis|
|analysis and journalistic editing of content||personalised case exampleIV; call for public intervention or criticismII; purpose of data analysisV; visualisation|
|interactive features||interactive functions; online access to the databaseII; opportunities of communication (for users)|
I Suggested by data journalist Lorenz Matzat.
II Adopted from Parasie and Dagiral (2013: 5–14).
III Suggested by (data) journalism researcher Julian Ausserhofer.
IV Inspired by Holtermann (2011)
V Inspired by Gray et al. (2012: n.p.)
Borges-Rey, Eddy (2016): Unravelling data journalism. A study of data journalism practice in British newsrooms. In: Journalism Practice, 10(7), 833–843. Available online at: http://dx.doi.org/10.1080/17512786.2016.1159921.
Gray, Jonathan; Bounegru, Liliana; Chambers, Lucy (Hg.) (2012): The data journalism handbook. How journalists can use data to improve the news. (Early release). Sebastopol: O’Reilly. Available online at: http://datajournalismhandbook.org/.
Holtermann, Hannes (2011): Datenjournalismus: eine neue Form der journalistischen Wertschöpfung aus Daten. Unpublished: Master’s Thesis. Hamburg.
Lombard, Matthew; Snyder-Duch, Jennifer; Bracken, Cheryl Campanella (2002): Content analysis in mass communication. Assessment and reporting of intercoder reliability. In: Human Communication Research, 28(4), 587–604. Available online at: http://dx.doi.org/10.1111/j.1468-2958.2002.tb00826.x.
Parasie, Sylvain; Dagiral, Eric (2013): Data-driven journalism and the public good. “Computer-assisted-reporters” and “programmer-journalists” in Chicago. In: New Media & Society, 15(6), 853–871. Available online at: http://dx.doi.org/ 10.1177/1461444812463345.
A more detailed description of the method, a short overview of research on DDJ in general and an analysis of the data from 2013 to 2015 has been published as a working paper of the Hans Bredow Institute (Arbeitspapiere des HBI) and made available as a download.
The project was funded by Seed-Money of the HIIG and the HBI in the years 2013/2014.
The project was supported externally by Fenja de Silva-Schmidt, a former student worker at the HBI and now research associate at the Institute of Journalism and Communication Studies (IJK) at the University of Hamburg.
RP1 - Transformation of Public Communication
2013/2014: Seed-Money HIIG / Hans-Bredow-Institut
Das Projekt wird unterstützt von Fenja de Silva-Schmidt, ehemals studentische Mitarbeiterin am Institut und nun Junior Researcher am Institut für Journalistik und Kommunikationswissenschaft der Universität Hamburg.
The relationship between journalism and its audience is changing, yielding consequences for what journalists cover and how. The project, which is funded by the German Research Foundation (DFG), examines the breadth, depth, and diversity of this re-figuration and its consequences.
The project, funded by the German Research Foundation (DFG), investigates how journalistic pioneers and pioneer communities envision an ideal future for journalism and, by that, actually shape the future of the field.
For the first time in history, Facebook grants science access to its data records. As part of the SHARENEWS project, Dr. Cornelius Puschmann is investigating the criteria according to which Facebook users share messages on the platform.
What role do health-related offers play in the app repertoire of young people? And how are the offers with regard to their health-promoting potential, their possible risks and their quality to be evaluated?
To what extent can algorithmic recommendation systems be part of the own journalistic activity of public service media providers and take the side of (or take the place of) journalistic selection and compilation of information? In a White Paper for the MDR (Central German Broadcasting), the main cha...
Kettemann, M. (2019): Rezension zu Caldarola/Schrey (2019), Big Data und Recht. Einführung für die Praxis [Review of Caldarola/Schrey (2019), Big Data and Law. Introduction to the Practice], 1st edition, C.H. Beck. On: "dierezensenten": http://dierezensenten.blogspot.com/2019/02/...
Loosen, W. (2018): Data-Driven Gold-Standards: What the Field Values as Award-Worthy Data Journalism and How Journalism Co-Evolves with the Datafication of Society. In: J. Gray, L. Bounegru (eds.), The Data Journalism Handbook 2. Towards a Critical Data Practice. European Journalism Centre and Googl...
In his current contribution to the journal 'Medien & Kommunikationswissenschaft' 2/2018, Dr. Sascha Hölig writes about the personal characteristics of active Twitter users. The article is available at the eLibrary of Nomos. S. Hölig (2018): Eine meinungsstarke Minderheit als S...
Hoffmann-Riem, W. (ed.) (2018): Big Data – Regulative Herausforderungen [Regulative Challenges]. Baden-Baden: Nomos Verlagsgesellschafft. Edited by Prof. Dr. Wolfgang Hoffmann-Riem, LL.M. With contributions by Wolfgang Hoffmann-Riem, Gerrit Hornung, Yoan Hermstrüwer, An...
Dreyer, S. (2018): Predictive Analytics aus der Perspektive von Menschenwürde und Autonomie [Predicitve Analytics from the Perspective of Human Dignity and Autonomy]. In: W. Hoffmann-Riem (ed.), Big Data - Regulative Herausforderungen [Big Data - Regulatory Challenges]. Baden-Baden: Nomos, pp. 135-...
On 17 November, 600 colleagues will discuss the trends in the industry at the Journalists’ Day of the DJV under the title “Kollege Algorithmus, Claudia Roth und die Zukunft des Radiomarktes in NRW [Colleague Algorithm, Claudia Roth and the Future of the Radio Market in North Rhine-Westphalia].”...
"Datafizierter Journalismus – die Antwort des Journalismus auf die datafizierte Gesellschaft [Datafied Journalism - Journalism's Response to the Datafication of Society]", WZB Big Data Brown Bag seminar with W. Loosen at the Berlin Social Science Center (WZB) on 23 November 2017.
"Wenn aus Daten Journalismus wird – eine wissenschaftliche Auswertung nominierter Datenjournalismus-Projekte" [When Data become Journalism – A Scientific Evaluation of Nominated Data Journalism Projects], speech by J. Reimer at the event WHERE SCIENCE MEETS COMPUTER...