Improving the usefulness of research data with better paradata

Author

Abstract

Considerable investments have been made in Europe and worldwide for developing research data infrastructures. Instead of a general lack of data about data, it has become apparent that a pivotal factor that drastically constrains data use is the absence of contextual knowledge about how data was created and how it has been curated and used. This applies especially to many branches of social science and humanities research, where data is highly heterogeneous, both by its kind (e.g. being qualitative, quantitative, naturalistic, purposefully created) and origins (e.g. being historical/contemporary, from different contexts and geographical places). The problem is that there may be enough metadata (data about data) but there is too little paradata (data on the processes of its creation, curation and use). The aim of this position paper is to draw attention 1) to the need for a better and more systematic understanding and documentation of the contexts of creation, curation and use of research data to make it useful and usable for researchers and other potential users in the future, and 2) to specific obstacles that make the capturing of this particular type of metadata, known as paradata, especially difficult. Failing to understand what information about the creation, curation and use of research data is needed and how to capture enough of that information risks that the currently collected vast amounts of research data become useless in the future.

Year of Publication

2022

Journal

Open Information Science

Volume

Start Page

Issue

ISSN Number

2451-1781

DOI

10.1515/opis-2022-0129

Taxonomy terms

paradata

CAPTURE

research data

File attachment

Huvila2022b.pdf (789.16 KB)

Latest Publications

What a Standard Makes out of a Process? Data-documentation Standards and Their Consequences to Process Documentation

Huvila, I., Sköld, O., Zengenene, D., & Andersson, L. (2026). What a Standard Makes out of a Process? Data-documentation Standards and Their Consequences to Process Documentation. Journal of Documentation, 82, 289–314. http://doi.org/10.1108/JD-10-2025-0324 (Original work published 2026)

Habitats of Archaeological Knowledge: From Information Ecologies to Information-in-Ecologies

Huvila, I. (2026). Habitats of Archaeological Knowledge: From Information Ecologies to Information-in-Ecologies. In N. Solhjoo (Ed.), Multispecies Information Science (pp. 201–220). London: Routledge. http://doi.org/10.4324/9781003583424-15

Documenting AI Use in Humanities Research

Huvila, I. (2025). Documenting AI Use in Humanities Research. In H. Verhagen, S. Tienken, A. Widholm, M. Fridlund, M. Nermo, & A. Blåder (Eds.), Huminfra 2025 (pp. 57–62). Stockholm: Stockholm University.

Letting AI Loose in an Archive: Technology to Manage or to Manage With

Huvila, I. (2025). Letting AI Loose in an Archive: Technology to Manage or to Manage With. Archiv, Theorie & Praxis, 75, 12–15.

Researchers Data Processing Descriptions–Understanding Paradata Creation Practices and Their Underpinning Instrumentalities

Huvila, I., Andersson, L., & Sköld, O. (2025). Researchers Data Processing Descriptions–Understanding Paradata Creation Practices and Their Underpinning Instrumentalities. Journal of the Association for Information Science and Technology, 76(11), 1570–1590. http://doi.org/10.1002/asi.70003 (Original work published 2026)

Improving the usefulness of research data with better paradata

Forthcoming presentations

Latest Publications

Latest toots

Isto Huvila