From Text to Events: Turning Freedom House Reports into Evidence of Democratization and Democratic Backsliding

Matthew Wilson; Kelsey Martin-Morales; Gregory Nelson

doi:10.33774/apsa-2026-hgrxk

Methodology

Search within Methodology

From Text to Events: Turning Freedom House Reports into Evidence of Democratization and Democratic Backsliding

31 March 2026, Version 1

Working Paper

Show author details

This content is an early or alternative research output and has not been peer-reviewed at the time of posting.

Abstract

A key limitation in debates about democracy measurement and whether democratic backsliding is occurring is the mismatch between theoretical claims — about specific institutional actions and processes — and available data. Political events are the appropriate unit of analysis: they denote discrete actions by identifiable actors with consequences for democratic institutions, but are difficult to produce at scale. This paper presents an LLM-based pipeline that transforms Freedom House's annual country reports (1990–2024) into structured event data. Our four-stage process produces nearly 200,000 annotated events across 228 countries, each tied to textual evidence. Validation against human coders and existing datasets demonstrates high construct validity. We apply the data to compare democratic backsliding in Hungary and Poland. The primary contribution is auditable interpretation as a standard for LLM-assisted measurement — producing outputs traceable to verbatim source material. Such event-level data enables analyses of institutional sequencing where indices and case studies fall short.

Keywords

Democracy

Democratic backsliding

Qualitative data

Freedom House

Large Language Models

Supplementary materials

Title

Description

Actions

Title

Supplementary Materials for "From Text to Events"

Description

This supplementary document accompanies "From Text to Events: Turning Freedom House Reports into Evidence of Democratization and Democratic Backsliding" (Wilson, Martin-Morales, and Nelson, 2026). It provides methodological detail supporting the paper's core pipeline, which uses large language models to extract structured political events from Freedom House country reports spanning 1990–2024. The appendices include diagnostic figures on sentence and highlight extraction rates, inter-coder reliability comparisons between human coders and GPT, stability tests across repeated model runs, and validation results for eight case-study countries. Additional appendices document the complete prompt templates used for highlight selection, event extraction, thematic tagging, and democratic sentiment classification, along with configuration parameters for each pipeline stage. A detailed de-duplication procedure is described, and country-year coverage and formatting changes in Freedom House reports over time are documented. Replication files and the final annotated event dataset will be made available for online access.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Mar 31, 2026 Version 1

Metrics

336

172

Views

Downloads

Citations

License

DOI

10.33774/apsa-2026-hgrxk

Funding

Institute for Humane Studies, George Mason University

IHS018727

University of South Carolina

25R-4001

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Conference

2026 APSA Virtual Research Meeting

From Text to Events: Turning Freedom House Reports into Evidence of Democratization and Democratic Backsliding

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Conference

Share