DeuParl v2

Plenary protocols from the German Reichstag (1867-1945) and Bundestag (1949-2022).

We obtain data from two sources: (i) Open Data where the German parliament publishes all plenary protocols from the Bundestag (en.: federal diet); and (ii) Reichstagsprotokolle that contains all Reichstag (en.: imperial diet) protocols, distributed by the Bayerische Staatsbibliothek; we use the OCR-scanned version from Walter et al. (2021).

For the Reichstag data, we apply preprocessing steps similar to Walter et al. (2021), but keep German umlauts. We then automatically split the data into individual sittings and collect metadata like the date, period and session number of each sitting, which we manually checked and corrected. The Reichstag data is subdivided into the seven time periods listed in the table below. The abbreviations in brackets indicate the type as used in our dataset.

Period	Years
Konstituierender Reichstag [`k`]	1867
Reichstag (Norddeutscher Bund) [`ndb`]	1867-1870
Zollparlament [`zp`]	1886-1870
Reichstag (Deutsches Kaiserreich) [`dkr`]	1871-1918
Nationalversammlung [`nv`]	1919-1920
Reichstag (Weimarer Republik) [`wr`]	1919-1933
Reichstag (Nationalsozialismus) [`ns`]	1933-1945

Both directories contain a details.json file with the following information for each sitting:

Key	Value	Description
`era`	string	`rt` for Reichstag or `bt` for Bundestag
`type`	string	only for Reichstag data, see table above
`period`	number	election period
`no`	number	number of the sitting
`year`	number	year of the sitting
`month`	number	month of the sitting
`day`	number	day of the sitting

Citation

@inproceedings{kostikova-etal-2024-fine,
	title = {Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of {G}erman Parliamentary Debates},
	author = {Kostikova, Aida and Beese, Dominik and Paassen, Benjamin and P{\"u}tz, Ole and Wiedemann, Gregor and Eger, Steffen},
	year = 2024,
	month = 11,
	booktitle = {Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing},
	publisher = {Association for Computational Linguistics},
	address = {Miami, Florida, USA},
	pages = {5884--5907},
	doi = {10.18653/v1/2024.emnlp-main.337},
	url = {https://aclanthology.org/2024.emnlp-main.337/},
	editor = {Al-Onaizan, Yaser and Bansal, Mohit and Chen, Yun-Nung},
}

Abstract: Solidarity is a crucial concept to understand social relations in societies. In this study, we investigate the frequency of (anti-)solidarity towards women and migrants in German parliamentary debates between 1867 and 2022. Using 2,864 manually annotated text snippets, we evaluate large language models (LLMs) like Llama 3, GPT-3.5, and GPT-4. We find that GPT-4 outperforms other models, approaching human annotation accuracy. Using GPT-4, we automatically annotate 18,300 further instances and find that solidarity with migrants outweighs anti-solidarity but that frequencies and solidarity types shift over time. Most importantly, group-based notions of (anti-)solidarity fade in favor of compassionate solidarity, focusing on the vulnerability of migrant groups, and exchange-based anti-solidarity, focusing on the lack of (economic) contribution. This study highlights the interplay of historical events, socio-economic needs, and political ideologies in shaping migration discourse and social cohesion.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Bundestag Data		Bundestag Data
Reichstag Data		Reichstag Data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeuParl v2

Citation

About

Uh oh!

Uh oh!

Contributors 1

Folders and files

Latest commit

History

Repository files navigation

DeuParl v2

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 1