What happens when public data infrastructures aren’t save? A look at past and current developments in the US

Lab Life
Research
Author
Affiliation

Laura Rothfritz

Published

February 5, 2025

In my PhD research, I look at DataRescue or Data Refuge movements, for example through the work of the Environmental Data and Governance Initiative (EDGI) (“Environmental Data & Governance Initiative n.d.) to understand how people respond when they fear data is at risk. I’m particularly interested in what these efforts reveal about how public data infrastructures actually work—not just as systems of information storage, but as objects of political struggle.

This research started quite some time ago with an overarching topic of trust and distrust in public data infrastructures (Rothfritz 2024a). I planned for the first iteration of the DataRescue movement to serve as a case study for this research (Rothfritz 2024b). Of course, I did not anticipate that there would be another series of events which would lead to the loss of access to public information and data in the U.S. so soon.

Rescuing Data

In 2016, a group of librarians, archivists, and environmental activists saw the writing on the wall: the incoming U.S. administration might threaten access to crucial federal climate data. What followed was a series of DataRescue events: community-led efforts to archive and preserve environmental datasets that many feared would be deleted, altered, or buried in bureaucratic obscurity (Janz 2017; Currie and Paris 2018)

Fast forward to 2025. Another major political shift is unfolding in the U.S., and with it, new concerns about the integrity and accessibility of public data. This time around, large amounts of public federal information such as websites (Singer 2025) and data (Koebler 2025) are disappearing from public access again. The scale and breadth of which kind of information is disappearing is arguably larger though - it concerns not just environmental data and climate change mentions on websites, but also for example health data and information from the Centers for Disease and Control Prevention (CDC) (Sun, Keating, and Nirappil 2025; Cox et al. 2025). Efforts seem to be more strategic and orchestrated.

The good news is that at least to a pretty far extent preparations were taking place before the Trump administration took office in January. For example, all federal websites were archived through the End-Of-Term Web Archive, a project which routinely has been doing this for administration changes in the U.S. since 2008 (“End of Term Web Archive n.d.). A dataset of CDC data uploaded before January 28th, 2025 is available on the Internet Archive. EDGI and the Public Environmental Data Project have worked ahead to archive crucial environmental data (EDGI 2025).

There are quite a few other initiatives taking place at the moment. A good overview resource is collected through IASSIST on this Google Doc, which is constantly being updated.

Why This Matters

Public data infrastructures—the systems that store and provide access to government data—are often taken for granted. We assume that scientific reports, climate models, and environmental monitoring data will always be there when we need them. But these infrastructures are not just technical systems; they are deeply political. Decisions about what gets collected, what remains accessible, and what disappears are shaped by power, policy, and priorities.

The Data Rescue events of 2016–2017 brought together scientists, librarians, and civic tech communities to back up environmental datasets they feared could vanish under the Trump administration. At first glance, the movement looked like a technical project—downloading datasets, copying them to safer locations. But it was just as much an act of political resistance (Paris 2017).

Looking back, one of the most fascinating aspects of Data Rescue is that very little data actually disappeared. Yet the movement wasn’t a failure per se, even though most of the rescued data was not usable and the collection was far from complete (Lamdan 2017). It revealed something crucial: it wasn’t just about saving data; it was about making government accountability visible (Dillon et al. 2017, 2019). One theory might be that the fact that people felt the need to take matters into their own hands showed a loss of trust in official assurances represented by federal infrastructures.

Now, nearly a decade later, similar concerns are emerging. As the political landscape in the U.S. shifts once again. This time around,

I am currently speaking with people who were in one way or another involved in the first DataRescue events. Some of them have continued to monitor federal data policies and website changes over the years. Their work is a reminder that data loss isn’t always dramatic—sometimes, it’s a slow erosion of accessibility, funding cuts, or subtle changes to the language used in official reports.

If there’s one thing the past decade has taught us, it’s that data isn’t neutral and neither are the infrastructures they are connected to. Public data infrastructures are shaped by politics, and in turn, they shape what we know about the world. Understanding how these systems work—and how they can be protected—will be critical in the years ahead.

Further information about the research group can be found on our official website.

This text – excluding quotes and otherwise labelled parts – is licensed under the CC BY 4.0 DEED.

References

CDC Datasets Uploaded Before January 28th, 2025 : Centers for Disease Control and Prevention : Free Download, Borrow, and Streaming.” n.d. Internet Archive. Accessed February 5, 2025. https://archive.org/details/20250128-cdc-datasets.
Cox, Cynthia, Matthew Rae, Jennifer Kates, Emma Wager, Jared Ortaliza, and Lindsey Dawson Published. 2025. “A Look at Federal Health Data Taken Offline.” KFF.
Currie, Morgan E., and Britt S. Paris. 2018. “Back-Ups for the Future: Archival Practices for Data Activism.” Archives and Manuscripts 46 (2): 124–42. https://doi.org/10.1080/01576895.2018.1468273.
“Data + Screening Tools.” n.d. Data + Screening Tools. Accessed February 5, 2025. https://screening-tools.com.
Dillon, Lindsey, Rebecca Lave, Becky Mansfield, Sara Wylie, Nicholas Shapiro, Anita Say Chan, and Michelle Murphy. 2019. “Situating Data in a Trumpian Era: The Environmental Data and Governance Initiative.” Annals of the American Association of Geographers 109 (2): 545–55. https://doi.org/10.1080/24694452.2018.1511410.
Dillon, Lindsey, Dawn Walker, Nicholas Shapiro, Vivian Underhill, Megan Martenyi, Sara Wylie, Rebecca Lave, Michelle Murphy, and Phil Brown. 2017. “Environmental Data Justice and the Trump Administration: Reflections from the Environmental Data and Governance Initiative.” Environmental Justice 10 (6): 186–92. https://doi.org/10.1089/env.2017.0020.
EDGI. 2025. “Why EDGI Is Archiving Public Environmental Data.” Environmental Data and Governance Initiative. https://envirodatagov.org/why-edgi-is-archiving-public-environmental-data/.
“End of Term Web Archive.” n.d. End of Term Web Archive. Accessed February 5, 2025. https://eotarchive.org/.
“Environmental Data & Governance Initiative.” n.d. Environmental Data and Governance Initiative. Accessed February 5, 2025. https://envirodatagov.org/.
Janz, Margaret Miller. 2017. “Maintaining Access to Public Data: Lessons from Data Refuge.” Against the Grain 29 (6). https://doi.org/10.7771/2380-176X.7875.
Koebler, Jason. 2025. “Archivists Work to Identify and Save the Thousands of Datasets Disappearing From Data.gov.” 404 Media. https://www.404media.co/archivists-work-to-identify-and-save-the-thousands-of-datasets-disappearing-from-data-gov/.
Lamdan, Sarah. 2017. “Lessons from Datarescue: The Limitations of Grassroots Climate Change Data Preservation and the Need for Federal Records Law Reform Essay.” University of Pennsylvania Law Review Online 166: 231–48.
Paris, Britt. 2017. “Data-Mirroring as Archival Activism.” Datapower Conference Proceedings, 1.2.4–4. https://doi.org/10.22215/datapower.v2017i0.22.
Rothfritz, Laura. 2024a. “Understanding Mistrust and Distrust in Public Data Infrastructures.” https://doi.org/10.5281/zenodo.10663566.
———. 2024b. “Understanding Trust and Distrust in Public Data Infrastructures as Socio-Technical Systems.” https://doi.org/10.5281/zenodo.11142410.
Singer, Ethan. 2025. “Thousands of U.S. Government Web Pages Have Been Taken Down Since Friday - The New York Times.” https://www.nytimes.com/2025/02/02/upshot/trump-government-websites-missing-pages.html.
Sun, Lena H., Dan Keating, and Fenit Nirappil. 2025. CDC Removes Gender, Equity References in Public Health Material.” Washington Post, February.

Citation

BibTeX citation:
@online{rothfritz2025,
  author = {Rothfritz, Laura},
  title = {What Happens When Public Data Infrastructures Aren’t Save?
    {A} Look at Past and Current Developments in the {US}},
  date = {2025-02-05},
  url = {https://doi.org/10.59350/6g3km-9w70},
  langid = {en}
}
For attribution, please cite this work as:
Rothfritz, Laura. 2025. “What Happens When Public Data Infrastructures Aren’t Save? A Look at Past and Current Developments in the US.” February 5, 2025. https://doi.org/10.59350/6g3km-9w70.