Tutorials

Read all latest tutorial posts

WebSweep: Collecting Website Text for Research

March 24, 2026

WebSweep: Collecting Website Text for Research

WebSweep helps researchers capture what was publicly visible on a given date, preserve the raw HTML as a reproducible archive, and turn those pages into analysis-ready text. Use WebSweep when you: have a list of public websites or domains want a repeatable workflow for many domains mainly need HTML text and metadata from public pages In this tutorial, we use the example of FIRMBACKBONE. It is the Dutch research infrastructure to provides secure, FAIR access to comprehensive data on all registered organizations in the Netherlands, including web-based data.

read more

Collecting online platforms data for science: an example using WhatsApp

September 8, 2023

Collecting online platforms data for science: an example using WhatsApp

In this tutorial, we use data donation and the Port software to get access to WhatsApp group-chat data in a way that completely preserves privacy of research participants.

read more

ArtScraper: A Python library to scrape online artworks

October 4, 2022

ArtScraper: A Python library to scrape online artworks

ArtScraper is a Python library to download images and metadata for artworks available on WikiArt and Google Arts & Culture.

read more

How to share your research code

September 5, 2022

How to share your research code

What are the best ways to create an understandable, openly accessible, findable, citable, and stable archive of your code?

read more