Tutorials

Read all latest tutorial posts

WebSweep: Collecting Website Text for Research
March 24, 2026

WebSweep: Collecting Website Text for Research

WebSweep helps researchers capture what was publicly visible on a given date, preserve the raw HTML as a reproducible archive, and turn those pages into analysis-ready text. Use WebSweep when you: have a list of public websites or domains want a repeatable workflow for many domains mainly need HTML text and metadata from public pages In this tutorial, we use the example of FIRMBACKBONE. It is the Dutch research infrastructure to provides secure, FAIR access to comprehensive data on all registered organizations in the Netherlands, including web-based data.

read more
Collecting online platforms data for science: an example using WhatsApp
September 8, 2023

Collecting online platforms data for science: an example using WhatsApp

In this tutorial, we use data donation and the Port software to get access to WhatsApp group-chat data in a way that completely preserves privacy of research participants.

read more
ArtScraper: A Python library to scrape online artworks
October 4, 2022

ArtScraper: A Python library to scrape online artworks

ArtScraper is a Python library to download images and metadata for artworks available on WikiArt and Google Arts & Culture.

read more
How to share your research code
September 5, 2022

How to share your research code

What are the best ways to create an understandable, openly accessible, findable, citable, and stable archive of your code?

read more