Tutorials

Read all latest tutorial posts

WebSweep: Collecting Website Text for Research
March 24, 2026

WebSweep: Collecting Website Text for Research

WebSweep helps researchers capture what was publicly visible on a given date, preserve the raw HTML as a reproducible archive, and turn those pages into analysis-ready text. Use WebSweep when you: have a list of public websites or domains want a repeatable workflow for many domains mainly need HTML text and metadata from public pages In this tutorial, we use the example of FIRMBACKBONE. It is the Dutch research infrastructure to provides secure, FAIR access to comprehensive data on all registered organizations in the Netherlands, including web-based data.

read more
Map explorer
March 3, 2025

Map explorer

Ever wanted to create stunning, interactive maps without dealing with servers, databases, or complicated backends?

read more
Streamlining Data Anonymisation for Research
March 3, 2025

Streamlining Data Anonymisation for Research

Whether you’re dealing with interview transcripts, survey responses, or scraped online content, failing to properly anonymise your data can have serious ethical consequences for researchers.

read more
The Best of Both Worlds: Saving Costs and Time When Using OpenAI's API
March 3, 2025

The Best of Both Worlds: Saving Costs and Time When Using OpenAI's API

When working with OpenAI’s API for large language models (LLMs) in research projects, there are two techniques that can help save both time and costs.

read more
NetCBS: creating network measures using CBS networks (POPNET) in the RA
October 28, 2024

NetCBS: creating network measures using CBS networks (POPNET) in the RA

The netcbs library provides an easy-to-use query language for networks at Statistics Netherlands.

read more
Detecting communities in signed networks with Python
May 15, 2024

Detecting communities in signed networks with Python

In this tutorial, we will guide you through applying two popular community detection algorithms to signed networks, using Python.

read more
Generating synthetic data in a safe way with metasyn
April 26, 2024

Generating synthetic data in a safe way with metasyn

In this post, we show how to create synthetic data in an auditable, transparent way with the software package metasyn.

read more
How to manage your IP address in python
February 27, 2024

How to manage your IP address in python

If you have ever tried web scraping, you may have run into your IP address being blocked by the website you scrape.

read more
Training a fastText model from scratch using Python
January 22, 2024

Training a fastText model from scratch using Python

In this tutorial, we explain how to train a natural language processing model using fastText

read more
Visualizing international flows with Geoflow visualizer
December 11, 2023

Visualizing international flows with Geoflow visualizer

A new free and open-source tool designed to visualize international flows in an interactive way.

read more
Wrangling interval data using lubridate
September 29, 2023

Wrangling interval data using lubridate

Using time interval objects to robustly extract data from transactions.

read more