Tutorials
Read all latest tutorial posts
WebSweep: Collecting Website Text for Research
WebSweep helps researchers capture what was publicly visible on a given date, preserve the raw HTML as a reproducible archive, and turn those pages into analysis-ready text. Use WebSweep when you: have a list of public websites or domains want a repeatable workflow for many domains mainly need HTML text and metadata from public pages In this tutorial, we use the example of FIRMBACKBONE. It is the Dutch research infrastructure to provides secure, FAIR access to comprehensive data on all registered organizations in the Netherlands, including web-based data.
read more
Collecting online platforms data for science: an example using WhatsApp
In this tutorial, we use data donation and the Port software to get access to WhatsApp group-chat data in a way that completely preserves privacy of research participants.
read more
ArtScraper: A Python library to scrape online artworks
ArtScraper is a Python library to download images and metadata for artworks available on WikiArt and Google Arts & Culture.
read more
How to share your research code
What are the best ways to create an understandable, openly accessible, findable, citable, and stable archive of your code?
read more