Johann Rolschewski has been working at the Berlin State Library since 2006. He is in charge of the system support of the Zeitschriftendatenbank (ZDB) with a main focus on APIs and data formats. Johann has studied Biology at the FU Berlin and has a Master degree in Computer Science from TFH Berlin. He is an active contributor to the Catmandu data toolkit.
Participants should be familiar with command line interfaces (CLI) and have a basic knowledge of library metadata.
Systems librarians, metadata librarians and data managers.
No Programming experience expected
In 2002 Roy Tennant declared “”MARC Must Die”” . Today the MARC 21 format  is still the workhorse of library metadata. Even our “”Next Generation Library Systems”” heavily rely on this standard from the ‘60s. Since we will continue to work with MARC 21 in the coming years, this bootcamp will give an introduction with the following topics:
* Structure of MARC 21 records and their different serializations (MARCXML, MARCMaker, MARC-in-JSON, ALEPHSEQ)
* Validation of MARC 21 records and common errors
* Statistical analysis of MARC 21 data sets
* Conversion of MARC 21 records
* Metadata extraction from MARC 21 records
This bootcamp is aimed at systems librarians, metadata librarians and data managers. For most of the tasks we will use command line tools like `yaz-marcdump`, `marcstats`, `marcvalidate` and `catmandu`, so participants should be familiar with command line interfaces (CLI) and have a basic knowledge of library metadata. For exercises a notebook is required. Please install VirtualBox  and the VirtualBox image  or all required tools  beforehand. Participants could bring their own MARC 21 datasets to work with.
Beginner to Intermediate knowledge of software development
Installation requirements to be sent out
Programming experience: Beginners welcome, some knowledge of coding helps
The tutorial will go over the basic structure of a vue app, the main concepts featured in the framework, the available tooling that facilitates vue development, and will build a small application featuring most of these concepts during the workshop. Also covered will be how to connect to a backend to fetch data from an JSON API using axios.js.
Participants are instructed to use a computer with at least 8 GB of RAM and at least 20 GB free disk space to complete the exercises. The organizers will provide the software as a preconfigured VirtualBox virtual machine; alternative options are a Docker image or native Linux installation.
• No programming experience required, but useful in optional advanced exercises
• Familiarity with subject vocabularies (subject headings, thesauri, classifications)
In this bootcamp, participants will be introduced to the multilingual automated subject indexing tool Annif (https//annif.org) as a potential component in a library’s metadata generation system. By completing exercises, participants will get practical experience on setting up Annif, training algorithms using example data, and using Annif to produce subject suggestions for new documents using the command line interface, the web user interface and REST API provided by the tool. We will also introduce the corpus formats supported by Annif so that participants will be able to apply the tool to their own vocabularies and documents.
Instructional videos and written exercises are already available in the Annif-tutorial GitHub repository (https://github.com/NatLibFi/Annif-tutorial/). The participants have the opportunity to familiarize themselves with it beforehand and are encouraged to attempt to complete some of the exercises on their own time before the event. During the actual event we will go through some of the exercises. It’s also an opportunity to discuss, solve problems, ask questions and get a feeling of the community around Annif.