DHCenter - logo
Photo by Alex Kotliarskyi on Unsplash

The first DTS hackathon is organised by the DTS community and funded by the dhCenter UNIL-EPFL and the École Nationale des Chartes. The goal of this hackathon is to raise awareness about the DTS specifications, encourage their use and, ultimately, to improve the ecosystem of datasets and tools around DTS.



The hackathon will take place fully online between 27 September – 8 October 2021.

Two types of hacks will be proposed :

  1. data hacks, aimed at exposing new or already existing textual resources via a DTS-compliant API with the goal of enlarging the variety of domains, languages and periods covered by DTS. Participants proposing a data hack may be affiliated with the institution owning the data, but this is not strictly necessary (e.g. in case of open data).
  2. tool hacks, aimed at enlarging the offer of DTS-compatible tools by extending existing software or developing new ones. Ideas for tool hacks include:
  • development of DTS client libraries in any programming language (e.g., Javascript, Python, Go, Java, Ruby, etc.);
  • development of generic viewers for DTS texts collections;
  • extending existing annotation tools (e.g. Recogito, INCEpTION) to support DTS as a way of importing TEI documents to annotate;
  • adding integration with tools/libraries that perform linguistic and/or semantic enrichment of textual data such as NLTK , CLTK or DHTK;
  • integration with digital editing tools such as collation tools (e.g., CollateX) or publication tools (e.g., eXistDB / TEI Publisher, EFES, ediarum)

As far as code licensing is concerned, anything produced by hackathon participants is property of its author(s), yet it must be made available to the community under an open license (including non-commercial licenses) by the end of the hackathon.



The contributions developed during the hackathon will be evaluated by an international jury which will award prizes to winners in the following categories:

  1. best tool I (limited to tools that consume DTS-ready corpora, where consumption means enrichment, annotation, collation, etc.)
  2. best tool II (limited to tools for the publication of DTS-compliant corpora);
  3. largest DTS corpus
  4. most inclusive DTS corpus (i.e. a corpus of texts whose language, time period or genre is currently underrepresented in the DTS offer).

(Prizes will have an individual value in the range of 300-400 CHF).


Important dates

1 July: hackathon registration opens

24 September (midnight CEST): hackathon registration closes

27 September: hackathon begins

28 September (midnight CEST): deadline to communicate teams and hacks. Note that:

-> each team member needs to be individually registered for the hackathon

-> team information needed: team name, names of participants, hack description (200 words max.)

8 October: hackathon ends

(a more detailed schedule of the hackathon will be added soon)



Participation to the hackathon is free, but participants need to send an email to the organizers (Matteo Romanello, matteo.romanello@unil.ch or Pietro Liuzzo, pietro.liuzzo@uni-hamburg.de).


21 Oct 2021
Online conference

Save The Date : Unconference ‘Digital Critique’ 21- 22 October 2021

03 Oct 2021
In-person conference

United Nations World Data Forum 2021 in Bern

27 Sep 2021
Online talk

Save the dates: EPFL Campus Lectures

14 Sep 2021
Online conference

Study day on Image & Data Processing in the Cultural Heritage Sector