The first DTS hackathon is organised by the DTS community and funded by the dhCenter UNIL-EPFL and the École Nationale des Chartes. The goal of this hackathon is to raise awareness about the DTS specifications, encourage their use and, ultimately, to improve the ecosystem of datasets and tools around DTS.
The hackathon will take place fully online between 27 September – 8 October 2021.
Two types of hacks will be proposed :
- data hacks, aimed at exposing new or already existing textual resources via a DTS-compliant API with the goal of enlarging the variety of domains, languages and periods covered by DTS. Participants proposing a data hack may be affiliated with the institution owning the data, but this is not strictly necessary (e.g. in case of open data).
- tool hacks, aimed at enlarging the offer of DTS-compatible tools by extending existing software or developing new ones. Ideas for tool hacks include:
- development of generic viewers for DTS texts collections;
- extending existing annotation tools (e.g. Recogito, INCEpTION) to support DTS as a way of importing TEI documents to annotate;
- adding integration with tools/libraries that perform linguistic and/or semantic enrichment of textual data such as NLTK , CLTK or DHTK;
- integration with digital editing tools such as collation tools (e.g., CollateX) or publication tools (e.g., eXistDB / TEI Publisher, EFES, ediarum)
As far as code licensing is concerned, anything produced by hackathon participants is property of its author(s), yet it must be made available to the community under an open license (including non-commercial licenses) by the end of the hackathon.
The contributions developed during the hackathon will be evaluated by an international jury which will award prizes to winners in the following categories:
- best tool I (limited to tools that consume DTS-ready corpora, where consumption means enrichment, annotation, collation, etc.)
- best tool II (limited to tools for the publication of DTS-compliant corpora);
- largest DTS corpus
- most inclusive DTS corpus (i.e. a corpus of texts whose language, time period or genre is currently underrepresented in the DTS offer).
(Prizes will have an individual value in the range of 300-400 CHF).
1 July: hackathon registration opens
24 September (midnight CEST): hackathon registration closes
27 September: hackathon begins
28 September (midnight CEST): deadline to communicate teams and hacks. Note that:
-> each team member needs to be individually registered for the hackathon
-> team information needed: team name, names of participants, hack description (200 words max.)
8 October: hackathon ends
(a more detailed schedule of the hackathon will be added soon)