Skip to content. | Skip to navigation

Personal tools

You are here: Home / kulturarvscluster

Cultural Heritage Cluster

DeIC (Danish e-Infrastructure Cooperation) has been charged with spreading High-Performance Computing (HPC) to new research areas, such as the humanities and social science areas. In order to respond to this, DeIC and the Royal Danish Library have agreed to establish the DeIC National Cultural Heritage Cluster, Royal Danish Library.

The cultural heritage cluster applies state-of-the-art technologies within data science, and for the first time ever facilitates quantitative research projects on the digital Danish cultural heritage – e.g. radio and TV programmes, websites and historical newspapers.

In recent years, the Royal Danish Library has participated in national and international research and research infrastructure projects based on Danish digital cultural heritage. The library has expanded both knowledge and competences about what it takes to offer, for instance, data mining – the search for structures and patterns in large data sets.

The agreement between DeIC and the Royal Danish Library has a total financial framework of DKK 7.2 million.

Collections available to research projects

The Royal Danish Library is responsible for collecting and preserving large parts of the Danish cultural heritage, including the digital cultural heritage. This digital cultural heritage is divided into numerous collections, each with its own properties, formats and possibilities. Examples of collections that are now made available to researchers include radio/TV, the Netarchive and the Danish Newspaper Collection.

The radio/TV collection contains more than 1 million hours of TV broadcasts and more than 1.5 million hours of radio programmes broadcast on Danish channels from the 1980s until today. The collec-tion's data are made accessible as audio and video files. The collection also contains large amounts of metadata, such as programme titles, broadcast times and subtitles, depending on the epoch from which the material originates. Read more at

The Netarchive contains more than 800 TB data, corresponding to more than 25 billion objects gath-ered from the Danish part of the Internet from 2005 until today. This archive also contains both data and metadata, and both are made available to research projects. You can read more at

The digital newspaper collection contains 35 million newspaper pages from the 1700s until today. All of these pages are stored as image files along with a large amount of metadata and optical character recognition data (OCR).

In addition to these large collections, the Royal Danish Library also has smaller special collections.

All in all, more than 4 PB, corresponding to approx. 4,000,000 gigabytes, are made available to new and existing research projects.


The Cultural Heritage Cluster is to support new areas, particularly within digital humanities. It was therefore decided to design a system that would make it easier easy to conduct well-established analyses without having to compromise in relation to advanced and be-spoke methods.

The Cultural Heritage Cluster is making Hortonworks Data Platform available to research projects. This platform is developed within the framework of the Open Data Platform (ODPi) and on top of that there is installed web based tools to easy the access to the cluster.

The Open Data Platform is a new initiative from the largest Hadoop distributors, and it features many of the current Hadoop technologies. You can read about ODPi at, and from this site, it is possible to download a virtual and fully functional ODPi server, which can be run on an ordinary desk-top PC so that the techniques can be tested in a small setup.

The web based access tools will include Jupyter Notebooks and RStudio

Pilot projects

In 2018 several planned pilot projects will utilise the system's new facilities. The Royal Danish Library in collaboration with the DeIC eScience center of competence will make facilities available and offer training in use of the system to the researchers working on these projects free of charge. DeIC and the Royal Danish Library will also offer further, fully financed pilot projects through open project invitations.

Later on, it will also be possible to buy calculation time and consultancy assistance under a transparent price model, which will be developed in connection with the first pilot projects.

Further information

Future project invitations will be distributed through national channels for all relevant fields. If you are interested in being notified directly, please contact us. See to the right for contact information.

Also see this page in Danish with details on the process of becoming a pilot project.




Cordinator and general contact
8946 2177


8946 2100



8946 2100



8946 2301