// Archives

Better OCR of Research Papers and Newspaper Articles

Optical Character Recognition (OCR) describes a technique whereby computers can read printed text. OCR of newspapers, magazines and research papers has always been challenging due to the unconventional and unusual formatting of the material. Text boxes, images and tables are often overlaid and are difficult (for a computer) to distinguish. Conventional, out-of-the-box OCR software is …

Learn how to train a Simpsons character classifier at home using Ludwig

Researchers may be wondering what tools are available for them to work on their datasets from home. In this blog, we highlight an open-source toolkit that allows researchers to implement machine learning architectures and deep learning models on their home computers with little to no coding. In this blog post you will learn about the …

DRHack: The results of our Digital Research Hackathon

On the afternoon of Friday 27 April, at the same time as the Digital Research Week’s closing ceremony, DRHack kicked off. We co-hosted this event with HackSoc, a student society dedicated to hacking. We invited students to come and try their hand at designing and creating new ways of engaging with research data. Hackathon participants …