{"id":1235,"date":"2023-05-07T10:42:59","date_gmt":"2023-05-07T10:42:59","guid":{"rendered":"https:\/\/innovationisrael.org.il\/en\/?post_type=success_story&p=1235"},"modified":"2023-10-18T09:35:46","modified_gmt":"2023-10-18T09:35:46","slug":"impact","status":"publish","type":"success_story","link":"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/","title":{"rendered":"Digitising the past"},"content":{"rendered":"\n

Background<\/h2>\n\n\n\n

Project acronym: <\/strong>IMPACT<\/p>\n\n\n\n


Although millions of books are scanned and put online every year, making old documents and texts available on the web is a difficult and painstaking process.<\/strong>

Project IMPACT \u2013 which stands for Improving Access to Text – is focused on the making the process easier.

Project IMPACT director Hildelies Balk explained: \u201cThe problem with turning an historic document into a machine readable text is that it is so very old, everything is different from a modern document, it has old fonts, old words and a very difficult layout.\u201c

Once scanned they are left full of errors, because computers struggle to read old texts with strange layouts, fonts and spellings.

Clemens Neudecker, technical manager for European projects at Koninklijke Bibliotheek, showed us one example: \u201cThis is the Principia Mathematica by Isaac Newton. You see actually what we call shine through, that is ink from the opposite page which is just shining through the paper, you see that the paper is warped, and you can also see here there is this long \u2018s\u2019 also in use, which can very easily be confused with an \u2018f\u2019.\u201d

Researchers at the National Library of the Netherlands have spent four years in a European project to improve software tools to read old books.

Researcher Hildelies Balk said: \u201cWe improved software for image enhancement, optical character recognition, post-correction of the document and language technology to make it more accessible.\u201c

That know-how has already been integrated into the market-leader digitisation software \u2013 and the results are much improved.

Clemens Neudecker talked us through one project: \u201cHere we have an example of the image being straightened. And the next thing is that these borders also need to be cropped. The next step is to transform that into a black and white image in order to enhance the contrast background and foreground.

\u201cAt the very end of the process the user gets the recognised full text, and there\u2019s also the structural features of this text – for example paragraphs, headlines and the like are also detected.\u201c

The project claims at least a 15 percent improvement in the accuracy of scanned text.

It means precious archives should be much more available.

Hildelies Balk concluded: \u201cText that is not fully digital, it is virtually invisible. Everyone is used to going into a search engine, and looking for a word, and if they don\u2019t find this it basically isn\u2019t there for them.\u201d

This innovation was made possible by Israel\u2019s continued participation in the official Horizon 2020 fund, managed in Israel by ISERD part of The Israel Innovation Authority (Formerly the Office of the Chief Scientist and MATIMOP). The initiative has taken Israeli R&D to the next level with the help of ground-breaking collaboration between scientists in Israel and Europe, as well as essential funding and support.<\/p>\n\n\n\n


Project details<\/strong>
Project acronym: <\/strong>IMPACT
Participants:<\/strong> Netherlands (Coordinator), Poland, UK, Slovenia, Bulgaria, Israel, Germany
FP7 Project N\u00b0<\/strong> 215064
Total costs:<\/strong> \u20ac15 503 509
EU contribution:<\/strong> \u20ac11 500 000
Duration:<\/strong> January 2008 – June 2012<\/p>\n","protected":false},"excerpt":{"rendered":"

Background Project acronym: IMPACT Although millions of books are scanned and put online every year, making old documents and texts available on the web is a difficult and painstaking process. Project IMPACT \u2013 which stands for Improving Access to Text – is focused on the making the process easier. Project IMPACT director Hildelies Balk explained: \u201cThe […]<\/p>\n","protected":false},"featured_media":0,"parent":0,"template":"","geographic_location":[96],"collaboration_opportunities":[100],"technologies":[],"acf":[],"yoast_head":"\nDigitising the past - English Innovation Site<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Digitising the past - English Innovation Site\" \/>\n<meta property=\"og:description\" content=\"Background Project acronym: IMPACT Although millions of books are scanned and put online every year, making old documents and texts available on the web is a difficult and painstaking process. Project IMPACT \u2013 which stands for Improving Access to Text – is focused on the making the process easier. Project IMPACT director Hildelies Balk explained: \u201cThe […]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/\" \/>\n<meta property=\"og:site_name\" content=\"English Innovation Site\" \/>\n<meta property=\"article:modified_time\" content=\"2023-10-18T09:35:46+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/\",\"url\":\"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/\",\"name\":\"Digitising the past - English Innovation Site\",\"isPartOf\":{\"@id\":\"https:\/\/innovationisrael.org.il\/en\/#website\"},\"datePublished\":\"2023-05-07T10:42:59+00:00\",\"dateModified\":\"2023-10-18T09:35:46+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/innovationisrael.org.il\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Success stories\",\"item\":\"https:\/\/innovationisrael.org.il\/en\/success_story\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Digitising the past\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/innovationisrael.org.il\/en\/#website\",\"url\":\"https:\/\/innovationisrael.org.il\/en\/\",\"name\":\"English Innovation Site\",\"description\":\"Just another Innovation Authority Sites site\",\"publisher\":{\"@id\":\"https:\/\/innovationisrael.org.il\/en\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/innovationisrael.org.il\/en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/innovationisrael.org.il\/en\/#organization\",\"name\":\"English Innovation Site\",\"url\":\"https:\/\/innovationisrael.org.il\/en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/innovationisrael.org.il\/en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/innovationisrael.org.il\/en\/wp-content\/uploads\/sites\/3\/2022\/11\/cropped-menu-logo.png\",\"contentUrl\":\"https:\/\/innovationisrael.org.il\/en\/wp-content\/uploads\/sites\/3\/2022\/11\/cropped-menu-logo.png\",\"width\":129,\"height\":36,\"caption\":\"English Innovation Site\"},\"image\":{\"@id\":\"https:\/\/innovationisrael.org.il\/en\/#\/schema\/logo\/image\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Digitising the past - English Innovation Site","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/","og_locale":"en_US","og_type":"article","og_title":"Digitising the past - English Innovation Site","og_description":"Background Project acronym: IMPACT Although millions of books are scanned and put online every year, making old documents and texts available on the web is a difficult and painstaking process. Project IMPACT \u2013 which stands for Improving Access to Text – is focused on the making the process easier. Project IMPACT director Hildelies Balk explained: \u201cThe […]","og_url":"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/","og_site_name":"English Innovation Site","article_modified_time":"2023-10-18T09:35:46+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/","url":"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/","name":"Digitising the past - English Innovation Site","isPartOf":{"@id":"https:\/\/innovationisrael.org.il\/en\/#website"},"datePublished":"2023-05-07T10:42:59+00:00","dateModified":"2023-10-18T09:35:46+00:00","breadcrumb":{"@id":"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/innovationisrael.org.il\/en\/success_story\/impact\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/innovationisrael.org.il\/en\/"},{"@type":"ListItem","position":2,"name":"Success stories","item":"https:\/\/innovationisrael.org.il\/en\/success_story\/"},{"@type":"ListItem","position":3,"name":"Digitising the past"}]},{"@type":"WebSite","@id":"https:\/\/innovationisrael.org.il\/en\/#website","url":"https:\/\/innovationisrael.org.il\/en\/","name":"English Innovation Site","description":"Just another Innovation Authority Sites site","publisher":{"@id":"https:\/\/innovationisrael.org.il\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/innovationisrael.org.il\/en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/innovationisrael.org.il\/en\/#organization","name":"English Innovation Site","url":"https:\/\/innovationisrael.org.il\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/innovationisrael.org.il\/en\/#\/schema\/logo\/image\/","url":"https:\/\/innovationisrael.org.il\/en\/wp-content\/uploads\/sites\/3\/2022\/11\/cropped-menu-logo.png","contentUrl":"https:\/\/innovationisrael.org.il\/en\/wp-content\/uploads\/sites\/3\/2022\/11\/cropped-menu-logo.png","width":129,"height":36,"caption":"English Innovation Site"},"image":{"@id":"https:\/\/innovationisrael.org.il\/en\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/innovationisrael.org.il\/en\/wp-json\/wp\/v2\/success_story\/1235"}],"collection":[{"href":"https:\/\/innovationisrael.org.il\/en\/wp-json\/wp\/v2\/success_story"}],"about":[{"href":"https:\/\/innovationisrael.org.il\/en\/wp-json\/wp\/v2\/types\/success_story"}],"wp:attachment":[{"href":"https:\/\/innovationisrael.org.il\/en\/wp-json\/wp\/v2\/media?parent=1235"}],"wp:term":[{"taxonomy":"geographic_location","embeddable":true,"href":"https:\/\/innovationisrael.org.il\/en\/wp-json\/wp\/v2\/geographic_location?post=1235"},{"taxonomy":"collaboration_opportunities","embeddable":true,"href":"https:\/\/innovationisrael.org.il\/en\/wp-json\/wp\/v2\/collaboration_opportunities?post=1235"},{"taxonomy":"technologies","embeddable":true,"href":"https:\/\/innovationisrael.org.il\/en\/wp-json\/wp\/v2\/technologies?post=1235"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}