Next step in Artificial Intelligence applications in Hebrew and Arabic

20/09/2023

Approximately 30 million NIS in 17 projects focused on NLP models in Hebrew and Arabic

Next step in Artificial Intelligence applications in Hebrew and Arabic

Israel Innovation Authority to invest approximately 30 million NIS in 17 projects focused on creating and providing access to data and language models in Hebrew and Arabic

Projects to serve as infrastructure for R&D in various fields.

Dror Bin, CEO of the Israel Innovation Authority: “In order to promote the integration of Artificial Intelligence into the Israeli high-tech industry and enable Israeli citizens to benefit from the fruits of this technology, we are keen to promote and advance activity in this area. This program is crucial for realizing Israel’s unique data potential in Hebrew and Arabic, facilitating the adoption of Artificial Intelligence in the industry, and advancing innovation in Israel. These infrastructures will lead to significant and open applications for the benefit of Israeli residents and enable companies developing products in this field to sell both domestically and internationally.”

As part of the implementation of the National Artificial Intelligence Plan and the initiative to create research and development infrastructure and advanced capabilities in the field of Natural Language Processing (NLP) in spoken Hebrew and Arabic, the Innovation Authority today announced its approval of 17 different projects in industry and academia, with a total budget of approximately 30 million NIS, which will facilitate the leap in AI applications in Hebrew and Arabic, requiring natural language processing as part of the solution.

New solutions and applications in the field of artificial intelligence are emerging almost on a daily basis. However, the gap between existing capabilities for Hebrew and Arabic and more common languages (such as English) is significant.

To narrow this gap, a wide range of infrastructures are required for application developers dependent on language. This call for proposals has promoted infrastructure activity to implement natural language-based artificial intelligence technologies and called on researchers, developers, academia, and industry to rise to this challenge.

Funding for approved projects will provide infrastructures and data repositories in general and specific areas, enabling a wide range of tasks from academia and industry.

Here are the approved projects:

  1. Verbit will develop and provide a transcription and summarization model for clinical and business applications.
  2. Dr. Amir David Nissan HaCohen and Dr. Avi Katschulero will develop and provide a model that enables chatbot creation and semantic retrieval of documents based on queries.
  3. Binat will develop and provide a speech-to-text service with additional capabilities such as speaker separation, real-time alerts, and more.
  4. Clalit Health Services will create a very large and unique international-level database that combines visual data with written interpretation of the image while meticulously preserving data integrity.
  5. Briya will develop and provide a language model for analyzing medical texts in Hebrew combined with English. Project partners include Assuta Ashdod Hospital, Galilee Medical Center, Sourasky Medical Center Tel Aviv, and Shaare Zedek Medical Center.
  6. Sheba Impact will develop and provide Clinical Bert, a language model trained on clinical textual data from Sheba Hospital (over 100 million medical records).
  7. Prof. Ruth Apter will develop a model that allows the generation of structured information from unstructured data.
  8. Dr. Amos Azaria (Ariel University) will develop and provide a translation algorithm that allows the use of pre-trained models in English for queries in Hebrew.
  9. Dr. Haya Libskind (Lev Academic Center) will develop and provide a model for identifying harmful content in texts.
  10. Dr. Amos Azaria (Ariel University) will develop and provide an algorithm for detecting hallucinations (false positives) in Hebrew language models.
  11. Dr. Arnon Shetrom (Ben-Gurion University of the Negev) will develop and provide an entity database in the legal field and an automatic tagging model.
  12. Tel Aviv University and Ichilov Hospital will develop and provide a medical model in Hebrew based on medical summaries and time relationship classification between two events.
  13. Ben-Gurion University of the Negev and Sami Shamoon School of Engineering will develop and provide tools for certifying service-oriented language models.
  14. Prof. Ido Dagan (Bar Ilan University) will develop and provide infrastructure for dynamic execution of queries based on semantic relationships (similar to the use of structured data and knowledge).
  15. Prof. Lior Rokach (Bar Ilan University) will develop and provide bidirectional translation models for Hebrew and Arabic languages.
  16. Prof. Koby Gell (Ben-Gurion University of the Negev) will develop and provide a model for detecting distress.
  17. Dr. Shai Fine (Reichman University) will develop and provide a model that allows extraction of information from recorded audio.

These projects will work to reduce existing gaps in the capabilities of Hebrew and Arabic language processing and some of them even challenge the global State of The Art in the field.