Over 10,000 physical typewritten documents from 1932 to 1941 had to be digitised, structured, and connected in order to create a single, centralised source of knowledge, for enabling the analysis of historical processes.
Federica Ventruto and Alessia Melania Lonoce are Junior Data Scientists at GraphAware who spoke at NODES2022. Natural language processing is an indispensable toolkit to build knowledge graphs from unstructured data. However, it comes with a price. Keywords and entities in unstructured texts are ambiguous - the same concept can be expressed by many different linguistic variations. The resulting knowledge graph would thus be polluted with many nodes representing the same entity without any order. In this session, we show how the semantic similarity based on transformer embeddings and agglomerative clustering can help in the domain of academic disciplines and research fields and how Neo4j improves the browsing experience of this knowledge graph.
Vlasta Kůs is Lead Data Scientist at GraphAware and presented at NODES2022. Public archives contain incredible amount of knowledge. In this session, we’ll cover a real use case of building a knowledge graph for the archive of a major foundation to help empower researchers (or business analysts) to access previously unavailable levels of insights. This archive, going up to a century back, contains detailed information about funded projects and conversations preceding them, budgets, research endeavors, and outcomes, as well as priceless knowledge about influence networks of foundation representatives, researchers, and students. A particular challenge was that the same events were described in multiple sources. The only way to leverage all of this knowledge was through the use of advanced analytics and machine learning. We will explore the technologies (including OCR, NLP, and graph data science) and complex pipelines employed to create this major knowledge graph.
Vlasta Kůs takes us through converting a corpus of research papers through Natural Language Processing, entity (relation) extraction and graph algorithms to highly informative connected insights organized in a knowledge graph.
Christophe Willemsen, CTO, GraphAware, explains how to apply NLP to extract entities and key phrases to build and search knowledge graphs
Dr. Alessandro Negro, Chief Scientist at GraphAware, presents on knowledge graphs at GraphTour DC.
Knowledge Graphs are becoming the de-facto solution for managing complex aggregated knowledge, and Neo4j is the leading platform for storing and querying connected data. In this talk, Christophe will describe a graph-centric cognitive computing pipeline and detail the process from the ingestion of unstructured text up to the generation of a knowledge graph, queryable using natural language through chatbots built with IBM Watson Conversation.
In this talk, Christophe will describe a graph-centric cognitive computing pipeline and detail the process from the ingestion of unstructured text up to the generation of a knowledge graph, queryable using natural language through chatbots built with IBM Watson Conversation.
A great part of the world’s knowledge is stored using text in natural language, but using it in an effective way is still a major challenge. Natural Language Processing (NLP) techniques provide the basis for harnessing this huge amount of data and converting it into a useful source of knowledge for further processing. By Alessandro Negro, Chief Scientist, GraphAware.