Resources - KG

Videos, Slides, Case Studies and other GraphAware related resources

NODES2022 - Temporal Graph Analysis

25 Nov 2022 videos KG ML

Fabio Montagna is Lead Machine Learning Engineer at GraphAware and presented Temporal Graph Analysis at NODES2022. In this session, we’ll share our experience with horizon scanning over a graph of medical research papers. By leveraging the author keywords from scientific publications, it’s possible to build a cooccurrence graph with a temporal component provided by the paper publication date. We’ll show how we can analyze trends and evolution patterns using an unsupervised algorithm that assigns roles to author keyword.

NODES2022 - Keyword Disambiguation Using Transformers and Clustering to Build Cleaner Knowledge

25 Nov 2022 videos KG NLP

Federica Ventruto and Alessia Melania Lonoce are Junior Data Scientists at GraphAware who spoke at NODES2022. Natural language processing is an indispensable toolkit to build knowledge graphs from unstructured data. However, it comes with a price. Keywords and entities in unstructured texts are ambiguous - the same concept can be expressed by many different linguistic variations. The resulting knowledge graph would thus be polluted with many nodes representing the same entity without any order. In this session, we show how the semantic similarity based on transformer embeddings and agglomerative clustering can help in the domain of academic disciplines and research fields and how Neo4j improves the browsing experience of this knowledge graph.

NODES2022 - Data Management with Knowledge Graphs Bringing Archives to Life

25 Nov 2022 videos KG NLP

Vlasta Kůs is Lead Data Scientist at GraphAware and presented at NODES2022. Public archives contain incredible amount of knowledge. In this session, we’ll cover a real use case of building a knowledge graph for the archive of a major foundation to help empower researchers (or business analysts) to access previously unavailable levels of insights. This archive, going up to a century back, contains detailed information about funded projects and conversations preceding them, budgets, research endeavors, and outcomes, as well as priceless knowledge about influence networks of foundation representatives, researchers, and students. A particular challenge was that the same events were described in multiple sources. The only way to leverage all of this knowledge was through the use of advanced analytics and machine learning. We will explore the technologies (including OCR, NLP, and graph data science) and complex pipelines employed to create this major knowledge graph.