Inspired by the International Consortium of Investigative Journalists (ICIJ) use of Neo4j in their highly popular Panama Papers exposé, the European Space Agency decided to try new methods to support their goal of monitoring the satellite technology market.
The challenge
The European Space Agency (ESA) is dedicated to the peaceful exploration and use of space for the benefit of humankind. Established in 1975, the ESA works together with 22 Member States to further its mission to shape the development of Europe’s space capability and ensure that investment in space continues to deliver benefits to the citizens of Europe and the world.
An independent, internal exercise to survey a portion of the market produced encouraging results and clearly demonstrated the potential benefit of graph databases.
However, the amount of human effort required to ingest and map data into the graph made it unrealistic to proceed. ESA envisioned an automated Machine Learning (ML) powered enrichment workflow to create and grow their knowledge graph from structured and unstructured (textual) data.
The solution
ESA approached GraphAware to learn more about the Natural Language Processing (NLP) capabilities of Hume, GraphAware’s flagship product, a graph-powered intelligence analysis platform. Following a valuable demo of Hume by the ML experts at GraphAware, the ESA decided to strengthen its understanding of GraphAware capabilities through a proof-of-concept.
First, GraphAware consultants configured GraphAware Hume to automate the ingestion of unstructured data. This data was comprised of PDFs, CSVs, RSS feeds and spreadsheets containing URLs of web pages. The data gateway either extracted text content directly from file types such as PDFs, or crawled the web pages to extract their content. This unstructured text was then picked up for processing by Hume Orchestra.
Hume Orchestra is a key component of the platform, and manages the operational workflow for the storage, analysis, and post-processing of unstructured text. For the ESA, it was configured to include named entity recognition, entity relationship extractionm and post-processing of results, including materialisation of results including named entities, to align the resulting NLP graph with the domain model.
The ESA team worked with GraphAware to train machine learning algorithms that leverage knowledge of domain experts. This was then used to enhance the recognition of key entity types and relationships between them, and produce a knowledge graph from the unstructured text input. GraphAware Hume also provided an intuitive and visual mechanism connect and style the knowledge graph’s schema.
Finally, the knowledge graph visualisation generated much excitement at ESA, demonstrating that custom insights could be delivered and visualised easily through configuration of Hume Actions.
The result
ESA initially engaged with GraphAware for their NLP capabilities, but were delighted to receive an end-to-end solution that mapped a segment of the space ecosystem, right from automated ingestion of data to NLP and machine learning models with an enrichment workflow, and delivering business value by making insights accessible through an actionable graph visualisation.
“GraphAware Hume being a complete product is a key factor for ESA. The ability to customise Hume Actions via Cypher queries provided ESA with flexibility to cover a range of use cases and customers”.
Having met the brief by speeding up and simplifying the creation of ESA’s knowledge graph, and receiving unexpected added value in the form of insights surfaced in the visualisation, the ESA team expressed their appreciation for the professionalism of the GraphAware consultants that propelled them beyond their initial goal.
The European Space Agency are now preparing to build on this successful initial demonstration and drive more usage through the organisation in order to help it best focus its support efforts.
The ability to customise Hume Actions via Cypher queries provided ESA with flexibility to cover a range of use cases and customers.
— Data Science Lead
