Perspectives

10 things we’ve learned about building knowledge graphs

Knowledge Graphs
Murray Callander
July 5, 2023

There is definitely something intuitive about building knowledge graphs, after all, they mirror the way we as humans acquire knowledge and retain it through connections and context.

Over the past five years building industrial scale knowledge graphs, we have learnt many lessons to build them faster – and focused on delivering value.

Here’s our top 10.

1. Start small, don’t boil the ocean

The temptation is to find a really meaty domain, one used by many people to deliver massive impact. The bigger the domain, the more stakeholders, the more nuances and the slower the process. Often, it’s better to pick a narrow domain, with a small number of users and build small solutions that can scale.

2. Build use case by use case

This is our main learning, always start with real-world problems, ideally with substantive value attached to their solution. Never try to build for what-if or you will waste time and risk user engagement. Build incrementally from each use case, adding more data and context into the knowledge graph.

3. The model serves the use case, never the other way round

This complements the previous lesson. As with building any digital solution there are moments when it’s tempting to add features and design ideas that perfect the product but don’t add value to the user or solve a problem. A laser focus on use case is vital.

4. Pick a graph technology vendor and stick to it

There are many great graph database vendors out there now, such is the demand – Stardog, Dgraph, ArangoDB etc. We did some testing a few years back and landed on Neo4j. We like the speed, the functionality, and the tools, but more importantly we’ve now invested time in understanding how it works – swapping now would slow our pace of delivery and learning. Stick with one – don’t get tempted by offers and features!

5. Use technology that allows you to access data at-the-source

When data is replicated to a data lake, there is inevitable loss of fidelity, integrity and context, not to mention waiting for the data lake project to finish the migration. Use a technology stack that allows working with data kept at the source. Eigen Ingenuity has been doing this for years; it can also access data already in a cloud-based data lake.

6. Visualisation of the knowledge graph is key to engage stakeholders

One of the things we really like about Neo4j is the way we can build a knowledge graph in minutes and share it with a client. A simple and intuitive front-end that masks considerable complexity below helps engage users and visualise what they are getting.

7. Whiteboard it first

If you can draw your outline knowledge graph on a whiteboard, you can model it in a tool like Neo4j. Drawing it out forces you to clarify your thinking before you start building, enabling you to identify opportunities to link with existing data sources, to surface any connection or logic challenges as well as to explore other use cases. Your whiteboard version is also a great communication tool to engage stakeholders so they understand - and can input to - what you plan to build.

8. Don’t constrain yourself to relational database thinking

Most of us have grown up with databases and so it’s natural to transfer this thinking to knowledge graph design, but the latter are more flexible and scalable; they don’t have predefined boundaries or the same constraints. Encourage free thinking in your design.

9. Don’t be afraid to iterate

Experimentation and learning are key to building knowledge graphs; we almost always build them in agile projects, with two or three week sprints to add in focused design, development and testing. By involving product owners at the end of each sprint to provide feedback and direction, we quickly learn how close the knowledge graph-based solution is to resolving the client’s problem and where we need to iterate further to close gaps.

10. Stay up to date

Knowledge graph technology is evolving fast. Keep informed about advances in technologies, standards and best practices or work with a partner who makes this a priority.

Have you any other lessons about building knowledge graphs? What’s worked for you – and what is still proving a challenge? Drop me a line at murray.callander@eigen.co I’d be happy to share any of our insights. 

Written by
Murray Callander
Posted on
July 5, 2023
July 6, 2023

Knowledge graph to manage safety risk and asset barrier health

November 10, 2021

Knowledge graphs: a bridge

You may also like

DCOM

Distributed Component Object Model (DCOM) is a Microsoft remote communication protocol at the heart of many real time data links.

Knowledge Graphs

Context and clarity through dynamic monitoring and visualisation

Digital Transformation

The trend in many mature basins, just like the UK and Norwegian continental shelves, has been one of steady divestment by the supermajors and a new generation of smaller oil and gas operators.

Electric cars

The running cost of an electric car depends on where you charge it and can vary from 3p to over 35p per mile.
In short, electricity prices vary hugely compared to petrol or diesel; you can pay anywhere from 5p/kWh to 70p/kWh so it costs me anywhere from 3p/mile to 35p/mile depending on where I charge.

Electric cars

The range of an electric car is less important than how quickly you can recharge it
Would I recommend an electric car? Yes, definitely. They are nicer to drive, much cheaper to run and need less maintenance. Range isn’t the issue, maybe the real question is “are you prepared to change your habits?”.

Electric cars

reliable destination charging is key to making electric car ownership work for most people.
When you say you own an electric car everyone’s first question is about that magic word “Range” – “How far can it go on a charge?”. Having owned an electric car for over six years now I can tell you that range is much less important than the ability to recharge at your destination. Let me explain why.

Safety Barriers

At Eigen, we build and deploy live data models as part of a digital twin for oil and gas facilities. One of the challenges in doing this is how to combine the physical and the abstract into a working data model. And a great example of this is the bow tie – a diagram showing all the protective measures against hazardous events and their potential consequences.

Knowledge Graphs

The automobile and industrial designer Freeman Thomas said “We were promised a simpler life, and technology has only complicated our lives.” In complex domains, like oil and gas, technology may well have simplified some things but it has certainly complicated others, particularly where data is concerned.

Decision making

Death and taxes may historically have been the only two certainties in life, but we might consider a third – increasing volumes of data. With ubiquitous sensing, unlimited cloud storage and insatiable demands from every walk of life, data is growing exponentially.

Digital twin

Much of the focus of digital transformation is given to big data, clever systems and game-changing innovations, like robotics, AI and IoT. But what is often missing is the focus on people, who have been irrevocably transformed by digital.

Digital twin

We formed Eigen almost fifteen years ago, although the original founding team had all been working in oil and gas for the previous decade. As individuals, then later as Eigen, we worked with bp to deploy its breakthrough Field of the Future programme in Azerbaijan, and have since been part of many oil and gas operators’ digital transformation journeys.

Digital twin

There’s a lot of hype about digital twins in oil and gas. Much is well-founded, since the integration of the underlying technologies that make digital twins possible have potential to accelerate much needed digital transformation in the sector. But you can’t believe everything you read.

Digital twin

The concept of digital twin has a futuristic quality at a time when robotics and artificial intelligence also are coming of age. But whilst there’s no disputing the transformative impact of digital twins, in industries as wide-ranging as construction, marine, agriculture and oil and gas, what makes up a digital twin is technology of now.

Decision making

One of the problems of asset-heavy industries, including Oil & Gas and Petrochemical operations, is decision-making for maximum impact, under resource constraint conditions. These industries typically manage an array of highly interconnected assets and relationships which bear on the impact of the decision.

Digital twin

When it comes to digital transformation and learning how to discover insights hidden in data, one question that regularly arises is whether we should copy all of our data into a data warehouse or data lake, or alternatively connect directly to the source systems.

Digital twin

In this blog series we’re looking at how you turn a bowtie diagram into an online Digital Twin showing live barrier health information. The previous part looked at the level of detail required to make a working data model in practice. In this instalment we’re looking at the challenge of assessing the health of protective functions not directly related to the physical SECEs (Safety and Environmentally Critical Equipment).

Digital twin

In this blog series we’re looking at how you turn a bowtie diagram into an online Digital Twin showing live barrier health information. In this instalment we’re considering the level of detail required to make a working data model in practice.

Digital twin

How do you turn a bowtie diagram into an online Digital Twin showing live barrier health information?

Complex Subsea Hydraulic System

Imagine the situation: Hydraulic fluid pressure is dropping fast; supply tank levels are also dropping. You are pretty sure there is a big leak in the system, but you have no idea where it is. Pretty soon the operation will be compromised: The pressure will be below the minimum required and you may be forced to shut down the field. It is a scary picture.

What makes a good datamodel?

Wireframe rendering of turbojet engine and mirrored physical body on black background. Digital twin concept. 3D rendering image.
There is a lot of buzz nowadays about “Digital Twins”, and every Digitalisation Director out there is thinking they need to have a Digital Twin of their assets. It makes sense: If you have a digital representation of your assets; you can simulate, test scenarios, plan operations and visualise data within the context afforded by the twin, without having to visit or “be present” in the real asset, especially if these assets are remote.

Agile

Agile development software business web computer agility nimble quick fast start up concept
Ah, the old ones still the best ????. But such questions can sometimes be worth considering. For example, when is Agile not Agile?

Open Source

Two months ago I switched my phone from an Apple iPhone 7 to an Android based POCO X3 NFC. At the same time one of my daughters upgraded their phone to an iPhone XR so we’ve been comparing experiences. So what’s it been like??

Uncategorized

2021 futuristic tehcnology trend concept, hand man pointing in concept futuristic in 2021 coming year the year of artificial intelligence ,big data, iot, augmented reality, machine learning
As the COVID-19 pandemic hit us round about March 2020 and the Oil prices crashed, all of us working for the Oil & Gas industry panicked and prepared for the worse. In previous market downturns, budgets had been slashed and projects cancelled, but this time something quite different happened, at least in the Digitalisation space.

Agile

What has Agile ever done for us?
In an earlier blog, I wrote about what Agile is. And that’s all well and good, but what has Agile ever done for us?

Open Source

Using Open Source to avoid lock-in to expensive proprietary technologies
I’ve been using Apple products since 2007. Every smart phone I’ve had has been an iPhone but yesterday I ordered a new Android phone. I’m so fed up with shelling out around £1000 every couple of years but I’ve been trapped because all my music is in iTunes, all my family have iPhones and we run Macs*. Man, I hate being locked in! I wouldn’t mind if it was still value for money, but it’s not – I feel like I’m being taken for a ride now.

Perspective

In November (this month as I write this) we have not long closed out possible two of the best custom software projects we have done. Sure there were bugs in the testing and there will be more found by the users I’m sure, but what made them so good? We hit the schedule to within a week on both of them and there were no scope changes over the entire project, so we hit the budget as well. This is particularly impressive for one of them because it was a completely new piece of software, written from scratch; taken from powerpoint to in production in 5 months.

Uncategorized

percent sign on screen. interest rate
This picture makes me laugh and if it makes you laugh too then you’ve probably been involved in purchasing or project delivery at some point.

Is your Oil field being exploited at its best?

A man holding a flask of oil. Oil rigs in the background. Oil production in Russia. Sunset. Oil pumping
Production Well Testing is an essential part of managing an Oil & Gas field if the field has more than one well and production measurement resources are limited, as is generally the case.

Agile

Agile Software Development Business Internet Techology Concept
You have probably heard software developers talk about being Agile, and maybe wondered what they mean. Perhaps they do star jumps and touch their toes before breakfast, or write code while limboing under desks? Well, it’s nothing like that… read on to find out more!

Uncategorized

Eigen and Wintershall Dea partnership
I’m really excited about the long term partnership we have recently signed with Wintershall Dea for a number of reasons not least because it’s a great team and we have a lot of fun working together!

Decision Bridge Technology

eigendbt2
Back in July I asked myself the question “Why are we still in business after 11 years?”. A small specialist company in a big competitive industry. We’ve been through 3 downturns now and we continue to develop longstanding relationships with some of the most innovative operators in the business. Clearly we are doing something right but what is it?

Decision Bridge Technology

Young musician playing acoustic guitar and singing,
Did you ever learn to play “Sweet child of mine” on the guitar? Like almost every other teenager who had a guitar I learned the opening riff to “Sweet child of mine” and I remember playing it in a guitar shop when I was testing out an amp, only to have one of the staff point to a sign on the wall that said anyone playing “sweet child of mine” would be thrown out the shop and told never to return!

Is your Oil field being exploited at its best?

Smart city and abstract dot point connect with gradient line and aesthetic Intricate wave line design , big data connection technology concept .

In our previous blog, the discussion was centred on assessing how well an asset is exploited, as a key concern for the field manager and the asset team. The Integrated Choke Model (ICM) was proposed as a model to identify opportunities for improving the free cash flow as the fundamental objective during periods when the oil price is low.

Uncategorized

Businessman holding tablet and showing holographic graphs and stock market statistics gain profits. Concept of growth planning and business strategy. Display of good economy form digital screen.
The world was shocked to learn about the massive explosion in the port of Beirut: This has been a terrible tragedy with an astronomical cost in terms of lives lost and damage to property; one could even argue, to the whole country, as even the strategic grain reserves were hit.

Is your Oil field being exploited at its best?

Is your Oil field being exploited at its best?
A typical question for a Petroleum Field Manager is how well the asset is being exploited and if this exploitation is efficient and generates enough value for Operator and Partners. This article explores some answers based on our understanding of the challenges that appear throughout the life of the field, and to how to be prepared to transform them into opportunities.

What makes a good datamodel?

What makes a good datamodel?
Now we are potentially getting right in amongst the weeds and many a data model gets stuck in the weeds because this issue is not understood.

What makes a good datamodel?

What makes a good data model? | Eigen
Once upon a time, in a land far far away, hard disks and memory were small and really expensive. In 1969 the computer that landed man on the moon only had 4k of RAM and 72k of ‘disk’ (ROM) storage.

What makes a good datamodel?

What makes a good data model? | Eigen
Having the best datamodel in the world won’t help you grow your digital capabilities if the only way to access it is through proprietary drivers.

What makes a good datamodel?

What makes a good data model? | Eigen
Google search is not a validated datamodel. Nor is it comprehensive, but it’s good enough for most of us.

What makes a good datamodel?

What makes a good data model? | Eigen
“A well structured class library allows you to build applications that are portable across different assets. They overcome the problems of different naming conventions across assets or sites.”

What makes a good datamodel?

What makes a good data model? | Eigen
Trying to model humans is probably the easiest way I know to break an inflexible model. Why? Because humans are recursive – that means they keep repeating the same thing and also they keep changing the rules! Humans start off as a child and become parents, they form relationships with each other, they marry each other and they can now change gender. If you designed your database schema in the 1500’s you would have had to rebuild it several times since!

What makes a good datamodel?

What makes a good datamodel?
The problem with hierarchies is they are subjective and only relevant to a certain viewpoint or problem. For example, what is the hierarchy of equipment on an industrial facility? Should it be based on area or on equipment size, or on position in the process?

What makes a good datamodel?

What makes a good datamodel?
Every Digital Twin needs a data model somewhere inside it. The Data Model is the digital map of where everything is; it’s the central reference for how everything is connected; it’s the search engine for your information.