Knowledge graphs for the IT Crowd

Addressing challenges of IT partners about knowledge graph technology

Knowledge graphs
March 10, 2023
by Joel Chacon
Eigen's Chief Commercial Officer and Chief Information Security Officer
30+ year experience in oil and gas at bp, PDVSA, Honeywell and Matrikon

“We need this technology”. It’s a common refrain from the business when they stumble on the latest buzz word or silver bullet. And IT often comes across as the bad cops having to explain why it won’t work or how it doesn’t conform to company standards.

As a responsible IT professional, you might be thinking the same about knowledge graph
technology. Or maybe you haven’t been able to get past the hype to get your questions answered properly.

Here’s a few questions I get from the IT crowd, that you might have too:

  • What’s the underlying architecture?
  • What do we need to have to make this work?
  • How can you get away without defining a schema?
  • Is it really just a big database?
  • How does it handle unstructured, scattered data?
  • Where is the knowledge graph hosted?
  • What kind of support is there?
  • How will it comply with Company security standards?
  • How is data governance managed?
  • What do we need to do in IT, to make this work?

Over the past few years, we have really tried to listen to the usually quite legitimate challenges and concerns IT departments have raised. We have tried to see knowledge graph technology from their perspective, faced with their objectives and needs, so we can improve our offer and help our clients.

Here’s some of the questions I often get asked – and the answers I give:

What’s the underlying architecture?

We know there’s no such thing as typical in oil and gas, but here’s a pretty simple diagram that shows how the knowledge graph fits into an oil and gas systems domain. It’s based on a real architecture we worked with connecting in multiple data sources, internal and external,  enabling monitoring, reporting, alerts and visualisation for different user groups.

We can deploy knowledge graph technology to any Cloud supporting Kubernetes or Virtual Machines. The containers that comprise the solution can be deployed to a VM (or VM cluster) within a resource group assigned to Eigen.

Eigen knowledge graphs are built using Neo4j. They can be accessed using native Cypher queries or via the Eigen Python Library from user-generated Python code.

How can you get away without defining a schema?

The biggest thing that sets the knowledge graph apart from the database or data lake is the fact the former points to source data; it doesn’t ingest or copy data.

So, there’s no need to define a schema or the boundaries, as required by a database or data lake.

Building use-case by use-case allows us to continually evolve the knowledge graph based on the most important business needs – rather than needing to predict these potentially years in advance. And by following an Agile methodology the graph can evolve by improving on previous graph structures.

Is it really just a big database?

Not really, because a database is by its nature predefined. If business requirements change,
modifying the underlying database is both complex and costly. Databases also require a high level of standardisation in nomenclature and format, which often creates bottlenecks and silos as new systems, tools and even assets are onboarded.

Knowledge graph technology by contrast is more flexible and dynamic and can handle, for
example, a single asset with multiple names and multiple asset hierarchies.

How does it handle unstructured, scattered data?

Moving to a new database or data lake often requires considerable data cleansing during migration to fit with the new structure.

Knowledge graphs point to source data, rather than moving a copy of the source data. It does not require the data to be structured and it does not care where the data sits.

Knowledge graphs can point to documents, blueprints, text files, and the analytics applications that use the graph knows how to interpret each reference, e.g. trending a timeseries or opening a file.

Where is the knowledge graph hosted?

The knowledge graph is part of the multi-container Eigen Analytics Platform, which is hosted typically within the operator’s own cloud environment.

The knowledge graph can also be deployed on its own and can be queried for useful insights.

What kind of support is there?

The knowledge graph technology we use at Eigen is based on Neo4j; it is an open source technology, with a large community of major corporate users.

The knowledge graph and Neo4j is part of the Eigen framework and is supported by Eigen as part of its application support contract. Different SLAs can be offered based on the operator’s need and budget. Eigen provides first level support for several of our clients, although several others have chosen to keep first level support in-house.

How will it comply with Company security standards?

We recommend thorough investigation of knowledge graph technology, like any external application, to assure digital security. Neo4j’s client list of major banks, insurers and other IT providers, may offer some reassurance of security standards and practices.

At Eigen, we have worked hard to achieve accreditation in ISO 9001 (Quality) and ISO 27001
(Information Security) to ensure we design, build and operate solutions that comply with standards at the highest level for quality and digital security.

How is data governance managed?

The honest truth is that like any poorly run IT projects knowledge graph projects can go wrong – and they can get out of control. Knowledge graphs require rules, processes and standards. Eigen has experience and tools to help operators define and operate such governance but ultimately these are the responsibility of the operator.

If you have any other questions about knowledge graphs, feel free to visit our user community or ask your question here.

If you would like a demo of an Eigen knowledge graph, tailored to your needs as an IT specialist, please book a convenient time here.

What do we need to do in IT, to make this work?

At Eigen, we always engage with our customer’s IT departments and to define the steps required to deploy our technology. A secure cloud infrastructure design is generally the first step.

Once that is approved, all the IT department needs to do is assign an appropriate resources within its cloud provider and implement the secure channels and firewall rules agreed in the design. For example in Microsoft Azure this would be a Standard_D8s_v3 VM or an AKS cluster of three Standard_D8s_v3 VMs for full container orchestration through Kubernetes.
Alternatives are always possible and Eigen experts are available to answer any questions.

If you have other IT-related questions or want to discuss any of these points further, drop me a line in the form below, or contact Joel Chacon, Chief Information Security Officer.
July 14, 2023

Get ready for DCOM hardening

July 6, 2023

Knowledge graph to manage safety risk and asset barrier health

July 5, 2023

10 things we’ve learned about building knowledge graphs

December 5, 2022

Dynamic digital transformation for fast growing oil & gas companies 

February 1, 2022

How much does it cost to drive an electric car in the real world?

February 1, 2022

My electric car does 1000 miles on a charge

You may also like

DCOM

Distributed Component Object Model (DCOM) is a Microsoft remote communication protocol at the heart of many real time data links.

Knowledge Graphs

Context and clarity through dynamic monitoring and visualisation

Knowledge Graphs

Glowing net and spirals
There is definitely something intuitive about building knowledge graphs, after all, they mirror the way we as humans acquire knowledge and retain it through connections and context.

Digital Transformation

The trend in many mature basins, just like the UK and Norwegian continental shelves, has been one of steady divestment by the supermajors and a new generation of smaller oil and gas operators.

Electric cars

The running cost of an electric car depends on where you charge it and can vary from 3p to over 35p per mile.
In short, electricity prices vary hugely compared to petrol or diesel; you can pay anywhere from 5p/kWh to 70p/kWh so it costs me anywhere from 3p/mile to 35p/mile depending on where I charge.

Electric cars

The range of an electric car is less important than how quickly you can recharge it
Would I recommend an electric car? Yes, definitely. They are nicer to drive, much cheaper to run and need less maintenance. Range isn’t the issue, maybe the real question is “are you prepared to change your habits?”.

Electric cars

reliable destination charging is key to making electric car ownership work for most people.
When you say you own an electric car everyone’s first question is about that magic word “Range” – “How far can it go on a charge?”. Having owned an electric car for over six years now I can tell you that range is much less important than the ability to recharge at your destination. Let me explain why.

Safety Barriers

At Eigen, we build and deploy live data models as part of a digital twin for oil and gas facilities. One of the challenges in doing this is how to combine the physical and the abstract into a working data model. And a great example of this is the bow tie – a diagram showing all the protective measures against hazardous events and their potential consequences.

Knowledge Graphs

The automobile and industrial designer Freeman Thomas said “We were promised a simpler life, and technology has only complicated our lives.” In complex domains, like oil and gas, technology may well have simplified some things but it has certainly complicated others, particularly where data is concerned.

Decision making

Death and taxes may historically have been the only two certainties in life, but we might consider a third – increasing volumes of data. With ubiquitous sensing, unlimited cloud storage and insatiable demands from every walk of life, data is growing exponentially.

Digital twin

Much of the focus of digital transformation is given to big data, clever systems and game-changing innovations, like robotics, AI and IoT. But what is often missing is the focus on people, who have been irrevocably transformed by digital.

Digital twin

We formed Eigen almost fifteen years ago, although the original founding team had all been working in oil and gas for the previous decade. As individuals, then later as Eigen, we worked with bp to deploy its breakthrough Field of the Future programme in Azerbaijan, and have since been part of many oil and gas operators’ digital transformation journeys.

Digital twin

There’s a lot of hype about digital twins in oil and gas. Much is well-founded, since the integration of the underlying technologies that make digital twins possible have potential to accelerate much needed digital transformation in the sector. But you can’t believe everything you read.

Digital twin

The concept of digital twin has a futuristic quality at a time when robotics and artificial intelligence also are coming of age. But whilst there’s no disputing the transformative impact of digital twins, in industries as wide-ranging as construction, marine, agriculture and oil and gas, what makes up a digital twin is technology of now.

Decision making

One of the problems of asset-heavy industries, including Oil & Gas and Petrochemical operations, is decision-making for maximum impact, under resource constraint conditions. These industries typically manage an array of highly interconnected assets and relationships which bear on the impact of the decision.

Digital twin

When it comes to digital transformation and learning how to discover insights hidden in data, one question that regularly arises is whether we should copy all of our data into a data warehouse or data lake, or alternatively connect directly to the source systems.

Digital twin

In this blog series we’re looking at how you turn a bowtie diagram into an online Digital Twin showing live barrier health information. The previous part looked at the level of detail required to make a working data model in practice. In this instalment we’re looking at the challenge of assessing the health of protective functions not directly related to the physical SECEs (Safety and Environmentally Critical Equipment).

Digital twin

In this blog series we’re looking at how you turn a bowtie diagram into an online Digital Twin showing live barrier health information. In this instalment we’re considering the level of detail required to make a working data model in practice.

Digital twin

How do you turn a bowtie diagram into an online Digital Twin showing live barrier health information?

Complex Subsea Hydraulic System

Imagine the situation: Hydraulic fluid pressure is dropping fast; supply tank levels are also dropping. You are pretty sure there is a big leak in the system, but you have no idea where it is. Pretty soon the operation will be compromised: The pressure will be below the minimum required and you may be forced to shut down the field. It is a scary picture.

What makes a good datamodel?

Wireframe rendering of turbojet engine and mirrored physical body on black background. Digital twin concept. 3D rendering image.
There is a lot of buzz nowadays about “Digital Twins”, and every Digitalisation Director out there is thinking they need to have a Digital Twin of their assets. It makes sense: If you have a digital representation of your assets; you can simulate, test scenarios, plan operations and visualise data within the context afforded by the twin, without having to visit or “be present” in the real asset, especially if these assets are remote.

Agile

Agile development software business web computer agility nimble quick fast start up concept
Ah, the old ones still the best ????. But such questions can sometimes be worth considering. For example, when is Agile not Agile?

Open Source

Two months ago I switched my phone from an Apple iPhone 7 to an Android based POCO X3 NFC. At the same time one of my daughters upgraded their phone to an iPhone XR so we’ve been comparing experiences. So what’s it been like??

Uncategorized

2021 futuristic tehcnology trend concept, hand man pointing in concept futuristic in 2021 coming year the year of artificial intelligence ,big data, iot, augmented reality, machine learning
As the COVID-19 pandemic hit us round about March 2020 and the Oil prices crashed, all of us working for the Oil & Gas industry panicked and prepared for the worse. In previous market downturns, budgets had been slashed and projects cancelled, but this time something quite different happened, at least in the Digitalisation space.

Agile

What has Agile ever done for us?
In an earlier blog, I wrote about what Agile is. And that’s all well and good, but what has Agile ever done for us?

Open Source

Using Open Source to avoid lock-in to expensive proprietary technologies
I’ve been using Apple products since 2007. Every smart phone I’ve had has been an iPhone but yesterday I ordered a new Android phone. I’m so fed up with shelling out around £1000 every couple of years but I’ve been trapped because all my music is in iTunes, all my family have iPhones and we run Macs*. Man, I hate being locked in! I wouldn’t mind if it was still value for money, but it’s not – I feel like I’m being taken for a ride now.

Perspective

In November (this month as I write this) we have not long closed out possible two of the best custom software projects we have done. Sure there were bugs in the testing and there will be more found by the users I’m sure, but what made them so good? We hit the schedule to within a week on both of them and there were no scope changes over the entire project, so we hit the budget as well. This is particularly impressive for one of them because it was a completely new piece of software, written from scratch; taken from powerpoint to in production in 5 months.

Uncategorized

percent sign on screen. interest rate
This picture makes me laugh and if it makes you laugh too then you’ve probably been involved in purchasing or project delivery at some point.

Is your Oil field being exploited at its best?

A man holding a flask of oil. Oil rigs in the background. Oil production in Russia. Sunset. Oil pumping
Production Well Testing is an essential part of managing an Oil & Gas field if the field has more than one well and production measurement resources are limited, as is generally the case.

Agile

Agile Software Development Business Internet Techology Concept
You have probably heard software developers talk about being Agile, and maybe wondered what they mean. Perhaps they do star jumps and touch their toes before breakfast, or write code while limboing under desks? Well, it’s nothing like that… read on to find out more!

Uncategorized

Eigen and Wintershall Dea partnership
I’m really excited about the long term partnership we have recently signed with Wintershall Dea for a number of reasons not least because it’s a great team and we have a lot of fun working together!

Decision Bridge Technology

eigendbt2
Back in July I asked myself the question “Why are we still in business after 11 years?”. A small specialist company in a big competitive industry. We’ve been through 3 downturns now and we continue to develop longstanding relationships with some of the most innovative operators in the business. Clearly we are doing something right but what is it?

Decision Bridge Technology

Young musician playing acoustic guitar and singing,
Did you ever learn to play “Sweet child of mine” on the guitar? Like almost every other teenager who had a guitar I learned the opening riff to “Sweet child of mine” and I remember playing it in a guitar shop when I was testing out an amp, only to have one of the staff point to a sign on the wall that said anyone playing “sweet child of mine” would be thrown out the shop and told never to return!

Is your Oil field being exploited at its best?

Smart city and abstract dot point connect with gradient line and aesthetic Intricate wave line design , big data connection technology concept .

In our previous blog, the discussion was centred on assessing how well an asset is exploited, as a key concern for the field manager and the asset team. The Integrated Choke Model (ICM) was proposed as a model to identify opportunities for improving the free cash flow as the fundamental objective during periods when the oil price is low.

Uncategorized

Businessman holding tablet and showing holographic graphs and stock market statistics gain profits. Concept of growth planning and business strategy. Display of good economy form digital screen.
The world was shocked to learn about the massive explosion in the port of Beirut: This has been a terrible tragedy with an astronomical cost in terms of lives lost and damage to property; one could even argue, to the whole country, as even the strategic grain reserves were hit.

Is your Oil field being exploited at its best?

Is your Oil field being exploited at its best?
A typical question for a Petroleum Field Manager is how well the asset is being exploited and if this exploitation is efficient and generates enough value for Operator and Partners. This article explores some answers based on our understanding of the challenges that appear throughout the life of the field, and to how to be prepared to transform them into opportunities.

What makes a good datamodel?

What makes a good datamodel?
Now we are potentially getting right in amongst the weeds and many a data model gets stuck in the weeds because this issue is not understood.

What makes a good datamodel?

What makes a good data model? | Eigen
Once upon a time, in a land far far away, hard disks and memory were small and really expensive. In 1969 the computer that landed man on the moon only had 4k of RAM and 72k of ‘disk’ (ROM) storage.

What makes a good datamodel?

What makes a good data model? | Eigen
Having the best datamodel in the world won’t help you grow your digital capabilities if the only way to access it is through proprietary drivers.

What makes a good datamodel?

What makes a good data model? | Eigen
Google search is not a validated datamodel. Nor is it comprehensive, but it’s good enough for most of us.

What makes a good datamodel?

What makes a good data model? | Eigen
“A well structured class library allows you to build applications that are portable across different assets. They overcome the problems of different naming conventions across assets or sites.”

What makes a good datamodel?

What makes a good data model? | Eigen
Trying to model humans is probably the easiest way I know to break an inflexible model. Why? Because humans are recursive – that means they keep repeating the same thing and also they keep changing the rules! Humans start off as a child and become parents, they form relationships with each other, they marry each other and they can now change gender. If you designed your database schema in the 1500’s you would have had to rebuild it several times since!

What makes a good datamodel?

What makes a good datamodel?
The problem with hierarchies is they are subjective and only relevant to a certain viewpoint or problem. For example, what is the hierarchy of equipment on an industrial facility? Should it be based on area or on equipment size, or on position in the process?

What makes a good datamodel?

What makes a good datamodel?
Every Digital Twin needs a data model somewhere inside it. The Data Model is the digital map of where everything is; it’s the central reference for how everything is connected; it’s the search engine for your information.