‘Behind every great technology lies a genius’ — or in our case, several geniuses! This series of articles will introduce you to our Founders, three Professors from the University of Oxford and an accomplished Venture Capitalist. For the first article in our series, we welcome Professor Ian Horrocks to the hot seat.
So Ian, you are well known for your work in Semantic Reasoning, Description Logics, Ontologies and Rule Languages and you have many published papers (for links see the end). How did you end up in this space, and as a founder of Oxford Semantic Technologies?
“I had a slightly non-standard career for an academic. I did a Bachelor degree at the University of Manchester in computer science. I then worked there as a researcher for a few years in desktop publishing, which then got commercialised and I went off in a startup. I worked with the startup for several years before deciding that I wanted to do a PhD. I wanted to really get into some deeper research where I could spend a bit longer thinking about things.
Friends of mine from Manchester by this point were academics working on medical infomatics. The medical domain had been building organised vocabularies for a long time; prior to computer systems you can find medical texts where knowledge is categorised. The group at Manchester were building software systems to support this development of structured vocabularies. Nowadays we call them ontologies and knowledge graphs, but then those terms didn’t exist and they weren’t really thinking about semantics. It all sounded quite interesting, although I knew little or nothing about it at that point.
When I was working on my PhD I had the space to think about the underlying problems that we were trying to solve. I identified logic and description logic as being an appropriate underlying semantics. So I developed different algorithms that were based on logical reasoning rather than ad hoc processing and built a logic-based reasoning system, FaCT: ‘Fast Classification of Terminologies’. That’s how I got interested in logic-based semantics, ontologies and knowledge graph reasoning systems.
As Bernardo mentioned in his interview on SWRL, you have spent a lot of time working on languages. Where did OWL come into the picture?
“Around the time of FaCT, various groups were looking to design/standardise an ontology language for use on the web. FaCT demonstrated that it was possible to design a language with rigorous logic-based semantics and at the same time provide practical reasoning services. The logic supported by FaCT was therefore taken as the formal basis for OWL, and the rest is history.”
In 2007, Ian moved to the University of Oxford with Boris and Bernardo, where they began working on RDFox, a high-performance knowledge graph and semantic reasoning engine. The term knowledge graph gets thrown around, but those commercialised vary greatly in performance, capabilities and structure.
So Ian, what do you love most about RDFox?
“I think that the raison d’être is query answering over very large knowledge graphs. RDFox is this amalgam of formal rigour and performance.You don’t have to trust the system on the basis that ‘I tested it quite a lot and it never went wrong (so far)’; the algorithms used by RDFox have been formally proven to compute correct answers regardless of the query and regardless of the knowledge graph that you throw at it. On the other hand, a formal guarantee of correctness isn’t very useful if the performance is bad. Fortunately, RDFox also benefits from Boris’s design insights and brilliant software engineering, which allows it to be correct while at the same time incredibly fast — which is really hard to achieve!”
Considering there is a formal guarantee of correctness and impressive performance, where do you see RDFox and Oxford Semantic Technologies in 5 years time?
“Oh well, we will have a lovely office in the Bay Area, a hugely successful business!”
Are there any use cases of RDFox and knowledge graphs more broadly, which you love?
“I really like the configuration use cases. They combine a lot of the different features that we support: they often use large and complex vocabularies, for example parts catalogues, and they also require complex rules capturing component interactions, aggregated values, design constraints, etc. I think that this is a very convincing application of RDFox.”
Many people without a computer science background might not understand why or when they would use knowledge graphs and how these differ from relational databases. Do you think there should be a shift to graph databases from relational?
“I think it is situational, and depends on how pressing the need is for the extra things that graphs give you. Graphs have a bunch of advantages but there’s inertia to switch. Graph is probably how relational databases should have been designed, but the technology wasn’t there. Graphs are more intuitive methods for storing and querying data. Rather than matching strings, everything is already connected. Graph is superior but requires fancy query processing technology to make it all work.
The graph name itself could be confusing for a lot of people who will think of a graph as an X and Y axis, whereas in this context it is a computer science term for a network. They are circle and arrow diagrams basically, which are a much more natural way of capturing data and knowledge than a relational database. To try and explain how the tables within relational databases join and how queries work is much harder and more complicated than explaining how graphs work, as they are more intuitive.”
What do you think are the biggest challenges for semantic reasoning which are yet to be solved?
“I guess scalability in the sense of going beyond a single machine, using cluster technology or offline storage. At the moment performance is limited across all systems which implement one of these methods. There is really no system that can bridge that gap effectively at the moment. We are working on it and from an academic perspective this poses a great challenge to ensuring a method for distribution which maintains the correctness and speed.”
Do you have any tips for businesses who are interested in using knowledge graphs?
“Do it now!!
It’s easier and more intuitive than you think and the move from relational programming shouldn’t be too grating. The SPARQL query language is not so different from SQL, so if you are familiar with SQL then the move shouldn’t be too hard.”
Thank you Ian for spending time in our hot seat, and providing us with an insight into your journey and thoughts on semantic technologies. The next article in the series will place the spotlight on Professor Boris Motik and reveal his passion for software engineering and all things RDFox.
If you are interested in Professor Ian Horrocks work on Semantic Reasoning, Description Logics, Ontologies and Rule Languages, here are some articles:
Learn more about RDFox here, or request a free trial to get hands on!
The team behind Oxford Semantic Technologies started working on RDFox in 2011 at the Computer Science Department of the University of Oxford with the conviction that flexible and high-performance reasoning was a possibility for data-intensive applications without jeopardising the correctness of the results. RDFox is the first market-ready knowledge graph designed from the ground up with reasoning in mind. Oxford Semantic Technologies is a spin-out of the University of Oxford and is backed by leading investors including Samsung Venture Investment Corporation (SVIC), Oxford Sciences Enterprises (OSE) and Oxford University Innovation (OUI).