Processing Complex Logical Information or Structured Knowledge
Saturday, April 22, 2017 at 06:28AM
Charlie in Becoming an XBRL Master Craftsman

(Please consider this a draft at the moment, a work-in-progress. I am trying to get this 100% precise.  That is doable, but this is a painstaking task because it is so detailed and there are parts that I am pulling together and learning as I write this information.)

Artificial intelligence is coming perhaps sooner than you might realize. For example, the article The Use of AI in Banking is Set to Explode says that 32% of all banks use some sort of predictive technology. This Wired article starts off,

"IT'S HARD TO think of a single technology that will shape our world more in the next 50 years than artificial intelligence."

And this article, Top 5 Jobs Robots Will Take First, points out that accountants are high on the list of those who will be impacted by AI.  My personal view is that the best way to protect your job is to learn as much as possible about AI and how it works.

A financial report is complex logical information.

Before XBRL, a financial report was unstructured information and therefore the only way you could interact with that financial report was to have a human that understands financial reports read the financial report and pull information out.  Or, possibly, you would write a computer algorithm that would parse the unstructured text to try and glean information from the financial report. 

With XBRL, information reported in a financial report is structured and can still be read by humans using renderings generated from the structured information; but the information can also be read by machine-based processes directly.  (See the video How XBRL Works for more information about the difference between structured and unstructured information.)

So how do you process complex logical information, or structured knowledge, such as the information found in an XBRL-based intelligent digital financial report?  How do those creating such reports create the reports correctly so that the reports mean what the creator intended so that the correct information is conveyed to users of the financial report?  How to analysts, investors, regulators and others know that the report they are using has been created correctly?  How do standard setters such as the FASB know that they created the US GAAP XBRL Taxonomy correctly to enable the creator and users to interact with harmony, minimizing dissonance?

Well, people (meta-engineers) like Benjamin Grosof, Ian Horrocks, John Sowa and others have been working to solve that problem for 25+ years.  Today if you try and look for an answer to the question, "How do you process complex logical information?" the answer exists but that answer looks like a bit of a convoluted mess if you don't understand what has been going on the past 25 years.  But it is not really a mess.  There are different opinions because there are different "camps" because there are different needs, different target audiences, and different approaches have been taken to solve the same problem.  There is a solution and I will get to that.

But the problem itself, processing complex logical information, has its roots in artificial intelligence. That is one of the problems the artificial intelligence community had to solve in order to get artificial intelligence to work. Does artificial intelligence work?  Well, the article The Use of AI in Banking is Set to Explode seems to think so. There are other clues that it is working.

So how did they make it work?  Who made it work?  To process complex logical information you had to represent that complex logical information in a form that a computer would read, understand, and effectively work with.  Three general approaches were used to solve that problem:

One more approach to representing complex knowledge is worth mentioning. Description logic is another method to representing complex knowledge but in the past, description logic was not machine-readable.  That is changing.  OWL 2 DL has a description logic style syntax and supports SROIQ description logic as I understand it.

There are two approaches to specify knowledge: based on axioms (used by ontology) and model based (used by business rules and schema).

So who got it right? Is an ontology, business rules, or a schema the best way to represent complex logical information?  Is it better to use axioms or a model to represent knowledge?  There has to be only one best way, right?  Well...no.

In an article, The Semantic Web and the Business Rules Approach ~ Differences and Consequences, Silvie Spreeuwenberg answers that question in this way:

"Fundamentalism for one position undermines collaboration between the two communities."

I agree with Ms. Spreeuwenberg.  Too many people tend to exist in silos and believe everything in their silo is right and every other silo is wrong.  Another way to say this is, "If the only tool you have is a hammer, then everything looks like a nail." 

Some people, like the meta-engineers I mentioned, crossed the silos.

In another blog post I pointed to a paper by John Sowa that explained the problems caused by fads, trends, misinformation, politics, arbitrary preferences, and competing standards.

There is something that is common between all three knowledge representation approaches: ontology, business rules, and schema.  That common thing is logic. As I mentioned in the blog post Describing Systems Formally, Aristotle created logic in about 450 B.C. Logic is a discipline of philosophy. Logic has been around a long time and is useful for many things.

Logic is the study of the principles of correct reasoning. Formal logic helps identify patterns of good reasoning and patterns of bad reasoning. These logic systems can be used to describe how things work so you can understand if they are working as expected.

Another definition of logic from the Book of Proof is as follows.  Logic is a systematic way of thinking that allows us to deduce new information from old information and to parse the meanings of sentences. There are different definitions of logic and you could likely have interesting philosophical and theoretical debates about logic.  Or, you can use logic as a useful tool.

Business professionals use logic and reasoning in everyday life.  Logic and reasoning are not hard to understand; in fact, humans tend to have an innate understanding of logic and reasoning.  Some people tend to use logic and reasoning more than others but that is a different story.

Business professionals care about correct reasoning.  As I explained in the 15 XBRL-based Digital Financial Report Principles, fundamentally a financial report is a system and that system needs to work.  There needs to be harmony between all the stakeholders that play a role in the process of working with financial reports: standards setters, report creators, data aggregators, analysts, regulators. Logic can serve as a communications tool that helps maximize harmony.

Each of these stakeholders tends to have a natural understands logic. People who have no formal training in logic still tend to understand logic.  Sure, perhaps a bit of additional training in logic would help business professionals work with complex logical information even better.

OK, so let us assume that you buy the argument that logic is a good tool to describe complex logical information.  Let us assume we want to use logic. Which logic is the best logic to use for the task?  There are lots of different logic systems. 

In his presentation, Survey of Knowledge Representations for Rules and Ontologies, Benjamin Grosof answers that question.  That presentation is very technical.  I have tried to distill the essence of what Mr. Grosof is saying into this explanation below that should be both accurate and understandable by business professionals.

Here is an overview of some logical systems.  What I am concerned with is picking the logical system that can be used by software application so that the software provides results that are reliable, predictable, repeatable and otherwise safe to use. (i.e. software that blows up on us or is not reliable is not very useful to business professionals)

The following is my best attempt to describe a deductive system using terms which a business professional might be familiar with.  

A properly functioning deductive system must be sound, complete, and effective.  A fundamental principle of logic is that a fact (or declarative statement or proposition) is a logical consequence of one or more other facts (or declarative statement or proposition).    A deductive system is sound if any fact that can be derived in the system is a logically valid fact. A deductive system is complete if every logically valid fact is derivable.  A deductive system also shares the property that it is possible to effectively verify that a purportedly valid deduction is actually a valid deduction; such a deduction system is called effective.

A deductive system can be extended.  While a fact might not be directly derivable; a fact may be defensible. Probabilistic reasoning or non-monotonic logics provide the feature whereby non derivable but defensible inferences can be made.  It is crucial that derived knowledge and non-derived but defensible knowledge be distinguishable.

Systems are not homogeneous, they tend to be heterogeneous even within one organization.  As such being able to use either ontology-based approaches, rule-based approaches, and/or schema-based approaches to describe complex logical information has advantages.  But to exchange information between these different systems the systems must agree on a logic.

The following graphic shows somewhat of a hierarchy of logics. Business professionals need to be conscious of the capabilities of the problem solving logic they are using, the expressive power of the problem solving logic, and the propensity of the problem solving logic to "blow up" or have some sort of catastrophic failure (i.e. not be safe to use).  If this information is laid out appropriately then business professionals can make good choices.

(Click image for larger view)

There are two logics that I left off the graphic above that are very important but I did not add them because I don't want to clutter that fairly straight-forward graphic and I don't know the relative problem solving logic.  The two logics are ISO/IEC Common Logic and OMG Semantics of Business Vocabulary and Business Rules (SBVR). Common logic and SBVR are logically equivalent as I understand it.  As I understand it, they are a subset of RuleLog.  I would also like to understand where OWL 2 DL fits into that graphic.  All things considered, the safest and most expressive problem solving logic is RuleLog.

The graphic below shows a complete knowledge based system. In order to process complex logical information while you do need some sort of problem solving logic, it does not matter if you use an ontology, or business rules, or a schema as a delivery mechanism for that logic.  You also need to be conscious of the expressive power of the problem solving logic.

(Click image for larger view)

But it is important that your system is provably sound, complete, and effective. Your system needs to work.  Your complex logical information needs to be correct.  Business professionals need to know that information is correct and that systems are reliable, operate predictably, and are otherwise safe.

A question that you might have is, "What syntax should you use to represent the logic you select?" I will answer that question in a blog post in the very near future.

Article originally appeared on XBRL-based structured digital financial reporting (http://xbrl.squarespace.com/).
See website for complete article licensing information.