It’s only a problem if someone notices? Why Data Teams need to recalibrate
You miss 100% of the shots you don’t take
Introduction
There are two things I observe consistently in organisations large and small:
Data Teams are viewed as a cost-centre, and are almost always included in budget cuts
People working in Data Teams often lament the inability of the rest of the organisation to be data-driven
Combined with the clear desire from executives to be “data-driven”, it seems odd that organisations and data teams are consistently getting it so wrong.
In this article, I want to offer an explanation for this that draws on the parallels (or lack thereof) between software engineering. Software engineering standards are extremely high. First off, there is a bias towards releasing code to production to quickly garner customer feedback.
This does not mean releasing broken or buggy code, however. Indeed, software engineers rely on a huge variety of methodologies like dev tooling, CI/CD and environments to catch bugs while shipping fast. Devs spend between 25–50% of their time writing tests.
By contrast, Data Practitioners struggle relatively more with CI/CD. Dev, Staging and Production environments are not **as** standard. Unit testing is done in a variety of ways, but there is no common practice. The amount of time spent writing tests is generally a lot lower than 25–50%.
If the level of testing in both fields is optimal, then it must follow that the acceptable level of bugginess or flakiness in the two fields is different if the level of testing in both fields is different.
If not, and the generally-acceptable level of testing in software is optimal, then it must follow that data teams in general do not test egenerally acceptable nough.
Let’s dive in.
The motivation for testing in Software Engineering
Testing in Software Engineering has a clear and tangible benefit.
We can consider what would happen without testing. In the most simple cases, a lack of testing causes bugs to enter software. This adversely impacts end users of it. This diminished experience means customers my not find the software fit for purpose, and may churn. This costs the software company money.
In other examples, where automation is key, without testing software can have very real and hard-hitting effects. Self-driving cars, and software that can detect cancer are obvious examples.
The motivation for testing in Data Engineering
Taken at a high-level, we could define the aim of Data Engineering generally as “to improve decision-making, automated or otherwise”.
The one key difference is that often, there is no transaction at the end of the process; the end user of a data engineering pipeline is someone internal to an organisation vs. external. There are many and frequent exceptions to this, like businesses where data is surfaced back to the end consumer (see embedded BI) or Hedge Funds — let’s restrict the discussion to Internal Business Intelligence (BI) use-cases.
You could argue that because internal decisions are less important than external ones, the quality of the data or data engineering processes can be lower. This would be a weak argument.
Decisions require trust
Generally humans exhibit varying degrees of efficacy at making decisions on data they believe to be partially correct (c.f. Bounded Rationality). BI is one such example. “Can you please check this number?” becomes the bane of many data teams’ existence, often because a single missing or odd-looking data point is enough to cast the entire dataset into doubt.
Indeed, the goal of software fundamentally is to reassure humans that decision-making can be made automatically, or to present said users with data that helps them more optimally make decisions.
At a human-level, this is exactly the same as Business Intelligence.
Therefore, the data quality / SLA requirements for a human to reliably and consistently action decisions using data in BI is actually just as high as that for commercially produced software.
This has nothing to do with the fact that transactions may or may not be present, but that the fundamental behaviour the end product is trying to instill in the user is the same with data engineering as it is with software.
It matters not to the user if they are looking at a Lead-Generation Tool or a Looker Dashboard with some data in it — their goal in both cases is the same; irrespective of if the Lead-Generation Tool has been purchased by the company for the user or if a data engineer has been hired to build a dashboard.
It appears likely that because both data and software engineering aim to create the same behaviour in users in similar situations, that the standards required to reliably produce it in both cases is similar. Why is this the case?
Cognitive Dissonance in Data Engineering
I expect many of us Data Engineers feel this all the time. There has to be something odd about having to set KPIs you feel don’t really have any bearing on end-user experience.
For example, a data team might have a KPI for their uptime. If a pipeline fails for one hour in an entire month, the success rate for that pipeline is over 99%. However, if that moment is when a stakeholder is looking at it, the Data Team is 0 from 1 (0%). The result will be Cognitive Dissonance, as the Data Team feels they have done a good job and grow to resent ad-hoc requests (“Why did they have to look at it right at that moment?”).
This Information Asymmetry is problematic for Data Teams since it means incentives are not aligned; if the Data Team’s KPI is uptime, this does not benefit end-users since uptime is not really what they care about (unlike in software engineering).
The blind desire to turn Data Teams into Software Engineering Teams in BI means this Information Asymmetry Persists. The DORA metrics do not really work for Data Teams. However the insistence by some that “Data Engineering is Software Engineering”, paradoxically, damages the efficacy of BI teams by forcing metrics upon them that do not better their end-users.
Consequence: Ineffective Data Teams serving minimal BI use-cases
The consequence of all this is that Data Teams do not meet the right standards for effectively serving BI use-cases.
This is because of incorrect targets but also due to very high data quality / reliability requirements from end stakeholders (which may not always be acknowledged).
This in turn results in a damaging view of the Data Team, who fail to meet the (binary) goal of enabling users to consistently and reliably make decisions using data. Of course, Data Teams may make progress towards their goals — a single team, or set of Power Users may become sufficiently comfortable with Data.
This may be acknowledged, that the Data Team are able in part to improve how people do things.
However for the most part, without a high “hit rate” this damages the reputation of the Data Team. The attitude of “It’s only a problem if someone notices” results in a lot of long-term pain for the Data Team.
Conclusion — Data Teams must hold themselves to higher standards
Data Engineering is a highly technical profession. Unfortunately, for those of us serving BI use-cases it is also a question of reputation and trust — things that cannot be engineered.
The degree to which these are required to effectively enable end-users to reliably and consistenly make decisions based on data is higher than we think.
This means the standard, quality, and SLA of Data Products that is acceptable is likely higher than we believe.
Given Data Practitioners, in general, spend significantly less time writing tests than their software counterparts, there are strong arguments that we should spend more time gathering requirements, ensuring robust infrastructure for logging, monitoring etc. and writing unit / similar tests.
The “Gold-Rush Paradox” in Data: Why Your KPIs Need a Rethink
You’re not doing as good a job as you think you are
As I write in the “Gold Rush paradox”, Data Teams also need to better align their KPIs with end stakeholders of the business.
It is becoming apparent to many Data Teams that rather than improving averages, minimising risk and maximising the number of “wins” while keeping losses to virtually zero is the way to run things really well. I am excited to see how these ideas develop and how Data Teams start holding themselves to even higher standards to drive Data and AI Adoption