Tuesday, March 2, 2010

The ABS Data Quality Framework

One of the first things to take into account when starting on your long road to a governed information quality environment is your Data Quality Framework (DQF). With the ABS being the foremost central data and statistical organisation in the country, most of the frameworks and procedures they adopt tend to turn into a national standard. The DQF is no exception, with a plethora of Australian government organisations adopting this as their own.

It is essential to learn how to adapt this framework to your organisation. Initially I had a few gripes with a how to apply a missing dimension, however depending on the context, just about everything can be applied through sub categories of this framework. The seven dimensions of the ABS framework include Institutional Environment, Accuracy, Relevance, Timeliness, Accessibility, Coherence, and Interpretability. In ensuring your organisation has a common view of each dimensions importance to them, you must build a hierarchy. Traditionally Institutional Environment, Accuracy, and Timeliness are probably the most essential elements to what most people would traditionally associate with 'quality'. This is something to keep in mind.

Frameworks, Strategies, Policies… it seems its easy to get lost in all this documented beaurocracy. No matter how you approach your overall data quality initiative there are a few other essential accompaniments to your framework. At the forefront would be Data Quality Statements or Declarations, a Data Quality Assessment Framework or method, Roles & Definitions, and a Data Quality Maturity model. Sometimes the maturity model can be more associated with a Data Governance Framework, however DQ is a cornerstone in the Data Governance Framework - so it is a case of "Which came first, the chicken or the egg?". Essentially these all make up an overarching strategy, and I'll briefly outline a bit more about each of these components to a good Data Quality Strategy:

Data Quality Statements or Declarations:
Essentially these are a document to accompany any submission or delivery of a dataset internally or externally. They can be outlined by DQ dimensions and include a set of contextual questions to accompany each dimension. This is essential for someone to make a risk assessment around their data need. Does this data cover the right timeframe they need to research? What were the accuracy issues that were encountered during this dataset and how will it effect the outcome of the data need? All sorts of imperative questions can be answered by these quality statements prior to delving into the data and seeing what will arise.

Data Quality Assessment Framework:
This is a framework for how you would like people to approach the assessment of data collections or outputs in their area. The format and questions usually correlate pretty closely with your Data Quality Statements, and your output essentially feeds in to that document. This is where you will uncover all the nitty gritty to allow a dataset user to assess the fit for purpose for their data need.

Roles & Definitions:
There are a thousand different definitions around on the internet and in varying organisations. Essentially what you need to do here is build a common language for roles and responsibilities, as well as any data quality terminology and jargon. It's all about getting everyone onto the same page and promoting a culture right? Part of a culture is language, so you will need to determine how the language will suit your current organisational terminology.

Data Quality Maturity Model:
This part is easily confused. A lot of Executives will want to know what the data quality of a report is like, or how trustworthy it is. They want you to put it into some simple terms… what gets more simple than a number rating of 1-5? Essentially this is what a Data Quality Maturity Model boils down to. It is not a perfect representation of the quality of the data, however the score is associated with how governed and processed the data is. You will have to tailor this to your organisation and their processes. An example range would be level 1 would be data that is straight from the source system with only local processes applied. The range would then extend up to the highest level, where 5 represents the data being governed, modelled, entered in the data dictionary, validated, cleansed, warehoused, and all the other data quality processes of your organisation have been applied. You may feel that a rating of 1-5 is too broad and you need to further define it. This is where tailoring to your organisation is imperative and you must make a process that will work for you. This is also a vital component to building quality through Data Governance.

The ABS are also very good at supplying extra information and helping you apply the framework to your organisation. Their services include many different classes and workshops, as well as Information Management Strategy consultants to help your organisation build and implement a strategy that will suit you. Ultimately not one size will always fit all, and there will be more information on Data Quality Strategy and Framework to come in due course….



For more information on the ABS Data Quality Framework click HERE.


No comments:

Post a Comment