Last week I wrote a post on the Raw Data Vault that got some good insightful comments. This post is a joint effort of me and Dan Linstedt regarding this subject.
In his book – published this week - Dan mentions a Raw Data Vault as well. We have discussed this and came to the conclusion that the Raw Data Vault as mentioned by Dan in his book is in fact the actual DV (integrated on the HUB’s, using business keys). He used the term ‘Raw’ to distinguish from the Business Data Vault.
Let us be clear; the Raw Data Vault as described in my blog post “Data Vault Schools” is not the same Raw Data Vault as described in Dan’s book. In fact it’s a fundamental difference with regard to DV methodology as Dan intented it. This is in line with my blog post "the case against the Raw Data Vault".
We both agree that there is no way to generate everything, because identification of the business keys has to happen. We do however acknowledge the possibility that, if you can specify the business keys, there are options to generate the model.
Ronald Damhof & Dan Linstedt
The first two comments on this post are valid and we felt to be more precise on the terminology.
1) Raw Data Vault = A term that should no longer be used in DV methodology. If it is used in formal writings, communications, blogs or whatever, then it resembles a Data Vault (integrated on the HUB’s, using business keys etc..) as defined by Dan Linstedt.
I will also update my other 2 posts to reflect this terminology.