Master Data Management Tips: When is Different Not Different?
Consider the following two addresses. There are many differences between the two records, but none represents a true conflict in the business information. A key to successful MDM is to be able to distinguish between conflicts, omissions and standards.
Many legacy systems literally shout I’M OLD MAINFRAME DATA in uppercase only. Modern systems prefer the more pleasing Title Case. Case differences don’t represent a difference in business data so we treat fields like Street, City and State as identical values.
The Name field is clearly different but means the same thing. The mainframe system holds a contracted version of the university name to squeeze this name into 30 bytes. This contraction is typical of mainframe data where each byte was valuable. If we want to compare the source record to the Hub we need first to transform the mainframe data by expanding contractions. The expanded Name1 data can be considered equivalent to the hub data but not identical.
Street, City and State
Street and city are identical between the two systems (ignoring case). The equivalent Name and the identical Street and City data are enough for us to consider these two records to be good match pairs.
It is not uncommon for older systems to limit Zip codes to 5 digits. The “plus-four” digits were only introduced in 1983 and have never been considered mandatory for address validation. In the MDM process, an address should be validated against the USPS database to validate the address components including the Zip+4 code. While MDM survivorship should prefer the CRM Zip+4 address for the golden record, this does not mean that the Legacy system is wrong.
Data does not need to be identical to be the same. Ensure that your MDM strategy is tolerant of differences in representing business data to avoid sending needless changes to connected systems.
Opt-in with Gaine for More Insight
Keep ahead of the rest with critical insight into Healthcare and Life Sciences MDM and interoperability technique, best practices, and the latest solutions.