Skip to main content

What is Data profiling?

As Wikepdia says :
Data profiling is the process of examining the data available in an existing data source (e.g. a database or a file) and collecting statistics and information about that data. The purpose of these statistics may be to:
  1. find out whether existing data can easily be used for other purposes
  2. Give metrics on data quality including whether the data conforms to company standards
  3. Assess the risk involved in integrating data for new applications, including the challenges of joins
  4. Track data quality
  5. Assess whether metadata accurately describes the actual values in the source database
  6. Understanding data challenges early in any data intensive project, so that late project surprises are avoided. Finding data problems late in the project can incur time delays and project cost overruns.
  7. Have an enterprise view of all data, for uses such as Master Data Management where key data is needed, or Data governance for improving data quality

Now this is all we know. The real question though is - How to get the profiling done correctly? What kind of companies or consultants available in the market? What kind of services they provide? Are they in my budget ? What is my ROI for doing this?

So We really need answer on :

Why - should I do it?

Who - will provide me this services

What - are my options, cost and ROI of doing this?

How - this needs to be done and how much is the time.

Once this is done what will be the next thing? What will I get ?

Watch this space next monday to have all your questions answered here.

Comments

Popular posts from this blog

Bristlecone Webinar on Supplier Risk Management

Purchasing Magazine, Bristlecone and SAP getting together to bring you very good discussion on supplier risk management. Webinar will be held on June 24, 2 to 3 PM eastern Time, USA. Contributors are - Paul Teague from Purchasing magazine , Jason Buch from spendmatters , Naresh Hingorani from Bristlecone - Supply chain leader company and Padmini Ranganathan from SAP. As we always talk about spend visibility, data issues, strategic sourcing - these distinguised speakers will bring out more strategic views to the table on how all this can be achieved to analyze your business risks better, upfront. You can register for event by accessing this link and register See you there.

Master Data Management – Product or Process ?

I have 2 SAP systems and I want to fix my material master, Services Master. I want all that data to be clean, standardized, classified, enriched and load it back to my SAP in next 6 months. What do you suggest ? Chris - one of my key client was explaining during a “solution understanding” call. My sales manager Tom, enthusiastically started talking about new version of the MDM platform by ERP company, tools, technologies, product landscape, licenses etc. After 30 minutes of sales pitch, I could see confusion on Chris’s face clearly. He said - but I don’t want to add any new product in my infrastructure for all this. Can you just implement MDM for me without I adding any new software ?   Both are using MDM implementation as a keyword, but in a completely different context. Chris wants to implement MDM as a process while Tom was trying to sell MDM as a new software. Whats the difference ? Lot I will say. MDM as a product – when you sell a   software license to a...

Data Management - Elephant & Seven Blind Men

I am sure most of you read a story of Elephant and Seven Blind Men. For starters - this is how the story goes - "A number of blind men came to an elephant. Somebody told them that it was an elephant. The blind men asked, ‘What is the elephant like?’ and they began to touch its body. One of them said: 'It is like a pillar.' This blind man had only touched its leg. Another man said, ‘The elephant is like a husking basket.’ This person had only touched its ears. Similarly, he who touched its trunk or its belly talked of it differently." As Wikipedia mentions - This story has been used to illustrate a range of truths and fallacies. At various times it has provided insight into the relativism, opaqueness or inexpressible nature of truth, the behavior of experts in fields where there is a deficit or inaccessibility of information, the need for communication, and respect for different perspectives. The moral of the story is - While one's subjective experience...