As Wikepdia says :
Data profiling is the process of examining the data available in an existing data source (e.g. a database or a file) and collecting statistics and information about that data. The purpose of these statistics may be to:
Data profiling is the process of examining the data available in an existing data source (e.g. a database or a file) and collecting statistics and information about that data. The purpose of these statistics may be to:
- find out whether existing data can easily be used for other purposes
- Give metrics on data quality including whether the data conforms to company standards
- Assess the risk involved in integrating data for new applications, including the challenges of joins
- Track data quality
- Assess whether metadata accurately describes the actual values in the source database
- Understanding data challenges early in any data intensive project, so that late project surprises are avoided. Finding data problems late in the project can incur time delays and project cost overruns.
- Have an enterprise view of all data, for uses such as Master Data Management where key data is needed, or Data governance for improving data quality
Now this is all we know. The real question though is - How to get the profiling done correctly? What kind of companies or consultants available in the market? What kind of services they provide? Are they in my budget ? What is my ROI for doing this?
So We really need answer on :
Why - should I do it?
Who - will provide me this services
What - are my options, cost and ROI of doing this?
How - this needs to be done and how much is the time.
Once this is done what will be the next thing? What will I get ?
Watch this space next monday to have all your questions answered here.
Comments
Post a Comment